Photo tourism

ACM Transactions on Graphics - Tập 25 Số 3 - Trang 835-846 - 2006
Noah Snavely1, Steven M. Seitz1, Richard Szeliski2
1University of Washington
2Microsoft Research

Tóm tắt

We present a system for interactively browsing and exploring large unstructured collections of photographs of a scene using a novel 3D interface. Our system consists of an image-based modeling front end that automatically computes the viewpoint of each photograph as well as a sparse 3D model of the scene and image to model correspondences. Our photo explorer uses image-based rendering techniques to smoothly transition between photographs, while also enabling full 3D navigation and exploration of the set of images and world geometry, along with auxiliary information such as overhead maps. Our system also makes it easy to construct photo tours of scenic or historic locations, and to annotate image details, which are automatically transferred to other relevant images. We demonstrate our system on several large personal photo collections as well as images gathered from Internet photo sharing sites.

Từ khóa


Tài liệu tham khảo

10.1109/MCG.2003.1242379

10.1145/641480.641511

10.1145/293347.293348

10.1109/3DIM.2005.81

10.1145/383259.383309

10.1145/166117.166153

10.1145/41958.41981

10.1145/957013.957093

10.1145/237170.237191

10.1023/B:VISI.0000029665.07652.61

Feiner , S. , MacIntyre , B. , Hollerer , T. , and Webster , A . 1997. A touring machine: Prototyping 3d mobile augmented reality systems for exploring the urban environment . In Proc. IEEE Int. Sym. on Wearable Computers, 74--81 . Feiner, S., MacIntyre, B., Hollerer, T., and Webster, A. 1997. A touring machine: Prototyping 3d mobile augmented reality systems for exploring the urban environment. In Proc. IEEE Int. Sym. on Wearable Computers, 74--81.

Fischler M. and Bolles R. 1987. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Readings in computer vision: issues problems principles and paradigms 726--740. Fischler M. and Bolles R. 1987. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Readings in computer vision: issues problems principles and paradigms 726--740.

10.1145/237170.237200

Grzeszczuk , R. 2002 . Course 44: Image-based modeling . In SIGGRAPH 2002. Grzeszczuk, R. 2002. Course 44: Image-based modeling. In SIGGRAPH 2002.

Hartley R. I. and Zisserman A. 2004. Multiple View Geometry. Cambridge University Press Cambridge UK. Hartley R. I. and Zisserman A. 2004. Multiple View Geometry. Cambridge University Press Cambridge UK.

Irani , M. , and Anandan , P. 1998 . Video indexing based on mosaic representation . IEEE Trans. on Pattern Analysis and Machine Intelligence 86 , 5, 905 -- 921 . Irani, M., and Anandan, P. 1998. Video indexing based on mosaic representation. IEEE Trans. on Pattern Analysis and Machine Intelligence 86, 5, 905--921.

Johansson , B. , and Cipolla , R . 2002. A system for automatic pose-estimation from a single image in a city scene . In Proc. IASTED Int. Conf. Signal Processing, Pattern Recognition and Applications. Johansson, B., and Cipolla, R. 2002. A system for automatic pose-estimation from a single image in a city scene. In Proc. IASTED Int. Conf. Signal Processing, Pattern Recognition and Applications.

10.1145/1076034.1076158

10.1145/237170.237199

10.1145/800250.807465

Lourakis , M. , and Argyros , A . 2004 . The de sign and implementation of a generic sparse bundle adjustment software package based on the levenberg-marquardt algorithm. Tech. Rep. 340, Inst. of Computer Science-FORTH, Heraklion, Crete, Greece. Available from www.ics.forth.gr/~lourakis/sba. Lourakis, M., and Argyros, A. 2004. The design and implementation of a generic sparse bundle adjustment software package based on the levenberg-marquardt algorithm. Tech. Rep. 340, Inst. of Computer Science-FORTH, Heraklion, Crete, Greece. Available from www.ics.forth.gr/~lourakis/sba.

10.1023/B:VISI.0000029664.99615.94

10.1145/1067170.1067172

10.1145/218380.218398

10.1007/s11263-005-3848-x

Naaman , M. , Paepcke , A. , and Garcia-Molina , H . 2003. From where to what: Metadata sharing for digital photographs with geographic coordinates . In Proc. Int. Conf. on Cooperative Information Systems, 196--217 . Naaman, M., Paepcke, A., and Garcia-Molina, H. 2003. From where to what: Metadata sharing for digital photographs with geographic coordinates. In Proc. Int. Conf. on Cooperative Information Systems, 196--217.

10.1145/996350.996366

Nocedal J. and Wright S. J. 1999. Numerical Optimization. Springer Series in Operations Research. Springer-Verlag New York NY. Nocedal J. and Wright S. J. 1999. Numerical Optimization. Springer Series in Operations Research. Springer-Verlag New York NY.

10.1023/B:VISI.0000025798.50602.3a

Robertson , D. P. , and Cipolla , R . 2002. Building architectural models from many views using map constraints . In Proc. European Conf. on Computer Vision , vol. II , 155--169. Robertson, D. P., and Cipolla, R. 2002. Building architectural models from many views using map constraints. In Proc. European Conf. on Computer Vision, vol. II, 155--169.

10.1145/642611.642682

10.1109/VISUAL.2004.50

Russell , B. C. , Torralba , A. , Murphy , K. P. , and Freeman , W. T . 2005 . Labelme: A database and web-based tool for image annotation. Tech. Rep. MIT-CSAIL-TR-2005-056 , Massachusetts Institute of Technology . Russell, B. C., Torralba, A., Murphy, K. P., and Freeman, W. T. 2005. Labelme: A database and web-based tool for image annotation. Tech. Rep. MIT-CSAIL-TR-2005-056, Massachusetts Institute of Technology.

Schaffalitzky , F. , and Zisserman , A . 2002. Multi-view matching for unordered image sets, or "How do I organize my holiday snaps? ". In Proc. European Conf. on Computer Vision , vol. 1 , 414--431. Schaffalitzky, F., and Zisserman, A. 2002. Multi-view matching for unordered image sets, or "How do I organize my holiday snaps?". In Proc. European Conf. on Computer Vision, vol. 1, 414--431.

Schmid , C. , and Zisserman , A . 1997. Automatic line matching across views . In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 666--671 . Schmid, C., and Zisserman, A. 1997. Automatic line matching across views. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 666--671.

10.1145/237170.237196

Sivic , J. , and Zisserman , A . 2003. Video Google: A text retrieval approach to object matching in videos . In Proc. Int. Conf. on Computer Vision, 1470--1477 . Sivic, J., and Zisserman, A. 2003. Video Google: A text retrieval approach to object matching in videos. In Proc. Int. Conf. on Computer Vision, 1470--1477.

Steedly , D. , Essa , I. , and Delleart , F . 2003. Spectral partitioning for structure from motion . In Proc. Int. Conf. on Computer Vision, 996--103 . Steedly, D., Essa, I., and Delleart, F. 2003. Spectral partitioning for structure from motion. In Proc. Int. Conf. on Computer Vision, 996--103.

10.1023/A:1023035826052

10.1145/957013.957046

10.1145/985692.985733

10.1145/1186562.1015766