skip to main content
research-article
Free Access

Scene completion using millions of photographs

Published:01 October 2008Publication History
Skip Abstract Section

Abstract

What can you do with a million images? In this paper, we present a new image completion algorithm powered by a huge database of photographs gathered from the Web. The algorithm patches up holes in images by finding similar image regions in the database that are not only seamless, but also semantically valid. Our chief insight is that while the space of images is effectively infinite, the space of semantically differentiable scenes is actually not that large. For many image completion tasks, we are able to find similar scenes which contain image fragments that will convincingly complete the image. Our algorithm is entirely data driven, requiring no annotations or labeling by the user. Unlike existing image completion methods, our algorithm can generate a diverse set of image completions and we allow users to select among them. We demonstrate the superiority of our algorithm over existing image completion approaches.

References

  1. Agarwala, A., Dontcheva, M., Agrawala, M., Drucker, S., Colburn, A., Curless, B., Salesin, D., and Cohen, M. Interactive digital photomontage. ACM Trans. Graph., 23(3):294--302, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Agrawal, A., Raskar, R., and Chellappa, R. What is the range of surface reconstructions from a gradient field? European Conference on Computer Vision (ECCV), May 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Boykov, Y., Veksler, O., and Zabih, R. Fast approximate energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell., 23(11):1222--1239, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Criminisi, A., Perez, P., and Toyama, K. Object removal by exemplar-based inpainting. CVPR, 02:721, 2003.Google ScholarGoogle Scholar
  5. Diakopoulos, N., Essa, I., and Jain, R. Content based image synthesis. Conference on Image and Video Retrieval (CIVR), 2004.Google ScholarGoogle ScholarCross RefCross Ref
  6. Drori, I., Cohen-Or, D., and Yeshurun, H. Fragment-based image completion. ACM Trans. Graph., 22(3):303--312, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Efros, A. A. and Freeman, W. T. Image quilting for texture synthesis and transfer. Proceedings of SIGGRAPH 2001, pages 341--346, August 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Efros, A. A. and Leung, T. K. Texture synthesis by non-parametric sampling. ICCV, pages 1033--1038, Corfu, Greece, September 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hays, J. and Efros, A. A. Scene completion using millions of photographs. ACM Trans. Graph. (SIGGRAPH 2007), 26(3): 4, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hays, J. and Efros, A. A. im2gps: estimating geographic information from a single image. CVPR, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  11. Jia, J., Sun, J., Tang, C.-K., and Shum H.-Y Drag-and-drop pasting. ACM Trans. Graph., 25(3): 631--637, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Johnson, M., Brostow, G. J., Shotton, J.: Arandjelovik, O., Kwatra, V., and Cipolla, R. Semantic photo synthesis. Comput. Graph. Forum (Proa Eurographics), 25(3):407--413, 2006.Google ScholarGoogle Scholar
  13. King, D. The Commissar Vanishes. Henry Holt and Company, London, 1997.Google ScholarGoogle Scholar
  14. Kwatra, V., Essa, I., Bobick, A., and Kwatra, N. Texture optimization for example-based synthesis. ACM Trans. Graph., 24: 795--802, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Kwatra, V., Schodl, A., Essa, I., Turk, G., and Bobick, A. Graphcut textures: Image and video synthesis using graph cuts. ACM Trans. Graph., 22(3):277--286,2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Oliva, A. and Torralba, A. Building the gist of a scene: The role of global image features in recognition. Visual perception. Progr. Brain Res., 155:23--36, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  17. Perez, P., Gangnet, M., and Blake, A. Poisson image editing. ACM Trans. Graph., 22(3):313--318, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Russell, B. C., Torralba, A., Murphy K. P., and Freeman, W. T. LabelMe: A database and web-based tool for image annotation. Technical Report, MIT, 2005, 2005.Google ScholarGoogle Scholar
  19. Snavely, N., Seitz, S. M., and Szeliskr R. Photo tourism: exploring photo collections in 3d. ACM Trans. Graph., 25(3):835--846, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Sun, J., Yuan, L., Jia, J., and Shum H.-Y. Image completion with structure propagation. ACM Trans. Graph., 24(3):861--868, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Torralba, A., Fergus, R., and Freeman, W. T. Tiny images. Technical Report MIT-CSAIL-TR-2007-024, 2007Google ScholarGoogle Scholar
  22. Torralba, A., Murphy, K. P., Freeman, W. T., and Rubin, M. A. Context-based vision system for place and object recognition. ICCV, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Wertheimer, M. Laws of organization in perceptual forms (partial translation). In: W. Ellis, editor, A Sourcebook of Gestatt Psychology, pages 71--88. Harcourt Brace and Company, London, 1938.Google ScholarGoogle Scholar
  24. Wexler, Y., Shechtman, E., and Irani, M. Space-time video completion. CVPR. 01:120--127, 2004.Google ScholarGoogle Scholar
  25. Wilczkowiak, M., Brostow, G. J., Tordoff, B., and Cipolla, R. Hole filling through photomontage. BMVC, 492--501, July 2005.Google ScholarGoogle Scholar

Index Terms

  1. Scene completion using millions of photographs

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in

              Full Access

              • Published in

                cover image Communications of the ACM
                Communications of the ACM  Volume 51, Issue 10
                October 2008
                130 pages
                ISSN:0001-0782
                EISSN:1557-7317
                DOI:10.1145/1400181
                Issue’s Table of Contents

                Copyright © 2008 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 October 2008

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article
                • Popular
                • Refereed

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader

              HTML Format

              View this article in HTML Format .

              View HTML Format