skip to main content
10.1145/2072298.2071930acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Combining image and text features: a hybrid approach to mobile book spine recognition

Published:28 November 2011Publication History

ABSTRACT

Despite the successful use of local image features for large-scale object recognition, they are not effective in recognizing book spines on bookshelves. This is because some book spines contain only text components that do not yield distinguishing image features. To overcome this issue, we develop a new approach that combines a text-based spine recognition pipeline with an image feature-based spine recognition pipeline. The text within the book spine image is recognized and used as keywords to search a book spine text database. The image features of the book spine image are searched through a book spine image database. The search results of the two approaches are then carefully combined to form the final result. We implement the proposed hybrid book recognition pipeline used in a book inventory management system, and conduct extensive experiments to evaluate its performance. The experimental results show that while text-based or image feature-based systems only achieve a recall of 72%, the proposed hybrid system achieves a recall of ~91%.

References

  1. H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool. Speeded-up robust features (SURF). Computer Vision and Image Understanding, 110(3), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Chen, S. Tsai, K.-H. Kim, C.-H. Hsu, J. P. Singh, and B. Girod. Low-cost asset tracking using location-aware camera phones. Number 1, San Diego, California, USA, 2010.Google ScholarGoogle Scholar
  3. D. M. Chen, S. S. Tsai, B. Girod, C.-H. Hsu, K.-H. Kim, and J. P. Singh. Building book inventories using smartphones. In Proc. ACM Multimedia (MM'10'), MM '10, Firenze, Italy, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Chen, S. S. Tsai, G. Schroth, D. M. Chen., V. Chandrasekhar, G. Takacs, R. Vedantham, R. Grzeszczuk, and B. Girod. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In International Conference on Image Processing, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  5. D. Crasto, A. Kale, and C. Jaynes. The smart bookshelf: A study of camera projector scene augmentation of an everyday environment. In Proc. IEEE Workshop on Applications of Computer Vision (WACV'05), Breckenridge, CO, January 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Fischler and R. Bolles. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6), 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Lee, Y. Chang, J. Archibald, and C. Pitzak. Matching book-spine images for library shelf-reading process automation. In Proc. IEEE International Conference on Automation Science and Engineering (CASE'08), Arlington, VA, September 2008.Google ScholarGoogle ScholarCross RefCross Ref
  8. M. Loechtefeld, S. Gehring, J. Schoening, and A. Krueger. Shelftorchlight: Augmenting a shelf using a camera projector unit. UBIProjection 2010 - Workshop on Personal Projection, 2010.Google ScholarGoogle Scholar
  9. K. Matsushita, D. Iwai, and K. Sato. Interactive bookshelf surface for in situ book searching and storing support. In Proceedings of the 2nd Augmented Human International Conference, New York, NY, USA, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR'06), New York, NY, June 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Philbin, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08),Anchorage, AL, June 2008.Google ScholarGoogle ScholarCross RefCross Ref
  12. N. Quoc and W. Choi. A framework for recognition books on bookshelves. In Proc. International Conference on Intelligent Computing (ICIC'09), Ulsan, Korea, September 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5), 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. I. H. Witten, A. Moffat, and T. C. Bell. Managing gigabytes: Compressing and indexing documents and images. 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. Yeh and B. Katz. Searching documentation using text, ocr, and image. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Combining image and text features: a hybrid approach to mobile book spine recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '11: Proceedings of the 19th ACM international conference on Multimedia
      November 2011
      944 pages
      ISBN:9781450306164
      DOI:10.1145/2072298

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 November 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader