skip to main content
10.1145/2484028.2484115acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Interpretation of coordinations, compound generation, and result fusion for query variants

Published:28 July 2013Publication History

ABSTRACT

We investigate interpreting coordinations (e.g. word sequences connected with coordinating conjunctions such as "and" and "or") as logical disjunctions of terms to generate a set of disjunctionfree query variants for information retrieval (IR) queries. In addition, so-called hyphen coordinations are resolved by generating full compound forms and rephrasing the original query, e.g. "rice im-and export" is transformed into "rice import and export". Query variants are then processed separately and retrieval results are merged using a standard data fusion technique. We evaluate the approach on German standard IR benchmarking data. The results show that: i) Our proposed approach to generate compounds from hyphen coordinations produces the correct results for all test topics. ii) Our proposed heuristics to identify coordinations and generate query variants based on shallow natural language processing (NLP) techniques is highly accurate on the topics and does not rely on parsing or part-of-speech tagging. iii) Using query variants to produce multiple retrieval results and merging the results decreases precision at top ranks. However, in combination with blind relevance feedback (BRF), this approach can show significant improvement over the standard BRF baseline using the original queries.

References

  1. E. Airio. Word normalization and decompounding in monoand bilingual IR. Inf. Retr., pages 249--271, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Braschler and B. Ripplinger. How effective is stemming and decompounding for German text retrieval? Inf. Retr., 7(3-4):291--316, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Chen and F. C. Gey. Multilingual information retrieval using machine translation, relevance feedback and decompounding. Inf. Retr., 7(1--2):149--182, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. B. Croft. Combining approaches to information retrieval. In Advances Information Retrieval: Recent Research from the CIIR, chapter 1, pages 1--36. Kluwer Academic, 2000.Google ScholarGoogle Scholar
  5. J. A. Fox and E. A. Shaw. Combination of multiple searches. In TREC-2, pages 243--252, Gaithersburg, MD, 1994. NISTGoogle ScholarGoogle Scholar
  6. S. Hartrumpf and J. Leveling. Recursive question decomposition for answering complex geographic questions. In CLEF 2009, volume 6241 of LNCS, pages 310--317. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Huston and W. B. Croft. Evaluating verbose query processing techniques. In SIGIR 2010, pages 291--298. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Jones, B. Rey, O. Madani, and W. Greiner. Generating query substitutions. In WWW'06, pages 387--396, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Kluck. The domain-specific track in CLEF 2004: Overview of the results and remarks on the assessment process. In CLEF 2004, volume 3491 of LNCS, pages 260--270. Springer, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Koehn and K. Knight. Empirical methods for compound splitting. In EACL '03, pages 187--193. ACL, 2003 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Neumann and J. Piskorski. A shallow text processing core engine. Computational Intelligence, 18(3):451--476, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  12. S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3. In TREC-3, pages 109--126, Gaithersburg, MD, 1995. NIST.Google ScholarGoogle Scholar
  13. J. Savoy. Report on CLEF-2003 monolingual tracks: fusion of probabilistic models for effective monolingual retrieval. In CLEF 2003, volume 3237 of LNCS, pages 322--336. Springer, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  14. X. Xue and W. B. Croft. Representing queries as distributions. In Query representation and understanding workshop at SIGIR 2010, pages 9--12, 2010.Google ScholarGoogle Scholar

Index Terms

  1. Interpretation of coordinations, compound generation, and result fusion for query variants

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
          July 2013
          1188 pages
          ISBN:9781450320344
          DOI:10.1145/2484028

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 28 July 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

          Acceptance Rates

          SIGIR '13 Paper Acceptance Rate73of366submissions,20%Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader