Cross-Language Information Retrieval with Incorrect Query Translations

Rajendra Prasath, Sudeshna Sarkar

Abstract


In this paper, we present a Cross Language Information Retrieval (CLIR) approach using corpus driven query suggestion. We have used corpus statistics to gather a clue on selecting the right query terms when the translation of a specific query is missing or incorrect. The derived set of queries are ranked to select the top ranked queries. These top ranked queries are further used to perform query formulation. Using the re-formulated weighted query, we perform cross language information retrieval. The results are compared with the results of CLIR system with Google translation of user queries and CLIR with the proposed query suggestion approach. We have English and Tamil corpus of FIRE 2012 dataset and analyzed the effects of the proposed approach. The experimental results show that the proposed approach performs well with the incorrect translation of the queries.

Keywords


Cross-language information retrieval; incorrect query translations; corpus-driven query suggestion; query representation; retrieval performance

Full Text: PDF

Refbacks

  • There are currently no refbacks.