Design and Implementation of a Practical Query Translation Interface for Cross-language Information Retrieval

  • Gao Yingfan ,
  • Xu Hongjiao
Expand
  • Institute of Scientific and Technical Information of China, Beijing 100038

Received date: 2013-08-18

  Revised date: 2013-09-13

  Online published: 2013-10-20

Abstract

With the huge multilingual information resources, there's an obvious language barrier for the existing information retrieval platform to provide documents organization and management services. A practical query translation interface for Cross-Language Information Retrieval (CLIR) has been established in the paper. It could be embedded expediently in any information retrieval platform, expanding the multilingual information processing ability for the existing system. A hybrid corpora-based method for translation disambiguation is proposed in the paper. Provided an English-Chinese bilingual dictionary, parallel corpus have been employed to append translation probability value to bilingual dictionary. Moreover, maximum forward matching of phrases and query classification method were also used. The test collection has been used in testing CLIR systems successfully, and the experimental results demonstrate the methods of query translation and translation disambiguation for CLIR in the paper is feasible.

Cite this article

Gao Yingfan , Xu Hongjiao . Design and Implementation of a Practical Query Translation Interface for Cross-language Information Retrieval[J]. Library and Information Service, 2013 , 57(20) : 123 -126 . DOI: 10.7536/j.issn.0252-3116.2013.20.021

References

[1] Kwok K L. NTCIR-2 Chinese, cross language retrieval experiments using PIRCS[C]//Proceedings of NTCIR Workshopon Research in Chinese & Japanese Text Retrieval and Text Summarization. Tokyo:National Institute of Informatics,2001: 111-118.
[2] 吴丹,何大庆,王惠临.一种基于相关反馈的跨语言信息检索查询翻译优化技术研究[J].情报学报,2012,31(4):398-406.
[3] Zheng Ye,Huang Jimmy Xiangji, Ben He,et al. Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval[J]. Journal of the American Society for Information Science and Technology, 2012, 63(12):2474-2487.
[4] Wang Xin, Wang Ying. Research on Web query translation based on ontology[J]. Journal of Software, 2012, 7(12):2849-2856.
[5] 高影繁,徐红姣,王惠临.基于平行语料的查询翻译词典改进方法研究[J]. 图书情报工作,2011,55(20):126-128.
[6] Thomas M,Christa Womser-Hacker.The effect of named entities on effectiveness in cross language information retrieval evaluation[C]//Proceedings of the 2005 ACM Symposium on Applied computing.New Yak:ACM, 2005:1059-1064.
[7] Ture F, Lin J.Flat vs.hierarchical phrase-based translation models for cross-language information retrieval[C]//ACM Special Interest Group on Information Retrieval 2013( SIGIR'13),Dublin, Ireland, 2013:813-816.
[8] Strötgen J, Gertz M, Junghans C. An event-centric model for multilingual document similarity[C]//ACM Special Interest Group on Information Retrieval 2011( SIGIR'11).New Yak:ACM,2011:953-962.

Outlines

/