Library and Information Service >
A Query Expansion Method for Micro-blog
Received date: 2013-11-28
Revised date: 2013-12-18
Online published: 2014-01-05
In this paper, a query expansion method was proposed for micro-blog, which uses local query expansion and pseudo-feedback technology documentation feedback technology. We divides candidate words into three levels, respectively theme-words layer, documents-word layer and the words-words layer, corresponding to the three levels proposed similarity calculation method of calculation and weights. Finally, experiments were analyzed by the method. The experimental results show that, the received extended-words considering the topic-word weights and documents-word weights were better to meet the users' satisfaction.
Key words: micro-blog; query expansion; latent semantic analysis; TF-IDF; similarity
Tang Xiaobo , Fang Xiaoke . A Query Expansion Method for Micro-blog[J]. Library and Information Service, 2014 , 58(01) : 130 -135 . DOI: 10.13266/j.issn.0252-3116.2014.01.019
[1] Purcell K, Brenner J, Rainie L. Search-engine-use-2012[EB/OL].[2013-12-01]. http://pewinternet.org/Reports/2012/Search-Engine-Use-2012.
[2] Manning C D. Introduction to information retrieval[M]. New York: Cambridge University Press, 2008: 3-4.
[3] Caro G C, Benavides C L, Yates B R. Web queries: The tip of the iceberg of the user's intent[C]//Proceedings of the 2011 the International Conference on Web Search and Web Data Mining. New York: ACM, 2011.
[4] Yates R B, Neto B R. Modern information retrieval[M]. New York: Addison-Wesley Professional, 1999: 117-118.
[5] 中国互联网信息中心. 第31次中国互联网络发展状况统计报告[EB/OL].[2013-12-01]. http://www. cnnic. cn/hlwfzyj/hlwxzbg/hlwtjbg/201301/t20130115_38508. html.
[6] Weerkamp W. Finding people and their utterances in social media[D]. Netherlands: University of Amsterdam, 2011.
[7] Abberley D, Kirby D, Renal S, et al. The THISL broadcast news retrieval system[C]//Proceedings of ESCA Workshop on Accessing Information In Spoken Audio. Cambridge: Mathematical Overview, 1999:19-24.
[8] 丁国栋, 白硕, 王斌. 一种基于局部共现的查询扩展方法[J]. 中文信息学报, 2006, 20(3):84-91.
[9] Qiu Yonggang, Frei H P. Concept based query expansion[C]//Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 1993:160-169.
[10] Harman D. Relevance feedback revisited[C]//Proceedings of SIGIR'92 Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 1992:1-10.
[11] 吴丹, 何大庆, 王惠临. 基于伪相关反馈的跨语言查询扩展[J]. 情报学报, 2010, 29(2):232-239.
[12] Kurland O, Lee L, Domshlak C. Better than the real thing?: Iterative pseudo-query processing using cluster-based language models[C]//Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York:ACM, 2006:19-26.
[13] Xu Jinxi, Croft W B. Query expansion using local and global document analysis[C]//Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York:ACM, 1996:4-11.
[14] Balog K, Weerkamp W, Rijke M. A few examples go a long way: Constructing query models from elaborate query formulations[C]//Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 2008: 371-378.
[15] Meij E, Weerkamp W, Rijke M. A query model based on normalized log-likelihood[C]//Proceedings of the 18th ACM Conference on Information and Knowledge Management. New York:ACM, 2009:1903-1906.
[16] Wang Shengyuan, Liao Weishing, Hsieh L C. Learning by Expansion: Exploiting social media for image classification with few training examples[J]. Neurocomputer, 2012, 95: 117-125.
[17] Zhou Dong, Lawless S, Wade V. Improving search via personallized query expansion using social media[J]. Information Retrieval, 2012, 15(3-4):218-242.
[18] Anagnostopoulos L, Kolias V, Mylonas P. Socio-semantic query expansion using Twitter hashtags[C]//Proceedings of 7th International Workshop on Semantic and Social Media Adaptation and Personalization. Luxembourg: SMAP, 2012:29-34.
[19] Yan Rong, Hauptmann A, Query expansion using probabilistic local feedback with application to multimedia retrieval[C]//Proceedings of the 16th ACM International Conference on Information and Knowledge Management. New York: ACM, 2007:361-370.
[20] Su Xingyu, Suominen H, Hanlen L. Machine intelligence for health information: Capturing concepts and trends in social media via query expansion[C]//Proceedings of 19th Australian National Health Informatics Conference. Canberra: Canberra Research Laboratory, 2011:150-157.
[21] 臧文茂. 基于PLSI的信息检索中查询扩展相关技术研究[D]. 哈尔滨:哈尔滨工业大学, 2008.
[22] Blei D, Ng A, Jordan M. Latent Dirichlet allocation[J].Journal of Machine Learning Research, 2003, 3 (4-5): 993-1022.
[23] 唐晓波, 王洪燕. 基于潜在语义分析的微博主题挖掘模型研究[J]. 图书情报工作, 2012, 56(24): 114-118.
[24] Salton G, Wong A, Yang C S. A vector space model for automatic indexing[J]. Communications of ACM, 1975, 18(11):613-620.
/
〈 | 〉 |