Library and Information Service >
Detection and Analysis of the Topic Based on the Social Q & A Website:A Case Study of“the Elderly”on Zhihu Website
Received date: 2016-01-04
Revised date: 2016-01-28
Online published: 2016-03-05
[Purpose/significance] This paper aims at the limitations of current methods and proposing a topic detection and analysis method for the social Q & A website, which provides a reference for this website in topic detection and analysis.[Method/process] This method is based on the improved Chinese word segmentation techniques, using the Linear Weighted to determine the weight of words, combined with keyword extraction method for determining topic keywords to extract the topic focus and measure the distributions. According to the proposed method, based on data from Zhihu, we carry out the network topic detection and analysis of the topic "the elderly" from three angles:the focus of topic, topic distribution and hot subtopics.[Result/conclusion] This study has shown that this method is scientific and practical. It extends the analytical data source of social problems and provides a basis of decision-makingfor the "Actively Deal with the Population Ageing" activity.
Key words: the elderly; Zhihu website; topics; Chinese character segmentation; measurement
Huang Lucheng , Jiang Linshan , Miao Hong , Wu Feifei . Detection and Analysis of the Topic Based on the Social Q & A Website:A Case Study of“the Elderly”on Zhihu Website[J]. Library and Information Service, 2016 , 60(5) : 93 -100 . DOI: 10.13266/j.issn.0252-3116.2016.05.014
[1] 王晰巍, 邢云菲, 赵丹, 等. 基于社会网络分析的移动环境下网络舆情信息传播研究——以新浪微博"雾霾"话题为例[J]. 图书情报工作,2015, 59(7):14-22.
[2] 曹学艳, 段飞飞, 方宽, 等. 网络论坛视角下突发事件舆情的关键节点识别及分类研究[J].图书情报工作, 2014, 58(4):65-70.
[3] 张玉亮. 基于多案例事件的突发事件网络舆情信息流传导特征研究[J]. 现代情报, 2014, 34(12):44-49.
[4] 黄梦婷, 张鹏翼. 社会化问答社区的协作方式与效果研究:以知乎为例[J]. 图书情报工作, 2015,59(12):85-92.
[5] 孙晓宁, 赵宇翔, 朱庆华. 基于SQA系统的社会化搜索答案质量评价指标构建[J]. 中国图书馆学报, 2015, 41(4):65-82.
[6] 苏素. 从知识传播的角度看问答型社交网站"知乎"[J]. 传播与版权, 2013(1):28.
[7] 吴娱. 网络舆情分析关键技术研究与实现[D].成都:电子科技大学, 2011.
[8] WITTEN I H, PAYNTER G W, FRANK E, et al. KEA:practical automatic keyphrase extraction[C]//Proceedings of the fourth ACM conference on digital libraries.New York:ACM, 1999:254-255.
[9] 方俊, 郭雷, 王晓东. 基于语义的关键词提取算法[J]. 计算机科学, 2008, 35(6):148-151.
[10] SALTON G, YU C T. On the construction of effective vocabularies for information retrieval[C]//ACM SIGIR forum.New York:ACM, 1973, 9(3):48-60.
[11] 张伟男. 社区型问答中问句检索关键技术研究[D].哈尔滨:哈尔滨工业大学, 2014.
[12] 姜雯, 许鑫. 在线问答社区信息质量评价研究综述[J]. 现代图书情报技术, 2014(6):41-50.
[13] 刘宏明, 柴梓淇. 论垂直型问答网站的发展[J]. 计算机光盘软件与应用, 2012(21):84.
[14] 吴丹, 刘媛, 王少成. 中英文网络问答社区比较研究与评价实验[J]. 现代图书情报技术, 2011, 27(1):74-82.
[15] 孙晓宁, 赵宇翔, 朱庆华. 基于SQA系统的社会化搜索答案质量评价指标构建[J]. 中国图书馆学报, 2015, 41(4):65-82.
[16] 宋文丹. 社会化问答社区的社会网络分析[D].西南交通大学,2015.
[17] HAN W, XIRONG Q, SIQI S, et al. Ranking potential reply-providers in community question answering system[J].China communications, 2013,10(10):125-136.
[18] CHIANG C L, CHEN S Y, CHENG P J. Summarizing search results with community-based question answering[C]//Proceedings of the 2014 IEEE/WIC/ACM international joint conferences on Web intelligence(WI) and intelligent agent technologies(IAT)-volume 01.Washington DC:IEEE Computer Society, 2014:254-261.
[19] WANG R, LI S. Constructing a question corpus for textual semantic relations[C]//Istanbul:LREC, 2012:4092-4097.
[20] 余素华. 社会化问答社区的内容抽取研究[D].武汉:华中师范大学, 2014.
[21] 钱爱兵, 江岚. 基于改进TF-IDF的中文网页关键词抽取——以新闻网页为例[J]. 情报理论与实践, 2008,31(6):945-950.
[22] 苏祥坤, 吾守尔, 斯拉木. 基于词序统计组合的中文文本关键词提取技术[J]. 计算机工程与设计, 2015, 36(6):1647-1651.
[23] 张红鹰. 基于模糊处理的中文文本关键词提取算法[J]. 现代图书情报技术, 2009, 25(5):39-43.
[24] 夏火松, 甄化春. 大数据环境下舆情分析与决策支持研究文献综述[J]. 情报杂志, 2015, 34(2):1-6.
[25] 田久乐, 赵蔚. 基于同义词词林的词语相似度计算方法[J]. 吉林大学学报(信息科学版), 2010(6):602-608.
[26] 贾佳, 宋恩梅, 苏环. 社会化问答平台的答案质量评估[J]. 信息资源管理学报, 2013,3(2):19-28.
[27] 曹雨骋, 李浩鸣. 科普网站的社交功能对科技传播的影响研究——以知乎网为例[J]. 出版广角, 2015(6):79-81.
/
〈 | 〉 |