收稿日期: 2014-03-05
修回日期: 2014-03-20
网络出版日期: 2014-04-05
Research on Evolution of Knowledge Category Structure in Wikipedia
Received date: 2014-03-05
Revised date: 2014-03-20
Online published: 2014-04-05
维基百科分类页面所构成是一种在用户自发的协同编辑行为下建立起来的、具备自组织特性的全新知识分类结构。依据维基百科分类结构特点,将其看成是以分类页面为领域知识节点,以节点间相互包含关系为边的领域知识分类结构。采用自回避随机游走方法(self-avoiding random walks)构建领域知识节点熵测量模型,从而测定领域知识结构中心领域知识和边缘领域知识节点,以研究领域知识节点随时间演化的规律。研究表明,大多数情况下,多样性熵值越大的节点,越趋向于知识结构的中心领域知识,且其创建的时间也越长;处于相同社团结构中的节点间存在关联关系,使得它们的多样性熵值表现出相似的变化趋势。
徐胜国 , 刘旭 . 维基百科知识分类结构演化规律研究[J]. 图书情报工作, 2014 , 58(07) : 119 -124 . DOI: 10.13266/j.issn.0252-3116.2014.07.020
The structure formed by Wikipedia category page is one kind of new knowledge category structure for that it is spontaneous, self-organized and collaborative edited by people. So it has drawn many researchers' attention. Based on the characteristics of Wikipedia category page, we defined this structure as category structure of domain knowledge, in which Wikipedia category page corresponds to a node, while relationships between these nodes are expressed as edges. By using the model of diversity entropy based on self-avoiding random walks, we got the value of nodes and identified the central nodes and the frontier nodes in the category of domain knowledge. These could help us study the evolution of nodes. The result shows that in most cases, the higher value of diversity entropy the node is, the more likely it will be central node and the longer it has been created. And for the relationships between these nodes in the same community structure, the value of diversity entropy of these nodes shows the similar variation trend. The research in this paper could help us know about development situation and hot areas of domain knowledge, and improve the knowledge category in Wikipedia, which could promote the innovation of knowledge.
[1] 马费成, 刘记. Web2.0环境下的信息构建——对信息构建基本原理的再认识[J].情报学报,2008,27(5):683-690.
[2] 丁大尉, 李正风. 网络信息空间中的知识构建——以维基百科知识生成机制为例[J].自然辩证法研究,2012,28(5):61-65.
[3] Goodyear P. Situated action and distributed knowledge: A JITOL perspective on EPSS[J]. Programmed Learning, 1995,32(1):45-55.
[4] Wikipedia.Wikipedia category overview [EB/OL].[2014-01-10].http://stats.wikimedia.org/EN/Overview_EN_Complete.htm.
[5] Hotho A, Jäschke R, Schmitz C, et al. BibSonomy: A social bookmark and publication sharing system[C/OL].[2014-01-09]. http://www.kde.cs.uni-kassel.de/jaeschke/paper/hotho06bibsonomy.pdf.
[6] Vander Wal T. Folksonomy[EB/OL].[2014-01-13]. http://www.vanderwal.net/essays/051130/folksonomy.pdf.
[7] Halavais A, Lackaff D. An analysis of topical coverage of Wikipedia[J]. Journal of Computer-Mediated Communication,2008,13(2):429-440.
[8] Thornton K, McDonald D W. Tagging Wikipedia: collaboratively creating a category system[C]//Proceedings of the 17th ACM international conference on Supporting group work. New York:ACM, 2012:219-228.
[9] Silva F N, Viana M P, Travençolo B A N, et al. Investigating relationships within and between category networks in Wikipedia[J]. Journal of Informetrics,2011,5(3):431-438.
[10] Suchecki K, Salah A A A, Gao Cheng, et al. Evolution of Wikipedia's Category Structure[J]. Advances in Complex Systems,2012,15(supp01):1250,068.
[11] Voss J. Collaborative thesaurus tagging the Wikipedia way[EB/OL]. [2014-01-09].http://arxiv.org/abs/cs/0604036.
[12] Salah A A, Gao Cheng, Suchecki K, et al. Need to categorize: A comparative look at the categories of universal decimal classification system and Wikipedia[J]. Leonardo, 2012, 45(1): 84-85.
[13] Colgrove C, Neidert J, Chakoumakos R. Using network structure to learn category classification in Wikipedia[EB/OL]. [2014-01-09].http://snap.stanford.edu/class/cs224w-2011/proj/.
[14] Kittur A, Chi E H, Suh B. What's in Wikipedia?: mapping topics and conflict using socially annotated category structure[C]//Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. New York: ACM, 2009: 1509-1512.
[15] Chernov S, Iofciu T, Nejdl W, et al. Extracting semantics relationships between Wikipedia categories[EB/OL]. [2014-01-09].http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.73.5507&rep=rep1&type=pdf.
[16] Gantner Z, Schmidt-Thieme L. Automatic content-based categorization of Wikipedia articles[C/OL].[2014-01-13].http://dl.acm.org/citation.cfm?id=1699770.
[17] Szymański J. Mining relations between wikipedia categories[M]// Communications in Computer and Information Science. Berlin:Springer, 2010: 248-255.
[18] Muchnik L, Itzhack R, Solomon S, et al. Self-emergence of knowledge trees: Extraction of the Wikipedia hierarchies[J]. Physical Review E, 2007, 76(1): 016106.
[19] Biuk-Aghai R P, Cheang F H H. Wikipedia category visualization using radial layout[C]//Proceedings of the 7th International Symposium on Wikis and Open Collaboration. New York:ACM, 2011: 193-194.
[20] Holloway T, Bozicevic M, Brner K. Analyzing and visualizing the semantic coverage of Wikipedia and its authors[J]. Complexity, 2007, 12(3): 30-40.
[21] Wang Juncheng, Ma Feicheng, Cheng Jun. The impact of research design on the half-life of the wikipedia category system[C]. Computer Design and Applications. IEEE, Qinhuangdao, 2010, 4: 25-27.
[22] Zesch T, Gurevych I. Analysis of the Wikipedia category graph for NLP applications[EB/OL].[2014-01-09]. http://acl.ldc.upenn.edu/W/W07/W07-02.pdf?origin=publication_detail#page=11.
[23] Wang QiShun, Wang Xiaohua, Chen Zhiqun, et al. The category structure in Wikipedia: To analyze and know how it grows[M]. Berlin: Springer, 2013: 538-545.
[24] Satija M P. Classification: Some fundamentals, some myths, some realities[J]. Knowledge Organization, 1998, 25(1): 32-35.
[25] 张余.知识分类新探[J].图书馆论坛, 2006, 26(6): 175-177.
[26] Wikipedia:FAQ/Categorization[EB/OL].[2014-01-14]. http://en.wikipedia.org/wiki/Wikipedia:FAQ/Categorization.
[27] Noh J D, Rieger H. Random walks on complex networks[J]. Physical Review Letters, 2004, 92(11): 118701.
[28] Travençolo B A N, Costa L da F. Accessibility in complex networks[J]. Physics Letters A, 2008, 373(1): 89-95.
[29] Costa L da F. Inward and outward node accessibility in complex networks as revealed by non-linear dynamics[EB/OL]. [2014-01-09].http://adsabs.harvard.edu/abs/2008arXiv0801.1982D.
[30] Wikimedia downloads[EB/OL].[2012-10-31].http://dumps.wikimedia.org/enwiki/20120902/.
/
〈 | 〉 |