Learning Non-taxonomic Relationships Based on Focused Crawler

Qiao Jianzhong

Library and Information Service ›› 2010, Vol. 54 ›› Issue (18) : 120-125,129.

PDF(612 KB)
PDF(612 KB)
Library and Information Service ›› 2010, Vol. 54 ›› Issue (18) : 120-125,129.

Learning Non-taxonomic Relationships Based on Focused Crawler

Author information +
History +

Abstract

  • In this paper, a novel framework and methodology for learning non-taxonomic relationships based focused crawler is presented. According to the characteristics of ontology learning from the Web, the main methods used in this paper are word frequency, co-occurrence statistics and, K-Means one of partitioning clustering algorithm, without the complex syntax analysis and semi-supervised clustering algorithm such as EM, BIRCH and SOM, and therefore achieves a high degree of automation and efficiency. Study results will be used to analyze and judge the relevance of the topic for focused crawling. The quality of relations learning will be evaluated objectively by the performance in the practical application of the focused crawler.

Key words

ontology learning / non-taxonomic relation / focused crawler / partitioning clustering algorithm / relevance

Cite this article

Download Citations
Qiao Jianzhong. Learning Non-taxonomic Relationships Based on Focused Crawler[J]. Library and Information Service, 2010, 54(18): 120-125,129

References

[1]Hiep Phuc Luong, Susan Gauch, Qiang Wang. Ontology-Based Focused Crawling[J]. International Conference on Information, Process, and Knowledge Management. 2009:123-128. [2]Gomez A. Perez, Manzano D. Mancho. OntoWeb Deliverable 1.5: A survey of ontology learning methods and techniques [R].IST Programme of the Commission of the European Communities,IST-2000-29243, 2003. [3]Lucas Drumond, Rosario Girardi. A Survey of Ontology Learning Procedures[C]. Proceedings of the 3rd Workshop on Ontologies and their Applications, Brazil, 2008-10-26. [4]Paul Buitelaar, P., Cimiano, P., Magnini B. (Eds.). Ontology Learning from Text:An Overview.Ontology Learning from Text:Methods, Evaluation and Applications[B],2005:3-12. [5]David Sánchez,Antonio Moreno. Discovering non-taxonomic relations from the Web[J].Lecture Notes in Computer Science.IDEAL 2006.Springer Berlin,2006 Vol. 4224. [6]M. Baroni and S. Bisi. Using cooccurrence statistics & the web to discover synonyms in a technical language[J]. In Proceedings of the 4th International Conference on Language Resources and Evaluation, vol. 5, 2004:1725–1728. [7]Peter D. Turney. Mining the web for synonyms: Pmiir versus lsa on toe?[J]. In Proceedings of the 12th European Conference on Machine Learning, 2001: 491 – 502. [8]David Sánchez, Antonio Moreno. Learning non-taxonomic relationships from web documents for domain ontology construction[J]. Data & Knowledge Engineering, Vol. 64, No. 3. 2008:600-623. [9]Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen. A Comparison of Implicit and Explicit Links for Web Page Classification[C]. Proceedings of the 15th international conference on World Wide Web. Banff, Canada, 2006:643 - 650. [10]Jun Li, Kazutaka Furuse, Kazunori Yamaguchi. Focused Crawling by Exploiting Anchor Text Using Decision Tree[C]. Proceedings of the 14th international World Wild Web Conference. Chiba, Japan, 2005:1190 - 1191. [11]Missikoff M., Navigli R., Velardi P..Integrated Approach for Web Ontology Learning and Engineering[J]. IEEE Computer 35 (11), 2002: 60–63. [12]HowNet[EB/OL][2009-10-10].http://www.keenage.com/zhiwang/c_zhiwang.html. [13]刘群,李素建.基于《知网》的词汇语义相似度计算[C].第三届中文词汇语义学研讨会论文集,2002-5.http://www.keenage.com/html/paper.html. [14]Jing Kong. Ontology Learning for Chinese Information Organization and Knowledge Discovery in Ethnology and Anthropology[J]. Data Science Journal 6,2007: 500-510. [15]Hartigan, J. A.. Clustering algorithms[B]. New York: John Wiley,1975. [16]K-means Algorithm [EB/OL] [2009-10-10]. Wikipedia. http://en.wikipedia.org/wiki/K-means_algorithm. [17]XindongWu, Vipin Kumar, Ross Quinlan J..Top 10 algorithms in data mining[J].Knowl Inf Syst 2008, 14:1–37. [18]Miha Grcar. D2.2.2 Ontology learning services library[R].EU-IST Project IST-2004-026460 TAO,2008-4-1. [19]Agirre, E., Ansa, O., Hovy, E., and Martinez, D.. Enriching very large ontologies using the WWW[C]. In Proceedings of the Workshop on Ontology Construction of the European Conference of AI (ECAI-00), 2000. [20]Su, C., Gao, Y., Yang, J., Luo, B.. An efficient adaptive focused crawler based on ontology learning[C]. Fifth International Conference of Hybrid Intelligent Systems, 2005:6-9. [21]Tian Zhang,Raghu Ramakrishnan,Miron Livny. BIRCH: A New Data Clustering Algorithm and Its Applications[J].Data Mining and Knowledge Discovery.1997,1(2):141 - 182. [22]Haibo Jia, Julian Newman, Huaglory Tianfield. A new Formal Concept Analysis based learning approach to Ontology Building[J].Metadata and Semantics.Springer US. 2007:433-444. [23]Claire Nedellec, Adeline Nazarenko. Ontologies and Information Extraction[R], LIPN Internal Report (2005), CoRR abs/cs/0609137,2006. [24]Marie Chagnoux,Nathalie Hernandez, Nathalie Aussenac-Gilles. An interactive pattern based approach for extracting non-taxonomic relations from texts[C].IRIT - University of Toulouse, France, 2008. [25]Alexander Maedche, Steffen Staab. Mining Non-Taxonomic Conceptual Relations from Text[J].In: R.DIENG & O CORBY. EKAW-00 – EUROPEAN KNOWLEDGE ACQUISITION WORKSHOP, 2000,10: 2-6. [26]Athanasios Tegos, Vangelis Karkaletsis, Alexandros Potamianos. Learning of Semantic Relations between Ontology Concepts using Statistical Techniques[C].HLIE08, ECML-PKDD 2008. [27]Faure, D., Nedellec, C..A Corpus-Based Conceptual Clustering Method for Verb Frames and Ontology Acquisition[C]. In: LREC, Spain, 1998. [28]Kavalec, M., Maedche, A., Svatek, V..Discovery of Lexical Entries for Non-taxonomic Relations in Ontology Learning[C]. In: SOFSEM ,2004. [29]Punuru, J., Chen, J..Extraction of Non-hierarchical Relation from Domain Texts[C].In: IEEE Symposium on Computational Intelligence and Data Mining (CIDM),2007. [30]Ciaramita, M., Gangemi, A., Ratsch, E.,etc..Unsupervised Learning of Semantic Relations between Concepts of Molecular Biology Ontology[C]. In:International Joint Conference on Arti?cial Intelligence, 2005 [31]Aurawan Imsombut, Asanee Kawtrakul. Automatic Building of An Ontology on the Basis of Txt Corpora in Thai[J]. Language Resources and Evaluation Journal special issue on Asian Language technology, December, 2007,12:137-149. [32]Schutz A., Buitelaar P., RelExt. A Tool for Relation Extraction in Ontology Extension[C]. In: Proc. of the Fourth International Semantic Web Conference, 2005: 593–606. [33]Kavalec M., Maedche A., Skátek V.. Discovery of Lexical Entries for Non-Taxonomic Relations in Ontology Learning[C]. In: Proc. of SOFSEM 2004, Lecture Notes in Computer Science, 2004, vol. 2932:249–256. [34]Wiemer Hastings P., Graesser A., Wiemer Hastings K.. Inferring the Meaning of Verbs from Context[C]. In: Proc. of the 20th Annual Conference of the Cognitive Science Society, 1998: 1142–1147. [35]Ian H. Witten, Eibe Frank: Data Mining: Practical machine learning tools and techniques[EB/OL]. 2nd Edition, Morgan Kaufmann, San Francisco, 2005. http://www.cs.waikato.ac.nz/ml/weka/. [36]中国科学院计算技术研究所.ICTCLAS [EB/OL] (2009-2-29)[2009-10-10].http://www.ictclas.org/.
PDF(612 KB)

997

Accesses

0

Citation

Detail

Sections
Recommended

/