Volume 35 Issue 10
Oct.  2009
Turn off MathJax
Article Contents
Yu Yang, Lin Zhangxi, Xia Guopinget al. Extracting thematic communities from Wikipedia[J]. Journal of Beijing University of Aeronautics and Astronautics, 2009, 35(10): 1283-1286. (in Chinese)
Citation: Yu Yang, Lin Zhangxi, Xia Guopinget al. Extracting thematic communities from Wikipedia[J]. Journal of Beijing University of Aeronautics and Astronautics, 2009, 35(10): 1283-1286. (in Chinese)

Extracting thematic communities from Wikipedia

  • Received Date: 30 Nov 2008
  • Publish Date: 31 Oct 2009
  • The current search module in Wikipedia has low search efficiency due to the search method, which is built on simple keywords matching. To improve the efficiency of knowledge retrieval from the Wikipedia spheres with more accurate links among them, the algorithm named term distance based on linkage (TDL) was proposed. TDL defines a new measure of distance between two keywords, which reorients and organizes those keywords into clusters. It is based on link structure analysis underpinned by computational models. The mechanism of ranking and recommending was imported. The experiment, which based on the snapshot of Wikipedia (May 2009), indicates that TDL would significantly increase the accuracy of knowledge retrieval in Wikipedia and this new algorithm can improve the users- satisfaction by 7% compared with the present one.

     

  • loading
  • [1] Markus K, Denny V, Max V. Wikipedia and the semantic web-the missing links Walter V. Wikimania 2005. Frankfurt am Main, Germany: Association for Computing Machinery Press(ACM),2005:117-125 [2] Max V, Markus K, Denny V, et al. Semantic Wikipedia Leslie C. WWW2006. Edinburgh, Scotland: Association for Computing Machinery Press(ACM),2005:265-274 [3] Shawn D A. Structure helps a Wiki navigate Mohammad A. WebDB 2005. Arlington,VA: AAAI Press,2005:97-108 [4] Natalia K. Automatic ontology extraction for document classification . Saarbrücken: Computer Science Department,Saarland University, 2006 [5] Daniel K. WikiSense-mining the Wiki Walter V. Wikimania 2005. Frankfurt am Main, Germany: Association for Computing Machinery Press (ACM),2005:254-276  [6] Chakrabarti S. Data mining for hypertext: A tutorial survey Usama M F. SIGKDD Explorations.Cambridge,Massachusetts:MIT Press,2000:113-125 [7] Jakob V. Measuring Wikipedia Peter I. ISSI 2005. Stockholm,Sweden:Karolinska University Press,2005:21-36 [8] Francesco B, Roberto B. Network analisis for Wikipedia Walter V. Wikimania 2005. Frankfurt am Main, Germany: Association for Computing Machinery Press (ACM),2005:334-367 [9] Sergey B, Lawrence P. The anatomy of a large-scale hypertextual web search engine[J]. Computer Networks and ISDN Systems,1998,30(1-7):107-117 [10] Jon K. Authoritative sources in a hyperlinked environment . Technical Report RJ 10076, IBM, 1997 [11] Fernanda B, Martin W, Kushal D. Studying cooperation and conflict between authors with history flow visualizations Brian B. SIGCHI 2004. Vienna:Association for Computing Machinery Press (ACM),2004:575-582 [12] Salton G. Automatic text processing: the transformation, analysis, and retrieval of information by computer[M]. New York:Addison-Wesley,1989:11-17 [13] Broder, Henzinger M. Information retrieval on the web: Tools and algorithmic issues [M]. Austin:Addison-Wesley,1998:112-145
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views(2791) PDF downloads(1761) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return