正在加载图片...
tudy /reference B/..So by analyzing the words between [4] Marshakova, I. V 1973. System of document two references it is often possible to automatically analyze connections based on references Nauchno- the exact relationship between these two references and Tekhnicheskaya Informatsiya, vol. 2, no 6, pp. 3-8 how they compare to each other [5] Beel, J. Gipp, B. 2008, The Potential of Oftentimes it is possible by knowing the position of each Collaborative Document Evaluation for Science. the citation within a document. to draw conclusions about the 1 1th International Conference on Digital Asian document type e.g. state-of-the art publications, etc. The Libraries (ICadl 2008), December 2-5, Kuta, gathered information can be used to classify further Indonesia, published in G. Buchanan, M. Masoodian documents and to develop a more sophisticated 'Web of S. Cunningham(Eds ) Digital Libraries: Universal and Science. We believe that these technologies. in Ubiquitous Access to Information of Lecture Notes in combination with collaborative filtering, will be the future Computer Science, vol 5362, DOI 10.1007/978-3-540 for identifying related work and will open the doors for 895336,1SSN0302-9743,pp.375-378, Springer powerful research paper recommender systems Verlag Berlin Heidelberg. [6] Small, H. 1973. Co-citation in the scientific literature V. discussion Conclusion a new measure of the relationship between two As shown, the CPa and Coa offer substantial advantages documents, Journal of the American Society for in identifying related documents in comparison to existing Information Science, vol 24, pp, 265-269 approaches. However, it should also be taken into account [7] Klavans,R, Boyack, K.(2006). Identifying a better that the effort is considerable. it is not sufficient to evaluate measure of relatedness for mapping science, Journal of the bibliography of documents, but it is necessary to the American Society for Information Science and process the complete document, identify each reference and Technology, Vol. 57, No. 2, pp. 251-263 is in practice not always possible, and leads in ca. 3% of [8] Sternitzke C. Bergmann, I(2009), Similarity map it to the corresponding entry in the bibliography, which measures for document mapping: A comparative study cases to mismatches. This is because sometimes only an on the level of an individual scientist. scientometrics abstract and the bibliography can be accessed, documents Vol.78,No.1,pp.113-1 cannot be parsed as OCR fails, or a reference style is used that makes it unfeasible to automatically link references to 9] Garfield, E(2001, November 27, 2001). From the corresponding items in the bibliography. This leads to Bibliographic Coupling to Co-Citation Analysis Via the conclusion that although these new approaches deliver Algorithmic Historio-Bibliography: A Citationist's superior results, they cannot completely replace the already Tribute to BelverC. Griffith. Paper presented at the existing approaches, but should be used in combination Drexel University, Philadelphia, PA [10] Giles, C L Bollacker, K D. And Lawrence, S. 1998 CiteSeer: an automatic citation indexing system, In REFERENCES Digital Libraries 98- The Third ACM Conference on [1] Gipp, B. Beel, J. (2009). Citation Proximity Digital Libraries, pp 89-98 Analysis(CPA)-A new approach for identifying [11] Gipp, B. Beel, J. Hentschel, C(2009), Scienstein related work based on Co-Citation Analysis. In A Research Paper Recommender System, in Proceedings of the 12th International Conference on Proceedings of IEEE International Conference on Scientometrics and Informetrics, pp 571-575 Emerging Trends in Computing. Tamil Nadu, India [2] Rip, A, Courtial, J(1984). Co-Word Maps of Biotechnology: An Example of Cognitive Scientometrics. Scientometrics, 6(6), 381-400 3] Fano, R. M. 1956. Information theory and the retrieval of recorded information in documentation in Action Shera, J. H. Kent, A. Perry, J. w.(Edts), New York Reinhold Publ. Co., pp 238-244study [reference B]...” So by analyzing the words between two references it is often possible to automatically analyze the exact relationship between these two references and how they compare to each other. Oftentimes it is possible by knowing the position of each citation within a document, to draw conclusions about the document type e.g. state-of-the art publications, etc. The gathered information can be used to classify further documents and to develop a more sophisticated „Web of Science‟. We believe that these technologies, in combination with collaborative filtering, will be the future for identifying related work and will open the doors for powerful research paper recommender systems. V. DISCUSSION & CONCLUSION As shown, the CPA and COA offer substantial advantages in identifying related documents in comparison to existing approaches. However, it should also be taken into account that the effort is considerable. It is not sufficient to evaluate the bibliography of documents, but it is necessary to process the complete document, identify each reference and map it to the corresponding entry in the bibliography, which is in practice not always possible, and leads in ca. 3% of cases to mismatches. This is because sometimes only an abstract and the bibliography can be accessed, documents cannot be parsed as OCR fails, or a reference style is used that makes it unfeasible to automatically link references to the corresponding items in the bibliography. This leads to the conclusion that although these new approaches deliver superior results, they cannot completely replace the already existing approaches, but should be used in combination. REFERENCES [1] Gipp, B. & Beel, J. (2009). Citation Proximity Analysis (CPA) - A new approach for identifying related work based on Co-Citation Analysis. In Proceedings of the 12th International Conference on Scientometrics and Informetrics, pp. 571-575. [2] Rip, A., & Courtial, J. (1984). Co-Word Maps of Biotechnology: An Example of Cognitive Scientometrics. Scientometrics, 6(6), 381-400. [3] Fano, R. M. 1956. Information theory and the retrieval of recorded information, in Documentation in Action, Shera, J. H. Kent, A. Perry, J. W. (Edts), New York: Reinhold Publ. Co., pp. 238–244. [4] Marshakova, I. V. 1973. System of document connections based on references, Nauchno￾Tekhnicheskaya Informatsiya, vol. 2, no. 6, pp. 3–8. [5] Beel, J. & Gipp, B. 2008, The Potential of Collaborative Document Evaluation for Science, the 11th International Conference on Digital Asian Libraries (ICADL 2008), December 2 - 5, Kuta, Indonesia, published in G. Buchanan, M. Masoodian & S. Cunningham (Eds.), Digital Libraries: Universal and Ubiquitous Access to Information of Lecture Notes in Computer Science, vol. 5362, DOI 10.1007/978-3-540- 89533-6, ISSN 0302-9743, pp. 375-378, Springer￾Verlag Berlin Heidelberg. [6] Small, H. 1973. Co-citation in the scientific literature: a new measure of the relationship between two documents, Journal of the American Society for Information Science, vol. 24, pp. 265–269. [7] Klavans, R., & Boyack, K. (2006). Identifying a better measure of relatedness for mapping science, Journal of the American Society for Information Science and Technology, Vol. 57, No. 2, pp. 251-263. [8] Sternitzke, C. Bergmann, I. (2009), Similarity measures for document mapping: A comparative study on the level of an individual scientist, Scientometrics, Vol. 78, No. 1, pp. 113-130. [9] Garfield, E. (2001, November 27, 2001). From Bibliographic Coupling to Co-CitationAnalysis Via Algorithmic Historio-Bibliography: A Citationist‟s Tribute to BelverC. Griffith. Paper presented at the Drexel University, Philadelphia, PA. [10] Giles, C. L. Bollacker, K. D. And Lawrence, S. 1998. CiteSeer: an automatic citation indexing system, In Digital Libraries 98 - The Third ACM Conference on Digital Libraries, pp. 89-98. [11] Gipp, B. Beel, J. & Hentschel, C. (2009), Scienstein - A Research Paper Recommender System, in Proceedings of IEEE International Conference on Emerging Trends in Computing. Tamil Nadu, India
<<向上翻页
©2008-现在 cucdc.com 高等教育资讯网 版权所有