正在加载图片...
Recommendation of Web pages based on Concept association Mingyu Lu Qiang Zhou Fan Li Yuchang Lu Lizhu Zhou Department of Computer Science and Technology, Tsinghua University, Beijing, 100084) ( Department of Computer Science and Engineering, Yantai University, Shandong, 264005) E-mail:my/u99@mails.tsinghua.edu.cn Abstract done in the area. They mainly fall into three categories (1)Recommendation based on the similarity between The precision and recall are two main criteria used to web pages evaluate the performance of search engines. For general (2)Recommendation based on the preferences and queries, the precision is most important, but for specific behaviors of group users/4) ses, e.g. scientific researchers and applicants for patent ()Recommendation based on the preference and rights, the recall is most important and is often ignored. In behavior of individual user 'e have noticed an interesting phenomenon which based on concept association is introduced. This approach can be called concept association. For example, when we on the information that associates strongly with users'"Microsoft"or"Operating System, because we take it for queries, and therefore improve the recall of a search granted that " Windows"is associated tightly with engine. In the paper, we discuss the meaning, effect and "Microsoft"or"Operating System"in the nature of things variety of concept association, and describe how to Therefore, when a user input"windows"as a keyword generate associational information related with users' of his query, the search engine can provide information queries and how to realize the association-based not only about"Windows also about“ Microsoft recommendation of web pages. We also present relevant and " Operating System", according to the same reason experimental results and give idea about our further work. Another example is that, if users use 9. 11 Event"as a ke word, the traditional information retrieval technique 1. ntroduction which relies on key word matching will just fetch back web pages with9 11 Event"in them, while the search With the explosive growth of the Internet, information engine based on concept association can also get pages including"terrorist attacks","World Trade Center"."Bin on World Wide Web(www) is swiftly becoming a Laden" and etc, because they have tight relation with the new huge information source But in reality, when a web 9. 11 Even". In these cases the recall will be significantly user attempts to search information by using a search ngine, the traditional information retrieval techniques improved. Such examples are too numerous to be ually return those web pages including the key words in enumerated one by one. the query. The result page set consists of too much Different from the above three main methods association-based approach we present for web page In such situations, precision and recall are poor and recommendation parses users query from which the key disappointing.How to carry out effective information retrieval has become a very important and knotty problem. to web pages associated strongly with the query through conce The precision and recall are two main criteria for pt association. It can help web users to get more evaluatingtheperformanceofsearchenginesForgeneralusefulandinterestinginformationfromthewww.and improve the efficiency and quality of information retrieval queries, precision is most important, but for specific uses, of a web search engine, especially its recall. e.g. scientific researchers and applicants for patent rights, recall is most concerned about and is often ignored esearches on improving the recall in interesting 2. Taxonomy of concept association Web information recommendation is an effec measure to improve the recall rate. Many works have been In the paper, we define five kinds of associations of 973 Program of China under Grant No. G199803041 the National (1)Superclass Association Natural Science Foundation of China un Proceedings of the 4th IEEE Int'l Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems(WECWIS 2002) 1530-1354/02$1700e2002EE SOCIERecommendation of Web Pages Based on Concept Association Mingyu Lu1,2 Qiang Zhou1 Fan Li1 Yuchang Lu1 Lizhu Zhou1 (Department of Computer Science and Technology, Tsinghua University, Beijing, 100084) (Department of Computer Science and Engineering, Yantai University, Shandong, 264005) E-mail: mylu99@mails.tsinghua.edu.cn Abstract The precision and recall are two main criteria used to evaluate the performance of search engines. For general queries, the precision is most important, but for specific uses, e.g. scientific researchers and applicants for patent rights, the recall is most important and is often ignored. In this paper, an approach to web pages recommendation based on concept association is introduced. This approach can produce recommended web page links for users based on the information that associates strongly with users’ queries, and therefore improve the recall of a search engine. In the paper, we discuss the meaning, effect and variety of concept association, and describe how to generate associational information related with users’ queries and how to realize the association-based recommendation of web pages. We also present relevant experimental results and give idea about our further work. 1. Introduction With the explosive growth of the Internet, information on World Wide WebWWW is swiftly becoming a new huge information source[1]. But in reality, when a web user attempts to search information by using a search engine, the traditional information retrieval techniques usually return those web pages including the key words in the query. The result page set consists of too much irrelevant information and may lose some relevant ones. In such situations, precision and recall are poor and disappointing[1,2]. How to carry out effective information retrieval has become a very important and knotty problem. The precision and recall are two main criteria for evaluating the performance of search engines. For general queries, precision is most important, but for specific uses, e.g. scientific researchers and applicants for patent rights, recall is most concerned about and is often ignored --- researches on improving the recall in interesting. Web information recommendation is an effective measure to improve the recall rate. Many works have been The research has been supported by the National Grand Fundamental 973 Program of China under Grant No.G1998030414 and the National Natural Science Foundation of China under Grant No.79990580. done in the area. They mainly fall into three categories: (1) Recommendation based on the similarity between web pages [3]. (2) Recommendation based on the preferences and behaviors of group users[4]. (3) Recommendation based on the preference and behavior of individual user[5]. We have noticed an interesting phenomenon which can be called concept association. For example, when we talk about “Windows”, naturally we will think of “Microsoft” or “Operating System”, because we take it for granted that “Windows” is associated tightly with “Microsoft” or “Operating System” in the nature of things. Therefore, when a user input “windows” as a keyword of his query, the search engine can provide information not only about “Windows”, but also about “Microsoft” and “Operating System”, according to the same reason. Another example is that, if users use “9.11 Event” as a key word, the traditional information retrieval technique which relies on key word matching will just fetch back web pages with “9.11 Event” in them, while the search engine based on concept association can also get pages including “terrorist attacks”, “World Trade Center”, “Bin Laden” and etc, because they have tight relation with the “9.11 Even”. In these cases the recall will be significantly improved. Such examples are too numerous to be enumerated one by one. Different from the above three main methods, association-based approach we present for web page recommendation parses users’ query from which the key words are extracted, and provides candidate links pointed to web pages associated strongly with the query through concept association. It can help web users to get more useful and interesting information from the WWW, and improve the efficiency and quality of information retrieval of a web search engine, especially its recall. 2. Taxonomy of concept association In the paper, we def ine f ive kinds of associations of concept: (1) Superclass Association Proceedings of the 4th IEEE Int’l Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems (WECWIS 2002) 1530-1354/02 $17.00 © 2002 IEEE
向下翻页>>
©2008-现在 cucdc.com 高等教育资讯网 版权所有