正在加载图片...
基于RSS的企业Web搜索引擎研究与设计 ABSTRACT ABSTRACT Along with abundance of the Internet contents, the tremendous amount of information released by Internet conversely reduces the values of nformation itself. Consequently, the search engine has become an rtant tool f nd ises to obta ntelligence and ledge from the sea of information. The search engine techniques how to apply in enterprises also become the hot point of this research realm This paper analyzes the problem employed by the enterprise Web search engine applying the existing public search engine techniques, starts with the actual need of enterprise search service, aimed to improve the effectiveness of search engine nformation collection as well as t reduce the cost of deployment and running. Adopting rss technology based on push mode to solve these problems of typical search engines based on pull mode such as longer refresh period, and higher cost for deployment and running, and presents a new RSS Feed auto-discovery technique based on meta-search This paper analyzed the technical specifications and features of RSS. Furthermore, subsequently studied in detail on the key techniques of Web search engine including the Chinese words segmentation, data indexing and information retrieval, mainly considered at their working mechanisms workflow and realization method, at the meantime, combining the characteristics of the enterprise search engine, researched and advanced partial of techniques During the above mentioned studies, addressed the design and realization methods of the enterprise Web search engine based on RSS heavily concerned on module design, workflow and main data structure. K eywords Search Engine, RSs, Chinese Words Segmentation, Information Retrieval 第2页基于 RSS 的企业 Web 搜索引擎研究与设计 ABSTRACT 第 2页 ABSTRACT Along with abundance of the Internet contents, the tremendous amount of information released by Internet conversely reduces the values of information itself. Consequently, the search engine has become an extremely important tool for more and more enterprises to obtain intelligence and knowledge from the sea of information. The search engine techniques how to apply in enterprises also become the hot point of this research realm. This paper analyzes the problem employed by the enterprise Web search engine applying the existing public search engine techniques, starts with the actual need of enterprise search service, aimed to improve the effectiveness of search engine's information collection as well as to reduce the cost of deployment and running. Adopting RSS technology based on push mode to solve these problems of typical search engines based on pull mode such as longer refresh period, and higher cost for deployment and running, and presents a new RSS Feed auto-discovery technique based on meta-search. This paper analyzed the technical specifications and features of RSS. Furthermore, subsequently studied in detail on the key techniques of Web search engine including the Chinese words segmentation, data indexing, and information retrieval, mainly considered at their working mechanisms, workflow and realization method, at the meantime, combining the characteristics of the enterprise search engine, researched and advanced partial of techniques. During the above mentioned studies, addressed the design and realization methods of the enterprise Web search engine based on RSS, heavily concerned on module design、workflow and main data structure. Keywords Search Engine, RSS, Chinese Words Segmentation, Information Retrieval
<<向上翻页向下翻页>>
©2008-现在 cucdc.com 高等教育资讯网 版权所有