Introduction to XML R XML Group
1 Introduction to XML IR XML Group
Outline Introduction XML Search XML Scoring and Ranking Conclusion
2 Outline • Introduction • XML Search • XML Scoring and Ranking • Conclusion
Introduction Submit 网页图片资讯地图更多》 keywords oogle xml keyword search [60搜索° ⊙所有网页○中文网页○简体中文网页○中国的网页 What Users Want in XML Keyword Search-繁·[转为简体网页 Speaker: Dr. Yi Chen Assistant Professor Department of Computer Science and Engineering Arizona State University Title: "What Users Want in XML Keyword Search"Date: Monday, 14 May 2007 Time: 4: 00pm-5: 00pm Venue: Lecture Theatre F wwww cse. ust hk./ pg/seminars/SU7/ chen. htm?ang=hk-11k-类似网页 ntegrating Keyword Search into XML Query Processing[翻译此页BETA op-k 9th International World Wide Web Conference (9)Refereed Papers ranking wW9. org/cdrom324/324hm79k·类似网页 P SLCA (smallest lowest common ancestor) problem is a basic task of keyword search in XML information retrieval. It means to find all the nodes.[2]Xu Y, Papakonstantinou Y Eficient keyword search for smallest LCAs in XML databases www.jos.org.cn/009825/18919hm·17k·类似网页 PDFl XRANK: Ranked Keword Search over XMLDocuments 文件格式 PDF/Adobe Acrobat·HTML版 First, XML keyword search queries do not always return. entire documents, but can returm deeply nested XML elements that. designed to handle these novel features of XML keyword search. Our. experimental results show that XRANK offers www.cs.cornell.edu/~cbotev/XRank.pdf-类似网页
3 Introduction Submit keywords Top-k ranking
Introduction Research Institute Institute Researcher Researcher Researcher Project project Researcher Gendel Proiref Gender ProjRef Name Name NameGender、| Name Gender、1pic, Topic Female Jee”‖ Linda” Female John”Male XML”"RDF Q1: Linda Q2: Female, Lind a The result is XML fragment Q3: Female. Researcher
4 Introduction Research Institute Institute Projects Researcher Researcher Researcher Researcher Project Project Name Name Topic Topic ”Alice” ”Joe” Name ProjRef Name ProjRef ”Linda” ”John” ”XML” ”RDF” Gender Female Gender Male Gender Female Gender Male Researcher Name ”Alice” Gender Female Researcher Name ProjRef ”Linda” Gender Female Q1: Linda Q2: Female, Linda Q3: Female, Researcher The result is XML fragment
Introduction-Conceptual model Documents Query Indexing Formulation Keywords Document representation Query representation Inverted index (Algorithm Design) Retrieval function Relevance Retrieval results feedback Matching content+ structure Presentation of related components (Scoring and Ranking (Semantic Definition)
5 Introduction-Conceptual Model Keywords Inverted index (Algorithm Design) Matching content + structure (Scoring and Ranking) Presentation of related components (Semantic Definition) Documents Query Document representation Retrieval results Query representation Indexing Formulation Retrieval function Relevance feedback
Introduction XML IR Query Semantic Query Processing(XmL Search Algorithm) Scoring and Ranking Result representation
6 Introduction • XML IR – Query Semantic – Query Processing (XML Search Algorithm) – Scoring and Ranking – Result representation
Outline · ntroduction XML Search XML Search Semantic XML Search Algorithms XML Scoring and Ranking Conclusion
7 Outline • Introduction • XML Search – XML Search Semantic – XML Search Algorithms • XML Scoring and Ranking • Conclusion
Ⅹ ML Search Languages Three classes of Xml search languages Keyword search book xml Path Expression Keyword search /book[ title about“ xml db” XQuery Complex full-text search for sb in /book let score Ss: =Sb ftcontains xml""&&db distance 5
8 XML Search Languages • Three classes of XML search languages – Keyword search • “book xml” – Path Expression + Keyword search • /book[./title about “xml db”]] – XQuery + Complex full-text search • for $b in /book let score $s := $b ftcontains “xml” && “db” distance 5
错误的 理想的 结果 Search semantic 结果 Researcher Research Projects Gender Institute Institute Projects Name Topic Researche Researcher Researcher Project Project Female Linda Researcher XML” Gender ProjRef Gender ojRe Name NameName Gender Name Gender ..Topic , Topic Female Alice Jde""Linda" Female "John"Male "XML""RDE Q: Female. XML Tree Graph(IDRef)
9 Search Semantic Q: Female, XML Research Institute Institute Projects Researcher Researcher Researcher Researcher Project Project Name Name Topic Topic ”Alice” ”Joe” Name ProjRef Name ProjRef ”Linda” ”John” ”XML” ”RDF” Gender Female Gender Male Gender Female Gender Male Project Topic ”XML” Researcher Name ”Linda” Gender Female Projects Topic ”XML” 理想的 结果 错误的 结果 Researcher Name ProjRef ”Linda” Gender Female Tree & Graph (IDRef)
Search semantic Factors affect the semantic Tre& Graph(是否考虑|DRef Relationship Between Entities(实体间的关系) Schema(是否考虑 Schema) ⅩML结构的灵活性 10
10 Search Semantic • Factors affect the Semantic – Tree & Graph (是否考虑IDRef) – Relationship Between Entities (实体间的关系) – Schema (是否考虑Schema) – XML结构的灵活性