P2P tutorial (ESEC 2001) Distributed Databases Fragmenting large databases (e.g.,relational) over physically distributed nodes ● Efficient processing of complex queries (e.g., SQL)by decomposing them Efficient update strategies (e.g.,lazy vs.eager) Consistent transactions (e.g.,2 phase commit) Normally approaches rely on central coordination 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 9 Distributed Databases vs.Peer-to-Peer Data distribution is a key issue for P2P systems Approaches in distributed DB that address scalability LH*family of scalable hash index structures [Litwin97] Snowball:scalable storage system for workstation clusters [Vingralek98] Fat-Btree:a scalable B-Tree for parallel DB [Yokota 9] Approaches in distributed DB that address autonomy (and scalability) Mariposa:distributed relational DBMS based on an underlying economic model [Stonebraker96] 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 20 (c)2001 Karl Aberer,Manfred Hauswirth 10P2P tutorial (ESEC 2001) (c) 2001 Karl Aberer, Manfred Hauswirth 10 © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 19 Distributed Databases • Fragmenting large databases (e.g., relational) over physically distributed nodes • Efficient processing of complex queries (e.g., SQL) by decomposing them • Efficient update strategies (e.g., lazy vs. eager) • Consistent transactions (e.g., 2 phase commit) • Normally approaches rely on central coordination © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 20 Distributed Databases vs. Peer-to-Peer • Data distribution is a key issue for P2P systems • Approaches in distributed DB that address scalability – LH* family of scalable hash index structures [Litwin97] – Snowball: scalable storage system for workstation clusters [Vingralek98] – Fat-Btree: a scalable B-Tree for parallel DB [Yokota 9] • Approaches in distributed DB that address autonomy (and scalability) – Mariposa: distributed relational DBMS based on an underlying economic model [Stonebraker96]