P2P tutorial (ESEC 2001) c Peer-to-peer information systems: concepts and models,state-of-the-art, and future systems Karl Aberer Manfred Hauswirth ■P■ TU EPFL-DSC TU Wien-DSG Distributed Information Distributed Systems Group Systems Laboratory karl.aberer@epfl.ch M.Hauswirth@infosys.tuwien.ac.at Goals of the Tutorial Position the P2P paradigm in the design space of distributed systems Get a detailed overview of state-of-the-art P2P systems Understand the problem of decentralized data management in P2P systems Understand the research issues for future systems Detailed information on the new P-Grid approach for P2P systems 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 (c)2001 Karl Aberer,Manfred Hauswirth 1
P2P tutorial (ESEC 2001) (c) 2001 Karl Aberer, Manfred Hauswirth 1 Peer-to-peer information systems: concepts and models, state-of-the-art, and future systems Karl Aberer EPFL - DSC Distributed Information Systems Laboratory karl.aberer@epfl.ch Manfred Hauswirth TU Wien - DSG Distributed Systems Group M.Hauswirth@infosys.tuwien.ac.at © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 2 Goals of the Tutorial • Position the P2P paradigm in the design space of distributed systems • Get a detailed overview of state-of-the-art P2P systems • Understand the problem of decentralized data management in P2P systems • Understand the research issues for future systems • Detailed information on the new P-Grid approach for P2P systems
P2P tutorial (ESEC 2001) 和 Outline of the Tutorial ·What is P2P? P2P properties Types of P2P Systems -A historical view -Related approaches 。 State-of the-art systems Napster,Gnutella,Freenet ·P2 P Data Management -Gnutella,Freenet,Chord,CAN,P-Grid -Applications of P-Grid ·Gridella ·Managing trust Conclusions and References 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 Listen-P2P is around P2P systems get a lot of attention currently File-sharing systems Napster,Gnutella,Freenet,etc. -Conferences O'Reilly P2P conference 2001 (conferences.oreilly.com/p2p/) 2001 International Conference on Peer-to-Peer Computing (P2P2001) (www.ida.liu.se/conferences/p2p/p2p2001/) 。etc. P2P is nothing new -see Arpanet 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 (c)2001 Karl Aberer,Manfred Hauswirth
P2P tutorial (ESEC 2001) (c) 2001 Karl Aberer, Manfred Hauswirth 2 © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 3 Outline of the Tutorial • What is P2P? – P2P properties – Types of P2P Systems – A historical view – Related approaches • State-of the-art systems – Napster, Gnutella, Freenet • P2P Data Management – Gnutella, Freenet, Chord, CAN, P-Grid – Applications of P-Grid • Gridella • Managing trust • Conclusions and References © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 4 Listen — P2P is around • P2P systems get a lot of attention currently – File-sharing systems • Napster, Gnutella, Freenet, etc. – Conferences • O’Reilly P2P conference 2001 (conferences.oreilly.com/p2p/) • 2001 International Conference on Peer-to-Peer Computing (P2P2001) (www.ida.liu.se/conferences/p2p/p2p2001/) • etc. • P2P is nothing new – see Arpanet
P2P tutorial (ESEC 2001) 和 What is P2P? Every participating node acts as both a client and a server (servent") Every node "pays"its participation by providing access to (some of)its resources Properties: no central coordination no central database no peer has a global view of the system global behavior emerges from local interactions all existing data and services are accessible from any peer peers are autonomous peers and connections are unreliable 82001 Karl Abe ESEC/FSE 2001 What is P2P?...and what isn't? Clay Shirkey (The Accelerator Group): -"Peer-to-peer is a class of applications that take advantage of resources-storage,cycles,content,human presence- available at the edges of the Internet.Because accessing these decentralized resources means operating in an environment of unstable connectivity and unpredictable IP addresses,peer-to-peer nodes must operate outside the DNS and have significant or total autonomy of central servers. P2P "litmus test: Does it allow for variable connectivity and temporary network addresses? Does it give the nodes at the edges of the network significant autonomy? P2P an application-level internet on top of the Internet 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 (c)2001 Karl Aberer,Manfred Hauswirth 3
P2P tutorial (ESEC 2001) (c) 2001 Karl Aberer, Manfred Hauswirth 3 © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 5 What is P2P? • Every participating node acts as both a client and a server (“servent”) • Every node “pays” its participation by providing access to (some of) its resources • Properties: – no central coordination – no central database – no peer has a global view of the system – global behavior emerges from local interactions – all existing data and services are accessible from any peer – peers are autonomous – peers and connections are unreliable © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 6 What is P2P? ... and what isn’t? • Clay Shirkey (The Accelerator Group): – “Peer-to-peer is a class of applications that take advantage of resources—storage, cycles, content, human presence— available at the edges of the Internet. Because accessing these decentralized resources means operating in an environment of unstable connectivity and unpredictable IP addresses, peer-to-peer nodes must operate outside the DNS and have significant or total autonomy of central servers.” – P2P “litmus test:” • Does it allow for variable connectivity and temporary network addresses? • Does it give the nodes at the edges of the network significant autonomy? • P2P ~ an application-level internet on top of the Internet
P2P tutorial (ESEC 2001) Types of P2P Systems 。E-commerce systems -eBay,B2B market places,B2B integration servers,... ·File sharing systems -Napster,Gnutella,Freenet,... Distributed Databases Mariposa [Stonebraker96],... ·Networks -Arpanet Mobile ad-hoc networks,Terminodes [Hubaux01],... 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 System Layers -where is P2P? ·Users User -Commerce and society is P2P 。Application layer uses QoS E-commerce systems can be P2P or centralized Application Information management -Directories are central but could exploits QoS be P2P Information Networks often are P2P Management Internet exploits Qos Network 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 (c)2001 Karl Aberer,Manfred Hauswirth 4
P2P tutorial (ESEC 2001) (c) 2001 Karl Aberer, Manfred Hauswirth 4 © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 7 Types of P2P Systems • E-commerce systems – eBay, B2B market places, B2B integration servers, … • File sharing systems – Napster, Gnutella, Freenet, … • Distributed Databases – Mariposa [Stonebraker96], … • Networks – Arpanet – Mobile ad-hoc networks, Terminodes [Hubaux01], … © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 8 User Application Information Management Network QoS Qos exploits QoS exploits uses System Layers — where is P2P? • Users – Commerce and society is P2P • Application layer – E-commerce systems can be P2P or centralized • Information management – Directories are central but could be P2P • Networks often are P2P – Internet
P2P tutorial (ESEC 2001) How much P2P is involved? P2P user P2P P2P interaction application information management eBay yes no no Napster yes yes no Gnutella, yes yes yes Freenet 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 9 P2P Cooperation Models 。Centralized model - global index held by a central authority (single point of failure) -direct contact between requestors and providers Example:Napster ·Decentralized model Examples:Freenet,Gnutella no global index,no central coordination,global behavior emerges from local interactions,etc. direct contact between requestors and providers(Gnutella)or mediated by a chain of intermediaries (Freenet) ·Hierarchical model introduction of "super-peers" -mix of centralized and decentralized model Example:DNS 2001 Karl Aberer,Manfred Hauswirth ESEC/FSE 2001 10 (c)2001 Karl Aberer,Manfred Hauswirth 5
P2P tutorial (ESEC 2001) (c) 2001 Karl Aberer, Manfred Hauswirth 5 © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 9 How much P2P is involved? Gnutella, yes yes yes Freenet Napster yes yes no eBay yes no no P2P information management P2P application P2P user interaction © 2001 Karl Aberer, Manfred Hauswirth ESEC/FSE 2001 10 P2P Cooperation Models • Centralized model – global index held by a central authority (single point of failure) – direct contact between requestors and providers – Example: Napster • Decentralized model – Examples: Freenet, Gnutella – no global index, no central coordination, global behavior emerges from local interactions, etc. – direct contact between requestors and providers (Gnutella) or mediated by a chain of intermediaries (Freenet) • Hierarchical model – introduction of “super-peers” – mix of centralized and decentralized model – Example: DNS