Peer-to-Peer Data Management


Book Description

This lecture introduces systematically into the problem of managing large data collections in peer-to-peer systems. Search over large datasets has always been a key problem in peer-to-peer systems and the peer-to-peer paradigm has incited novel directions in the field of data management. This resulted in many novel peer-to-peer data management concepts and algorithms, for supporting data management tasks in a wider sense, including data integration, document management and text retrieval. The lecture covers four different types of peer-to-peer data management systems that are characterized by the type of data they manage and the search capabilities they support. The first type are structured peer-to-peer data management systems which support structured query capabilities for standard data models. The second type are peer-to-peer data integration systems for querying of heterogeneous databases without requiring a common global schema. The third type are peer-to-peer document retrieval systems that enable document search based both on the textual content and the document structure. Finally, we introduce semantic overlay networks, which support similarity search on information represented in hierarchically organized and multi-dimensional semantic spaces. Topics that go beyond data representation and search are summarized at the end of the lecture. Table of Contents: Introduction / Structured Peer-to-Peer Databases / Peer-to-peer Data Integration / Peer-to-peer Retrieval / Semantic Overlay Networks / Conclusion




P2P Techniques for Decentralized Applications


Book Description

As an alternative to traditional client-server systems, Peer-to-Peer (P2P) systems provide major advantages in terms of scalability, autonomy and dynamic behavior of peers, and decentralization of control. Thus, they are well suited for large-scale data sharing in distributed environments. Most of the existing P2P approaches for data sharing rely on either structured networks (e.g., DHTs) for efficient indexing, or unstructured networks for ease of deployment, or some combination. However, these approaches have some limitations, such as lack of freedom for data placement in DHTs, and high latency and high network traffic in unstructured networks. To address these limitations, gossip protocols which are easy to deploy and scale well, can be exploited. In this book, we will give an overview of these different P2P techniques and architectures, discuss their trade-offs, and illustrate their use for decentralizing several large-scale data sharing applications. Table of Contents: P2P Overlays, Query Routing, and Gossiping / Content Distribution in P2P Systems / Recommendation Systems / Top-k Query Processing in P2P Systems




Data Management in Grid and Peer-to-Peer Systems


Book Description

This book constitutes the refereed proceedings of the First International Conference on Data Management in Grid and Peer-to-Peer Systems, Globe 2008, held in Turin, Italy, in September 2008. P2P and grid computing are important for scale distributed systems and applications that require effective management of voluminous, distributed, and heterogeneous data. The 10 revised full papers presented were carefully reviewed and selected from 23 submissions. The papers are organized in topical sections on P2P storage systems and caching, P2P data integration systems, querying in grid and P2P systems.




Peer-to-Peer Systems and Applications


Book Description

Starting with Napster and Gnutella, peer-to-peer systems became an integrated part of the Internet fabric attracting millions of users. This book provides an introduction to the field. It draws together prerequisites from various fields, presents techniques and methodologies, and gives an overview on the applications of the peer-to-peer paradigm.




Global Data Management


Book Description

An unforeseen growth of the volume and diversity of the data, content and knowledge is being generated all over the globe. Several factors lead to this growing complexity, among them: Size (the sheer increase in the numbers of knowledge producers and users, and in their production / use capabilities); Pervasiveness (in space and time of knowledge, knowledge producers and users); Dynamicity (new and old knowledge items will appear and disappear virtually at any moment); and Unpredictability (the future dynamics of knowledge are unknown not only at design time but also at run time). The situation is made worse by the fact that the complexity of knowledge grows exponentially with the number of interconnected components. The traditional approach of knowledge management and engineering is top-down and centralised, and depends on fixing at design time what can be expressed and how. Global Data Management is playing a crucial role in the development of our networked distributed society. Its importance has been recognised in the IST programme since several years, in particular in its long-term research part, Future and Emerging Technologies (FET). Many of the papers included in this book refer to IST and FET projects currently running or recently completed.




Principles of Distributed Database Systems


Book Description

This third edition of a classic textbook can be used to teach at the senior undergraduate and graduate levels. The material concentrates on fundamental theories as well as techniques and algorithms. The advent of the Internet and the World Wide Web, and, more recently, the emergence of cloud computing and streaming data applications, has forced a renewal of interest in distributed and parallel data management, while, at the same time, requiring a rethinking of some of the traditional techniques. This book covers the breadth and depth of this re-emerging field. The coverage consists of two parts. The first part discusses the fundamental principles of distributed data management and includes distribution design, data integration, distributed query processing and optimization, distributed transaction management, and replication. The second part focuses on more advanced topics and includes discussion of parallel database systems, distributed object management, peer-to-peer data management, web data management, data stream systems, and cloud computing. New in this Edition: • New chapters, covering database replication, database integration, multidatabase query processing, peer-to-peer data management, and web data management. • Coverage of emerging topics such as data streams and cloud computing • Extensive revisions and updates based on years of class testing and feedback Ancillary teaching materials are available.




Peer-to-Peer Computing


Book Description

Peer-to-peer (P2P) technology, or peer computing, is a paradigm that is viewed as a potential technology for redesigning distributed architectures and, consequently, distributed processing. Yet the scale and dynamism that characterize P2P systems demand that we reexamine traditional distributed technologies. A paradigm shift that includes self-reorganization, adaptation and resilience is called for. On the other hand, the increased computational power of such networks opens up completely new applications, such as in digital content sharing, scientific computation, gaming, or collaborative work environments. In this book, Vu, Lupu and Ooi present the technical challenges offered by P2P systems, and the means that have been proposed to address them. They provide a thorough and comprehensive review of recent advances on routing and discovery methods; load balancing and replication techniques; security, accountability and anonymity, as well as trust and reputation schemes; programming models and P2P systems and projects. Besides surveying existing methods and systems, they also compare and evaluate some of the more promising schemes. The need for such a book is evident. It provides a single source for practitioners, researchers and students on the state of the art. For practitioners, this book explains best practice, guiding selection of appropriate techniques for each application. For researchers, this book provides a foundation for the development of new and more effective methods. For students, it is an overview of the wide range of advanced techniques for realizing effective P2P systems, and it can easily be used as a text for an advanced course on Peer-to-Peer Computing and Technologies, or as a companion text for courses on various subjects, such as distributed systems, and grid and cluster computing.




Semantic Web and Peer-to-Peer


Book Description

Just like the industrial society of the last century depended on natural resources, today’s society depends on information and its exchange. Staab and Stuckenschmidt structured the selected contributions into four parts: Part I, "Data Storage and Access", prepares the semantic foundation, i.e. data modelling and querying in a flexible and yet scalable manner. These foundations allow for dealing with the organization of information at the individual peers. Part II, "Querying the Network", considers the routing of queries, as well as continuous queries and personalized queries under the conditions of the permanently changing topological structure of a peer-to-peer network. Part III, "Semantic Integration", deals with the mapping of heterogeneous data representations. Finally Part IV, "Methodology and Systems", reports experiences from case studies and sample applications. The overall result is a state-of-the-art description of the potential of Semantic Web and peer-to-peer technologies for information sharing and knowledge management when applied jointly.




Principles and Practice of Semantic Web Reasoning


Book Description

This book constitutes the refereed proceedings of the Third International Workshop on Principles and Practice of Semantic Web Reasoning, PPSWR 2005, held in Dagstuhl Castle, Germany in September 2005. The 12 revised full papers presented together with 3 invited contributions were carefully reviewed and selected for inclusion in the book. The major aspects of semantic Web research are addressed in the papers, namely semantic Web architectures, language issues, and formal reasoning methods. The advances are investigated in the context of new design principles and challenging applications.




Database and Applications Security


Book Description

This is the first book to provide an in-depth coverage of all the developments, issues and challenges in secure databases and applications. It provides directions for data and application security, including securing emerging applications such as bioinformatics, stream information processing and peer-to-peer computing. Divided into eight sections,