1 option
Large-Scale Graph Processing Using Apache Giraph / by Sherif Sakr, Faisal Moeen Orakzai, Ibrahim Abdelaziz, Zuhair Khayyat.
- Format:
- Book
- Author/Creator:
- Sakr, Sherif, 1979- author.
- Orakzai, Faisal Moeen, author.
- Abdelaziz, Ibrahim, author.
- Khayyat, Zuhair, author.
- Series:
- Computer Science (Springer-11645)
- Language:
- English
- Subjects (All):
- Database management.
- Big data.
- Data structures (Computer science).
- Database Management.
- Big Data/Analytics.
- Data Structures.
- Local Subjects:
- Database Management.
- Big Data/Analytics.
- Data Structures.
- Physical Description:
- 1 online resource (XXV, 197 pages) : 102 illustrations, 87 illustrations in color
- Edition:
- First edition 2016.
- Contained In:
- Springer eBooks
- Place of Publication:
- Cham : Springer International Publishing : Imprint: Springer, 2016.
- System Details:
- text file PDF
- Summary:
- This book takes its reader on a journey through Apache Giraph, a popular distributed graph processing platform designed to bring the power of big data processing to graph data. Designed as a step-by-step self-study guide for everyone interested in large-scale graph processing, it describes the fundamental abstractions of the system, its programming models and various techniques for using the system to process graph data at scale, including the implementation of several popular and advanced graph analytics algorithms. The book is organized as follows: Chapter 1 starts by providing a general background of the big data phenomenon and a general introduction to the Apache Giraph system, its abstraction, programming model and design architecture. Next, chapter 2 focuses on Giraph as a platform and how to use it. Based on a sample job, even more advanced topics like monitoring the Giraph application lifecycle and different methods for monitoring Giraph jobs are explained. Chapter 3 then provides an introduction to Giraph programming, introduces the basic Giraph graph model and explains how to write Giraph programs. In turn, Chapter 4 discusses in detail the implementation of some popular graph algorithms including PageRank, connected components, shortest paths and triangle closing. Chapter 5 focuses on advanced Giraph programming, discussing common Giraph algorithmic optimizations, tunable Giraph configurations that determine the system's utilization of the underlying resources, and how to write a custom graph input and output format. Lastly, chapter 6 highlights two systems that have been introduced to tackle the challenge of large scale graph processing, GraphX and GraphLab, and explains the main commonalities and differences between these systems and Apache Giraph. This book serves as an essential reference guide for students, researchers and practitioners in the domain of large scale graph processing. It offers step-by-step guidance, with several code examples and the complete source code available in the related github repository. Students will find a comprehensive introduction to and hands-on practice with tackling large scale graph processing problems using the Apache Giraph system, while researchers will discover thorough coverage of the emerging and ongoing advancements in big graph processing systems.
- Contents:
- 1. Introduction
- 2. Getting started with Giraph
- 3. Giraph-In-Action: Implementing Popular Graph Algorithms using Giraph
- 4. Giraph Programming Optimizations: Tips and Tricks
- 5. Similar Systems to Giraph
- 6. Conclusions.
- Other Format:
- Printed edition:
- ISBN:
- 978-3-319-47431-1
- 9783319474311
- Access Restriction:
- Restricted for use by site license.
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.