1 option
Learning Spark / Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia.
Van Pelt Library QA76.9.D343 K363 2015
Available
- Format:
- Book
- Author/Creator:
- Karau, Holden, author.
- Konwinski, Andy, author.
- Wendell, Patrick, author.
- Zaharia, Matei, author.
- Language:
- English
- Subjects (All):
- Spark (Electronic resource : Apache Software Foundation).
- Big data.
- Data mining--Computer programs.
- Data mining.
- Computer programs.
- Physical Description:
- xvi, 256 pages : illustrations ; 24 cm
- Edition:
- First edition.
- Place of Publication:
- Beijing ; Sebastopol : O'Reilly, [2015]
- Summary:
- This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.-- Source other than Library of Congress.
- Contents:
- Introduction to data analysis with Spark
- Downloading Spark and getting started
- Programming with RDDs
- Working with key/value pairs
- Loading and saving your data
- Advanced Spark programming
- Running on a cluster
- Tuning and debugging Spark
- Spark SQL
- Spark streaming
- Machine learning with MLlib.
- Notes:
- Subtitle on cover: Lightning-fast data analysis.
- Includes index.
- Local Notes:
- Acquired for the Penn Libraries with assistance from the Class of 1932 Fund.
- ISBN:
- 1449358624
- 9781449358624
- OCLC:
- 844872440
- Publisher Number:
- 99964949884
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.