1 option

Learning Spark / Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia.

QA76.9.D343 K363 2015

Loading location information...

Available This item is available for access.

Format:: Book
Author/Creator:: Karau, Holden, author.; Konwinski, Andy, author.; Wendell, Patrick, author.; Zaharia, Matei, author.
Contributor:: Class of 1932 Fund.
Language:: English
Subjects (All):: Spark (Electronic resource : Apache Software Foundation).; Big data.; Data mining--Computer programs.; Data mining.; Computer programs.
Physical Description:: xvi, 256 pages : illustrations ; 24 cm
Edition:: First edition.
Place of Publication:: Beijing ; Sebastopol : O'Reilly, [2015]
Summary:: This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.-- Source other than Library of Congress.
Contents:: Introduction to data analysis with Spark; Downloading Spark and getting started; Programming with RDDs; Working with key/value pairs; Loading and saving your data; Advanced Spark programming; Running on a cluster; Tuning and debugging Spark; Spark SQL; Spark streaming; Machine learning with MLlib.
Notes:: Subtitle on cover: Lightning-fast data analysis.; Includes index.
Local Notes:: Acquired for the Penn Libraries with assistance from the Class of 1932 Fund.
ISBN:: 1449358624; 9781449358624
OCLC:: 844872440
Publisher Number:: 99964949884

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

1 option

Learning Spark / Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia.

Find

My Account

Guides