My Account Log in

1 option

Learning Spark / Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia.

Van Pelt Library QA76.9.D343 K363 2015
Loading location information...

Available This item is available for access.

Log in to request item
Format:
Book
Author/Creator:
Karau, Holden, author.
Konwinski, Andy, author.
Wendell, Patrick, author.
Zaharia, Matei, author.
Contributor:
Class of 1932 Fund.
Language:
English
Subjects (All):
Spark (Electronic resource : Apache Software Foundation).
Big data.
Data mining--Computer programs.
Data mining.
Computer programs.
Physical Description:
xvi, 256 pages : illustrations ; 24 cm
Edition:
First edition.
Place of Publication:
Beijing ; Sebastopol : O'Reilly, [2015]
Summary:
This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.-- Source other than Library of Congress.
Contents:
Introduction to data analysis with Spark
Downloading Spark and getting started
Programming with RDDs
Working with key/value pairs
Loading and saving your data
Advanced Spark programming
Running on a cluster
Tuning and debugging Spark
Spark SQL
Spark streaming
Machine learning with MLlib.
Notes:
Subtitle on cover: Lightning-fast data analysis.
Includes index.
Local Notes:
Acquired for the Penn Libraries with assistance from the Class of 1932 Fund.
ISBN:
1449358624
9781449358624
OCLC:
844872440
Publisher Number:
99964949884

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account