My Account Log in

1 option

Data Analytics Using Spark and Hadoop / Maniyam, Sujee.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Video
Author/Creator:
Maniyam, Sujee, author.
Language:
English
Subjects (All):
Big data.
Data mining.
Spark (Electronic resource : Apache Software Foundation).
Apache Hadoop.
Genre:
Electronic videos.
Physical Description:
1 online resource (1 video file, approximately 1 hr., 53 min.)
Edition:
1st edition
Place of Publication:
Infinite Skills, 2016.
System Details:
video file
Summary:
Hadoop and Spark are the stars of the Big Data world. This course covers the basics of Spark and how to use Spark and Hadoop together for big data analytics. Designed for developers, architects, and data analysts with a fundamental understanding of Hadoop, it begins with an overview of how Hadoop and Spark are used in today's big data ecosystem before moving into hands-on labs that demonstrate Spark and Spark-Hadoop integration. You'll learn about the Spark shell, RDDs, and DataFrames; how to query data in Hadoop Hive Tables from Spark; and how to develop Spark applications and run them on YARN. Discover how to integrate the Hadoop and Spark big data analytics platforms Get access to 11 hands-on labs demonstrating the core aspects of Hadoop-Spark integration Learn the basics of the Spark framework: Spark shell, RDDs and DataFrames Explore methods for analyzing data in Hadoop HDFS and Hive using Spark Gain an understanding on how to write Spark applications and run them on YARN Sujee Maniyam is the co-founder of Elephant Scale, a Big Data training company specializing in Hadoop, NoSQL, and data science. An open-source author/developer since 2000, Sujee ran the analytics company CoverCake for five years, founded the Santa Clara Big Data Guru Meet-Up, developed a Hadoop course for Intel, worked as a software engineer for IBM for six years, and is co-author of the O'Reilly title HBase Design Patterns. He earned a Bachelor of Science in Computer Engineering from the University of Melbourne and holds certifications in both Hadoop and Spark.
Participant:
Presenter, Sujee Maniyam.
Notes:
Online resource; Title from title screen (viewed October 17, 2016)
Title from title screen (viewed November 1, 2016).
Date of publication from resource description page.
OCLC:
961944626

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account