My Account Log in

1 option

Performing Advanced Analytics on Relational Data with Spark SQL / Armbrust, Michael.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Video
Author/Creator:
Armbrust, Michael, author.
Language:
English
Subjects (All):
SQL (Computer program language).
Web usage mining.
Database management.
Databases--Design.
Databases.
Genre:
Electronic videos.
Physical Description:
1 online resource (1 video file, approximately 41 min.)
Edition:
1st edition
Place of Publication:
O'Reilly Media, Inc., 2014.
System Details:
video file
Summary:
In this event, we'll examine Spark SQL, a new Alpha component that is part of the Apache Spark 1.0 release. Spark SQL lets developers natively query data stored in both existing RDDs and external sources such as Apache Hive. A key feature of Spark SQL is the ability to blur the lines between relational tables and RDDs, making it easy for developers to intermix SQL commands that query external data with complex analytics. In addition to Spark SQL, we'll explore the Catalyst optimizer framework, which allows Spark SQL to automatically rewrite query plans to execute more efficiently.
Participant:
Presenter, Michael Armburst.
Notes:
Title from title screen (viewed Aug. 4, 2014).
Online resource; Title from title screen (viewed July 1, 2014)
OCLC:
885819473

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account