My Account Log in

1 option

Building Better Distributed Data Pipelines / McFadin, Patrick.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Video
Author/Creator:
McFadin, Patrick, author.
Language:
English
Subjects (All):
Data mining.
Electronic data processing.
Genre:
Electronic videos.
Physical Description:
1 online resource (1 video file, approximately 54 min.)
Edition:
1st edition
Place of Publication:
O'Reilly Media, Inc., 2017.
System Details:
video file
Summary:
Patrick McFadin explains the basics of how to build more efficient data pipelines, using Apache Kafka to organize, Apache Cassandra to store, and Apache Spark to analyze. Patrick offers an overview of how Cassandra works and why it can be a perfect fit for data-driven projects. Patrick then demonstrates that with the addition of Spark and Kafka, you can maintain a highly distributed, fault-tolerant, and scaling solution. You’ll leave with a comprehensive view of the many options to make considered choices in your data pipeline projects.
Participant:
Presenter, Patrick McFadin.
Notes:
Online resource; Title from title screen (viewed November 16, 2017)
Title from resource description page (viewed December 19, 2017).
OCLC:
1017738643

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account