My Account Log in

1 option

Applied Data Science Using PySpark : Learn the End-to-End Predictive Model-Building Cycle / by Ramcharan Kakarla, Sundar Krishnan, Balaji Dhamodharan, Venkata Gunnu.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Kakarla, Ramcharan.
Contributor:
Krishnan, Sundar.
Dhamodharan, Balaji.
Gunnu, Venkata.
Series:
Professional and Applied Computing Series
Language:
English
Subjects (All):
Big data.
Machine learning.
Python (Computer program language).
Parallel processing (Electronic computers).
Physical Description:
1 online resource (462 pages)
Edition:
2nd ed. 2024.
Place of Publication:
Berkeley, CA : Apress : Imprint: Apress, 2024.
Summary:
This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. In Chapters 1, 2 & 3, we will get started with setting up the environment, and the basics of PySpark focusing on data manipulations. In Chapter 4, we will dive into the art of Variable Selection where we demonstrate various selection techniques available in PySpark. In Chapters 5, 6 & 7, we take you on the journey of machine learning algorithms, implementations and fine-tuning techniques. Chapters 8 and 9 will walk you through machine learning pipelines, and various methods available to operationalize the model and serve it through docker/API. Chapter 10 will demonstrate how you can unlock the power of predictive models when used in coherence to create a meaningful impact on your business. Chapter 11 will introduce you to some of the most used and powerful modelling frameworks to unlock real value from data. In this new edition, you will learn predictive modelling frameworks that can quantify customer lifetime values and estimate the return of your predictive modelling investments. This edition also contains methods to measure engagement and identify actionable populations for churn treatments effectively. In addition, a dedicated chapter for experimentation design including steps to efficiently design, conduct, test and measure the results of your models is added. All the codes will be refreshed as needed to reflect the latest stable version of Spark. You will: Learn the overview of end to end predictive model building Understand Multiple variable selection techniques & implementations Work with Operationalizing models Perform Data science experimentations & tips.
Contents:
Chapter 1: Setting up the Pyspark Environment
Chapter 2: PySpark Basics
Chapter 3: Variable Selection
Chapter 4: Variable Selection
Chapter 5: Supervised Learning Algorithms
Chapter 6: Model Evaluation
Chapter 7: Unsupervised Learning and Recommendation Algorithms
Chapter 8: Machine Learning Flow and Automated Pipelines
Chapter 9: Deploying machine learning models
Chapter 10: Experimentation
Chapter 11: Modeling Frameworks.
Notes:
Includes index.
Description based on publisher supplied metadata and other sources.
ISBN:
9798868808203
OCLC:
1478222190

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account