My Account Log in

2 options

Machine learning with R cookbook : analyze data and build predictive models / AshishSingh Bhatia, Yu-Wei Chiu.

Ebook Central College Complete Available online

View online

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Bhatia, AshishSingh, author.
Chiu, Yu-Wei, author.
Language:
English
Subjects (All):
R (Computer program language).
Mathematical statistics--Data processing.
Mathematical statistics.
Physical Description:
1 online resource (1 volume) : illustrations
Edition:
Second edition.
Other Title:
Analyze data and build predictive models
Place of Publication:
Birmingham, England ; Mumbai, [India] : Packt, 2017.
System Details:
text file
Summary:
Explore over 110 recipes to analyze data and build predictive models with simple and easy-to-use R code About This Book Apply R to simplify predictive modeling with short and simple code Use machine learning to solve problems ranging from small to big data Build a training and testing dataset, applying different classification methods. Who This Book Is For This book is for data science professionals, data analysts, or people who have used R for data analysis and machine learning who now wish to become the go-to person for machine learning with R. Those who wish to improve the efficiency of their machine learning models and need to work with different kinds of data set will find this book very insightful. What You Will Learn Create and inspect transaction datasets and perform association analysis with the Apriori algorithm Visualize patterns and associations using a range of graphs and find frequent item-sets using the Eclat algorithm Compare differences between each regression method to discover how they solve problems Detect and impute missing values in air quality data Predict possible churn users with the classification approach Plot the autocorrelation function with time series analysis Use the Cox proportional hazards model for survival analysis Implement the clustering method to segment customer data Compress images with the dimension reduction method Incorporate R and Hadoop to solve machine learning problems on big data In Detail Big data has become a popular buzzword across many industries. An increasing number of people have been exposed to the term and are looking at how to leverage big data in their own businesses, to improve sales and profitability. However, collecting, aggregating, and visualizing data is just one part of the equation. Being able to extract useful information from data is another task, and a much more challenging one. Machine Learning with R Cookbook, Second Edition uses a practical approach to teach you how to perform machine learning with R. Each chapter is divided into several simple recipes. Through the step-by-step instructions provided in each recipe, you will be able to construct a predictive model by using a variety of machine learning packages. In this book, you will first learn to set up the R environment and use simple R commands to explore data. The next topic covers how to perform statistical analysis with machine learning analysis and assess created models, covered in detail later on in the book. Yo...
Contents:
Cover
Copyright
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Customer Feedback
Table of Contents
Preface
Chapter 1: Practical Machine Learning with R
Introduction
Downloading and installing R
Getting ready
How to do it...
How it works...
See also
Downloading and installing RStudio
Installing and loading packages
Understanding of basic data structures
Data types
Data structures
Vectors
Lists
Array
Matrix
DataFrame
Basic commands for subsetting
Data input
Reading and writing data
There's more...
Manipulating data
Applying basic statistics
Visualizing data
Getting a dataset for machine learning
Chapter 2: Data Exploration with Air Quality Datasets
Using air quality dataset
Converting attributes to factor
Detecting missing values
Imputing missing values
Exploring and visualizing data
Getting ready.
How to do it...
Predicting values from datasets
Chapter 3: Analyzing Time Series Data
Looking at time series data
Plotting and forecasting time series data
Extracting, subsetting, merging, filling, and padding
Successive differences and moving averages
Exponential smoothing
Plotting the autocorrelation function
Chapter 4: R and Statistics
Understanding data sampling in R
Operating a probability distribution in R
Working with univariate descriptive statistics in R
Performing correlations and multivariate analysis
Conducting an exact binomial test
Performing a student's t-test
Performing the Kolmogorov-Smirnov test
Understanding the Wilcoxon Rank Sum and Signed Rank test
Working with Pearson's Chi-squared test
How it works.
There's more...
Conducting a one-way ANOVA
Performing a two-way ANOVA
Chapter 5: Understanding Regression Analysis
Different types of regression
Fitting a linear regression model with lm
Summarizing linear model fits
Using linear regression to predict unknown values
Generating a diagnostic plot of a fitted model
Fitting multiple regression
Summarizing multiple regression
Using multiple regression to predict unknown values
Fitting a polynomial regression model with lm
Fitting a robust linear regression model with rlm
Studying a case of linear regression on SLID data
Applying the Gaussian model for generalized linear regression
Applying the Poisson model for generalized linear regression
Applying the Binomial model for generalized linear regression
See also.
Fitting a generalized additive model to data
Visualizing a generalized additive model
Diagnosing a generalized additive model
Chapter 6: Survival Analysis
Loading and observing data
Viewing the summary of survival analysis
Visualizing the Survival Curve
Using the log-rank test
Using the COX proportional hazard model
Nelson-Aalen Estimator of cumulative hazard
Chapter 7: Classification 1 - Tree, Lazy, and Probabilistic
Preparing the training and testing datasets
Building a classification model with recursive partitioning trees
Visualizing a recursive partitioning tree
Measuring the prediction performance of a recursive partitioning tree
Pruning a recursive partitioning tree
Handling missing data and split and surrogate variables
Building a classification model with a conditional inference tree.
Getting ready
Control parameters in conditional inference trees
Visualizing a conditional inference tree
Measuring the prediction performance of a conditional inference tree
Classifying data with the k-nearest neighbor classifier
Classifying data with logistic regression
Classifying data with the Naïve Bayes classifier
Chapter 8: Classification 2 - Neural Network and SVM
Classifying data with a support vector machine
Choosing the cost of a support vector machine
Visualizing an SVM fit
Predicting labels based on a model trained by a support vector machine
Tuning a support vector machine
The basics of neural network
Training a neural network with neuralnet
Visualizing a neural network trained by neuralnet
Predicting labels based on a model trained by neuralnet
Training a neural network with nnet.
Notes:
Includes index.
Description based on online resource; title from PDF title page (EBC, viewed December 2, 2017).
OCLC:
1011524936

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account