My Account Log in

3 options

R for data science : learn and explore the fundamentals of data science with R / Dan Toomey.

EBSCOhost Academic eBook Collection (North America) Available online

View online

Ebook Central Academic Complete Available online

View online

Ebook Central College Complete Available online

View online
Format:
Book
Author/Creator:
Toomey, Dan, author.
Series:
Community experience distilled.
Community Experience Distilled
Language:
English
Subjects (All):
R (Computer program language).
Mathematical statistics--Data processing.
Mathematical statistics.
Physical Description:
1 online resource (364 p.)
Edition:
1st ed.
Place of Publication:
Birmingham, England : Packt Publishing, 2014.
Language Note:
English
Summary:
If you are a data analyst who has a firm grip on some advanced data analysis techniques and wants to learn how to leverage the features of R, this is the book for you. You should have some basic knowledge of the R language and should know about some data science topics.
Contents:
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Data Mining Patterns; Cluster analysis; K-means clustering; Usage; Example; K-medoids clustering; Usage; Example; Hierarchical clustering; Usage; Example; Expectation-maximization; Usage; List of model names; Example; Density estimation; Usage; Example; Anomaly detection; Show outliers; Example; Example; Another anomaly detection example; Calculating anomalies; Usage; Example 1; Example 2; Association rules; Mine for associations; Usage; Example; Questions; Summary
Chapter 2: Data Mining SequencesPatterns; Eclat; Usage; Using eclat to find similarities in adult behavior; Finding frequent items in a dataset; An example focusing on highest frequency; arulesNBMiner; Usage; Mining the Agrawal data for frequent sets; Apriori; Usage; Evaluating associations in a shopping basket; Determining sequences using TraMineR; Usage; Determining sequences in training and careers; Similarities in the sequence; Sequence metrics; Usage; Example; Questions; Summary; Chapter 3: Text Mining; Packages; Text processing; Example; Creating a corpus; Text clusters; Word graphics
Analyzing the XML textQuestions; Summary; Chapter 4: Data Analysis - Regression Analysis; Packages; Simple regression; Multiple regression; Multivariate regression analysis; Robust regression; Questions; Summary; Chapter 5: Data Analysis - Correlation; Packages; Correlation; Example; Visualizing correlations; Covariance; Pearson correlation; Polychoric correlation; Tetrachoric correlation; A heterogeneous correlation matrix; Partial correlation; Questions; Summary; Chapter 6: Data Analysis - Clustering; Packages; K-means clustering; Example; Optimal number of clusters; Medoids clusters
The cascadeKM functionSelecting clusters based on Bayesian information; Affinity propagation clustering; Gap statistic to estimate the number of clusters; Hierarchical clustering; Questions; Summary; Chapter 7: Data Visualization - R Graphics; Packages; Interactive graphics; The latticist package; Bivariate binning display; Mapping; Plotting points on a map; Plotting points on a world map; Google Maps; The ggplot2 package; Questions; Summary; Chapter 8: Data Visualization - Plotting; Packages; Scatter plots; Regression line; A lowess line; scatterplot; Scatterplot matrices
splom - display matrix datacpairs - plot matrix data; Density scatter plots; Bar charts and plots; Bar plot; Usage; Bar chart; ggplot2; Word cloud; Questions; Summary; Chapter 9: Data Visualization - 3D; Packages; Generating 3D graphics; Lattice Cloud - 3D scatterplot; scatterplot3d; scatter3d; cloud3d; RgoogleMaps; vrmlgenbar3D; Big Data; pbdR; bigmemory; Research areas; Rcpp; parallel; microbenchmark; pqR; SAP integration; roxygen2; bioconductor; swirl; pipes; Questions; Summary; Chapter 10: Machine Learning in Action; Packages; Dataset; Data partitioning; Model; Linear model; Prediction
Logistic regression
Notes:
Includes index.
Description based on online resource; title from PDF title page (ebrary, viewed January 15, 2015).
ISBN:
9781784392659
1784392650
OCLC:
900882999

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account