My Account Log in

3 options

Data smart : using data science to transform information into insight / John W. Foreman.

Ebook Central Academic Complete Available online

View online

Ebook Central College Complete Available online

View online

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Foreman, John W., author.
Language:
English
Subjects (All):
Data mining.
Web sites--Design.
Web sites.
Web usage mining.
Physical Description:
1 online resource (434 p.)
Edition:
1st ed.
Other Title:
Using data science to transform information into insight
Place of Publication:
Indianapolis : Wiley, [2014]
Language Note:
English
System Details:
text file
Summary:
Data Science gets thrown around in the press like it's magic. Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions. But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the ""data scientist,"" to extract this gold from your data? Nope. Data science is little more than using straight-forward steps to process raw data into actionable insight.
Contents:
Cover; Title Page; Copyright; Contents; Chapter 1 Everything You Ever Needed to Know about Spreadsheets but Were Too Afraid to Ask; Some Sample Data; Moving Quickly with the Control Button; Copying Formulas and Data Quickly; Formatting Cells; Paste Special Values; Inserting Charts; Locating the Find and Replace Menus; Formulas for Locating and Pulling Values; Using VLOOKUP to Merge Data; Filtering and Sorting; Using PivotTables; Using Array Formulas; Solving Stuff with Solver; OpenSolver: I Wish We Didn't Need This, but We Do; Wrapping Up
Chapter 2 Cluster Analysis Part I: Using K-Means to Segment Your Customer Base Girls Dance with Girls, Boys Scratch Their Elbows; Getting Real: K-Means Clustering Subscribers in E-mail Marketing; Joey Bag O' Donuts Wholesale Wine Emporium; The Initial Dataset; Determining What to Measure; Start with Four Clusters; Euclidean Distance: Measuring Distances as the Crow Flies; Distances and Cluster Assignments for Everybody!; Solving for the Cluster Centers; Making Sense of the Results; Getting the Top Deals by Cluster; The Silhouette: A Good Way to Let Different K Values Duke It Out
How about Five Clusters? Solving for Five Clusters; Getting the Top Deals for All Five Clusters; Computing the Silhouette for 5-Means Clustering; K-Medians Clustering and Asymmetric Distance Measurements; Using K-Medians Clustering; Getting a More Appropriate Distance Metric; Putting It All in Excel; The Top Deals for the 5-Medians Clusters; Wrapping Up; Chapter 3 Naive Bayes and the Incredible Lightness of Being an Idiot; When You Name a Product Mandrill, You're Going to Get Some Signal and Some Noise; The World's Fastest Intro to Probability Theory; Totaling Conditional Probabilities
Joint Probability, the Chain Rule, and Independence What Happens in a Dependent Situation?; Bayes Rule; Using Bayes Rule to Create an AI Model; High-Level Class Probabilities Are Often Assumed to Be Equal; A Couple More Odds and Ends; Let's Get This Excel Party Started; Removing Extraneous Punctuation; Splitting on Spaces; Counting Tokens and Calculating Probabilities; And We Have a Model! Let's Use It; Wrapping Up; Chapter 4 Optimization Modeling: Because That "Fresh Squeezed" Orange Juice Ain't Gonna Blend Itself; Why Should Data Scientists Know Optimization?
Starting with a Simple Trade-Off Representing the Problem as a Polytope; Solving by Sliding the Level Set; The Simplex Method: Rooting around the Corners; Working in Excel; There's a Monster at the End of This Chapter; Fresh from the Grove to Your Glass...with a Pit Stop Through a Blending Model; You Use a Blending Model; Let's Start with Some Specs; Coming Back to Consistency; Putting the Data into Excel; Setting Up the Problem in Solver; Lowering Your Standards; Dead Squirrel Removal: The Minimax Formulation; If-Then and the "Big M" Constraint
Multiplying Variables: Cranking Up the Volume to 11
Notes:
Description based upon print version of record.
Includes bibliographical references and index.
Description based on online resource; title from PDF title page (ebrary, viewed December 9, 2013).
ISBN:
1-118-83986-2
1-118-66148-6
OCLC:
862611724

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account