1 option
Clojure data analysis cookbook : dive into data analysis with Clojure through over 100 practical recipes for every stage of the analysis and collection process / Eric Rochester.
- Format:
- Book
- Author/Creator:
- Rochester, Eric, author.
- Language:
- English
- Subjects (All):
- JavaScript (Computer program language)--Testing.
- JavaScript (Computer program language).
- JavaScript (Computer program language)--Programmed instruction.
- Physical Description:
- 1 online resource (372 p.)
- Edition:
- 2nd ed.
- Place of Publication:
- Birmingham, England : Packt Publishing Ltd, 2015.
- Language Note:
- English
- Summary:
- This book is for those with a basic knowledge of Clojure, who are looking to push the language to excel with data analysis.
- Contents:
- Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Importing Data for Analysis; Introduction; Creating a new project; Reading CSV data into Incanter datasets; Reading JSON data into Incanter datasets; Reading data from Excel with Incanter; Reading data from JDBC databases; Reading XML data into Incanter datasets; Scraping data from tables in web pages; Scraping textual data from web pages; Reading RDF data; Querying RDF data with SPARQL; Aggregating data from different formats; Chapter 2: Cleaning and Validating Data
- IntroductionCleaning data with regular expressions; Maintaining consistency with synonym maps; Identifying and removing duplicate data; Regularizing numbers; Calculating relative values; Parsing dates and times; Lazily processing very large data sets; Sampling from very large data sets; Fixing spelling errors; Parsing custom data formats; Validating data with Valip; Chapter 3: Managing Complexity with Concurrent Programming; Introduction; Managing program complexity with STM; Managing program complexity with agents; Getting better performance with commute; Combining agents and STM
- Maintaining consistency with ensureIntroducing safe side effects into the STM; Maintaining data consistency with validators; Monitoring processing with watchers; Debugging concurrent programs with watchers; Recovering from errors in agents; Managing large inputs with sized queues; Chapter 4: Improving Performance with Parallel Programming; Introduction; Parallelizing processing with pmap; Parallelizing processing with Incanter; Partitioning Monte Carlo simulations for better pmap performance; Finding the optimal partition size with simulated annealing; Combining function calls with reducers
- Parallelizing with reducersGenerating online summary statistics for data streams with reducers; Using type hints; Benchmarking with Criterium; Chapter 5: Distributed Data Processing with Cascalog; Introduction; Initializing Cascalog and Hadoop for distributed processing; Querying data with Cascalog; Distributing data with Apache HDFS; Parsing CSV files with Cascalog; Executing complex queries with Cascalog; Aggregating data with Cascalog; Defining new Cascalog operators; Composing Cascalog queries; Transforming data with Cascalog; Chapter 6: Working with Incanter Datasets; Introduction
- Loading Incanter's sample datasetsLoading Clojure data structures into datasets; Viewing datasets interactively with view; Converting datasets to matrices; Using infix formulas in Incanter; Selecting columns with ; Selecting rows with ; Filtering datasets with where; Grouping data with group-by; Saving datasets to CSV and JSON; Projecting from multiple datasets with join; Chapter 7: Statistical Data Analysis with Incanter; Introduction; Generating summary statistics with rollup; Working with changes in values; Scaling variables to simplify variable relationships
- Working with time series data with Incanter Zoo
- Notes:
- "Quick answer to common problems"--Cover.
- Includes index.
- Description based on online resource; title from PDF title page (ebrary, viewed February 12, 2015).
- ISBN:
- 9781784399955
- 1784399957
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.