1 option
Principles of data wrangling : practical techniques for data preparation / Tye Rattenbury [and four others].
- Format:
- Book
- Author/Creator:
- Rattenbury, Ty, author.
- Rattenbury, Tye, author.
- Language:
- English
- Subjects (All):
- Data mining.
- Electronic data processing--Data preparation.
- Electronic data processing.
- Physical Description:
- 1 online resource (84 pages) : illustrations, tables
- Edition:
- First edtion.
- Other Title:
- Practical techniques for data preparation
- Place of Publication:
- Beijing, [China] : O'Reilly, 2017.
- System Details:
- text file
- Summary:
- A key task that any aspiring data-driven organization needs to learn is data wrangling, the process of converting raw data into something truly useful. This practical guide provides business analysts with an overview of various data wrangling techniques and tools, and puts the practice of data wrangling into context by asking, "What are you trying to do and why?" Wrangling data consumes roughly 50-80% of an analyst’s time before any kind of analysis is possible. Written by key executives at Trifacta, this book walks you through the wrangling process by exploring several factors—time, granularity, scope, and structure—that you need to consider as you begin to work with data. You’ll learn a shared language and a comprehensive understanding of data wrangling, with an emphasis on recent agile analytic processes used by many of today’s data-driven organizations. Appreciate the importance—and the satisfaction—of wrangling data the right way. Understand what kind of data is available Choose which data to use and at what level of detail Meaningfully combine multiple sources of data Decide how to distill the results to a size and shape that can drive downstream analysis
- Contents:
- A data workflow framework
- The dynamics of data wrangling
- Profiling
- Transformation : structuring
- Transformation : enriching
- Using transformation to clean data
- Roles and responsibilities
- Data wrangling tools.
- Notes:
- Description based on online resource; title from PDF title page (ebrary, viewed July 26, 2017).
- ISBN:
- 9781491938874
- 1491938870
- 9781491938911
- 1491938919
- 9781491938898
- 1491938897
- OCLC:
- 993879257
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.