My Account Log in

1 option

A Guide to Improving Data Integrity and Adoption / Roper, Jessica.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Roper, Jessica, author.
Language:
English
Subjects (All):
Electronic data processing--Management.
Electronic data processing.
Information storage and retrieval systems--Management.
Information storage and retrieval systems.
Information technology--Management.
Information technology.
Data mining.
Physical Description:
1 online resource (38 pages)
Edition:
1st edition
Place of Publication:
O'Reilly Media, Inc., 2016.
System Details:
text file
Summary:
For most companies, quality data is key to measuring success and planning for business goals. But achieving data accuracy and integrity can be a daunting task given the messy nature of data in the wild. How can you trust that source data is accurate? What data should be excluded as invalid? What steps can you take to ensure that all the data is transformed correctly? How do you know if your conclusions are accurate? This report presents a case study from a large and critical data project at Spiceworks, the vibrant network, online community, and marketplace for IT professionals. Author Jessica Roper, a senior developer in Spiceworks’ data analytics division, demonstrates ways to think about data verification, processing, analysis, and automation. You’ll also get a guide to tools for determining whether the data you collect and use is reliable and accurate. Understand what’s involved in vetting data for trustworthiness Learn strategies and test cases for verifying raw data sources and working with transformations Become familiar with the data at each layer and create tests between each transformation to ensure consistency Understand which edge cases to look for, and what trends and outliers to expect Depend on data monitors to identify anomalies and system issues Automate process and acceptance tests to monitor and ensure reliability Work with other teams and groups to improve and validate data accuracy Increase adoption by using data to measure success
Notes:
Online resource; Title from title page (viewed December 15, 2016)
Includes bibliographical references.
ISBN:
9781491981573
1491981571
OCLC:
1039099707

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account