Automating data quality monitoring : scaling beyond rules with machine learning

Book Cover
Average Rating
Published
Sebastopol, CA : O'Reilly Media, Inc., 2024.
Status
Available Online

Description

Loading Description...

More Details

Format
Edition
First edition.
Language
English
ISBN
1098145909, 9781098145903

Notes

Description
The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records. Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately. This book will help you: Learn why data quality is a business imperative Understand and assess unsupervised learning models for detecting data issues Implement notifications that reduce alert fatigue and let you triage and resolve issues quickly Integrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systems Understand the limits of automated data quality monitoring and how to overcome them Learn how to deploy and manage your monitoring solution at scale Maintain automated data quality monitoring for the long term.
Local note
O'Reilly O'Reilly Online Learning: Academic/Public Library Edition

Discover More

Also in this Series

Checking series information...

More Like This

Loading more titles like this title...

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Stanley, J., & Schwartz, P. (2024). Automating data quality monitoring: scaling beyond rules with machine learning (First edition.). O'Reilly Media, Inc..

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Stanley, Jeremy and Paige, Schwartz. 2024. Automating Data Quality Monitoring: Scaling Beyond Rules With Machine Learning. Sebastopol, CA: O'Reilly Media, Inc.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Stanley, Jeremy and Paige, Schwartz. Automating Data Quality Monitoring: Scaling Beyond Rules With Machine Learning Sebastopol, CA: O'Reilly Media, Inc, 2024.

Harvard Citation (style guide)

Stanley, J. and Schwartz, P. (2024). Automating data quality monitoring: scaling beyond rules with machine learning. First edn. Sebastopol, CA: O'Reilly Media, Inc.

MLA Citation, 9th Edition (style guide)

Stanley, Jeremy,, and Paige Schwartz. Automating Data Quality Monitoring: Scaling Beyond Rules With Machine Learning First edition., O'Reilly Media, Inc., 2024.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Grouped Work ID
6e1236aa-0782-2dda-7a4b-71a15c477866-eng
Go To Grouped Work View in Staff Client

Grouping Information

Grouped Work ID6e1236aa-0782-2dda-7a4b-71a15c477866-eng
Full titleautomating data quality monitoring scaling beyond rules with machine learning
Authorstanley jeremy
Grouping Categorybook
Last Update2025-02-19 03:28:59AM
Last Indexed2025-05-22 03:21:22AM

Book Cover Information

Image SourcecontentCafe
First LoadedJan 23, 2025
Last UsedJan 23, 2025

Marc Record

First DetectedDec 17, 2024 06:58:31 AM
Last File Modification TimeDec 17, 2024 06:58:31 AM
SuppressedRecord had no items

MARC Record

LEADER03055cam a2200433 i 4500
001on1417195764
003OCoLC
00520241217065601.0
006m     o  d        
007cr cnu---unuuu
008240114t20242024cau     o     000 0 eng d
020 |a 1098145909|q electronic book
020 |a 9781098145903|q electronic book
035 |a (OCoLC)1417195764
037 |a 9781098145927|b O'Reilly Media
040 |a YDX|b eng|e rda|c YDX|d OCLCO|d ORMDA|d OCLCO|d YDX|d N$T|d YDX
049 |a MAIN
050 4|a HF5548.2|b .S73 2024
08204|a 658.05|2 23/eng/20240116
1001 |a Stanley, Jeremy,|e author.
24510|a Automating data quality monitoring :|b scaling beyond rules with machine learning /|c Jeremy Stanley and Paige Schwartz.
250 |a First edition.
264 1|a Sebastopol, CA :|b O'Reilly Media, Inc.,|c 2024.
264 4|c ©2024
300 |a 1 online resource
336 |a text|b txt|2 rdacontent
337 |a computer|b c|2 rdamedia
338 |a online resource|b cr|2 rdacarrier
520 |a The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records. Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately. This book will help you: Learn why data quality is a business imperative Understand and assess unsupervised learning models for detecting data issues Implement notifications that reduce alert fatigue and let you triage and resolve issues quickly Integrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systems Understand the limits of automated data quality monitoring and how to overcome them Learn how to deploy and manage your monitoring solution at scale Maintain automated data quality monitoring for the long term.
588 |a Description based on online resource; title from digital title page (viewed on February 20, 2024).
590 |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition
650 0|a Business|x Data processing.|9 31545
7001 |a Schwartz, Paige,|e author.
77608|i Print version:|z 1098145933|z 9781098145934|w (OCoLC)1382625824
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781098145927/?ar|x O'Reilly|z eBook
938 |a YBP Library Services|b YANK|n 20655427
938 |a EBSCOhost|b EBSC|n 3772900
994 |a 92|b VIA
999 |c 362095|d 362095