ACTIVE MACHINE LEARNING WITH PYTHON refine and elevate data quality over quantity with active learning

Book Cover
Average Rating
Published
Birmingham, UK : Packt Publishing Ltd., 2024.
Status
Available Online

Description

Use active machine learning with Python to improve the accuracy of predictive models, streamline the data analysis process, and adapt to evolving data trends, fostering innovation and progress across diverse fields Key Features Learn how to implement a pipeline for optimal model creation from large datasets and at lower costs Gain profound insights within your data while achieving greater efficiency and speed Apply your knowledge to real-world use cases and solve complex ML problems Purchase of the print or Kindle book includes a free PDF eBook Book Description Building accurate machine learning models requires quality data--lots of it. However, for most teams, assembling massive datasets is time-consuming, expensive, or downright impossible. Led by Margaux Masson-Forsythe, a seasoned ML engineer and advocate for surgical data science and climate AI advancements, this hands-on guide to active machine learning demonstrates how to train robust models with just a fraction of the data using Python's powerful active learning tools. You'll master the fundamental techniques of active learning, such as membership query synthesis, stream-based sampling, and pool-based sampling and gain insights for designing and implementing active learning algorithms with query strategy and Human-in-the-Loop frameworks. Exploring various active machine learning techniques, you'll learn how to enhance the performance of computer vision models like image classification, object detection, and semantic segmentation and delve into a machine AL method for selecting the most informative frames for labeling large videos, addressing duplicated data. You'll also assess the effectiveness and efficiency of active machine learning systems through performance evaluation. By the end of the book, you'll be able to enhance your active learning projects by leveraging Python libraries, frameworks, and commonly used tools. What you will learn Master the fundamentals of active machine learning Understand query strategies for optimal model training with minimal data Tackle class imbalance, concept drift, and other data challenges Evaluate and analyze active learning model performance Integrate active learning libraries into workflows effectively Optimize workflows for human labelers Explore the finest active learning tools available today Who this book is for Ideal for data scientists and ML engineers aiming to maximize model performance while minimizing costly data labeling, this book is your guide to optimizing ML workflows and prioritizing quality over quantity. Whether you're a technical practitioner or team lead, you'll benefit from the proven methods presented in this book to slash data requirements and iterate faster. Basic Python proficiency and familiarity with machine learning concepts such as datasets and convolutional neural networks is all you need to get started.

More Details

Format
Edition
1st edition.
Language
English
ISBN
9781835462683, 1835462685

Notes

Description
Use active machine learning with Python to improve the accuracy of predictive models, streamline the data analysis process, and adapt to evolving data trends, fostering innovation and progress across diverse fields Key Features Learn how to implement a pipeline for optimal model creation from large datasets and at lower costs Gain profound insights within your data while achieving greater efficiency and speed Apply your knowledge to real-world use cases and solve complex ML problems Purchase of the print or Kindle book includes a free PDF eBook Book Description Building accurate machine learning models requires quality data--lots of it. However, for most teams, assembling massive datasets is time-consuming, expensive, or downright impossible. Led by Margaux Masson-Forsythe, a seasoned ML engineer and advocate for surgical data science and climate AI advancements, this hands-on guide to active machine learning demonstrates how to train robust models with just a fraction of the data using Python's powerful active learning tools. You'll master the fundamental techniques of active learning, such as membership query synthesis, stream-based sampling, and pool-based sampling and gain insights for designing and implementing active learning algorithms with query strategy and Human-in-the-Loop frameworks. Exploring various active machine learning techniques, you'll learn how to enhance the performance of computer vision models like image classification, object detection, and semantic segmentation and delve into a machine AL method for selecting the most informative frames for labeling large videos, addressing duplicated data. You'll also assess the effectiveness and efficiency of active machine learning systems through performance evaluation. By the end of the book, you'll be able to enhance your active learning projects by leveraging Python libraries, frameworks, and commonly used tools. What you will learn Master the fundamentals of active machine learning Understand query strategies for optimal model training with minimal data Tackle class imbalance, concept drift, and other data challenges Evaluate and analyze active learning model performance Integrate active learning libraries into workflows effectively Optimize workflows for human labelers Explore the finest active learning tools available today Who this book is for Ideal for data scientists and ML engineers aiming to maximize model performance while minimizing costly data labeling, this book is your guide to optimizing ML workflows and prioritizing quality over quantity. Whether you're a technical practitioner or team lead, you'll benefit from the proven methods presented in this book to slash data requirements and iterate faster. Basic Python proficiency and familiarity with machine learning concepts such as datasets and convolutional neural networks is all you need to get started.
Local note
O'Reilly O'Reilly Online Learning: Academic/Public Library Edition

Table of Contents

Cover
Title Page
Copyright and Credits
Contributors
Table of Contents
Preface
Part 1: Fundamentals of Active Machine Learning
Chapter 1: Introducing Active Machine Learning
Understanding active machine learning systems
Definition
Potential range of applications
Key components of active machine learning systems
Exploring query strategies scenarios
Membership query synthesis
Stream-based selective sampling
Pool-based sampling
Comparing active and passive learning
Summary
Chapter 2: Designing Query Strategy Frameworks
Technical requirements
Exploring uncertainty sampling methods
Understanding query-by-committee approaches
Maximum disagreement
Vote entropy
Average KL divergence
Labeling with EMC sampling
Sampling with EER
Understanding density-weighted sampling methods
Summary
Chapter 3: Managing the Human in the Loop
Technical requirements
Designing interactive learning systems and workflows
Exploring human-in-the-loop labeling tools
Common labeling platforms
Handling model-label disagreements
Programmatically identifying mismatches
Manual review of conflicts
Effectively managing human-in-the-loop systems
Ensuring annotation quality and dataset balance
Assess annotator skills
Use multiple annotators
Balanced sampling
Summary
Part 2: Active Machine Learning in Practice
Chapter 4: Applying Active Learning to Computer Vision
Technical requirements
Implementing active ML for an image classification project
Building a CNN for the CIFAR dataset
Applying uncertainty sampling to improve classification performance
Applying active ML to an object detection project
Preparing and training our model
Analyzing the evaluation metrics
Implementing an active ML strategy
Using active ML for a segmentation project
Summary
Chapter 5: Leveraging Active Learning for Big Data
Technical requirements
Implementing ML models for video analysis
Selecting the most informative frames with Lightly
Using Lightly to select the best frames to label for object detection
SSL with active ML
Summary
Part 3: Applying Active Machine Learning to Real-World Projects
Chapter 6: Evaluating and Enhancing Efficiency
Technical requirements
Creating efficient active ML pipelines
Monitoring active ML pipelines
Determining when to stop active ML runs
Enhancing production model monitoring with active ML
Challenges in monitoring production models
Active ML to monitor models in production
Early detection for data drift and model decay
Summary
Chapter 7: Utilizing Tools and Packages for Active ML
Technical requirements
Mastering Python packages for enhanced active ML
scikit-learn
modAL
Getting familiar with the active ML tools
Summary
Index
Other Books You May Enjoy

Discover More

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Masson-Forsythe, M. (2024). ACTIVE MACHINE LEARNING WITH PYTHON: refine and elevate data quality over quantity with active learning (1st edition.). Packt Publishing Ltd..

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Masson-Forsythe, Margaux. 2024. ACTIVE MACHINE LEARNING WITH PYTHON: Refine and Elevate Data Quality Over Quantity With Active Learning. Birmingham, UK: Packt Publishing Ltd.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Masson-Forsythe, Margaux. ACTIVE MACHINE LEARNING WITH PYTHON: Refine and Elevate Data Quality Over Quantity With Active Learning Birmingham, UK: Packt Publishing Ltd, 2024.

Harvard Citation (style guide)

Masson-Forsythe, M. (2024). ACTIVE MACHINE LEARNING WITH PYTHON: refine and elevate data quality over quantity with active learning. 1st edn. Birmingham, UK: Packt Publishing Ltd.

MLA Citation, 9th Edition (style guide)

Masson-Forsythe, Margaux. ACTIVE MACHINE LEARNING WITH PYTHON: Refine and Elevate Data Quality Over Quantity With Active Learning 1st edition., Packt Publishing Ltd., 2024.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Grouped Work ID
f2ab77ef-eff4-db48-457f-b9a2c56b318d-eng
Go To Grouped Work View in Staff Client

Grouping Information

Grouped Work IDf2ab77ef-eff4-db48-457f-b9a2c56b318d-eng
Full titleactive machine learning with python refine and elevate data quality over quantity with active learning
Authormasson forsythe margaux
Grouping Categorybook
Last Update2025-01-24 12:33:29PM
Last Indexed2025-05-03 03:39:44AM

Book Cover Information

Image Sourcedefault
First LoadedFeb 12, 2025
Last UsedFeb 12, 2025

Marc Record

First DetectedDec 16, 2024 11:30:04 PM
Last File Modification TimeDec 17, 2024 08:29:13 AM
SuppressedRecord had no items

MARC Record

LEADER07447cam a22004697a 4500
001on1428526449
003OCoLC
00520241217082657.0
006m     o  d        
007cr |n|||||||||
008240404s2024    enk     o     000 0 eng d
019 |a 1428903804
020 |a 9781835462683|q (electronic bk.)
020 |a 1835462685|q (electronic bk.)
035 |a (OCoLC)1428526449|z (OCoLC)1428903804
037 |a 9781835464946|b O'Reilly Media
037 |a 10522554|b IEEE
040 |a YDX|b eng|c YDX|d OCLCO|d ORMDA|d OCLCO|d EBLCP|d OCLCQ|d IEEEE
049 |a MAIN
050 4|a QA76.73.P98
08204|a 005.13/3|2 23/eng/20240409
1001 |a Masson-Forsythe, Margaux,|e author.
24510|a ACTIVE MACHINE LEARNING WITH PYTHON|h [electronic resource] :|b refine and elevate data quality over quantity with active learning /|c Margaux Masson-Forsythe.
250 |a 1st edition.
260 |a Birmingham, UK :|b Packt Publishing Ltd.,|c 2024.
300 |a 1 online resource
5050 |a Cover -- Title Page -- Copyright and Credits -- Contributors -- Table of Contents -- Preface -- Part 1: Fundamentals of Active Machine Learning -- Chapter 1: Introducing Active Machine Learning -- Understanding active machine learning systems -- Definition -- Potential range of applications -- Key components of active machine learning systems -- Exploring query strategies scenarios -- Membership query synthesis -- Stream-based selective sampling -- Pool-based sampling -- Comparing active and passive learning -- Summary -- Chapter 2: Designing Query Strategy Frameworks
5058 |a Technical requirements -- Exploring uncertainty sampling methods -- Understanding query-by-committee approaches -- Maximum disagreement -- Vote entropy -- Average KL divergence -- Labeling with EMC sampling -- Sampling with EER -- Understanding density-weighted sampling methods -- Summary -- Chapter 3: Managing the Human in the Loop -- Technical requirements -- Designing interactive learning systems and workflows -- Exploring human-in-the-loop labeling tools -- Common labeling platforms -- Handling model-label disagreements -- Programmatically identifying mismatches -- Manual review of conflicts
5058 |a Effectively managing human-in-the-loop systems -- Ensuring annotation quality and dataset balance -- Assess annotator skills -- Use multiple annotators -- Balanced sampling -- Summary -- Part 2: Active Machine Learning in Practice -- Chapter 4: Applying Active Learning to Computer Vision -- Technical requirements -- Implementing active ML for an image classification project -- Building a CNN for the CIFAR dataset -- Applying uncertainty sampling to improve classification performance -- Applying active ML to an object detection project -- Preparing and training our model
5058 |a Analyzing the evaluation metrics -- Implementing an active ML strategy -- Using active ML for a segmentation project -- Summary -- Chapter 5: Leveraging Active Learning for Big Data -- Technical requirements -- Implementing ML models for video analysis -- Selecting the most informative frames with Lightly -- Using Lightly to select the best frames to label for object detection -- SSL with active ML -- Summary -- Part 3: Applying Active Machine Learning to Real-World Projects -- Chapter 6: Evaluating and Enhancing Efficiency -- Technical requirements -- Creating efficient active ML pipelines
5058 |a Monitoring active ML pipelines -- Determining when to stop active ML runs -- Enhancing production model monitoring with active ML -- Challenges in monitoring production models -- Active ML to monitor models in production -- Early detection for data drift and model decay -- Summary -- Chapter 7: Utilizing Tools and Packages for Active ML -- Technical requirements -- Mastering Python packages for enhanced active ML -- scikit-learn -- modAL -- Getting familiar with the active ML tools -- Summary -- Index -- Other Books You May Enjoy
520 |a Use active machine learning with Python to improve the accuracy of predictive models, streamline the data analysis process, and adapt to evolving data trends, fostering innovation and progress across diverse fields Key Features Learn how to implement a pipeline for optimal model creation from large datasets and at lower costs Gain profound insights within your data while achieving greater efficiency and speed Apply your knowledge to real-world use cases and solve complex ML problems Purchase of the print or Kindle book includes a free PDF eBook Book Description Building accurate machine learning models requires quality data--lots of it. However, for most teams, assembling massive datasets is time-consuming, expensive, or downright impossible. Led by Margaux Masson-Forsythe, a seasoned ML engineer and advocate for surgical data science and climate AI advancements, this hands-on guide to active machine learning demonstrates how to train robust models with just a fraction of the data using Python's powerful active learning tools. You'll master the fundamental techniques of active learning, such as membership query synthesis, stream-based sampling, and pool-based sampling and gain insights for designing and implementing active learning algorithms with query strategy and Human-in-the-Loop frameworks. Exploring various active machine learning techniques, you'll learn how to enhance the performance of computer vision models like image classification, object detection, and semantic segmentation and delve into a machine AL method for selecting the most informative frames for labeling large videos, addressing duplicated data. You'll also assess the effectiveness and efficiency of active machine learning systems through performance evaluation. By the end of the book, you'll be able to enhance your active learning projects by leveraging Python libraries, frameworks, and commonly used tools. What you will learn Master the fundamentals of active machine learning Understand query strategies for optimal model training with minimal data Tackle class imbalance, concept drift, and other data challenges Evaluate and analyze active learning model performance Integrate active learning libraries into workflows effectively Optimize workflows for human labelers Explore the finest active learning tools available today Who this book is for Ideal for data scientists and ML engineers aiming to maximize model performance while minimizing costly data labeling, this book is your guide to optimizing ML workflows and prioritizing quality over quantity. Whether you're a technical practitioner or team lead, you'll benefit from the proven methods presented in this book to slash data requirements and iterate faster. Basic Python proficiency and familiarity with machine learning concepts such as datasets and convolutional neural networks is all you need to get started.
590 |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition
650 0|a Machine learning.|9 46043
650 0|a Python (Computer program language)|9 71333
650 0|a Data mining.|9 71797
77608|i Print version:|z 1835464947|z 9781835464946|w (OCoLC)1406083929
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781835464946/?ar|x O'Reilly|z eBook
938 |a YBP Library Services|b YANK|n 306309663
938 |a ProQuest Ebook Central|b EBLB|n EBL31233992
994 |a 92|b VIA
999 |c 360764|d 360764