APACHE SPARK FOR MACHINE LEARNING build and deploy high-performance big data AI solutions for large-scale clusters

Book Cover
Average Rating
Published
Birmingham, UK : Packt Publishing Ltd., 2024.
Status
Available Online

Description

Develop your data science skills with Apache Spark to solve real-world problems for Fortune 500 companies using scalable algorithms on large cloud computing clusters Key Features Apply techniques to analyze big data and uncover valuable insights for machine learning Learn to use cloud computing clusters for training machine learning models on large datasets Discover practical strategies to overcome challenges in model training, deployment, and optimization Purchase of the print or Kindle book includes a free PDF eBook Book Description In the world of big data, efficiently processing and analyzing massive datasets for machine learning can be a daunting task. Written by Deepak Gowda, a data scientist with over a decade of experience and 30+ patents, this book provides a hands-on guide to mastering Spark's capabilities for efficient data processing, model building, and optimization. With Deepak's expertise across industries such as supply chain, cybersecurity, and data center infrastructure, he makes complex concepts easy to follow through detailed recipes. This book takes you through core machine learning concepts, highlighting the advantages of Spark for big data analytics. It covers practical data preprocessing techniques, including feature extraction and transformation, supervised learning methods with detailed chapters on regression and classification, and unsupervised learning through clustering and recommendation systems. You'll also learn to identify frequent patterns in data and discover effective strategies to deploy and optimize your machine learning models. Each chapter features practical coding examples and real-world applications to equip you with the knowledge and skills needed to tackle complex machine learning tasks. By the end of this book, you'll be ready to handle big data and create advanced machine learning models with Apache Spark. What you will learn Master Apache Spark for efficient, large-scale data processing and analysis Understand core machine learning concepts and their applications with Spark Implement data preprocessing techniques for feature extraction and transformation Explore supervised learning methods - regression and classification algorithms Apply unsupervised learning for clustering tasks and recommendation systems Discover frequent pattern mining techniques to uncover data trends Who this book is for This book is ideal for data scientists, ML engineers, data engineers, students, and researchers who want to deepen their knowledge of Apache Spark's tools and algorithms. It's a must-have for those struggling to scale models for real-world problems and a valuable resource for preparing for interviews at Fortune 500 companies, focusing on large dataset analysis, model training, and deployment.

More Details

Format
Language
English
ISBN
9781835460016, 1835460011

Notes

Description
Develop your data science skills with Apache Spark to solve real-world problems for Fortune 500 companies using scalable algorithms on large cloud computing clusters Key Features Apply techniques to analyze big data and uncover valuable insights for machine learning Learn to use cloud computing clusters for training machine learning models on large datasets Discover practical strategies to overcome challenges in model training, deployment, and optimization Purchase of the print or Kindle book includes a free PDF eBook Book Description In the world of big data, efficiently processing and analyzing massive datasets for machine learning can be a daunting task. Written by Deepak Gowda, a data scientist with over a decade of experience and 30+ patents, this book provides a hands-on guide to mastering Spark's capabilities for efficient data processing, model building, and optimization. With Deepak's expertise across industries such as supply chain, cybersecurity, and data center infrastructure, he makes complex concepts easy to follow through detailed recipes. This book takes you through core machine learning concepts, highlighting the advantages of Spark for big data analytics. It covers practical data preprocessing techniques, including feature extraction and transformation, supervised learning methods with detailed chapters on regression and classification, and unsupervised learning through clustering and recommendation systems. You'll also learn to identify frequent patterns in data and discover effective strategies to deploy and optimize your machine learning models. Each chapter features practical coding examples and real-world applications to equip you with the knowledge and skills needed to tackle complex machine learning tasks. By the end of this book, you'll be ready to handle big data and create advanced machine learning models with Apache Spark. What you will learn Master Apache Spark for efficient, large-scale data processing and analysis Understand core machine learning concepts and their applications with Spark Implement data preprocessing techniques for feature extraction and transformation Explore supervised learning methods - regression and classification algorithms Apply unsupervised learning for clustering tasks and recommendation systems Discover frequent pattern mining techniques to uncover data trends Who this book is for This book is ideal for data scientists, ML engineers, data engineers, students, and researchers who want to deepen their knowledge of Apache Spark's tools and algorithms. It's a must-have for those struggling to scale models for real-world problems and a valuable resource for preparing for interviews at Fortune 500 companies, focusing on large dataset analysis, model training, and deployment.
Local note
O'Reilly O'Reilly Online Learning: Academic/Public Library Edition

Table of Contents

Table of Contents An Overview of Machine Learning Concepts Data Processing with Spark Feature Extraction and Transformation Building a Regression System Building a Classification System Building a Clustering System Building a Recommendation System Mining Frequent Patterns Deploying a Model.

Discover More

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Gowda, D. (2024). APACHE SPARK FOR MACHINE LEARNING: build and deploy high-performance big data AI solutions for large-scale clusters . Packt Publishing Ltd..

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Gowda, Deepak. 2024. APACHE SPARK FOR MACHINE LEARNING: Build and Deploy High-performance Big Data AI Solutions for Large-scale Clusters. Birmingham, UK: Packt Publishing Ltd.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Gowda, Deepak. APACHE SPARK FOR MACHINE LEARNING: Build and Deploy High-performance Big Data AI Solutions for Large-scale Clusters Birmingham, UK: Packt Publishing Ltd, 2024.

Harvard Citation (style guide)

Gowda, D. (2024). APACHE SPARK FOR MACHINE LEARNING: build and deploy high-performance big data AI solutions for large-scale clusters. Birmingham, UK: Packt Publishing Ltd.

MLA Citation, 9th Edition (style guide)

Gowda, Deepak. APACHE SPARK FOR MACHINE LEARNING: Build and Deploy High-performance Big Data AI Solutions for Large-scale Clusters Packt Publishing Ltd., 2024.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Grouped Work ID
56038055-ca25-e3ed-8411-5f861fd58c4b-eng
Go To Grouped Work View in Staff Client

Grouping Information

Grouped Work ID56038055-ca25-e3ed-8411-5f861fd58c4b-eng
Full titleapache spark for machine learning build and deploy high performance big data ai solutions for large scale clusters
Authorgowda deepak
Grouping Categorybook
Last Update2025-01-24 12:33:29PM
Last Indexed2025-05-22 03:16:51AM

Book Cover Information

Image Sourcegoogle_isbn
First LoadedDec 23, 2024
Last UsedMay 17, 2025

Marc Record

First DetectedDec 16, 2024 11:30:33 PM
Last File Modification TimeDec 17, 2024 08:39:34 AM
SuppressedRecord had no items

MARC Record

LEADER04694cam a22004217a 4500
001on1458612836
003OCoLC
00520241217082904.0
006m     o  d        
007cr |n|||||||||
008241005s2024    enk     o     000 0 eng d
019 |a 1458762222
020 |a 9781835460016|q (electronic bk.)
020 |a 1835460011|q (electronic bk.)
035 |a (OCoLC)1458612836|z (OCoLC)1458762222
037 |a 9781804618165|b O'Reilly Media
037 |a 10763460|b IEEE
040 |a YDX|b eng|c YDX|d OCLCO|d EBLCP|d OCLCQ|d ORMDA|d OCLCO|d IEEEE
049 |a MAIN
050 4|a Q325.5
08204|a 006.3/1|2 23/eng/20241112
1001 |a Gowda, Deepak,|e author.
24510|a APACHE SPARK FOR MACHINE LEARNING|h [electronic resource] :|b build and deploy high-performance big data AI solutions for large-scale clusters /|c Deepak Gowda.
260 |a Birmingham, UK :|b Packt Publishing Ltd.,|c 2024.
300 |a 1 online resource
5050 |a Table of Contents An Overview of Machine Learning Concepts Data Processing with Spark Feature Extraction and Transformation Building a Regression System Building a Classification System Building a Clustering System Building a Recommendation System Mining Frequent Patterns Deploying a Model.
520 |a Develop your data science skills with Apache Spark to solve real-world problems for Fortune 500 companies using scalable algorithms on large cloud computing clusters Key Features Apply techniques to analyze big data and uncover valuable insights for machine learning Learn to use cloud computing clusters for training machine learning models on large datasets Discover practical strategies to overcome challenges in model training, deployment, and optimization Purchase of the print or Kindle book includes a free PDF eBook Book Description In the world of big data, efficiently processing and analyzing massive datasets for machine learning can be a daunting task. Written by Deepak Gowda, a data scientist with over a decade of experience and 30+ patents, this book provides a hands-on guide to mastering Spark's capabilities for efficient data processing, model building, and optimization. With Deepak's expertise across industries such as supply chain, cybersecurity, and data center infrastructure, he makes complex concepts easy to follow through detailed recipes. This book takes you through core machine learning concepts, highlighting the advantages of Spark for big data analytics. It covers practical data preprocessing techniques, including feature extraction and transformation, supervised learning methods with detailed chapters on regression and classification, and unsupervised learning through clustering and recommendation systems. You'll also learn to identify frequent patterns in data and discover effective strategies to deploy and optimize your machine learning models. Each chapter features practical coding examples and real-world applications to equip you with the knowledge and skills needed to tackle complex machine learning tasks. By the end of this book, you'll be ready to handle big data and create advanced machine learning models with Apache Spark. What you will learn Master Apache Spark for efficient, large-scale data processing and analysis Understand core machine learning concepts and their applications with Spark Implement data preprocessing techniques for feature extraction and transformation Explore supervised learning methods - regression and classification algorithms Apply unsupervised learning for clustering tasks and recommendation systems Discover frequent pattern mining techniques to uncover data trends Who this book is for This book is ideal for data scientists, ML engineers, data engineers, students, and researchers who want to deepen their knowledge of Apache Spark's tools and algorithms. It's a must-have for those struggling to scale models for real-world problems and a valuable resource for preparing for interviews at Fortune 500 companies, focusing on large dataset analysis, model training, and deployment.
590 |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition
63000|a Spark (Electronic resource : Apache Software Foundation)
650 0|a Machine learning.|9 46043
650 0|a Big data.|9 403931
650 0|a Information retrieval.|9 43126
77608|i Print version:|z 1804618160|z 9781804618165|w (OCoLC)1456588233
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781804618165/?ar|x O'Reilly|z eBook
938 |a YBP Library Services|b YANK|n 306695608
938 |a ProQuest Ebook Central|b EBLB|n EBL31694952
994 |a 92|b VIA
999 |c 361335|d 361335