Data science for dummies

Book Cover
Average Rating
Published
Hoboken, NJ : For Dummies, 2021.
Status
Available Online

Description

Monetize your company’s data and data science expertise without spending a fortune on hiring independent strategy consultants to help

What if there was one simple, clear process for ensuring that all your company’s data science projects achieve a high a return on investment? What if you could validate your ideas for future data science projects, and select the one idea that’s most prime for achieving profitability while also moving your company closer to its business vision? There is.

Industry-acclaimed data science consultant, Lillian Pierson, shares her proprietary STAR Framework – A simple, proven process for leading profit-forming data science projects.

Not sure what data science is yet? Don’t worry! Parts 1 and 2 of Data Science For Dummies will get all the bases covered for you. And if you’re already a data science expert? Then you really won’t want to miss the data science strategy and data monetization gems that are shared in Part 3 onward throughout this book.

Data Science For Dummies demonstrates:

  • The only process you’ll ever need to lead profitable data science projects
  • Secret, reverse-engineered data monetization tactics that no one’s talking about
  • The shocking truth about how simple natural language processing can be
  • How to beat the crowd of data professionals by cultivating your own unique blend of data science expertise

Whether you’re new to the data science field or already a decade in, you’re sure to learn something new and incredibly valuable from Data Science For Dummies. Discover how to generate massive business wins from your company’s data by picking up your copy today.

More Details

Format
Edition
Third edition.
Language
English
ISBN
9781119811619, 1119811619

Notes

Description
Monetize your company's data and data science expertise without spending a fortune on hiring independent strategy consultants to help What if there was one simple, clear process for ensuring that all your company's data science projects achieve a high a return on investment? What if you could validate your ideas for future data science projects, and select the one idea that's most prime for achieving profitability while also moving your company closer to its business vision? There is. Industry-acclaimed data science consultant, Lillian Pierson, shares her proprietary STAR Framework - A simple, proven process for leading profit-forming data science projects. Not sure what data science is yet? Don't worry! Parts 1 and 2 of Data Science For Dummies will get all the bases covered for you. And if you're already a data science expert? Then you really won't want to miss the data science strategy and data monetization gems that are shared in Part 3 onward throughout this book. Data Science For Dummies demonstrates: The only process you'll ever need to lead profitable data science projects Secret, reverse-engineered data monetization tactics that no one's talking about The shocking truth about how simple natural language processing can be How to beat the crowd of data professionals by cultivating your own unique blend of data science expertise Whether you're new to the data science field or already a decade in, you're sure to learn something new and incredibly valuable from Data Science For Dummies. Discover how to generate massive business wins from your company's data by picking up your copy today.
Local note
O'Reilly O'Reilly Online Learning: Academic/Public Library Edition

Table of Contents

Introduction 1

About This Book 3

Foolish Assumptions 3

Icons Used in This Book 4

Beyond the Book 4

Where to Go from Here 4

Part 1: Getting Started with Data Science 5

Chapter 1: Wrapping Your Head Around Data Science 7

Seeing Who Can Make Use of Data Science 8

Inspecting the Pieces of the Data Science Puzzle 10

Collecting, querying, and consuming data 11

Applying mathematical modeling to data science tasks 12

Deriving insights from statistical methods 12

Coding, coding, coding

it's just part of the game 13

Applying data science to a subject area 13

Communicating data insights 14

Exploring Career Alternatives That Involve Data Science 15

The data implementer 16

The data leader 16

The data entrepreneur 17

Chapter 2: Tapping into Critical Aspects of Data Engineering 19

Defining Big Data and the Three Vs 19

Grappling with data volume 21

Handling data velocity 21

Dealing with data variety 22

Identifying Important Data Sources 23

Grasping the Differences among Data Approaches 24

Defining data science 25

Defining machine learning engineering 26

Defining data engineering 26

Comparing machine learning engineers, data scientists, and data engineers 27

Storing and Processing Data for Data Science 28

Storing data and doing data science directly in the cloud 28

Storing big data on-premise 32

Processing big data in real-time 35

Part 2: Using Data Science to Extract Meaning from Your Data 37

Chapter 3: Machine Learning Means Using a Machine to Learn from Data 39

Defining Machine Learning and Its Processes 40

Walking through the steps of the machine learning process 40

Becoming familiar with machine learning terms 41

Considering Learning Styles 42

Learning with supervised algorithms 42

Learning with unsupervised algorithms 43

Learning with reinforcement 43

Seeing What You Can Do 43

Selecting algorithms based on function 44

Using Spark to generate real-time big data analytics 48

Chapter 4: Math, Probability, and Statistical Modeling 51

Exploring Probability and Inferential Statistics 52

Probability distributions 53

Conditional probability with Naïve Bayes 55

Quantifying Correlation 56

Calculating correlation with Pearson's r 56

Ranking variable-pairs using Spearman's rank correlation 58

Reducing Data Dimensionality with Linear Algebra 59

Decomposing data to reduce dimensionality 59

Reducing dimensionality with factor analysis 63

Decreasing dimensionality and removing outliers with PCA 64

Modeling Decisions with Multiple Criteria Decision-Making 65

Turning to traditional MCDM 65

Focusing on fuzzy MCDM 67

Introducing Regression Methods 67

Linear regression 67

Logistic regression 69

Ordinary least squares (OLS) regression methods 70

Detecting Outliers 70

Analyzing extreme values 70

Detecting outliers with univariate analysis 71

Detecting outliers with multivariate analysis 73

Introducing Time Series Analysis 73

Identifying patterns in time series 74

Modeling univariate time series data 75

Chapter 5: Grouping Your Way into Accurate Predictions 77

Starting with Clustering Basics 78

Getting to know clustering algorithms 79

Examining clustering similarity metrics 81

Identifying Clusters in Your Data 82

Clustering with the k-means algorithm 82

Estimating clusters with kernel density estimation (KDE) 84

Clustering with hierarchical algorithms 84

Dabbling in the DBScan neighborhood 87

Categorizing Data with Decision Tree and Random Forest Algorithms 88

Drawing a Line between Clustering and Classification 89

Introducing instance-based learning classifiers 90

Getting to know classification algorithms 90

Making Sense of Data with Nearest Neighbor Analysis 93

Classifying Data with Average Nearest Neighbor Algorithms 94

Classifying with K-Nearest Neighbor Algorithms 97

Understanding how the k-nearest neighbor algorithm works 98

Knowing when to use the k-nearest neighbor algorithm 99

Exploring common applications of k-nearest neighbour algorithms 100

Solving Real-World Problems with Nearest Neighbor Algorithms 100

Seeing k-nearest neighbor algorithms in action 101

Seeing average nearest neighbor algorithms in action 101

Chapter 6: Coding Up Data Insights and Decision Engines 103

Seeing Where Python and R Fit into Your Data Science Strategy 104

Using Python for Data Science 104

Sorting out the various Python data types 106

Putting loops to good use in Python 109

Having fun with functions 110

Keeping cool with classes 112

Checking out some useful Python libraries 114

Using Open Source R for Data Science 120

Comprehending R's basic vocabulary 121

Delving into functions and operators 124

Iterating in R 127

Observing how objects work 129

Sorting out R's popular statistical analysis packages 131

Examining packages for visualizing, mapping, and graphing in R 133

Chapter 7: Generating Insights with Software Applications 137

Choosing the Best Tools for Your Data Science Strategy 138

Getting a Handle on SQL and Relational Databases 139

Investing Some Effort into Database Design 144

Defining data types 144

Designing constraints properly 145

Normalizing your database 145

Narrowing the Focus with SQL Functions 147

Making Life Easier with Excel 151

Using Excel to quickly get to know your data 152

Reformatting and summarizing with PivotTables 157

Automating Excel tasks with macros 158

Chapter 8: Telling Powerful Stories with Data 161

Data Visualizations: The Big Three 162

Data storytelling for decision makers 162

Data showcasing for analysts 163

Designing data art for activists 164

Designing to Meet the Needs of Your Target Audience 164

Step 1: Brainstorm (All about Eve) 165

Step 2: Define the purpose 166

Step 3: Choose the most functional visualization type for your purpose 166

Picking the Most Appropriate Design Style 167

Inducing a calculating, exacting response 167

Eliciting a strong emotional response 168

Selecting the Appropriate Data Graphic Type 170

Standard chart graphics 171

Comparative graphics 173

Statistical plots 176

Topology structures 179

Spatial plots and maps 180

Testing Data Graphics 183

Adding Context 184

Creating context with data 184

Creating context with annotations 185

Creating context with graphical elements 186

Part 3: Taking Stock of Your Data Science Capabilities 187

Chapter 9: Developing Your Business Acumen 189

Bridging the Business Gap 189

Contrasting business acumen with subject matter expertise 190

Defining business acumen 191

Traversing the Business Landscape 192

Seeing how data roles support the business in making money 192

Leveling up your business acumen 195

Fortifying your leadership skills 196

Surveying Use Cases and Case Studies 197

Documen

Discover More

Author Notes

Loading Author Notes...

Similar Titles From NoveList

NoveList provides detailed suggestions for titles you might like if you enjoyed this book. Suggestions are based on recommendations from librarians and other contributors.
Data science for dummies - Pierson, Lillian
These have the subjects "Information retrieval," "Data mining," and "Information technology."

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Pierson, L. (2021). Data science for dummies (Third edition.). For Dummies.

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Pierson, Lillian. 2021. Data Science for Dummies. Hoboken, NJ: For Dummies.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Pierson, Lillian. Data Science for Dummies Hoboken, NJ: For Dummies, 2021.

Harvard Citation (style guide)

Pierson, L. (2021). Data science for dummies. Third edn. Hoboken, NJ: For Dummies.

MLA Citation, 9th Edition (style guide)

Pierson, Lillian. Data Science for Dummies Third edition., For Dummies, 2021.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Grouped Work ID
a77631e1-3733-d566-f9e3-661e7a84abae-eng
Go To Grouped Work View in Staff Client

Grouping Information

Grouped Work IDa77631e1-3733-d566-f9e3-661e7a84abae-eng
Full titledata science for dummies
Authorpierson lillian
Grouping Categorybook
Last Update2025-01-24 12:33:29PM
Last Indexed2025-05-22 03:32:02AM

Book Cover Information

Image Sourcesyndetics
First LoadedDec 21, 2024
Last UsedMar 4, 2025

Marc Record

First DetectedDec 16, 2024 11:19:37 PM
Last File Modification TimeDec 17, 2024 08:19:22 AM
SuppressedRecord had no items

MARC Record

LEADER11032cam a22004691i 4500
001on1285688053
003OCoLC
00520241217081631.0
006m        d        
007cr |||||||||||
008210831s2021    nju     o     000 0 eng d
015 |a GBC1F1452|2 bnb
0167 |a 020324020|2 Uk
019 |a 1289843165
020 |a 9781119811619|q (ePub ebook)
020 |a 1119811619
035 |a (OCoLC)1285688053|z (OCoLC)1289843165
037 |a 9781119811619|b Wiley
037 |a 9781119811558|b O'Reilly Media
040 |a UKMGB|b eng|e rda|e pn|c UKMGB|d OCLCF|d UKAHL|d OCLCO|d ORMDA|d OCLCO|d OCLCQ|d OCLCO
049 |a MAIN
050 4|a T58.5
08204|a 006.312|2 23
1001 |a Pierson, Lillian,|e author.|9 350190
24510|a Data science for dummies /|c Lillian Pierson.
250 |a Third edition.
264 1|a Hoboken, NJ :|b For Dummies,|c 2021.
300 |a 1 online resource
336 |a text|2 rdacontent
337 |a computer|2 rdamedia
338 |a online resource|2 rdacarrier
5050 |a <P><b>Introduction</b><b> 1</b></p> <p>About This Book 3</p> <p>Foolish Assumptions 3</p> <p>Icons Used in This Book 4</p> <p>Beyond the Book 4</p> <p>Where to Go from Here 4</p> <p><b>Part 1: Getting Started with Data Science</b><b> 5</b></p> <p><b>Chapter 1: Wrapping Your Head Around Data Science</b><b> 7</b></p> <p>Seeing Who Can Make Use of Data Science 8</p> <p>Inspecting the Pieces of the Data Science Puzzle 10</p> <p>Collecting, querying, and consuming data 11</p> <p>Applying mathematical modeling to data science tasks 12</p> <p>Deriving insights from statistical methods 12</p> <p>Coding, coding, coding -- it's just part of the game 13</p> <p>Applying data science to a subject area 13</p> <p>Communicating data insights 14</p> <p>Exploring Career Alternatives That Involve Data Science 15</p> <p>The data implementer 16</p> <p>The data leader 16</p> <p>The data entrepreneur 17</p> <p><b>Chapter 2: Tapping into Critical Aspects of Data Engineering</b><b> 19</b></p> <p>Defining Big Data and the Three Vs 19</p> <p>Grappling with data volume 21</p> <p>Handling data velocity 21</p> <p>Dealing with data variety 22</p> <p>Identifying Important Data Sources 23</p> <p>Grasping the Differences among Data Approaches 24</p> <p>Defining data science 25</p> <p>Defining machine learning engineering 26</p> <p>Defining data engineering 26</p> <p>Comparing machine learning engineers, data scientists, and data engineers 27</p> <p>Storing and Processing Data for Data Science 28</p> <p>Storing data and doing data science directly in the cloud 28</p> <p>Storing big data on-premise 32</p> <p>Processing big data in real-time 35</p> <p><b>Part 2: Using Data Science to Extract Meaning from Your Data </b><b>37</b></p> <p><b>Chapter 3: Machine Learning Means Using a Machine to Learn from Data</b><b> 39</b></p> <p>Defining Machine Learning and Its Processes 40</p> <p>Walking through the steps of the machine learning process 40</p> <p>Becoming familiar with machine learning terms 41</p> <p>Considering Learning Styles 42</p> <p>Learning with supervised algorithms 42</p> <p>Learning with unsupervised algorithms 43</p> <p>Learning with reinforcement 43</p> <p>Seeing What You Can Do 43</p> <p>Selecting algorithms based on function 44</p> <p>Using Spark to generate real-time big data analytics 48</p> <p><b>Chapter 4: Math, Probability, and Statistical Modeling</b><b> 51</b></p> <p>Exploring Probability and Inferential Statistics 52</p> <p>Probability distributions 53</p> <p>Conditional probability with Naïve Bayes 55</p> <p>Quantifying Correlation 56</p> <p>Calculating correlation with Pearson's r 56</p> <p>Ranking variable-pairs using Spearman's rank correlation 58</p> <p>Reducing Data Dimensionality with Linear Algebra 59</p> <p>Decomposing data to reduce dimensionality 59</p> <p>Reducing dimensionality with factor analysis 63</p> <p>Decreasing dimensionality and removing outliers with PCA 64</p> <p>Modeling Decisions with Multiple Criteria Decision-Making 65</p> <p>Turning to traditional MCDM 65</p> <p>Focusing on fuzzy MCDM 67</p> <p>Introducing Regression Methods 67</p> <p>Linear regression 67</p> <p>Logistic regression 69</p> <p>Ordinary least squares (OLS) regression methods 70</p> <p>Detecting Outliers 70</p> <p>Analyzing extreme values 70</p> <p>Detecting outliers with univariate analysis 71</p> <p>Detecting outliers with multivariate analysis 73</p> <p>Introducing Time Series Analysis 73</p> <p>Identifying patterns in time series 74</p> <p>Modeling univariate time series data 75</p> <p><b>Chapter 5: Grouping Your Way into Accurate Predictions</b><b> 77</b></p> <p>Starting with Clustering Basics 78</p> <p>Getting to know clustering algorithms 79</p> <p>Examining clustering similarity metrics 81</p> <p>Identifying Clusters in Your Data 82</p> <p>Clustering with the k-means algorithm 82</p> <p>Estimating clusters with kernel density estimation (KDE) 84</p> <p>Clustering with hierarchical algorithms 84</p> <p>Dabbling in the DBScan neighborhood 87</p> <p>Categorizing Data with Decision Tree and Random Forest Algorithms 88</p> <p>Drawing a Line between Clustering and Classification 89</p> <p>Introducing instance-based learning classifiers 90</p> <p>Getting to know classification algorithms 90</p> <p>Making Sense of Data with Nearest Neighbor Analysis 93</p> <p>Classifying Data with Average Nearest Neighbor Algorithms 94</p> <p>Classifying with K-Nearest Neighbor Algorithms 97</p> <p>Understanding how the k-nearest neighbor algorithm works 98</p> <p>Knowing when to use the k-nearest neighbor algorithm 99</p> <p>Exploring common applications of k-nearest neighbour algorithms 100</p> <p>Solving Real-World Problems with Nearest Neighbor Algorithms 100</p> <p>Seeing k-nearest neighbor algorithms in action 101</p> <p>Seeing average nearest neighbor algorithms in action 101</p> <p><b>Chapter 6: Coding Up Data Insights and Decision Engines</b><b> 103</b></p> <p>Seeing Where Python and R Fit into Your Data Science Strategy 104</p> <p>Using Python for Data Science 104</p> <p>Sorting out the various Python data types 106</p> <p>Putting loops to good use in Python 109</p> <p>Having fun with functions 110</p> <p>Keeping cool with classes 112</p> <p>Checking out some useful Python libraries 114</p> <p>Using Open Source R for Data Science 120</p> <p>Comprehending R's basic vocabulary 121</p> <p>Delving into functions and operators 124</p> <p>Iterating in R 127</p> <p>Observing how objects work 129</p> <p>Sorting out R's popular statistical analysis packages 131</p> <p>Examining packages for visualizing, mapping, and graphing in R 133</p> <p><b>Chapter 7: Generating Insights with Software Applications</b><b> 137</b></p> <p>Choosing the Best Tools for Your Data Science Strategy 138</p> <p>Getting a Handle on SQL and Relational Databases 139</p> <p>Investing Some Effort into Database Design 144</p> <p>Defining data types 144</p> <p>Designing constraints properly 145</p> <p>Normalizing your database 145</p> <p>Narrowing the Focus with SQL Functions 147</p> <p>Making Life Easier with Excel 151</p> <p>Using Excel to quickly get to know your data 152</p> <p>Reformatting and summarizing with PivotTables 157</p> <p>Automating Excel tasks with macros 158</p> <p><b>Chapter 8: Telling Powerful Stories with Data</b><b> 161</b></p> <p>Data Visualizations: The Big Three 162</p> <p>Data storytelling for decision makers 162</p> <p>Data showcasing for analysts 163</p> <p>Designing data art for activists 164</p> <p>Designing to Meet the Needs of Your Target Audience 164</p> <p>Step 1: Brainstorm (All about Eve) 165</p> <p>Step 2: Define the purpose 166</p> <p>Step 3: Choose the most functional visualization type for your purpose 166</p> <p>Picking the Most Appropriate Design Style 167</p> <p>Inducing a calculating, exacting response 167</p> <p>Eliciting a strong emotional response 168</p> <p>Selecting the Appropriate Data Graphic Type 170</p> <p>Standard chart graphics 171</p> <p>Comparative graphics 173</p> <p>Statistical plots 176</p> <p>Topology structures 179</p> <p>Spatial plots and maps 180</p> <p>Testing Data Graphics 183</p> <p>Adding Context 184</p> <p>Creating context with data 184</p> <p>Creating context with annotations 185</p> <p>Creating context with graphical elements 186</p> <p><b>Part 3: Taking Stock of Your Data Science Capabilities </b><b>187</b></p> <p><b>Chapter 9: Developing Your Business Acumen</b><b> 189</b></p> <p>Bridging the Business Gap 189</p> <p>Contrasting business acumen with subject matter expertise 190</p> <p>Defining business acumen 191</p> <p>Traversing the Business Landscape 192</p> <p>Seeing how data roles support the business in making money 192</p> <p>Leveling up your business acumen 195</p> <p>Fortifying your leadership skills 196</p> <p>Surveying Use Cases and Case Studies 197</p> <p>Documen
520 |a Monetize your company's data and data science expertise without spending a fortune on hiring independent strategy consultants to help What if there was one simple, clear process for ensuring that all your company's data science projects achieve a high a return on investment? What if you could validate your ideas for future data science projects, and select the one idea that's most prime for achieving profitability while also moving your company closer to its business vision? There is. Industry-acclaimed data science consultant, Lillian Pierson, shares her proprietary STAR Framework - A simple, proven process for leading profit-forming data science projects. Not sure what data science is yet? Don't worry! Parts 1 and 2 of Data Science For Dummies will get all the bases covered for you. And if you're already a data science expert? Then you really won't want to miss the data science strategy and data monetization gems that are shared in Part 3 onward throughout this book. Data Science For Dummies demonstrates: The only process you'll ever need to lead profitable data science projects Secret, reverse-engineered data monetization tactics that no one's talking about The shocking truth about how simple natural language processing can be How to beat the crowd of data professionals by cultivating your own unique blend of data science expertise Whether you're new to the data science field or already a decade in, you're sure to learn something new and incredibly valuable from Data Science For Dummies. Discover how to generate massive business wins from your company's data by picking up your copy today.
588 |a Description based on CIP data; resource not viewed.
590 |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition
650 0|a Data mining.|9 71797
650 0|a Business|x Data processing.|9 31545
77608|i Print version:|z 9781119811558
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781119811558/?ar|x O'Reilly|z eBook
938 |a Askews and Holts Library Services|b ASKH|n AH39163136
994 |a 92|b VIA
999 |c 358233|d 358233