Data science for dummies
Description
Monetize your company’s data and data science expertise without spending a fortune on hiring independent strategy consultants to help
What if there was one simple, clear process for ensuring that all your company’s data science projects achieve a high a return on investment? What if you could validate your ideas for future data science projects, and select the one idea that’s most prime for achieving profitability while also moving your company closer to its business vision? There is.
Industry-acclaimed data science consultant, Lillian Pierson, shares her proprietary STAR Framework – A simple, proven process for leading profit-forming data science projects.
Not sure what data science is yet? Don’t worry! Parts 1 and 2 of Data Science For Dummies will get all the bases covered for you. And if you’re already a data science expert? Then you really won’t want to miss the data science strategy and data monetization gems that are shared in Part 3 onward throughout this book.
Data Science For Dummies demonstrates:
- The only process you’ll ever need to lead profitable data science projects
- Secret, reverse-engineered data monetization tactics that no one’s talking about
- The shocking truth about how simple natural language processing can be
- How to beat the crowd of data professionals by cultivating your own unique blend of data science expertise
Whether you’re new to the data science field or already a decade in, you’re sure to learn something new and incredibly valuable from Data Science For Dummies. Discover how to generate massive business wins from your company’s data by picking up your copy today.
More Details
Notes
Table of Contents
Introduction 1
About This Book 3
Foolish Assumptions 3
Icons Used in This Book 4
Beyond the Book 4
Where to Go from Here 4
Part 1: Getting Started with Data Science 5
Chapter 1: Wrapping Your Head Around Data Science 7
Seeing Who Can Make Use of Data Science 8
Inspecting the Pieces of the Data Science Puzzle 10
Collecting, querying, and consuming data 11
Applying mathematical modeling to data science tasks 12
Deriving insights from statistical methods 12
Coding, coding, coding
Applying data science to a subject area 13
Communicating data insights 14
Exploring Career Alternatives That Involve Data Science 15
The data implementer 16
The data leader 16
The data entrepreneur 17
Chapter 2: Tapping into Critical Aspects of Data Engineering 19
Defining Big Data and the Three Vs 19
Grappling with data volume 21
Handling data velocity 21
Dealing with data variety 22
Identifying Important Data Sources 23
Grasping the Differences among Data Approaches 24
Defining data science 25
Defining machine learning engineering 26
Defining data engineering 26
Comparing machine learning engineers, data scientists, and data engineers 27
Storing and Processing Data for Data Science 28
Storing data and doing data science directly in the cloud 28
Storing big data on-premise 32
Processing big data in real-time 35
Part 2: Using Data Science to Extract Meaning from Your Data 37
Chapter 3: Machine Learning Means Using a Machine to Learn from Data 39
Defining Machine Learning and Its Processes 40
Walking through the steps of the machine learning process 40
Becoming familiar with machine learning terms 41
Considering Learning Styles 42
Learning with supervised algorithms 42
Learning with unsupervised algorithms 43
Learning with reinforcement 43
Seeing What You Can Do 43
Selecting algorithms based on function 44
Using Spark to generate real-time big data analytics 48
Chapter 4: Math, Probability, and Statistical Modeling 51
Exploring Probability and Inferential Statistics 52
Probability distributions 53
Conditional probability with Naïve Bayes 55
Quantifying Correlation 56
Calculating correlation with Pearson's r 56
Ranking variable-pairs using Spearman's rank correlation 58
Reducing Data Dimensionality with Linear Algebra 59
Decomposing data to reduce dimensionality 59
Reducing dimensionality with factor analysis 63
Decreasing dimensionality and removing outliers with PCA 64
Modeling Decisions with Multiple Criteria Decision-Making 65
Turning to traditional MCDM 65
Focusing on fuzzy MCDM 67
Introducing Regression Methods 67
Linear regression 67
Logistic regression 69
Ordinary least squares (OLS) regression methods 70
Detecting Outliers 70
Analyzing extreme values 70
Detecting outliers with univariate analysis 71
Detecting outliers with multivariate analysis 73
Introducing Time Series Analysis 73
Identifying patterns in time series 74
Modeling univariate time series data 75
Chapter 5: Grouping Your Way into Accurate Predictions 77
Starting with Clustering Basics 78
Getting to know clustering algorithms 79
Examining clustering similarity metrics 81
Identifying Clusters in Your Data 82
Clustering with the k-means algorithm 82
Estimating clusters with kernel density estimation (KDE) 84
Clustering with hierarchical algorithms 84
Dabbling in the DBScan neighborhood 87
Categorizing Data with Decision Tree and Random Forest Algorithms 88
Drawing a Line between Clustering and Classification 89
Introducing instance-based learning classifiers 90
Getting to know classification algorithms 90
Making Sense of Data with Nearest Neighbor Analysis 93
Classifying Data with Average Nearest Neighbor Algorithms 94
Classifying with K-Nearest Neighbor Algorithms 97
Understanding how the k-nearest neighbor algorithm works 98
Knowing when to use the k-nearest neighbor algorithm 99
Exploring common applications of k-nearest neighbour algorithms 100
Solving Real-World Problems with Nearest Neighbor Algorithms 100
Seeing k-nearest neighbor algorithms in action 101
Seeing average nearest neighbor algorithms in action 101
Chapter 6: Coding Up Data Insights and Decision Engines 103
Seeing Where Python and R Fit into Your Data Science Strategy 104
Using Python for Data Science 104
Sorting out the various Python data types 106
Putting loops to good use in Python 109
Having fun with functions 110
Keeping cool with classes 112
Checking out some useful Python libraries 114
Using Open Source R for Data Science 120
Comprehending R's basic vocabulary 121
Delving into functions and operators 124
Iterating in R 127
Observing how objects work 129
Sorting out R's popular statistical analysis packages 131
Examining packages for visualizing, mapping, and graphing in R 133
Chapter 7: Generating Insights with Software Applications 137
Choosing the Best Tools for Your Data Science Strategy 138
Getting a Handle on SQL and Relational Databases 139
Investing Some Effort into Database Design 144
Defining data types 144
Designing constraints properly 145
Normalizing your database 145
Narrowing the Focus with SQL Functions 147
Making Life Easier with Excel 151
Using Excel to quickly get to know your data 152
Reformatting and summarizing with PivotTables 157
Automating Excel tasks with macros 158
Chapter 8: Telling Powerful Stories with Data 161
Data Visualizations: The Big Three 162
Data storytelling for decision makers 162
Data showcasing for analysts 163
Designing data art for activists 164
Designing to Meet the Needs of Your Target Audience 164
Step 1: Brainstorm (All about Eve) 165
Step 2: Define the purpose 166
Step 3: Choose the most functional visualization type for your purpose 166
Picking the Most Appropriate Design Style 167
Inducing a calculating, exacting response 167
Eliciting a strong emotional response 168
Selecting the Appropriate Data Graphic Type 170
Standard chart graphics 171
Comparative graphics 173
Statistical plots 176
Topology structures 179
Spatial plots and maps 180
Testing Data Graphics 183
Adding Context 184
Creating context with data 184
Creating context with annotations 185
Creating context with graphical elements 186
Part 3: Taking Stock of Your Data Science Capabilities 187
Chapter 9: Developing Your Business Acumen 189
Bridging the Business Gap 189
Contrasting business acumen with subject matter expertise 190
Defining business acumen 191
Traversing the Business Landscape 192
Seeing how data roles support the business in making money 192
Leveling up your business acumen 195
Fortifying your leadership skills 196
Surveying Use Cases and Case Studies 197
Documen
Similar Titles From NoveList
Reviews from GoodReads
Citations
Pierson, L. (2021). Data science for dummies (Third edition.). For Dummies.
Chicago / Turabian - Author Date Citation, 17th Edition (style guide)Pierson, Lillian. 2021. Data Science for Dummies. Hoboken, NJ: For Dummies.
Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)Pierson, Lillian. Data Science for Dummies Hoboken, NJ: For Dummies, 2021.
Harvard Citation (style guide)Pierson, L. (2021). Data science for dummies. Third edn. Hoboken, NJ: For Dummies.
MLA Citation, 9th Edition (style guide)Pierson, Lillian. Data Science for Dummies Third edition., For Dummies, 2021.
Staff View
Grouping Information
Grouped Work ID | a77631e1-3733-d566-f9e3-661e7a84abae-eng |
---|---|
Full title | data science for dummies |
Author | pierson lillian |
Grouping Category | book |
Last Update | 2025-01-24 12:33:29PM |
Last Indexed | 2025-05-22 03:32:02AM |
Book Cover Information
Image Source | syndetics |
---|---|
First Loaded | Dec 21, 2024 |
Last Used | Mar 4, 2025 |
Marc Record
First Detected | Dec 16, 2024 11:19:37 PM |
---|---|
Last File Modification Time | Dec 17, 2024 08:19:22 AM |
Suppressed | Record had no items |
MARC Record
LEADER | 11032cam a22004691i 4500 | ||
---|---|---|---|
001 | on1285688053 | ||
003 | OCoLC | ||
005 | 20241217081631.0 | ||
006 | m d | ||
007 | cr ||||||||||| | ||
008 | 210831s2021 nju o 000 0 eng d | ||
015 | |a GBC1F1452|2 bnb | ||
016 | 7 | |a 020324020|2 Uk | |
019 | |a 1289843165 | ||
020 | |a 9781119811619|q (ePub ebook) | ||
020 | |a 1119811619 | ||
035 | |a (OCoLC)1285688053|z (OCoLC)1289843165 | ||
037 | |a 9781119811619|b Wiley | ||
037 | |a 9781119811558|b O'Reilly Media | ||
040 | |a UKMGB|b eng|e rda|e pn|c UKMGB|d OCLCF|d UKAHL|d OCLCO|d ORMDA|d OCLCO|d OCLCQ|d OCLCO | ||
049 | |a MAIN | ||
050 | 4 | |a T58.5 | |
082 | 0 | 4 | |a 006.312|2 23 |
100 | 1 | |a Pierson, Lillian,|e author.|9 350190 | |
245 | 1 | 0 | |a Data science for dummies /|c Lillian Pierson. |
250 | |a Third edition. | ||
264 | 1 | |a Hoboken, NJ :|b For Dummies,|c 2021. | |
300 | |a 1 online resource | ||
336 | |a text|2 rdacontent | ||
337 | |a computer|2 rdamedia | ||
338 | |a online resource|2 rdacarrier | ||
505 | 0 | |a <P><b>Introduction</b><b> 1</b></p> <p>About This Book 3</p> <p>Foolish Assumptions 3</p> <p>Icons Used in This Book 4</p> <p>Beyond the Book 4</p> <p>Where to Go from Here 4</p> <p><b>Part 1: Getting Started with Data Science</b><b> 5</b></p> <p><b>Chapter 1: Wrapping Your Head Around Data Science</b><b> 7</b></p> <p>Seeing Who Can Make Use of Data Science 8</p> <p>Inspecting the Pieces of the Data Science Puzzle 10</p> <p>Collecting, querying, and consuming data 11</p> <p>Applying mathematical modeling to data science tasks 12</p> <p>Deriving insights from statistical methods 12</p> <p>Coding, coding, coding -- it's just part of the game 13</p> <p>Applying data science to a subject area 13</p> <p>Communicating data insights 14</p> <p>Exploring Career Alternatives That Involve Data Science 15</p> <p>The data implementer 16</p> <p>The data leader 16</p> <p>The data entrepreneur 17</p> <p><b>Chapter 2: Tapping into Critical Aspects of Data Engineering</b><b> 19</b></p> <p>Defining Big Data and the Three Vs 19</p> <p>Grappling with data volume 21</p> <p>Handling data velocity 21</p> <p>Dealing with data variety 22</p> <p>Identifying Important Data Sources 23</p> <p>Grasping the Differences among Data Approaches 24</p> <p>Defining data science 25</p> <p>Defining machine learning engineering 26</p> <p>Defining data engineering 26</p> <p>Comparing machine learning engineers, data scientists, and data engineers 27</p> <p>Storing and Processing Data for Data Science 28</p> <p>Storing data and doing data science directly in the cloud 28</p> <p>Storing big data on-premise 32</p> <p>Processing big data in real-time 35</p> <p><b>Part 2: Using Data Science to Extract Meaning from Your Data </b><b>37</b></p> <p><b>Chapter 3: Machine Learning Means Using a Machine to Learn from Data</b><b> 39</b></p> <p>Defining Machine Learning and Its Processes 40</p> <p>Walking through the steps of the machine learning process 40</p> <p>Becoming familiar with machine learning terms 41</p> <p>Considering Learning Styles 42</p> <p>Learning with supervised algorithms 42</p> <p>Learning with unsupervised algorithms 43</p> <p>Learning with reinforcement 43</p> <p>Seeing What You Can Do 43</p> <p>Selecting algorithms based on function 44</p> <p>Using Spark to generate real-time big data analytics 48</p> <p><b>Chapter 4: Math, Probability, and Statistical Modeling</b><b> 51</b></p> <p>Exploring Probability and Inferential Statistics 52</p> <p>Probability distributions 53</p> <p>Conditional probability with Naïve Bayes 55</p> <p>Quantifying Correlation 56</p> <p>Calculating correlation with Pearson's r 56</p> <p>Ranking variable-pairs using Spearman's rank correlation 58</p> <p>Reducing Data Dimensionality with Linear Algebra 59</p> <p>Decomposing data to reduce dimensionality 59</p> <p>Reducing dimensionality with factor analysis 63</p> <p>Decreasing dimensionality and removing outliers with PCA 64</p> <p>Modeling Decisions with Multiple Criteria Decision-Making 65</p> <p>Turning to traditional MCDM 65</p> <p>Focusing on fuzzy MCDM 67</p> <p>Introducing Regression Methods 67</p> <p>Linear regression 67</p> <p>Logistic regression 69</p> <p>Ordinary least squares (OLS) regression methods 70</p> <p>Detecting Outliers 70</p> <p>Analyzing extreme values 70</p> <p>Detecting outliers with univariate analysis 71</p> <p>Detecting outliers with multivariate analysis 73</p> <p>Introducing Time Series Analysis 73</p> <p>Identifying patterns in time series 74</p> <p>Modeling univariate time series data 75</p> <p><b>Chapter 5: Grouping Your Way into Accurate Predictions</b><b> 77</b></p> <p>Starting with Clustering Basics 78</p> <p>Getting to know clustering algorithms 79</p> <p>Examining clustering similarity metrics 81</p> <p>Identifying Clusters in Your Data 82</p> <p>Clustering with the k-means algorithm 82</p> <p>Estimating clusters with kernel density estimation (KDE) 84</p> <p>Clustering with hierarchical algorithms 84</p> <p>Dabbling in the DBScan neighborhood 87</p> <p>Categorizing Data with Decision Tree and Random Forest Algorithms 88</p> <p>Drawing a Line between Clustering and Classification 89</p> <p>Introducing instance-based learning classifiers 90</p> <p>Getting to know classification algorithms 90</p> <p>Making Sense of Data with Nearest Neighbor Analysis 93</p> <p>Classifying Data with Average Nearest Neighbor Algorithms 94</p> <p>Classifying with K-Nearest Neighbor Algorithms 97</p> <p>Understanding how the k-nearest neighbor algorithm works 98</p> <p>Knowing when to use the k-nearest neighbor algorithm 99</p> <p>Exploring common applications of k-nearest neighbour algorithms 100</p> <p>Solving Real-World Problems with Nearest Neighbor Algorithms 100</p> <p>Seeing k-nearest neighbor algorithms in action 101</p> <p>Seeing average nearest neighbor algorithms in action 101</p> <p><b>Chapter 6: Coding Up Data Insights and Decision Engines</b><b> 103</b></p> <p>Seeing Where Python and R Fit into Your Data Science Strategy 104</p> <p>Using Python for Data Science 104</p> <p>Sorting out the various Python data types 106</p> <p>Putting loops to good use in Python 109</p> <p>Having fun with functions 110</p> <p>Keeping cool with classes 112</p> <p>Checking out some useful Python libraries 114</p> <p>Using Open Source R for Data Science 120</p> <p>Comprehending R's basic vocabulary 121</p> <p>Delving into functions and operators 124</p> <p>Iterating in R 127</p> <p>Observing how objects work 129</p> <p>Sorting out R's popular statistical analysis packages 131</p> <p>Examining packages for visualizing, mapping, and graphing in R 133</p> <p><b>Chapter 7: Generating Insights with Software Applications</b><b> 137</b></p> <p>Choosing the Best Tools for Your Data Science Strategy 138</p> <p>Getting a Handle on SQL and Relational Databases 139</p> <p>Investing Some Effort into Database Design 144</p> <p>Defining data types 144</p> <p>Designing constraints properly 145</p> <p>Normalizing your database 145</p> <p>Narrowing the Focus with SQL Functions 147</p> <p>Making Life Easier with Excel 151</p> <p>Using Excel to quickly get to know your data 152</p> <p>Reformatting and summarizing with PivotTables 157</p> <p>Automating Excel tasks with macros 158</p> <p><b>Chapter 8: Telling Powerful Stories with Data</b><b> 161</b></p> <p>Data Visualizations: The Big Three 162</p> <p>Data storytelling for decision makers 162</p> <p>Data showcasing for analysts 163</p> <p>Designing data art for activists 164</p> <p>Designing to Meet the Needs of Your Target Audience 164</p> <p>Step 1: Brainstorm (All about Eve) 165</p> <p>Step 2: Define the purpose 166</p> <p>Step 3: Choose the most functional visualization type for your purpose 166</p> <p>Picking the Most Appropriate Design Style 167</p> <p>Inducing a calculating, exacting response 167</p> <p>Eliciting a strong emotional response 168</p> <p>Selecting the Appropriate Data Graphic Type 170</p> <p>Standard chart graphics 171</p> <p>Comparative graphics 173</p> <p>Statistical plots 176</p> <p>Topology structures 179</p> <p>Spatial plots and maps 180</p> <p>Testing Data Graphics 183</p> <p>Adding Context 184</p> <p>Creating context with data 184</p> <p>Creating context with annotations 185</p> <p>Creating context with graphical elements 186</p> <p><b>Part 3: Taking Stock of Your Data Science Capabilities </b><b>187</b></p> <p><b>Chapter 9: Developing Your Business Acumen</b><b> 189</b></p> <p>Bridging the Business Gap 189</p> <p>Contrasting business acumen with subject matter expertise 190</p> <p>Defining business acumen 191</p> <p>Traversing the Business Landscape 192</p> <p>Seeing how data roles support the business in making money 192</p> <p>Leveling up your business acumen 195</p> <p>Fortifying your leadership skills 196</p> <p>Surveying Use Cases and Case Studies 197</p> <p>Documen | |
520 | |a Monetize your company's data and data science expertise without spending a fortune on hiring independent strategy consultants to help What if there was one simple, clear process for ensuring that all your company's data science projects achieve a high a return on investment? What if you could validate your ideas for future data science projects, and select the one idea that's most prime for achieving profitability while also moving your company closer to its business vision? There is. Industry-acclaimed data science consultant, Lillian Pierson, shares her proprietary STAR Framework - A simple, proven process for leading profit-forming data science projects. Not sure what data science is yet? Don't worry! Parts 1 and 2 of Data Science For Dummies will get all the bases covered for you. And if you're already a data science expert? Then you really won't want to miss the data science strategy and data monetization gems that are shared in Part 3 onward throughout this book. Data Science For Dummies demonstrates: The only process you'll ever need to lead profitable data science projects Secret, reverse-engineered data monetization tactics that no one's talking about The shocking truth about how simple natural language processing can be How to beat the crowd of data professionals by cultivating your own unique blend of data science expertise Whether you're new to the data science field or already a decade in, you're sure to learn something new and incredibly valuable from Data Science For Dummies. Discover how to generate massive business wins from your company's data by picking up your copy today. | ||
588 | |a Description based on CIP data; resource not viewed. | ||
590 | |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition | ||
650 | 0 | |a Data mining.|9 71797 | |
650 | 0 | |a Business|x Data processing.|9 31545 | |
776 | 0 | 8 | |i Print version:|z 9781119811558 |
856 | 4 | 0 | |u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781119811558/?ar|x O'Reilly|z eBook |
938 | |a Askews and Holts Library Services|b ASKH|n AH39163136 | ||
994 | |a 92|b VIA | ||
999 | |c 358233|d 358233 |