Catalog Search Results
Author
Publisher
Princeton University Press
Language
English
Description
"Data describe and represent the world. However, no matter how big they may be, data sets don't - indeed cannot - capture everything. Data are measurements - and, as such, they represent only what has been measured. They don't necessarily capture all the information that is relevant to the questions we may want to ask. If we do not take into account what may be missing/unknown in the data we have, we may find ourselves unwittingly asking questions...
Author
Publisher
O'Reilly Media, Inc
Pub. Date
[2022]
Language
English
Description
By 2025, the estimated global volume of data is expected to reach 180 ZB, more than double the amount collected in 2020. Yet despite data's increased presence and value within organizations, solutions for ensuring data quality have received little attention. This report examines how adequate observability combined with best practices for data logging and monitoring can help organizations efficiently redistribute their management of data quality and...
Author
Publisher
Academic Press
Pub. Date
[2018]
Language
English
Description
Principles and Practice of Big Data: Preparing, Sharing, and Analyzing Complex Information, Second Edition updates and expands on the first edition, bringing a set of techniques and algorithms that are tailored to Big Data projects. The book stresses the point that most data analyses conducted on large, complex data sets can be achieved without the use of specialized suites of software (e.g., Hadoop), and without expensive hardware (e.g., supercomputers)....
6) Google BigQuery: the definitive guide : data warehousing, analytics, and machine learning at Scale
Author
Publisher
O'Reilly Media
Pub. Date
[2019]
Language
English
Formats
Description
"Derive insights from petabyte-scale datasets while building a collaborative, agile workplace in the process. This practical book is the canonical reference to Google BigQuery whose storage system lets you consolidate data from across your enterprise, and whose query engine enables you to condust interactive analysis and machine learning on large datasets. BigQuery enables enterprises to efficiently store, query, ingest, and learn from data in one...
7) Data lakes
Series
Publisher
Wiley
Pub. Date
2020
Language
English
Description
The concept of a data lake is less than 10 years old, but they are already hugely implemented within large companies. Their goal is to efficiently deal with ever-growing volumes of heterogeneous data, while also facing various sophisticated user needs. However, defining and building a data lake is still a challenge, as no consensus has been reached so far. Data Lakes presents recent outcomes and trends in the field of data repositories. The main topics...
Author
Publisher
Packt Publishing, Limited
Pub. Date
2022
Language
English
Description
Process tabular data and build high-performance query engines on modern CPUs and GPUs using Apache Arrow, a standardized language-independent memory format, for optimal performance Key Features Learn about Apache Arrow's data types and interoperability with pandas and Parquet Work with Apache Arrow Flight RPC, Compute, and Dataset APIs to produce and consume tabular data Reviewed, contributed, and supported by Dremio, the co-creator of Apache Arrow...
Author
Publisher
Apress
Pub. Date
[2019]
Language
English
Description
At first glance, the skills required to work in the data science field appear to be self-explanatory. Do not be fooled. Impactful data science demands an interdisciplinary knowledge of business philosophy, project management, salesmanship, presentation, and more. In Managing Your Data Science Projects, author Robert de Graaf explores important concepts that are frequently overlooked in much of the instructional literature that is available to data...
Author
Publisher
Packt Publishing, Limited
Pub. Date
2023
Language
English
Description
A hands-on guide to working on use cases helping you ingest, analyze, and serve insightful data from IoT as well as telemetry data sources using Azure Synapse Data Explorer Free PDF included with this book Key Features Augment advanced analytics projects with your IoT and application data Expand your existing Azure Synapse environments with unstructured data Build industry-level projects on integration, experimentation, and dashboarding with Azure...
Author
Publisher
Apress
Pub. Date
[2022]
Language
English
Description
Implement the Snowflake Data Cloud using best practices and reap the benefits of scalability and low-cost from the industry-leading, cloud-based, data warehousing platform. This book provides a detailed how-to explanation, and assumes familiarity with Snowflake core concepts and principles. It is a project-oriented book with a hands-on approach to designing, developing, and implementing your Data Cloud with security at the center. As you work through...
Author
Publisher
Apress L.P
Pub. Date
[2022]
Language
English
Description
Understand the essentials of the Snowflake Database and the overall Snowflake Data Cloud. This book covers how Snowflake's architecture is different from prior on-premises and cloud databases. The authors also discuss, from an insider perspective, how Snowflake grew so fast to become the largest software IPO of all time. Snowflake was the first database made specifically to be optimized with a cloud architecture. This book helps you get started using...
Author
Language
English
Formats
Description
An irreverent, provocative, and visually fascinating look at what our online lives reveal about who we really are--and how this deluge of data will transform the science of human behavior. Big Data is used to spy on us, hire and fire us, and sell us things we don't need. In Dataclysm, Christian Rudder puts this flood of information to an entirely different use: understanding human nature. Drawing on terabytes of data from Twitter, Facebook, Reddit,...
17) THE ENTERPRISE DATA CATALOG: improve data discovery, ensure data governance, and enable innovation
Author
Publisher
O'Reilly Media, Inc
Pub. Date
[2023]
Language
English
Description
Combing the web is simple, but how do you search for data at work? It's difficult and time-consuming, and can sometimes seem impossible. This book introduces a practical solution: the data catalog. Data analysts, data scientists, and data engineers will learn how to create true data discovery in their organizations, making the catalog a key enabler for data-driven innovation and data governance.
Publisher
John Wiley & Sons, Inc
Pub. Date
2021.
Language
English
Description
"Written and painstakingly edited by leading experts in their respective fields, this volume offers a state-of-the-art overview of Big Data issues, concerns, and responses in survey methodology. Like several other books in the Wiley Series in Survey Methodology, this work has been prepared in conjunction with an international conference on the topic by the Survey Research Methods Section of the American Statistical Association. The conference and...
Publisher
Scrivener Publishing
Pub. Date
2022.
Language
English
Description
BIG DATA ANALYTICS AND MACHINE INTELLIGENCE IN BIOMEDICAL AND HEALTH INFORMATICS Provides coverage of developments and state-of-the-art methods in the broad and diversified data analytics field and applicable areas such as big data analytics, data mining, and machine intelligence in biomedical and health informatics. The novel applications of Big Data Analytics and machine intelligence in the biomedical and healthcare sector is an emerging field comprising...
Didn't find it?
Can't find what you are looking for? Try our Materials Request Service. Submit Request