Stream processing with Apache Flink : fundamentals, implementation, and operation of streaming applications

Book Cover
Average Rating
Published
Sebastopol, CA : O'Reilly Media, Inc., 2019.
Status
Available Online

Description

Get started with Apache Flink, the open source framework that powers some of the world's largest stream processing applications. With this practical book, you'll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data processing.

Longtime Apache Flink committers Fabian Hueske and Vasia Kalavri show you how to implement scalable streaming applications with Flink's DataStream API and continuously run and maintain these applications in operational environments. Stream processing is ideal for many use cases, including low-latency ETL, streaming analytics, and real-time dashboards as well as fraud detection, anomaly detection, and alerting. You can process continuous data of any kind, including user interactions, financial transactions, and IoT data, as soon as you generate them.

  • Learn concepts and challenges of distributed stateful stream processing
  • Explore Flink's system architecture, including its event-time processing mode and fault-tolerance model
  • Understand the fundamentals and building blocks of the DataStream API, including its time-based and statefuloperators
  • Read data from and write data to external systems with exactly-once consistency
  • Deploy and configure Flink clusters
  • Operate continuously running streaming applications

More Details

Format
Edition
First edition.
Language
English
ISBN
9781491974261, 1491974265, 9781491974247, 1491974249

Notes

Bibliography
Includes bibliographical references and index.
Description
"Get started with Apache Flink, the open source framework that powers some of the world's largest stream processing applications. With this practical book, you'll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data processing. Longtime Apache Flink committers Fabian Hueske and Vasia Kalavri show you how to implement scalable streaming applications with Flink's DataStream API and continuously run and maintain these applications in operational environments. Stream processing is ideal for many use cases, including low-latency ETL, streaming analytics, and real-time dashboards as well as fraud detection, anomaly detection, and alerting. You can process continuous data of any kind, including user interactions, financial transactions, and loT data, as soon as you generate them."-- Provided by publisher
Local note
O'Reilly O'Reilly Online Learning: Academic/Public Library Edition

Table of Contents

Cover; Copyright; Table of Contents; Preface; What You Will Learn in This Book; Conventions Used in This Book; Using Code Examples; O'Reilly Online Learning; How to Contact Us; Acknowledgments; Chapter 1. Introduction to Stateful Stream Processing; Traditional Data Infrastructures; Transactional Processing; Analytical Processing; Stateful Stream Processing; Event-Driven Applications; Data Pipelines; Streaming Analytics; The Evolution of Open Source Stream Processing; A Bit of History; A Quick Look at Flink; Running Your First Flink Application; Summary
Chapter 2. Stream Processing FundamentalsIntroduction to Dataflow Programming; Dataflow Graphs; Data Parallelism and Task Parallelism; Data Exchange Strategies; Processing Streams in Parallel; Latency and Throughput; Operations on Data Streams; Time Semantics; What Does One Minute Mean in Stream Processing?; Processing Time; Event Time; Watermarks; Processing Time Versus Event Time; State and Consistency Models; Task Failures; Result Guarantees; Summary; Chapter 3. The Architecture of Apache Flink; System Architecture; Components of a Flink Setup; Application Deployment; Task Execution
Highly Available SetupData Transfer in Flink; Credit-Based Flow Control; Task Chaining; Event-Time Processing; Timestamps; Watermarks; Watermark Propagation and Event Time; Timestamp Assignment and Watermark Generation; State Management; Operator State; Keyed State; State Backends; Scaling Stateful Operators; Checkpoints, Savepoints, and State Recovery; Consistent Checkpoints; Recovery from a Consistent Checkpoint; Flink's Checkpointing Algorithm; Performace Implications of Checkpointing; Savepoints; Summary; Chapter 4. Setting Up a Development Environment for Apache Flink; Required Software
Run and Debug Flink Applications in an IDEImport the Book's Examples in an IDE; Run Flink Applications in an IDE; Debug Flink Applications in an IDE; Bootstrap a Flink Maven Project; Summary; Chapter 5. The DataStream API (v1.7); Hello, Flink!; Set Up the Execution Environment; Read an Input Stream; Apply Transformations; Output the Result; Execute; Transformations; Basic Transformations; KeyedStream Transformations; Multistream Transformations; Distribution Transformations; Setting the Parallelism; Types; Supported Data Types; Creating Type Information for Data Types
Explicitly Providing Type InformationDefining Keys and Referencing Fields; Field Positions; Field Expressions; Key Selectors; Implementing Functions; Function Classes; Lambda Functions; Rich Functions; Including External and Flink Dependencies; Summary; Chapter 6. Time-Based and Window Operators; Configuring Time Characteristics; Assigning Timestamps and Generating Watermarks; Watermarks, Latency, and Completeness; Process Functions; TimerService and Timers; Emitting to Side Outputs; CoProcessFunction; Window Operators; Defining Window Operators; Built-in Window Assigners

Discover More

Author Notes

Loading Author Notes...

Similar Titles From NoveList

NoveList provides detailed suggestions for titles you might like if you enjoyed this book. Suggestions are based on recommendations from librarians and other contributors.
These have the subjects "Open source software" and "Computers / Data Science / General."
These have the subjects "Open source software" and "Computers / Data Science / General."
These have the subjects "Open source software" and "Computers / Data Science / General."

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Hueske, F., & Kalavri, V. (2019). Stream processing with Apache Flink: fundamentals, implementation, and operation of streaming applications (First edition.). O'Reilly Media, Inc..

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Hueske, Fabian and Vasiliki, Kalavri. 2019. Stream Processing With Apache Flink: Fundamentals, Implementation, and Operation of Streaming Applications. Sebastopol, CA: O'Reilly Media, Inc.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Hueske, Fabian and Vasiliki, Kalavri. Stream Processing With Apache Flink: Fundamentals, Implementation, and Operation of Streaming Applications Sebastopol, CA: O'Reilly Media, Inc, 2019.

Harvard Citation (style guide)

Hueske, F. and Kalavri, V. (2019). Stream processing with apache flink: fundamentals, implementation, and operation of streaming applications. First edn. Sebastopol, CA: O'Reilly Media, Inc.

MLA Citation, 9th Edition (style guide)

Hueske, Fabian,, and Vasiliki Kalavri. Stream Processing With Apache Flink: Fundamentals, Implementation, and Operation of Streaming Applications First edition., O'Reilly Media, Inc., 2019.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Grouped Work ID
9b704435-ee32-fc80-2988-da3110c64783-eng
Go To Grouped Work View in Staff Client

Grouping Information

Grouped Work ID9b704435-ee32-fc80-2988-da3110c64783-eng
Full titlestream processing with apache flink fundamentals implementation and operation of streaming applications
Authorhueske fabian
Grouping Categorybook
Last Update2025-01-24 12:33:29PM
Last Indexed2025-05-22 03:29:52AM

Book Cover Information

Image Sourcesyndetics
First LoadedAug 29, 2024
Last UsedMay 10, 2025

Marc Record

First DetectedMar 21, 2023 12:10:40 PM
Last File Modification TimeOct 21, 2024 08:17:30 AM
SuppressedRecord had no items

MARC Record

LEADER06893cam a2200673 i 4500
001on1097184105
003OCoLC
00520241021081439.0
006m     o  d        
007cr cnu---unuuu
008190415t20192019caua    ob    001 0 eng d
015 |a GBB7F9145|2 bnb
0167 |a 018495486|2 Uk
019 |a 1097313086|a 1099281163|a 1122591126|a 1202569676|a 1240527902
020 |a 9781491974261|q (electronic book)
020 |a 1491974265|q (electronic book)
020 |a 9781491974247|q (electronic book)
020 |a 1491974249|q (electronic book)
035 |a (OCoLC)1097184105|z (OCoLC)1097313086|z (OCoLC)1099281163|z (OCoLC)1122591126|z (OCoLC)1202569676|z (OCoLC)1240527902
037 |a 54C11709-AD5F-496B-9763-40531BDF5272|b OverDrive, Inc.|n http://www.overdrive.com
040 |a N$T|b eng|e rda|e pn|c N$T|d N$T|d EBLCP|d TEFOD|d YDXIT|d UMI|d MERER|d YDX|d UKAHL|d OCLCQ|d VT2|d OCLCQ|d OCLCO|d NZAUC|d OCLCQ|d OCLCO|d OCLCL
049 |a MAIN
050 4|a TK5105.887|b .H84 2019
072 7|a COM|x 000000|2 bisacsh
08204|a 006.7/876|2 23
1001 |a Hueske, Fabian,|e author.
24510|a Stream processing with Apache Flink :|b fundamentals, implementation, and operation of streaming applications /|c Fabian Hueske and Vasiliki Kalavri.
250 |a First edition.
264 1|a Sebastopol, CA :|b O'Reilly Media, Inc.,|c 2019.
264 4|c ©2019
300 |a 1 online resource
336 |a text|b txt|2 rdacontent
337 |a computer|b c|2 rdamedia
338 |a online resource|b cr|2 rdacarrier
504 |a Includes bibliographical references and index.
5050 |a Cover; Copyright; Table of Contents; Preface; What You Will Learn in This Book; Conventions Used in This Book; Using Code Examples; O'Reilly Online Learning; How to Contact Us; Acknowledgments; Chapter 1. Introduction to Stateful Stream Processing; Traditional Data Infrastructures; Transactional Processing; Analytical Processing; Stateful Stream Processing; Event-Driven Applications; Data Pipelines; Streaming Analytics; The Evolution of Open Source Stream Processing; A Bit of History; A Quick Look at Flink; Running Your First Flink Application; Summary
5058 |a Chapter 2. Stream Processing FundamentalsIntroduction to Dataflow Programming; Dataflow Graphs; Data Parallelism and Task Parallelism; Data Exchange Strategies; Processing Streams in Parallel; Latency and Throughput; Operations on Data Streams; Time Semantics; What Does One Minute Mean in Stream Processing?; Processing Time; Event Time; Watermarks; Processing Time Versus Event Time; State and Consistency Models; Task Failures; Result Guarantees; Summary; Chapter 3. The Architecture of Apache Flink; System Architecture; Components of a Flink Setup; Application Deployment; Task Execution
5058 |a Highly Available SetupData Transfer in Flink; Credit-Based Flow Control; Task Chaining; Event-Time Processing; Timestamps; Watermarks; Watermark Propagation and Event Time; Timestamp Assignment and Watermark Generation; State Management; Operator State; Keyed State; State Backends; Scaling Stateful Operators; Checkpoints, Savepoints, and State Recovery; Consistent Checkpoints; Recovery from a Consistent Checkpoint; Flink's Checkpointing Algorithm; Performace Implications of Checkpointing; Savepoints; Summary; Chapter 4. Setting Up a Development Environment for Apache Flink; Required Software
5058 |a Run and Debug Flink Applications in an IDEImport the Book's Examples in an IDE; Run Flink Applications in an IDE; Debug Flink Applications in an IDE; Bootstrap a Flink Maven Project; Summary; Chapter 5. The DataStream API (v1.7); Hello, Flink!; Set Up the Execution Environment; Read an Input Stream; Apply Transformations; Output the Result; Execute; Transformations; Basic Transformations; KeyedStream Transformations; Multistream Transformations; Distribution Transformations; Setting the Parallelism; Types; Supported Data Types; Creating Type Information for Data Types
5058 |a Explicitly Providing Type InformationDefining Keys and Referencing Fields; Field Positions; Field Expressions; Key Selectors; Implementing Functions; Function Classes; Lambda Functions; Rich Functions; Including External and Flink Dependencies; Summary; Chapter 6. Time-Based and Window Operators; Configuring Time Characteristics; Assigning Timestamps and Generating Watermarks; Watermarks, Latency, and Completeness; Process Functions; TimerService and Timers; Emitting to Side Outputs; CoProcessFunction; Window Operators; Defining Window Operators; Built-in Window Assigners
520 |a "Get started with Apache Flink, the open source framework that powers some of the world's largest stream processing applications. With this practical book, you'll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data processing. Longtime Apache Flink committers Fabian Hueske and Vasia Kalavri show you how to implement scalable streaming applications with Flink's DataStream API and continuously run and maintain these applications in operational environments. Stream processing is ideal for many use cases, including low-latency ETL, streaming analytics, and real-time dashboards as well as fraud detection, anomaly detection, and alerting. You can process continuous data of any kind, including user interactions, financial transactions, and loT data, as soon as you generate them."--|c Provided by publisher
5880 |a Online resource; title from digital title page (viewed on May 13, 2019).
590 |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition
63000|a Apache Flink (Electronic resource)
650 0|a Streaming technology (Telecommunications)|v Software.
650 0|a Big data.|9 403931
655 7|a Software|2 fast
7001 |a Kalavri, Vasiliki,|e author.
758 |i has work:|a Stream processing with Apache Flink (Text)|1 https://id.oclc.org/worldcat/entity/E39PCG8VFRqY4y3Gk8V796JXYd|4 https://id.oclc.org/worldcat/ontology/hasWork
77608|i Print version:|a Hueske, Fabian.|t Stream processing with Apache Flink.|d Beijing : O'Reilly, 2017|z 9781491974292|w (OCoLC)975362966
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781098184858/?ar|x O'Reilly|z eBook
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781098195687/?ar|x O'Reilly|z eBook
85640|u https://library.access.arlingtonva.us/login?url=https://learning.oreilly.com/library/view/~/9781491974285/?ar|x O'Reilly|z eBook
938 |a Askews and Holts Library Services|b ASKH|n AH36183993
938 |a ProQuest Ebook Central|b EBLB|n EBL5750356
938 |a EBSCOhost|b EBSC|n 2102045
938 |a YBP Library Services|b YANK|n 16164325
994 |a 92|b VIA
999 |c 288262|d 288262