Concurrent Data Processing in Elixir
Fast, Resilient Applications with OTP, GenStage, Flow, and Broadway
by: Svilen Gospodinov
Published | 2021-08-10 |
---|---|
Internal code | sgdpelixir |
Print status | In Print |
Pages | 174 |
User level | Intermediate |
Keywords | Elixir, OTP, GenStage, Flow, Broadway, concurrency, big data, scalability, data processing, data pipelines |
Related titles | - Designing Elixir Systems with OTP |
ISBN | 9781680508192 |
Other ISBN |
Channel epub: 9781680508963 Channel PDF: 9781680508970 Kindle: 9781680508949 Safari: 9781680508956 Kindle: 9781680508949 |
BISACs | COM051000 COMPUTERS / Programming / GeneralCOM018000 COMPUTERS / Data ProcessingCOM018000 COMPUTERS / Data Processing |
Highlight
Learn different ways of writing concurrent code in Elixir and increase your application’s performance, without sacrificing scalability or fault-tolerance. Most projects benefit from running background tasks and processing data concurrently, but the world of OTP and various libraries can be challenging. Which Supervisor and what strategy to use? What about GenServer? Maybe you need back-pressure, but is GenStage, Flow, or Broadway a better choice? You will learn everything you need to know to answer these questions, start building highly concurrent applications in no time, and write code that’s not only fast, but also resilient to errors and easy to scale.
Description
Whether you are building a high-frequency stock trading application or a consumer web app, you need to know how to leverage concurrency to build applications that are fast and efficient. Elixir and the OTP offer a range of powerful tools, and this guide will show you how to choose the best tool for each job, and use it effectively to quickly start building highly concurrent applications.
Learn about Tasks, supervision trees, and the different types of Supervisors available to you. Understand why processes and process linking are the building blocks of concurrency in Elixir. Get comfortable with the OTP and use the GenServer behavior to maintain process state for long-running jobs. Easily scale the number of running processes using the Registry. Handle large volumes of data and traffic spikes with GenStage, using back-pressure to your advantage. Create your first multi-stage data processing pipeline using producer, consumer, and producer-consumer stages. Process large collections with Flow, using MapReduce and more in parallel. Thanks to Broadway, you will see how easy it is to integrate with popular message broker systems, or even existing GenStage producers.
Start building the high-performance and fault-tolerant applications Elixir is famous for today.
Contents and Extracts
- Introduction
- Who Should Read This Book?
- About This Book
- About the Code
- Online Resources
- Easy Concurrency with the Task Module
- Introducing the Task Module
- Creating Our Playground excerpt
- Starting Tasks and Retrieving Results
- Managing Series of Tasks
- Linking Processes
- Meeting the Supervisor
- Understanding Let It Crash
- Wrapping Up
- Long-Running Processes Using GenServer excerpt
- Starting with a Basic GenServer
- GenServer Callbacks In Depth
- Building a Job-Processing System
- Introducing DynamicSupervisor
- Implementing a Supervisor
- Naming Processes Using the Registry
- Inspecting Supervisors at Runtime
- Wrapping Up
- Data-Processing Pipelines with GenStage excerpt
- Understanding Back-Pressure
- Introducing GenStage
- Building Your Data-Processing Pipeline
- Adding Concurrency with ConsumerSupervisor
- Creating Multi-Stage Data Pipelines
- Choosing the Right Dispatcher
- Wrapping Up
- Processing Collections with Flow
- Analyzing Airports Data
- Creating Flows and Reading Files
- Performing Reduce Concurrently with Partitions
- Using Windows and Triggers
- Adding Flow to a GenStage Pipeline
- Wrapping Up
- Data-Ingestion Pipelines with Broadway
- Processing Ticketing Events
- Broadway Callbacks In Depth
- Batching Messages
- Using GenStage Producers
- Wrapping Up