This post was originally published on this site
This is the 227th edition of my blog series blog series around Stream Data Integration and Stream Analytics!
As usual, find below the new blog articles, presentations, videos and software releases from last week. Happy reading and stay safe!
News and Blog Posts
General
- An Event-Driven Book of Reference to facilitate Modernization by Shahir A. Daya & Mehryar Maalem
- Real-time analytics begins to find business vocation by Stephen Pritchard
- Thinking in Events: From Databases to DistributedCollaboration Software by Martin Kleppmann
Kafka
- Processing Time-Series Data with Redis and Apache Kafka by Abhishek Gupta
- Building Real-Time Event Streams in the Cloud, On Premises, or Both with Confluent by Jeff Bean
- Saxo Bank’s Best Practices for a Distributed Domain-Driven Architecture Founded on the Data Mesh by Graham Stirling
- Serverless Kafka in a Cloud-native Data Lake Architecture by Kai Waehner
- Online, Managed Schema Evolution with ksqlDB Migrations by Zara Lim
- Kafka weather real time dashboard with Spring & Thymeleaf by Igor De Souza
- Data dump to data catalog for Apache Kafka by Andrew Stevenson
- Crossing the Streams: The New Streaming Foreign-Key Join Feature in Kafka Streams by John Roesler & Adam Bellemare
- Kafka for Cybersecurity (Part 1 of 6) – Data in Motion as Backbone by Kai Waehner
- Kafka for Cybersecurity (Part 2 of 6) – Data in Motion as Backbone by Kai Waehner
- Kafka upgrade improvements by Jakub Stejskal
- Low Latency Real-Time Cache Updates with Amazon ElastiCache for Redis and Confluent Cloud Kafka by Jobin George, Joseph Morais, and Roberto Luna Rojas
- Getting started with Red Hat OpenShift Streams for Apache Kafka by Bernard Tison
- Configuring Kafka Sources and Sinks in Kubernetes by Sebastien Goasguen
- Getting started with Kafka Connector for Azure Cosmos DB using Docker by Abhishek Gupta
- A Practical Guide for Kafka Cost Reduction by Elad Leev
- Create a Data Analysis Pipeline with Apache Kafka and RStudio by Patrick Neff
- Tyrannical Data and Its Antidotes in the Microservices World by Sam Newman
Spark
- What’s New in Apache Spark
3.1 Release for Structured Streaming by Yuanjian Li, Shixiong Zhu & Bo Zhang
Flink
- How to Analyze CDC Data in Iceberg Data Lake Using Flink by Apache Flink Community China
- How to identify the source of backpressure? by Piotr Nowojski
StreamSets
- Introducing StreamSets Summer ‘21 by Raji Narayanan
- Announcing StreamSets Transformer Engine 4.0.0 by Dash Desai
- StreamSets Engine For Snowpark by Dash Desai
- It’s Summer… And Even Data Engineers Need a Break by Karen Henke
- Model Experiments, Tracking and Registration using MLflow on Databricks by Dash Desai
New Presentations
- Apache Kafka and the Data Mesh by Ben Stopford & Michael G. Noll
New Videos
- Finding Workload Balance: Cruise Control for Kafka on Kubernetes by Paolo Patierno
- Apache Kafka and the Data Mesh by Ben Stopford & Michael G. Noll
- Thinking in Events: From Databases to Distributed Collaboration Software by Martin Kleppmann
New Podcasts
- Data-Driven Digitalization with Apache Kafka in the Food Industry at BAADER by Confluent Streaming Audio (#165)
- Chaos Engineering with Apache Kafka and Gremlin by Confluent Streaming Audio (#164)
- Automated Event-Driven Architectures and Microservices with Apache Kafka and SmartBear by Confluent Streaming Audio (#166)
New Releases
Please let me know if that is of interest. Please tweet your projects, blog posts, and presentations & videos to @gschmutz to get them listed in next week’s edition!