Get the latest tech news

“Streaming vs. Batch” Is a Wrong Dichotomy, and I Think It's Confusing


Often times, "Stream vs. Batch" is discussed as if it’s one or the other, but to me this does not make that much sense really.

The prevailing trend towards storage/compute separation in data streaming and processing architectures (for instance, thinking of platforms such as WarpStream, and Diskless Kafka at large) further accelerates this development. Unless a source represents a consumable stream of changes itself (you see where this is going), a pull system may miss updates happening between fetch attempts, as well as deletes. A streaming system lets you put your data to the location where you need it, in the format you need it, and in the shape you need it (think denormalization), immediately as it gets produced or updated.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Batch

Batch

Photo of wrong dichotomy

wrong dichotomy

Related news:

News photo

Batch data processing is too slow for real-time AI: How open-source Apache Airflow 3.0 solves the challenge with event-driven data orchestration

News photo

Here's our first batch of Xbox Game Pass titles for April

News photo

Here's our first batch of Xbox Game Pass titles for March