Stream Processing vs. Message Processing: What’s the Difference?

How to fix unassigned shards issue in Elasticsearch?

July 10, 2023

What is the difference between client and cluster deploy modes in Spark?

July 24, 2023

Published by Big Data In Real World at July 17, 2023

Message Processing

Message processing involves receiving and processing messages from a message queue or a pub/sub system. Message processing is commonly used to integrate different systems or components, and it provides a decoupling mechanism that enables different systems to work together without being tightly coupled.

We apply simple computations on the messages — in most cases individually per message.

Eg. RabbitMQ

Stream Processing

Stream processing is the technique of processing real-time data streams as they occur. It involves continuously processing and analyzing data in real-time as it flows through a system. In stream processing applications or platforms, we can apply complex operations on multiple input streams and multiple records or messages at the same time performing complex operations on messages like aggregations and joins.

Eg. Kafka

Kafka vs. RabbitMQ

RabbitMQ is a message processing platform. Producer(s) ingest messages into RabbitMQ. Consumer(s) pick up messages, process them and messages get removed from RabbitMQ once all the consumers consume the message.

Kafka is a message processing platform at its core but it is also a stream processing platform as well. Typical messaging systems do not have the ability to “rewind” and access previously delivered messages, as they are automatically deleted once all subscribed consumers have received them. In contrast, Kafka uses a pull-based model, in which consumers retrieve data from Kafka, and retain messages for a configurable period of time. As a result, Kafka has the ability to store and retrieve messages that have already been sent, even after they have been consumed by subscribers.

In addition to above, Kafka streams allow us to apply complex operations like aggregation, joins, window and other analytic operations on real-time streaming data.

Big Data In Real World

We are a group of Big Data engineers who are passionate about Big Data and related Big Data technologies. We have designed, developed, deployed and maintained Big Data applications ranging from batch to real time streaming big data platforms. We have seen a wide range of real world big data problems, implemented some innovative and complex (or simple, depending on how you look at it) solutions.

Comments are closed.

Stream Processing vs. Message Processing: What’s the Difference?

How to fix unassigned shards issue in Elasticsearch?

What is the difference between client and cluster deploy modes in Spark?

How to fix unassigned shards issue in Elasticsearch?

What is the difference between client and cluster deploy modes in Spark?

Message Processing

Stream Processing

Kafka vs. RabbitMQ

Big Data In Real World

Related posts

How does a consumer know the offset to read after restart in Kafka?

How to list topics without accessing Zookeeper in Kafka?

How to fix Kafka Broker may not be available on 127.0.0.1 error?