1. By default, Kafka tries to send records as soon as possible
- It will have up to 5 reuests in flight, meaning up to messages individually sent at the same time.
- After this, if more messages have to be sent while others are in flight, Kafka is smart and will start batching them while they wait to send them all at once.
2. This smart batching allows Kafka to increase throughput while maintaining very low latency.
3. Batches have higher compression ratio so better efficiency
4. So how can we control the batching mechanism?
- Linger.ms: Number of milliseconds a producer is willing to wait before sending a batch out. (default 0)
- By introducing some lag (for example linger.ms=5), we increase the chances of messages being sent together in a batch
- So at the expense of introducing a small delay, we can increase throughput, compression and efficiency of our producer.
- If a batch is full (see batch.size) before the end of the linger.ms period, it will be sent to Kafka right away!
* Linger.ms