Kafka producers

How a Kafka producer writes messages to topics — message keys, serializers, partitioning strategies and acknowledgment modes explained for beginners.

By Stéphane Derosiaux · July 23, 2026

Learn how Kafka producers send data to topics

Producers are applications that write data to Kafka topics. Understanding how producers work, including message keys, serialization, and partitioning, is essential for building reliable data pipelines.

What you'll learn:

How producers send messages to Kafka topics
How message keys affect partitioning and ordering
The structure of a Kafka message
How serialization converts data to bytes

Kafka producers

Once a topic has been created with Kafka, the next step is to send data into the topic. This is where Kafka producers come in.

Applications that send data into topics are known as Kafka producers. Applications typically integrate a Kafka client library to write to Apache Kafka. Excellent client libraries exist for almost all programming languages that are popular today including Python, Java, Go, and others.

Apache Kafka Producers send data into Kafka. These messages are then routed by the broker to the relevant topics and partitions.

A Kafka producer sends messages to a topic, and messages are distributed to partitions according to a mechanism such as key hashing (more on it below).

For a message to be successfully written into a Kafka topic, a producer has to specify a level of acknowledgment (acks). This subject will be introduced in depth in the topic replication section.

Message keys

Each event message contains an optional key and a value.

In case the key (key=null) is not specified by the producer, messages are distributed evenly across partitions in a topic. This means messages are sent in a round-robin fashion (partition p0 then p1 then p2, etc... then back to p0 and so on...).

If a key is sent (key != null), then all messages that share the same key will always be sent and stored in the same Kafka partition. A key can be anything to identify a message - a string, numeric value, binary value, etc.

Kafka message keys are commonly used when there is a need for message ordering for all messages sharing the same field. For example, in the scenario of tracking trucks in a fleet, we want data from trucks to be in order at the individual truck level. In that case, we can choose the key to be truck_id. In the example shown below, the data from the truck with id truck\_id\_123 will always go to partition p0.

Apache Kafka Producer sending trucking fleet data into 2 Kafka Brokers.

When to use message keys

Use case	Key recommendation
Order processing	Order ID (keep all events for an order together)
User activity tracking	User ID (maintain user event sequence)
IoT sensor data	Device ID (preserve per-device ordering)
Log aggregation	No key needed (maximize throughput)
Metrics collection	No key needed (even distribution)

Key selection rule of thumb
Use a message key when you need ordering guarantees for related messages. Skip the key when maximum throughput is more important than ordering.

Kafka message anatomy

Kafka messages are created by the producer. A Kafka message consists of the following elements:

Diagram showing how Kafka Producers structure a message created by the Apache Kafka Producer.

Structure of a Kafka message:

Key. Key is optional in the Kafka message and it can be null. A key may be a string, number, or any object and then the key is serialized into binary format.
Value. The value represents the content of the message and can also be null. The value format is arbitrary and is then also serialized into binary format.
Compression Type. Kafka messages may be compressed. The compression type can be specified as part of the message. Options are none, gzip, lz4, snappy, and zstd
Headers. There can be a list of optional Kafka message headers in the form of key-value pairs. It is common to add headers to specify metadata about the message, especially for tracing.
Partition + Offset. Once a message is sent into a Kafka topic, it receives a partition number and an offset id. The combination of topic+partition+offset uniquely identifies the message
Timestamp. A timestamp is added either by the user or the system in the message.

Kafka message serializers

In many programming languages, the key and value are represented as objects, which greatly increases the code readability. However, Kafka brokers expect byte arrays as keys and values of messages. The process of transforming the producer's programmatic representation of the object to binary is called message serialization.

As shown below, we have a message with an Integer key and a String value. Since the key is an integer, we have to use an IntegerSerializer to convert it into a byte array. For the value, since it is a string, we have to use a StringSerializer.

Message serialization diagram showing how Apache Kafka Producers integer and string serializers.

Message serialization

As part of the Java Client SDK for Apache Kafka, several serializers already exist, such as string (which supersedes JSON), integer, float. Other serializers may have to be written by the users, but commonly distributed Kafka serializers exist and are efficiently written for formats such as JSON-Schema, Apache Avro and Protobuf, thanks to the Confluent schema registry.

If you are not using a JVM-based programming language for serialization and deserialization, ensure that your Kafka client library supports the data formats that you need!

Common serialization formats

Format	Best for	Schema support
String/JSON	Flexibility, debugging	No built-in
Avro	Schema evolution, compact	Schema Registry
Protobuf	Performance, cross-language	Schema Registry

Kafka message key hashing

A Kafka partitioner is a code logic that takes a record and determines to which partition to send it into.

Kafka Producers use default partitioning logic to assign Kafka messages to the appropriate Apache Kafka Partition.

Default partitioner

In that effect, it is common for partitioners to use the Kafka message keys to route a message into a specific topic-partition. As a reminder, all messages with the same key will go to the same partition.

Kafka key hashing is the process of determining the mapping of a key to a partition.

In the default Kafka partitioner, the keys are hashed using the murmur2 algorithm, with the formula below for the curious:

targetPartition = Math.abs(Utils.murmur2(keyBytes)) % (numPartitions - 1)

You can override the default partitioner via the producer property partitioner.class, although it is not advisable unless you know what you are doing.

If you increase the number of partitions for a topic, the same key may hash to a different partition. This breaks ordering guarantees for existing keys.

See it in practice with Conduktor
Conduktor Console lets you produce messages to topics directly from the UI. Test different keys, values, and headers to see how messages are distributed across partitions.

Next steps

Learn how consumers read data to understand the other side of the pipeline
Explore producer batching to optimize throughput
Configure producer acknowledgments for reliability