Kafka topics

What is a Kafka topic? A Kafka topic organizes data into partitions with immutable offsets. See how Kafka topics work with examples of ordering and replay.

By Stéphane Derosiaux · July 23, 2026

Learn about Kafka topics, partitions and offsets

Topics are the fundamental unit of organization in Apache Kafka. Understanding how topics, partitions, and offsets work together is essential for building effective data streaming applications.

What you'll learn:

What Kafka topics are and how they organize data
How partitions enable scalability and parallelism
What offsets are and how they track message position
How messages are ordered within partitions

Kafka topics, partitions and offsets

Beginner

Advanced

What is a Kafka topic?

Similar to how databases have tables to organize and segment datasets, Kafka uses the concept of topics to organize related messages.

A topic is identified by its name. For example, we may have a topic called logs that may contain log messages from our application, and another topic called purchases that may contain purchase data from our application as it happens.

A Kafka Cluster with 4 Topics shown in a diagram

Kafka topics can contain any kind of message in any format, and the sequence of all these messages is called a data stream.

Unlike database tables, Kafka topics are not query-able. Instead, we have to create Kafka producers to send data to the topic and Kafka consumers to read the data from the topic in order.

By default, data in Kafka topics is deleted after one week (also called the default message retention period), and this value is configurable. This mechanism of deleting old data ensures a Kafka cluster does not run out of disk space by recycling topics over time.

What are Kafka partitions?

Topics are broken down into a number of partitions. A single topic may have more than one partition, it is common to see topics with 100 partitions.

The number of partitions of a topic is specified at the time of topic creation. Partitions are numbered starting from 0 to N-1, where N is the number of partitions. The figure below shows a topic with three partitions, with messages being appended to the end of each one.

Kafka Topics are broken into partitions for improved fault tolerance. This diagram shows a Kafka Topic with 3 partitions and their respective offsets.

The offset is an integer value that Kafka adds to each message as it is written into a partition. Each message in a given partition has a unique offset.

Kafka topics are immutable: once data is written to a partition, it cannot be changed.

Why use partitions?

Partitions serve two critical purposes:

Scalability: Data is distributed across multiple brokers, allowing the cluster to handle more data than a single server could
Parallelism: Multiple consumers can read from different partitions simultaneously, increasing throughput

Diagram of a topic named orders with three partitions spread across brokers: Partition 0 on Broker 1 is read by Consumer 1, Partition 1 on Broker 2 by Consumer 2, and Partition 2 on Broker 3 by Consumer 3, each consumer reading its own partition in parallel.

Kafka topic example

Apache Kafka has many real world applications. This diagram shows how Apache Kafka can be used for fleet tracking in the transport industry.

A traffic company wants to track its fleet of trucks. Each truck is fitted with a GPS locator that reports its position to Kafka. We can create a topic named - trucks\_gps to which the trucks publish their positions. Each truck may send a message to Kafka every 20 seconds, each message will contain the truck ID and the truck position (latitude and longitude). The topic may be split into a suitable number of partitions, say 10. There may be different consumers of the topic. For example, an application that displays truck locations on a dashboard or another application that sends notifications if an event of interest occurs.

What are Kafka offsets?

Apache Kafka offsets represent the position of a message within a Kafka partition. Offset numbering for every partition starts at 0 and is incremented for each message sent to a specific Kafka partition. This means that Kafka offsets only have a meaning for a specific partition, e.g., offset 3 in partition 0 doesn't represent the same data as offset 3 in partition 1.

Kafka offset ordering: if a topic has more than one partition, Kafka guarantees the order of messages within a partition, but there is no ordering of messages across partitions.

Even though we know that messages in Kafka topics are deleted over time (as seen above), the offsets are not re-used. They continually are incremented in a never-ending sequence.

Message ordering

Here is how message ordering works:

Scenario	Ordering guarantee
Messages with same key	Ordered (same partition)
Messages without key	Not ordered (round-robin)
Messages across partitions	Not ordered

Once a topic is created, you can increase the partition count but cannot decrease it. Plan your partition count carefully, considering future growth.

See it in practice with Conduktor
Conduktor Console provides a visual interface for creating and managing topics. Browse topic messages, view partition distribution, and monitor topic metrics in real-time.

Next steps

Learn how producers write to topics to understand how data enters partitions
Learn about Kafka consumers to understand how data is read from topics
Explore topic configuration for advanced settings like retention and compaction