High-Level Design Tutorial 0/42 lessons ~6 min read Lesson 23“Scalable systems, HLD interviews & case studies”

Kafka Basics

Apache Kafka is a distributed commit log for high-throughput event streaming.

Course progress0%

Focus

10 guided sections

Practice signal

Examples included

Career prep

Interview Q&A included

Introduction

Apache Kafka is a distributed commit log for high-throughput event streaming. Producers append records to topics partitioned for parallelism; consumer groups coordinate partition assignment for scale. Unlike traditional queues, Kafka retains messages for configurable periods — enabling replay, audit, and multiple independent consumers.

Kafka appears in HLD for activity feeds, CDC pipelines, metrics ingestion, and microservice choreography. Know topics, partitions, offsets, consumer groups, and replication factor in interviews.

This lesson covers Kafka topology, ordering guarantees, and Spring Kafka integration patterns.

Understanding the topic

Key concepts

Topic: named log split into partitions for parallel writes/reads.
Partition key: same key → same partition preserves per-key ordering.
Offset: monotonic position in partition; consumers commit offsets after processing.
Consumer group: one consumer per partition max — scale consumers ≤ partitions.
Replication factor RF=3: leader + followers; ISR for durability.
Retention: time (7 days) or size — not a task queue that deletes on read.

text

flowchart LR
  Producer -->|produce| Topic
  Topic --> C1[Consumer Group A]
  Topic --> C2[Consumer Group B]

Internal architecture

Architecture overview

text

flowchart LR
  Producer -->|produce| Topic
  Topic --> C1[Consumer Group A]
  Topic --> C2[Consumer Group B]

Step-by-step explanation

Producers with idempotent + acks=all for durability.
Broker cluster 3+ nodes, RF=3, min.insync.replicas=2.
Consumers in group 'notification' and separate group 'analytics' on same topic.
Schema Registry (Avro) for compatible evolution.
Kafka Connect for CDC from PostgreSQL to downstream systems.
Monitor consumer lag, under-replicated partitions, disk usage.

Informative example

Spring Kafka producer with idempotence and consumer with manual offset commit:

yaml

spring:
  kafka:
    bootstrap-servers: kafka:9092
    producer:
      acks: all
      enable-idempotence: true
      key-serializer: org.apache.kafka.common.serialization.StringSerializer
      value-serializer: org.springframework.kafka.support.serializer.JsonSerializer
    consumer:
      group-id: order-processor
      auto-offset-reset: earliest
      enable-auto-commit: false
      key-deserializer: org.apache.kafka.common.serialization.StringDeserializer
      value-deserializer: org.springframework.kafka.support.serializer.JsonDeserializer
    listener:
      ack-mode: manual

Partition by orderId for ordered lifecycle events. Increase partitions before consumers for scale.

Real-world use

Real-world use cases

Uber trip events: driver location, status updates to multiple services.
Banking transaction audit log with 7-year retention compliance.
E-commerce CDC: product changes to search and cache invalidation.
Netflix-style viewing events to recommendation pipeline.

Best practices

Size partitions for target throughput — rebalancing costly later.
Use idempotent producers and transactional outbox for exactly-once-ish flows.
Dead letter topic for poison messages after retries.
Don't over-partition tiny topics — broker overhead.
Secure with SASL/SSL in production.
Capacity plan disk: retention × ingest rate.

Common mistakes

Consumer count > partitions — idle consumers.
No key on messages needing ordering — random partition breaks sequence.
auto-commit before processing — message loss on crash.
Single broker RF=1 in production — data loss on disk failure.
Using Kafka as request-response — wrong latency profile.

Advanced interview questions

Q1BeginnerWhat is a Kafka topic?

Distributed commit log divided into partitions for parallel publish/subscribe.

Q2BeginnerWhat is a consumer group?

Set of consumers sharing work — each partition consumed by one member in group.

Q3IntermediateHow preserve ordering in Kafka?

Use same partition key for related events; ordering guaranteed per partition only.

Q4IntermediateKafka vs RabbitMQ?

Kafka log with retention and replay; RabbitMQ queue-oriented routing with delete-after-ack task semantics.

Q5AdvancedSize Kafka for 1M events/sec.

Partition count from target throughput per partition (~10-50MB/s), RF=3, 6+ brokers, disk NVMe, consumer groups per downstream, monitor lag, Schema Registry.

Summary

Kafka is a durable distributed log for high-throughput events. Partitions enable parallelism; keys preserve per-entity order. Consumer groups scale processing with partition count cap. Retention enables replay and multiple subscribers. Idempotent producers and manual commits improve reliability. Event streaming patterns compose Kafka with stream processors.

Ready to mark this lesson complete?Track your journey across the entire course.