Distributed Spring Batch Coordination, Part 2: How Database-Backed Partitioning Works

#springbatch #java #cloudnative #opensource

📘 Part 2: How Database-Backed Partitioning Works

In Part 1, we discussed the challenges with traditional Spring Batch scaling — especially when relying on Kafka or RabbitMQ. In this part, let’s explore how we can simplify distributed coordination using a relational database as the central source of truth.

💡 Key Idea

Rather than broadcasting partition instructions via messaging middleware, the master node writes coordination state into the database. Worker nodes read from the database to discover which partitions they are responsible for — no messaging layer required.

⚙️ Core Coordination Tables

This model relies on three lightweight tables:

BATCH_NODES: Registers active nodes in the cluster
BATCH_JOB_COORDINATION: Tracks coordination for each partitioned step
BATCH_PARTITIONS: Stores partition metadata and execution state (assigned node, status, result)

These tables allow for real-time visibility into job execution without external queues or in-memory state.

🔁 Execution Flow

Master node receives the job request
It queries BATCH_NODES to find all currently active nodes
Using either:
- 🌀 Round-Robin, or
- 🎯 Fixed-Node allocation the master assigns partitions and stores them in BATCH_PARTITIONS
Workers poll for tasks where assigned_node = self
Once complete, they update their partition status
The master monitors for completion and performs final aggregation, if needed