Distribute Work Across Workers with AMQP Queues (2026)

AMQP queues are often thought of as simple message buffers, but their true power lies in their ability to orchestrate complex distributed workflows.

Consider a common scenario: processing incoming orders. A web server receives orders and needs to hand them off to a fleet of workers for fulfillment. Instead of the web server directly calling worker APIs (which is brittle and doesn’t scale), we can use AMQP.

Here’s a simplified setup:

# RabbitMQ configuration snippet
queues:
  - name: "order_processing_queue"
    durable: true
    arguments: {}

exchanges:
  - name: "order_exchange"
    type: "direct"
    durable: true
    internal: false
    auto_delete: false
    arguments: {}

bindings:
  - exchange: "order_exchange"
    queue: "order_processing_queue"
    routing_key: "new_order"

The web server, acting as a producer, publishes messages to order_exchange with a routing key of new_order.

# Producer (web server) example using pika
import pika

connection = pika.BlockingConnection(pika.ConnectionParameters('localhost'))
channel = connection.channel()

channel.exchange_declare(exchange='order_exchange', exchange_type='direct', durable=True)
channel.queue_declare(queue='order_processing_queue', durable=True)
channel.queue_bind(exchange='order_exchange', queue='order_processing_queue', routing_key='new_order')

order_data = {"order_id": "12345", "item": "widget", "quantity": 2}
channel.basic_publish(
    exchange='order_exchange',
    routing_key='new_order',
    body=str(order_data),
    properties=pika.BasicProperties(
        delivery_mode=2,  # Make message persistent
    ))

print(" [x] Sent 'new_order'")
connection.close()

Now, multiple worker processes can connect to the same order_processing_queue and consume messages.

# Consumer (worker) example using pika
import pika
import json
import time

def process_order(order_data):
    print(f"Processing order: {order_data}")
    # Simulate work
    time.sleep(5)
    print(f"Finished processing order: {order_data['order_id']}")

def callback(ch, method, properties, body):
    order_data = json.loads(body)
    process_order(order_data)
    ch.basic_ack(delivery_tag=method.delivery_tag) # Acknowledge message

connection = pika.BlockingConnection(pika.ConnectionParameters('localhost'))
channel = connection.channel()

channel.queue_declare(queue='order_processing_queue', durable=True)

# Set prefetch_count to limit how many unacknowledged messages a worker can hold
channel.basic_qos(prefetch_count=1)

channel.basic_consume(queue='order_processing_queue',
                      on_message_callback=callback)

print(' [*] Waiting for messages. To exit press CTRL+C')
channel.start_consuming()

When a worker receives a message, it calls basic_ack to signal that processing is complete. If a worker crashes before acknowledging, the message is automatically redelivered to another available worker. This guarantees at-least-once delivery.

The prefetch_count in the worker’s basic_qos is crucial. Setting it to 1 means each worker will only take one message at a time and will not fetch another until it has acknowledged the current one. This prevents a single fast worker from hogging all the messages while others sit idle. If you have 10 workers and set prefetch_count=10, each worker could grab up to 10 messages. If you have 10 workers and set prefetch_count=1, each worker will only ever hold one message at a time, ensuring even distribution if processing times vary.

The magic of AMQP here is that the broker (RabbitMQ) handles the distribution. You don’t need to write logic for load balancing or worker discovery. You just have workers connect to the queue and the broker ensures messages are delivered to available consumers.

The most surprising thing about this setup is how robust it is. If your web server goes down, orders are still queued. If a worker crashes mid-task, the message isn’t lost; it’s returned to the queue for another worker. This decoupling is the core of building resilient distributed systems.

When you start multiple workers all consuming from the same queue, the AMQP broker will round-robin messages to them by default. This is the simplest form of load balancing. Each worker gets a roughly equal share of the incoming work.

If you want to ensure that a specific type of message always goes to a specific worker or group of workers, you’d use topic exchanges and more complex routing keys. This allows for sophisticated work distribution patterns beyond simple round-robin.