How much CPU does Debezium add to a Postgres primary?

For an outbox-only publication, expect single-digit percent extra CPU on the primary under steady write load. For a full-database publication on a busy OLTP system, plan for 5 to 15 percent steady-state, with spikes during large transactions or schema migrations. The cost is dominated by logical decoding in the walsender process, not by Debezium itself. Tighter publications (filtering tables and columns at the database level) reduce this overhead because Postgres skips decoding for changes that are not in the publication.

Does Debezium count as another Postgres replica?

Partially. Debezium opens a logical replication connection, which spawns a walsender process and consumes a replication slot, just like a streaming replica would. The big differences are that Debezium does not replay WAL on a standby, the per-byte cost on the primary is different (logical decoding adds CPU), and Debezium can fall behind silently if Kafka Connect dies, which a streaming replica usually does not. Treat it as a logical consumer with extra decoding cost and a much higher risk of unbounded WAL growth.

What is the biggest risk of running Debezium in production?

Unbounded WAL growth on the primary. A replication slot prevents Postgres from recycling WAL segments until the consumer has acknowledged them. If Debezium or Kafka Connect goes down for hours, `pg_wal/` accumulates segments and can fill the primary's disk. When the disk fills, Postgres crashes. The fix is to set `max_slot_wal_keep_size` so Postgres invalidates a slot that falls too far behind, plus aggressive monitoring on slot lag in bytes.

Should I use pgoutput or wal2json with Debezium?

Use `pgoutput`. It ships with Postgres 10 and later, including AWS RDS, GCP Cloud SQL, and Azure Database for PostgreSQL, with no extra installation. It uses the efficient binary replication protocol and supports column and row filtering at the publication level. `wal2json` is older, requires a separate extension, and serializes to text, which is slower and bigger on the wire. The only reason to use `wal2json` today is legacy compatibility with tools that do not yet speak `pgoutput`.

Why should I avoid REPLICA IDENTITY FULL on the outbox table?

`REPLICA IDENTITY FULL` makes Postgres write the entire old-row image into the WAL for every UPDATE and DELETE. For an outbox table that is mostly INSERT, you do not need it, and it bloats your WAL traffic and decoding cost without giving anything back. Stick to `REPLICA IDENTITY DEFAULT`, which uses the primary key. The outbox event router only needs the new-row image for the INSERT, which is always available.

What is the Debezium heartbeat and when do I need it?

A heartbeat is a small periodic update Debezium emits so that the replication slot advances even when the tables in the publication are idle. Without it, a slot watching only an outbox table can stay frozen at an old LSN while the rest of the database keeps writing WAL, leading to runaway WAL retention. Two implementations exist: a heartbeat table that Debezium updates on a schedule, or `pg_logical_emit_message()`, which writes a logical message directly into the WAL without touching any table. Either works. You need one whenever the replicated tables are quiet relative to the rest of the database.

How does Debezium plus the outbox pattern compare to application-level polling?

Application polling repeatedly issues `SELECT ... FOR UPDATE SKIP LOCKED` against the outbox table, which causes index scans, lock contention, and tombstones from the cleanup `DELETE`. CDC just reads the WAL that Postgres is already writing, so it adds CPU for decoding but no query load and no extra index access. CDC also gives lower latency (milliseconds versus the polling interval) and a clean LSN-based checkpoint, which makes at-least-once delivery and catch-up simpler. Polling is fine for low throughput. CDC scales further and stays cheaper as load grows, provided you monitor the WAL retention risk.

What monitoring do I need for a Debezium plus outbox setup?

At minimum: `pg_replication_slots` (active flag, `confirmed_flush_lsn`, retained WAL bytes), `pg_stat_replication` (write_lag, flush_lag, replay_lag), `pg_wal/` disk usage on the primary, walsender CPU per process, reorder buffer spill files in `pg_replslot/ /`, and Debezium connector metrics (`MilliSecondsBehindSource`, `QueueRemainingCapacity`, connector state). Alert when retained WAL exceeds a few gigabytes and when the connector has been down for more than a few minutes. Most outages are caught in time if these alerts exist before launch.

How big should logical_decoding_work_mem be?

The default is 64 MB per walsender. Bump it up if you have large transactions that spill to disk (you will see files in `pg_replslot/ /`); 256 MB to 1 GB is common for systems with bulk imports. Tune it down only if you have many active slots and tight memory. The right value comes from watching reorder buffer spill files under your real workload and adjusting until they are rare.

Debezium and the Outbox Pattern: The Real Impact on Your Postgres Database

Key Takeaways

Debezium for Postgres uses logical replication. It looks like one extra replica to the database, but the cost is mostly CPU for decoding plus a replication slot that holds WAL on the primary.
The biggest operational risk is WAL bloat. If Debezium goes down or lags, `pg_wal/` can fill the disk and crash the primary. Use `max_slot_wal_keep_size` (Postgres 13+) to cap it.
Steady-state CPU overhead for an outbox-only publication is usually 5 to 15 percent on the primary, but only single-digit percent for outbox-shaped workloads with small inserts.
Use a narrow publication (`CREATE PUBLICATION outbox_pub FOR TABLE outbox`). Wildcard publications force the primary to decode every change before filtering, which is pure waste.
Set `REPLICA IDENTITY DEFAULT` (the primary key) on the outbox table. Never use `REPLICA IDENTITY FULL` here. It bloats every WAL record with the full row image.
Partition the outbox table by day or by hour and `DROP PARTITION` the old ones. Row-by-row `DELETE` from a hot CDC table creates tombstones, index bloat, and extra WAL traffic.
Monitor `pg_replication_slots.confirmed_flush_lsn` lag, walsender CPU, and reorder buffer spill files. Alert before the disk fills, not when it fills.
If the outbox table is rarely written to but the rest of the database is busy, add a [Debezium heartbeat](https://debezium.io/documentation/reference/stable/connectors/postgresql.html) so the slot keeps advancing and WAL keeps recycling.
Debezium plus the outbox pattern is the right default. Application-level polling against the outbox table looks simpler but causes index bloat, lock contention, and is hard to make exactly-once. CDC just reads the WAL the database is already writing.

You proposed the transactional outbox pattern for your service. The design review goes well until the DBA on the call says: “So you want me to add a third logical replica to the primary? That is going to slow the database down.”

That single sentence kills more outbox rollouts than any other concern. The DBA’s worry is real, but it is also often overstated. The truth sits in the middle. Debezium does add load to your Postgres primary, but the load lands in very specific places, the steady-state cost for an outbox-only stream is small, and the actual production risk is almost never CPU. It is WAL retention.

This post is the answer you wish you had ready in that meeting. We will walk through what Debezium actually does to a Postgres primary, where the overhead lives (CPU, memory, disk, network), why the outbox pattern is the best-case workload for change data capture, and the configuration knobs and monitoring that keep your DBA from paging you at 3 AM. The previous deep dive on the transactional outbox pattern covered the why and the how. This one is about what your database team needs to hear before they sign off.

If you are still deciding between Debezium and a polling relay, or between Kafka and another broker, the Kafka vs RabbitMQ vs SQS and How Kafka Works posts will help frame the trade-offs.

What Debezium Actually Does to Postgres

Debezium for Postgres is built on top of logical replication. From the database’s point of view, the connector behaves like one more logical replica. Three components on the database side carry all of the weight.

flowchart TB
    App["fa:fa-server App<br/>(INSERT into outbox)"]

    subgraph PG["fa:fa-database Postgres Primary"]
        direction TB
        WAL["fa:fa-scroll WAL<br/>(write-ahead log)"]
        Slot["fa:fa-bookmark Replication Slot<br/>(persistent cursor)"]
        Plugin["fa:fa-cog Logical Decoding<br/>(pgoutput)"]
        WS["fa:fa-network-wired walsender<br/>(one per connection)"]
        WAL --> Slot --> Plugin --> WS
    end

    DBZ["fa:fa-stream Debezium Connector"]
    KC["fa:fa-server Kafka Connect"]
    K["fa:fa-paper-plane Kafka"]

    App --> WAL
    WS --> DBZ
    DBZ --> KC --> K

    classDef src fill:#dbeafe,stroke:#1d4ed8,stroke-width:2px,color:#0f172a
    classDef pg fill:#fff3e0,stroke:#f57c00,stroke-width:2px,color:#0f172a
    classDef sink fill:#c8e6c9,stroke:#388e3c,stroke-width:2px,color:#0f172a
    class App src
    class WAL,Slot,Plugin,WS pg
    class DBZ,KC,K sink

A quick tour of each piece:

Replication slot (pg_replication_slots). A persistent cursor into the WAL stream. As long as the slot exists, Postgres will not recycle any WAL segments past the slot’s confirmed position. This is the single most important concept in this post.
Logical decoding plugin. Postgres ships with pgoutput since version 10. The older wal2json exists but you should not reach for it on a new project. We explain why below. The plugin is the code that takes physical WAL records and turns them into logical change events (INSERT, UPDATE, DELETE) that an external consumer can understand.
walsender process. One backend process per active replication connection from Debezium. It reads the WAL, runs it through the plugin, applies publication and column filters, reorders by transaction commit, and streams the result over the network to the connector.

So yes, your DBA is right that Debezium “looks like a replica” to the database. The cost profile is different from a streaming physical replica, but the mental model of “another consumer of the WAL pipeline” is correct. The disagreement is about what that actually costs.

Where the Overhead Actually Lands

There are four places the load shows up. CPU is the one everyone talks about. Disk is the one that takes down your production database.

CPU: mostly on the primary

The walsender does the decoding work, and the walsender lives on the primary. There is no way to push that to a standby. The cost depends on three things: how many tables are in the publication, how big the transactions are, and how much filtering happens at the publication level versus inside the walsender.

Source	Impact	Notes
Logical decoding (pgoutput)	Moderate	The walsender reads physical WAL and decodes it into logical change events. CPU work on the primary, not on a standby.
Reorder buffer	Can spike	Postgres reassembles transactions in commit order. Long-running or large transactions cause memory then disk spilling and visible CPU churn.
Filtering	Low to moderate	If you publish all tables but only care about the outbox, the primary still decodes everything before filtering. Use `PUBLICATION FOR TABLE outbox` to cut this to near zero.
TOAST and large columns	Low for outbox	Outbox payloads are small JSON. UPDATE-heavy tables with TOAST columns are far more expensive to decode.

A reasonable rule of thumb from the Debezium project’s own performance tests and from posts like Reorchestrate’s measurements is that Debezium adds little to no CPU on the primary under normal OLTP load when the publication is narrow. For a busy database with a wildcard publication, expect 5 to 15 percent extra CPU steady state, with spikes higher during bulk jobs and schema migrations. For an outbox-only publication on a small table with small inserts, the overhead is usually below 5 percent.

The Microsoft Azure Postgres CDC tuning guide also points out that multiple slots multiply the work. Each slot decodes the entire WAL stream independently, so two Debezium connectors against the same database do roughly twice the decoding work, even if they care about different tables. This matters when you start adding more services.

Memory: per walsender, not free but not huge

Each walsender has its own backend memory (around 10 to 20 MB of static cost) and a reorder buffer governed by logical_decoding_work_mem (default 64 MB). When a transaction grows beyond the reorder buffer, Postgres spills it to disk in the slot’s directory under pg_replslot/<slot>/.

flowchart LR
    TX["fa:fa-database Long Transaction"]
    RB["fa:fa-microchip Reorder Buffer<br/>logical_decoding_work_mem<br/>(default 64MB)"]
    Spill["fa:fa-hard-drive Spill Files<br/>pg_replslot/&lt;slot&gt;/"]
    Out["fa:fa-arrow-right Sent to walsender"]

    TX --> RB
    RB -->|"fits"| Out
    RB -->|"too big"| Spill
    Spill --> Out

    classDef src fill:#dbeafe,stroke:#1d4ed8,stroke-width:2px,color:#0f172a
    classDef mem fill:#fff3e0,stroke:#f57c00,stroke-width:2px,color:#0f172a
    classDef disk fill:#fde2e2,stroke:#b91c1c,stroke-width:2px,color:#0f172a
    classDef sink fill:#c8e6c9,stroke:#388e3c,stroke-width:2px,color:#0f172a
    class TX src
    class RB mem
    class Spill disk
    class Out sink

For an outbox table where every transaction inserts a single small row, the reorder buffer will rarely spill. For a database where the outbox table sits next to a long-running batch job that updates millions of rows in one transaction, the spill files for that one transaction can hit gigabytes. The spill is disk, not memory, but you still pay for the I/O.

Tune logical_decoding_work_mem up if you see frequent spills under real load. Tune it down only if you run many slots on a memory-tight host.

Disk and WAL retention: this is the one that matters

This is the section to read twice. Almost every Debezium production incident traces back to here.

A replication slot prevents WAL recycling until the consumer has acknowledged it. That sounds reasonable until something causes the consumer to stop acknowledging. The Kafka Connect worker dies. The connector enters a failed state. The downstream Kafka cluster goes into a rolling restart. A network partition isolates Debezium from Postgres for an hour. Your on-call engineer pauses the connector for a deployment and forgets to resume.

Whatever the reason, WAL keeps accumulating in pg_wal/ on the primary, and the primary has no way to push back. As Gunnar Morling’s deep dive on Postgres replication slots puts it, an inactive slot is a “ticking time bomb” for your database disk. Multiple production outages have been postmortem’d as “Debezium went down on Friday, disk filled by Sunday, primary crashed Monday morning.”

flowchart TB
    subgraph T1["fa:fa-clock T+0: healthy"]
        direction LR
        P1["fa:fa-database Primary"]
        WAL1["fa:fa-scroll pg_wal/<br/>~2 GB"]
        S1["fa:fa-bookmark Slot at LSN N"]
        DBZ1["fa:fa-stream Debezium ack at LSN N"]
        P1 --> WAL1 --> S1 --> DBZ1
    end

    subgraph T2["fa:fa-hourglass-half T+8h: connector down"]
        direction LR
        P2["fa:fa-database Primary<br/>(still writing)"]
        WAL2["fa:fa-scroll pg_wal/<br/>~120 GB and growing"]
        S2["fa:fa-bookmark Slot still at LSN N"]
        DBZ2["fa:fa-times-circle Debezium DOWN"]
        P2 --> WAL2 --> S2 --> DBZ2
    end

    T1 ~~~ T2

    classDef ok fill:#c8e6c9,stroke:#388e3c,stroke-width:2px,color:#0f172a
    classDef warn fill:#fde2e2,stroke:#b91c1c,stroke-width:2px,color:#0f172a
    class P1,WAL1,S1,DBZ1 ok
    class P2,WAL2,S2,DBZ2 warn

The fix is a combination of three things, in order of importance.

Set max_slot_wal_keep_size (Postgres 13 and later). This caps how much WAL a single slot is allowed to retain. When the cap is exceeded, Postgres invalidates the slot. Debezium will need to be re-snapshotted from the source tables, which is operationally annoying but very, very preferable to your primary crashing. A typical value is 50 to 200 GB depending on disk size and recovery tolerance.
Monitor slot lag in bytes, not just rows. Use pg_wal_lsn_diff(pg_current_wal_lsn(), confirmed_flush_lsn) per slot. Alert at a few hundred megabytes (warning) and a few gigabytes (page).
Alarm on inactive slots. A slot with active = false for more than a few minutes is a sign the connector is gone. Page on it. The minutes you save here are the difference between a graceful failover and a primary down.

We covered why disk pressure on a primary is a particularly nasty failure mode in How OpenAI scales Postgres and in the broader How databases store data internally write-up. The TL;DR is the same: when the primary’s disk fills, you do not get a graceful degradation. You get a hard crash and a recovery from backup.

Network: small for outbox, large for full database

The logical replication stream from primary to Debezium carries roughly the size of WAL records for tables in the publication, plus protocol overhead. For an outbox-only publication, this is small (proportional to outbox insert rate, which is one row per business write). For a full-database publication on a busy OLTP system, you are looking at a meaningful fraction of the primary’s network throughput.

The hop from Debezium to Kafka is a separate network path that does not load the database at all. Putting the Debezium worker on the same data center or VPC as Postgres is good practice. Cross-region streaming from Debezium to Postgres is allowed, but adds latency and increases the chance of slot lag if the link blips.

Debezium vs a Streaming Replica: What Your DBA Means

The “another replica” framing your DBA used is partially right. Here is the side-by-side that usually settles the conversation.

Aspect	Streaming (physical) replica	Debezium (logical)
WAL decoding on primary	None	Yes (extra CPU for `pgoutput`)
Replays WAL on a standby	Yes (separate machine)	No, Debezium just consumes events
Holds WAL on primary	Yes, until streamed	Yes, until acknowledged via slot
Risk of unbounded WAL growth	Low (replicas usually keep up)	Higher (Kafka Connect outages are common)
Network out of primary	Full WAL stream	Only the publication scope
Per-table filtering	No	Yes (publications, column filters, row filters)
Failover protocol	Built into Postgres	Manual or via tools like patroni-cdc

Two takeaways. First, Debezium is not “free” the way a read replica is “free” once it exists. There is real CPU on the primary for decoding. Second, Debezium is also not “the same as another replica” the way the DBA might fear. You can scope it down to a single small table and pay almost nothing in steady state, which is exactly what the outbox pattern lets you do.

Why the Outbox Pattern Is the Best Case for Debezium

If your only use of CDC was streaming an entire OLTP database, your DBA’s worry would be more justified. The outbox pattern is different. It is the friendliest possible workload for logical decoding for four reasons.

flowchart LR
    A["fa:fa-table outbox<br/>narrow, INSERT-only,<br/>small payloads"]
    B["fa:fa-filter Publication<br/>FOR TABLE outbox"]
    C["fa:fa-cog Decoding<br/>cheap per row"]
    D["fa:fa-broom Sweeper<br/>partition + DROP"]

    A --> B --> C --> D

    classDef step fill:#dbeafe,stroke:#1d4ed8,stroke-width:2px,color:#0f172a
    class A,B,C,D step

Narrow publication. Only the outbox table is decoded into events. Every other table in the database is invisible to the walsender from the moment you write CREATE PUBLICATION outbox_pub FOR TABLE outbox.
Small payloads. Outbox rows are typically a few hundred bytes of metadata plus a JSON payload that consumers actually need. There are no large TOAST columns, no full-row UPDATE traffic, no cascading FK updates.
Insert-only workload. The outbox is append-only by design. There are no UPDATE replication amplifiers (no full row reconstruction, no TOAST chasing) unless you accidentally enable REPLICA IDENTITY FULL.
Self-cleaning by partition drop. Combined with the Debezium outbox event router you can drop entire daily partitions instead of running row-by-row DELETE, which would otherwise generate WAL traffic of its own.

Concrete settings for the outbox table

These are the defaults you want for a Postgres outbox table that streams to Kafka via Debezium.

CREATE TABLE outbox (
    id              bigserial PRIMARY KEY,
    aggregate_type  text        NOT NULL,
    aggregate_id    text        NOT NULL,
    event_type      text        NOT NULL,
    payload         jsonb       NOT NULL,
    created_at      timestamptz NOT NULL DEFAULT now()
) PARTITION BY RANGE (created_at);

ALTER TABLE outbox REPLICA IDENTITY DEFAULT;

CREATE PUBLICATION outbox_pub FOR TABLE outbox;

Five rules ride on those few lines.

REPLICA IDENTITY DEFAULT, not FULL. Default uses the primary key for replication identity. FULL writes the entire old row image into every WAL record for UPDATE and DELETE. For an INSERT-only table you do not need it, and it bloats every WAL record without giving any information to the consumer that the new-row image does not already provide.
Keep the table narrow. Store only what you need to publish. Reference business entities by ID. Do not dump the whole order row into the outbox payload.
Partition by time. Daily partitions are a good starting point. DROP PARTITION outbox_2026_05_01 is metadata-only; it generates almost no WAL. DELETE FROM outbox WHERE created_at < ... is row-by-row, generates WAL for every row, and creates index bloat that you then have to vacuum.
Index sparingly. The relay (or Debezium) reads via the WAL, not the table, so most of the indexes you might add for “querying” the outbox are dead weight. Keep the primary key, add indexes only for sweepers that actually run.
Dedicated tablespace if needed. If your DBA is worried about IO contention with hot OLTP tables, put the outbox on its own tablespace. The append-only access pattern lives well on a separate disk.

A mental picture of the steady-state path

sequenceDiagram
    autonumber
    participant App as App
    participant DB as Postgres
    participant WAL as WAL
    participant WS as walsender
    participant DBZ as Debezium
    participant K as Kafka

    App->>DB: BEGIN
    App->>DB: INSERT INTO orders ...
    App->>DB: INSERT INTO outbox ...
    App->>DB: COMMIT
    DB->>WAL: write commit record
    WAL-->>WS: pgoutput decode
    WS-->>DBZ: logical change event
    DBZ-->>K: produce(topic = aggregate_type)
    K-->>DBZ: ack
    DBZ-->>DB: confirm flush LSN
    DB->>WAL: free WAL up to confirmed LSN

Notice the last two steps. The slot only advances when Debezium tells Postgres “I have safely shipped everything up to LSN X.” That ack is the heartbeat that keeps the disk from filling. Anything that breaks this loop, even briefly, accumulates WAL.

The WAL Bloat Mitigations, Step by Step

You have already met the headline mitigation (max_slot_wal_keep_size). Here is the full set of defenses, ordered roughly by how much they help.

1. Cap WAL retention per slot

# postgresql.conf
wal_level = logical
max_replication_slots = 10
max_wal_senders = 10
max_slot_wal_keep_size = 100GB
logical_decoding_work_mem = 256MB

max_slot_wal_keep_size is the brick wall that prevents a single misbehaving slot from killing the primary. When a slot exceeds the cap, Postgres invalidates the slot. Re-snapshotting Debezium is annoying. A primary crash on a Sunday is worse.

2. Use a Debezium heartbeat for quiet outbox tables

If your outbox table receives writes infrequently while the rest of the database is busy (think a low-volume admin service in a database that is shared with a busier service), the slot’s confirmed_flush_lsn will not advance past the latest activity in the publication. Meanwhile the rest of the database keeps writing WAL that the slot still pins. The result is exactly the same as a connector being down: WAL keeps growing.

The fix is a Debezium heartbeat. Two ways to implement it.

flowchart LR
    Quiet["fa:fa-bed Outbox table<br/>(idle for 6h)"]
    HB["fa:fa-heart-pulse Heartbeat<br/>tick every 30s"]
    HBT["fa:fa-table heartbeat table<br/>or pg_logical_emit_message"]
    Slot["fa:fa-bookmark Slot advances"]
    WAL["fa:fa-broom WAL recycled"]

    Quiet --> HB --> HBT --> Slot --> WAL

    classDef src fill:#dbeafe,stroke:#1d4ed8,stroke-width:2px,color:#0f172a
    classDef hb fill:#fff3e0,stroke:#f57c00,stroke-width:2px,color:#0f172a
    classDef ok fill:#c8e6c9,stroke:#388e3c,stroke-width:2px,color:#0f172a
    class Quiet src
    class HB,HBT hb
    class Slot,WAL ok

Heartbeat table. Debezium periodically updates a row in a small dedicated table that sits inside the publication. The change flows through the WAL like any other, the slot advances, WAL recycles. You can also query the table to see “is the heartbeat ticking?” from SQL. Donghua’s measurements on this approach showed slot lag dropping from gigabytes to a few hundred bytes after enabling heartbeats.
pg_logical_emit_message(). Postgres can emit a logical message into the WAL without touching any table. This is even cleaner because it does not pollute your schema with a heartbeat table.

Pick whichever fits your operational style. Either one solves the “quiet outbox in a busy database” problem.

3. Use a narrow, explicit publication

CREATE PUBLICATION outbox_pub FOR TABLE outbox;

Not FOR ALL TABLES. Not FOR TABLES IN SCHEMA public. Specifically the outbox table.

The reason is subtle. Postgres still decodes every WAL record before publication-level filtering decides what to ship. With pgoutput and a narrow FOR TABLE publication, much of that filtering happens early, before the expensive parts of decoding. With a wildcard publication, the walsender pays decoding cost for every change in the database, even when you are going to throw most of them away. For a busy database with a quiet outbox, that is pure waste.

4. Tune the reorder buffer for your workload

If you observe spill files in pg_replslot/<slot>/ under normal load, raise logical_decoding_work_mem. A common starting point is 256 MB; some teams go to 1 GB on hosts that can afford it. The cost is memory per active walsender, so multiply by the number of slots you run.

5. Watch for long-running transactions in the rest of the database

Logical decoding cannot ship the events for a transaction until that transaction commits. A transaction that stays open for an hour holds the slot at its starting LSN for that whole hour, which means an hour of WAL is pinned, even if the outbox table never sees a write from that transaction. Long transactions are a sin for many reasons; this is one more. The Postgres internals: how queries execute post covers transaction visibility in more depth.

Pgoutput vs Wal2json (and Why You Should Care)

Postgres ships with pgoutput since version 10. It is also what AWS RDS, GCP Cloud SQL, and Azure Database for PostgreSQL provide out of the box, with no extension installation required. wal2json is older, requires a separate extension, and serializes events to JSON text instead of the binary pgoutput format.

Plugin	Format	Install	Filtering	Recommendation
`pgoutput`	Binary, native protocol	Built into Postgres 10+	Publications, column lists, row filters	Default for all new projects
`wal2json`	JSON text	Separate extension, often needs OS package	Limited	Legacy only, do not start here

pgoutput is faster, smaller on the wire, and supports finer-grained filtering. The filtering matters because Postgres can skip work for excluded rows before decoding them, which directly saves CPU on the primary. There is essentially no reason to use wal2json on a new Debezium project today. If you inherit a setup that uses it, plan a migration.

Debezium vs Polling: A Fair Comparison

The original transactional outbox post introduces both options. Once you have read the database-impact analysis above, the comparison gets sharper.

Concern	Debezium / CDC	App-level polling
Primary CPU	Logical decoding cost	Repeated index scans, lock contention
Latency	Milliseconds	Polling interval (typically 100ms to 5s)
WAL retention risk	Yes, must monitor	None
Index bloat from cleanup	None (CDC just reads WAL)	Significant if you DELETE row-by-row
Lock contention with writers	None	Yes (`SELECT FOR UPDATE SKIP LOCKED` battles writers)
At-least-once delivery	Easy (LSN-based checkpoint)	Possible but you have to build it
Operational complexity	Higher (Kafka Connect, slot ops)	Lower (just an app job)
Throughput ceiling	Very high	Hits a wall around a few thousand events per second

For most teams that are already running Kafka, Debezium is the right default for an outbox, provided you put the WAL-retention monitoring in place. For small teams without Kafka, polling is fine to start with, and you can switch to CDC later because the outbox table schema does not change.

What to Tell Your DB Team

If you remember nothing else from this post, remember the script for the design review.

“Yes, the primary does extra CPU work for logical decoding. For an outbox-only publication on a small INSERT-heavy table, expect single-digit percent steady state.”
“The bigger operational risk is WAL retention, not steady-state CPU. We will set max_slot_wal_keep_size, alarm on slot lag in bytes, and page on inactive slots.”
“We will scope the publication tightly: CREATE PUBLICATION outbox_pub FOR TABLE outbox. No wildcards.”
“We will use REPLICA IDENTITY DEFAULT on the outbox table. Not FULL.”
“We will partition the outbox table by day and drop old partitions instead of DELETE. The WAL impact of cleanup is essentially zero.”
“We will add a Debezium heartbeat so the slot keeps advancing during quiet hours.”
“Steady-state overhead for an outbox-only stream is far cheaper than the alternative of polling the outbox table from the application, which would cause index bloat and lock contention with our writers.”

That conversation usually ends with “fine, but you own the slot lag dashboard.” Which, fairly, you should.

Monitoring Checklist

Before going to production, every one of these signals should be on a dashboard, and the most important ones should be on a pager.

flowchart TB
    subgraph PG["fa:fa-database Postgres signals"]
        A1["pg_replication_slots<br/>active, confirmed_flush_lsn"]
        A2["pg_stat_replication<br/>write_lag, flush_lag, replay_lag"]
        A3["pg_wal/ disk usage"]
        A4["walsender CPU per process"]
        A5["reorder buffer spill files<br/>pg_ls_dir('pg_replslot/&lt;slot&gt;')"]
    end

    subgraph DBZ["fa:fa-stream Debezium signals"]
        B1["MilliSecondsBehindSource"]
        B2["QueueRemainingCapacity"]
        B3["connector state (RUNNING / FAILED)"]
        B4["snapshot status"]
    end

    subgraph K["fa:fa-paper-plane Kafka signals"]
        C1["consumer lag on outbox topic"]
        C2["broker disk usage"]
    end

    classDef pg fill:#dbeafe,stroke:#1d4ed8,stroke-width:2px,color:#0f172a
    classDef dbz fill:#fff3e0,stroke:#f57c00,stroke-width:2px,color:#0f172a
    classDef k fill:#c8e6c9,stroke:#388e3c,stroke-width:2px,color:#0f172a
    class A1,A2,A3,A4,A5 pg
    class B1,B2,B3,B4 dbz
    class C1,C2 k

Concrete alert thresholds that have served real teams well.

Signal	Warning	Page
Slot retained WAL bytes	1 GB	5 GB or 50 percent of `max_slot_wal_keep_size`
Slot inactive duration	1 minute	5 minutes
`pg_wal/` disk usage	60 percent	80 percent
walsender CPU sustained	50 percent of one core	90 percent for 5 minutes
Debezium `MilliSecondsBehindSource`	10 seconds	60 seconds
Connector state	TASK_FAILED for 1 minute	TASK_FAILED for 5 minutes
Reorder buffer spill rate	Any sustained spilling	Spills exceed available disk on slot directory

The pattern from the thundering herd post applies here too: alarming late is alarming useless. The disk filling is a binary event with no soft-fail mode.

Failure Modes You Will Actually See

Patterns that show up in postmortems for Debezium plus outbox setups, with the fix.

Failure	Symptom	Fix
Connector down over the weekend	Disk fills on Monday	`max_slot_wal_keep_size` plus paging on inactive slot
Slot frozen on quiet outbox	WAL grows even though outbox is calm	Add a Debezium heartbeat
Long-running transaction elsewhere	Slot LSN does not advance, WAL pins for hours	Find and kill the transaction; alert on `pg_stat_activity` open transactions older than 5 minutes
Wildcard publication	Walsender CPU spikes during unrelated table activity	Switch to `FOR TABLE outbox`
`REPLICA IDENTITY FULL` left on	WAL traffic doubles or worse during any UPDATE	`ALTER TABLE outbox REPLICA IDENTITY DEFAULT`
Row-by-row DELETE for cleanup	Index bloat, vacuum churn, extra WAL	Partition by time, `DROP PARTITION` old data
Reorder buffer always spilling	Decoding latency, high disk I/O on slot directory	Raise `logical_decoding_work_mem`
Slot invalidated by `max_slot_wal_keep_size`	Debezium cannot resume from old LSN	Plan a re-snapshot; this is by design and protects the database
Two connectors against the same DB	CPU on the primary roughly doubles	Consolidate into one connector with multiple table includes if possible
Network partition between Debezium and Postgres	Slot lag grows	Place Debezium close to the database; alert on slot inactivity

A Production-Shaped Architecture

Putting all of the above together, the production-shaped architecture for an outbox stream looks like this. The diagram below mirrors the production flow your operations team will actually be on call for.

flowchart TB
    subgraph App["fa:fa-server Service"]
        API["fa:fa-network-wired API"]
        BL["fa:fa-cogs Business Logic"]
    end

    subgraph PG["fa:fa-database Postgres Primary"]
        Biz["fa:fa-table business tables"]
        OB["fa:fa-table outbox (partitioned by day)"]
        Pub["fa:fa-filter PUBLICATION outbox_pub<br/>FOR TABLE outbox"]
        Slot["fa:fa-bookmark slot: outbox_slot<br/>pgoutput, max_slot_wal_keep_size=100GB"]
        HB["fa:fa-heart-pulse Debezium heartbeat<br/>(pg_logical_emit_message every 30s)"]
    end

    subgraph CDC["fa:fa-stream Kafka Connect + Debezium"]
        DBZ["fa:fa-cog Debezium PG Connector"]
        SMT["fa:fa-route Outbox Event Router SMT"]
    end

    subgraph K["fa:fa-paper-plane Kafka"]
        T1["fa:fa-list topic: order-events"]
        T2["fa:fa-list topic: payment-events"]
    end

    subgraph Mon["fa:fa-chart-line Monitoring"]
        M1["fa:fa-bell Slot lag alarm"]
        M2["fa:fa-bell Inactive slot alarm"]
        M3["fa:fa-bell pg_wal disk alarm"]
        M4["fa:fa-bell Connector state alarm"]
    end

    API --> BL
    BL --> Biz
    BL --> OB
    OB --> Pub
    Pub --> Slot
    HB --> Slot
    Slot --> DBZ
    DBZ --> SMT
    SMT --> T1
    SMT --> T2

    Slot -. metrics .-> M1
    Slot -. metrics .-> M2
    PG -. metrics .-> M3
    DBZ -. metrics .-> M4

    classDef app fill:#dbeafe,stroke:#1d4ed8,stroke-width:2px,color:#0f172a
    classDef pg fill:#fff3e0,stroke:#f57c00,stroke-width:2px,color:#0f172a
    classDef cdc fill:#e0f2f1,stroke:#00796b,stroke-width:2px,color:#0f172a
    classDef k fill:#c8e6c9,stroke:#388e3c,stroke-width:2px,color:#0f172a
    classDef mon fill:#fde2e2,stroke:#b91c1c,stroke-width:2px,color:#0f172a
    class API,BL app
    class Biz,OB,Pub,Slot,HB pg
    class DBZ,SMT cdc
    class T1,T2 k
    class M1,M2,M3,M4 mon

Six things to defend in a design review.

One narrow publication scoped to the outbox table. No wildcards.
One slot per connector, with max_slot_wal_keep_size set, so a misbehaving consumer cannot kill the primary.
A heartbeat so the slot keeps advancing through quiet periods.
Time-partitioned outbox table with DROP PARTITION cleanup.
Outbox event router doing topic routing and key extraction inside Kafka Connect, not in your application.
Monitoring on slot lag, disk, connector state, and walsender CPU, with paging thresholds calibrated to the real disk size.

Practical Lessons for Software Developers

A short list of things that have surprised teams in production.

Logical decoding is not free, but it is also not the bogeyman

The mental model your DBA brings is “another replica means another full WAL apply path.” The reality is “another walsender process that decodes WAL and ships filtered events.” For an outbox-shaped workload, the cost is small. For a wildcard publication on a busy OLTP database, the cost is real. Be honest about which one you are running.

Disk fills, then everything else fails

The first time you see “primary down” because of a stuck Debezium slot, you stop arguing with your DBA about whether to monitor slot lag. Build the alarm before the first incident. The cost of building it is one afternoon. The cost of skipping it is a Sunday.

Pick one source of truth for “what changed”

The value of CDC plus the outbox is that the WAL becomes the canonical timeline of business events. Do not also publish events from the application layer for “speed.” Pick one path. Two paths give you the dual-write problem in a new costume.

Cleanup is part of the design, not an afterthought

A DELETE FROM outbox WHERE published = true job feels obvious. It also generates WAL for every row, creates dead tuples, requires VACUUM to reclaim space, and competes for locks with writers. Time partitioning and DROP PARTITION skips all of that and is essentially free in WAL terms. We covered the same idea for hot tables in the database locks deep dive and the Postgres internals post.

Treat slots like persistent state

A replication slot is not a transient connection. It survives across Debezium restarts, Postgres restarts, even host reboots. That is a feature (it lets the connector resume cleanly) and a hazard (a slot left behind by a removed connector quietly pins WAL). Treat slot creation and deletion like database migrations: deliberate, reviewed, observable.

Make consumers idempotent, regardless

Even with CDC and LSN-based checkpoints, you get at-least-once delivery, not exactly-once. The walsender can ship an event, then crash before the LSN ack reaches Postgres, and the event will be sent again on resume. Your consumers must already handle this. This is the same lesson as the transactional outbox post: idempotency at the consumer is non-negotiable.

Cross-Cutting References

This post deliberately stays focused on the database-impact analysis. For neighboring topics, the hub posts on this blog go deeper:

Transactional outbox pattern for the original “why” and the polling-based relay implementation.
How Kafka works for the broker mechanics that Debezium produces into.
Kafka vs RabbitMQ vs SQS for picking the right destination broker.
Saga pattern for distributed transactions for how the outbox feeds saga steps.
Postgres internals: how queries execute for transaction visibility and the WAL mechanics that logical decoding sits on top of.
How databases store data internally for the storage engine view that explains why long transactions hurt.
How OpenAI scales Postgres for a real-world example of a high-volume Postgres deployment.
Database locks explained for the lock contention story that polling causes and CDC avoids.
Thundering herd problem for the “alert too late, fail too hard” lesson that applies directly to slot monitoring.
Circuit breaker pattern for protecting downstream services if the consumer falls behind.
System design cheat sheet for the broader catalog of patterns this fits into.

Wrapping Up

The outbox pattern with Debezium and Postgres is not a free lunch. It is, however, a remarkably good lunch for the price. The CPU cost on the primary is real but small for a properly scoped publication. The memory cost is small. The network cost is small. The disk cost is small in steady state, but it is the one that can ruin your weekend if you do not put the alarms in place.

If you take one thing away from this post, take this: the failure mode of Debezium plus an outbox is not slowness. It is unbounded WAL growth from a misbehaving slot. Set max_slot_wal_keep_size. Page on slot inactivity. Page on slot lag in bytes. Use a heartbeat. Use a narrow publication. Use REPLICA IDENTITY DEFAULT. Partition the outbox table and drop old partitions. Do those six things and you will spend more time talking about the events on Kafka than about the database under it.

Your DBA was right to ask the question. They will be even more right to sign off once you walk them through the answers in this post.

For more on the broader pattern and the alternatives, see the Transactional outbox pattern, How Kafka works, Kafka vs RabbitMQ vs SQS, Saga pattern, Postgres internals, How databases store data internally, How OpenAI scales Postgres, Database locks, Thundering herd problem, Circuit breaker pattern, the System design cheat sheet, the full archive, and the Distributed systems hub.

Further reading: Gunnar Morling’s Mastering Postgres Replication Slots, the Debezium PostgreSQL connector docs, the Outbox Event Router SMT, and the PostgreSQL logical replication docs.

Debezium and the Outbox Pattern: The Real Impact on Your Postgres Database

What logical decoding, replication slots, and the WAL actually cost when you stream the outbox table to Kafka with Debezium

What Debezium Actually Does to Postgres

Where the Overhead Actually Lands

CPU: mostly on the primary

Memory: per walsender, not free but not huge

Disk and WAL retention: this is the one that matters

Network: small for outbox, large for full database

Debezium vs a Streaming Replica: What Your DBA Means

Why the Outbox Pattern Is the Best Case for Debezium

Concrete settings for the outbox table

A mental picture of the steady-state path

The WAL Bloat Mitigations, Step by Step

1. Cap WAL retention per slot

2. Use a Debezium heartbeat for quiet outbox tables

3. Use a narrow, explicit publication

4. Tune the reorder buffer for your workload

5. Watch for long-running transactions in the rest of the database

Pgoutput vs Wal2json (and Why You Should Care)

Debezium vs Polling: A Fair Comparison

What to Tell Your DB Team

Monitoring Checklist

Failure Modes You Will Actually See

A Production-Shaped Architecture

Practical Lessons for Software Developers

Logical decoding is not free, but it is also not the bogeyman

Disk fills, then everything else fails

Pick one source of truth for “what changed”

Cleanup is part of the design, not an afterthought

Treat slots like persistent state

Make consumers idempotent, regardless

Cross-Cutting References

Further Reading

Wrapping Up

Subscribe via RSS Feed

More from Database

What Debezium Actually Does to Postgres

Where the Overhead Actually Lands

CPU: mostly on the primary

Memory: per walsender, not free but not huge

Disk and WAL retention: this is the one that matters

Network: small for outbox, large for full database

Debezium vs a Streaming Replica: What Your DBA Means

Why the Outbox Pattern Is the Best Case for Debezium

Concrete settings for the outbox table

A mental picture of the steady-state path

The WAL Bloat Mitigations, Step by Step

1. Cap WAL retention per slot

2. Use a Debezium heartbeat for quiet outbox tables

3. Use a narrow, explicit publication

4. Tune the reorder buffer for your workload

5. Watch for long-running transactions in the rest of the database

Pgoutput vs Wal2json (and Why You Should Care)

Debezium vs Polling: A Fair Comparison

What to Tell Your DB Team

Monitoring Checklist

Failure Modes You Will Actually See

A Production-Shaped Architecture

Practical Lessons for Software Developers

Logical decoding is not free, but it is also not the bogeyman

Disk fills, then everything else fails

Pick one source of truth for “what changed”

Cleanup is part of the design, not an afterthought

Treat slots like persistent state

Make consumers idempotent, regardless

Cross-Cutting References

Further Reading

Wrapping Up

Subscribe via RSS Feed

Share this:

Related Posts

More from Database