Playbook

AI SDR Deliverability: Controls for Outbound at Scale

An AI SDR sends faster than any human team. Deliverability is the system that keeps that volume in the inbox. Here's what the GTM Engineer instruments.

By Rome Thorndike | June 2026

An AI SDR changes the deliverability math. A human SDR sends maybe 50 cold emails a day and self-corrects when replies stop. An agent sends thousands across a fleet of mailboxes and keeps going, because it has no feel for a domain going cold. The agent writes the email. You own the rails it sends on. Get the rails wrong and a fresh domain torches in 4-5 days.

This is the deliverability layer of the broader operator job covered in how GTM Engineers manage AI SDRs. That piece is the full playbook across guardrails, governance, and attribution. This one goes deep on the one control surface that decides whether any of it reaches a human: inbox placement. If you haven't set the agent's behavioral limits yet, start with AI SDR guardrails first, then come back here for the sending side.

Authentication: SPF, DKIM, DMARC on every sending domain

Every domain the agent sends from needs three DNS records, no exceptions. SPF lists the servers allowed to send for your domain. DKIM signs each message so the receiving server can confirm it wasn't altered in transit. DMARC ties the two together and tells receivers what to do when a message fails: nothing, quarantine, or reject.

Google and Yahoo made all three mandatory for bulk senders in February 2024. Microsoft extended the same requirement to Outlook, Hotmail, and Live consumer domains in May 2025. If a message fails authentication, these providers reject it at the SMTP level with a hard bounce. Microsoft's rejection code is explicit: 550 5.7.515 Access denied, sending domain does not meet the required authentication level. That's not a spam-folder placement you can recover from. It's a wall.

Start DMARC at p=none with an rua aggregate-report address so you can watch alignment without affecting delivery. After 2-4 weeks of clean reports, move to p=quarantine. The technical setup mirrors the cold-email side; the cold email deliverability guide has the exact records to publish. For an AI SDR, the difference is scale. You're not authenticating one domain, you're authenticating 10 or 20, and a single missing DKIM key on one of them means a fifth of your volume bounces. Audit the full fleet before the agent sends its first batch, and re-audit whenever you add a domain.

Bulk-sender rules the agent triggers fast

The provider thresholds kick in at 5,000 messages per day to a given mailbox provider. A human team rarely crosses that. An AI SDR running 20 mailboxes at 75 sends each blows past it in an afternoon. Treat every sending domain as a bulk sender from the first send.

Three rules matter beyond authentication. One-click unsubscribe per RFC 8058: commercial mail must carry a List-Unsubscribe header and an List-Unsubscribe-Post header so a recipient can opt out in a single click, and you have to honor it within two days. Wire the unsubscribe endpoint into your suppression list so the agent never touches that address again. Second, the spam-complaint rate stays under 0.3%, which I'll cover next, because it's the metric most likely to kill an AI SDR program. Third, valid forward and reverse DNS on the sending IPs, plus a PTR record that resolves. Miss any of these and the providers stop trusting the domain, regardless of how clean the copy is.

Complaint-rate monitoring and automatic throttling

The 0.3% spam-complaint ceiling is the single number that decides whether an AI SDR survives. Google asks senders to stay under 0.1% and treats 0.3% as a hard enforcement line. Cross it and Gmail starts routing your domain to spam wholesale, not just the offending campaign. Yahoo holds the same threshold. The problem with an agent is speed: by the time complaints surface in a dashboard, the agent has already sent thousands more.

Build the monitor as a control loop, not a report. Pull Google Postmaster Tools and Yahoo's Sender Hub feedback data on a schedule, compute complaints per thousand sent per domain, and wire two thresholds. A soft alert at 0.1% pages you and flags the domain. A hard auto-throttle at 0.2%, set deliberately below the 0.3% breach, pauses new sends on that domain, lets the in-flight queue drain, and shifts volume to healthy domains. The agent keeps working; it just stops feeding a domain that's heading for the cliff.

Tie throttle events back to segments. A complaint spike usually traces to one list source or one message variant the agent over-sent. Surface that in the alert so you fix the input, not just the symptom. A blind pause buys you a day. Killing the bad segment keeps the domain alive for months.

Send caps and domain warmup

Per-domain and per-mailbox send caps are the throttle you set before the agent ever runs. A warmed mailbox in 2026 tolerates roughly 50-75 cold sends a day, lower than the 100+ that worked in 2023, because the providers tightened. Cap the agent at the mailbox level and let it spread volume across the fleet rather than hammering any single inbox. If you need 1,000 sends a day, that's 15-20 mailboxes at 60 each, not 4 at 250.

Warmup is non-negotiable for a new domain. A fresh domain that starts at full volume looks exactly like a spammer to the receiving filters. Ramp it: 20-30 sends a day for the first week, climbing to full volume over 3-4 weeks while warmup traffic runs alongside the real sends to build positive engagement signals. The 2026 warm-up strategy has the day-by-day schedule. The mistake teams make with an AI SDR is pointing it at a freshly bought domain and expecting the agent's volume to ramp itself. It won't. The agent sends what you let it send, so the cap and the warmup schedule live in your config, not the agent's prompt.

List hygiene: suppress unverified and bounced

Bad addresses drive bounces, bounces drive provider distrust, and an AI SDR will cheerfully send to every row you hand it. Two suppression rules protect the fleet. First, verify before send: run every address through verification and suppress anything that comes back invalid, risky, or catch-all unknown. Don't let the agent send to an unverified list. Second, suppress on bounce: a hard bounce means the address is dead, so it goes on the global suppression list immediately and the agent never retries it.

Keep a single global suppression list synced across every campaign and every domain. Unsubscribes, hard bounces, complaints, and do-not-contact entries all land there, and the agent checks it before composing. The failure mode without this is an agent that re-sends to an address that already complained, which is the fastest way to push a domain over the complaint ceiling. Hygiene is cheap. A burned domain and a 2-week warmup to replace it is not.

Monitoring and alerting

The agent runs unattended, so the instrumentation has to watch for it. Track four metrics per domain, per day: authentication pass rate from DMARC aggregate reports, spam-complaint rate from postmaster feeds, bounce rate split into hard and soft, and inbox-placement rate from seed-list tests across Gmail, Yahoo, and Outlook. Each one gets a threshold and an action.

Bounce rate auto-pauses a domain at 3%. Complaint rate auto-throttles at 0.2%. An authentication pass-rate drop below 95% pages you, because it usually means a DNS record changed or a key rotated and a chunk of volume is now bouncing silently. Inbox placement below 80% on any provider flags the domain for a warmup reset before the agent sends more. Route all of it to one alerts channel with the domain, the metric, the value, and the triggering segment, so the on-call response is a fix, not an investigation.

Deliverability for an AI SDR is the same discipline you'd apply to your own cold outbound, instrumented to run at the agent's speed and to fail safe when a number drifts. The agents that scale cleanly in 2026 aren't the ones with the best copy. They're the ones wired to throttle themselves before a domain dies. For the operator view across the rest of the stack, the managing AI SDRs playbook covers governance and attribution, and the Claude Code sales agent build shows where these deliverability checks slot into an agent you wire yourself.

Authoritative references for the bulk-sender rules: Google's email sender guidelines and Yahoo's sender best practices.

Frequently Asked Questions

What does AI SDR deliverability mean?

It's the set of controls that keep an AI SDR's outbound landing in the inbox instead of getting bounced or spam-foldered. The agent writes and sends. You own the sending infrastructure underneath it: authenticated domains, send caps, complaint-rate monitoring, warmup, and list hygiene. An AI SDR can triple your send volume overnight, which means it can also burn a domain in days if the rails aren't there. Deliverability is the rail.

Do the Google and Yahoo bulk-sender rules apply to AI SDR outbound?

Yes, once you cross 5,000 messages a day to a given provider. Google and Yahoo made authentication mandatory in February 2024, Microsoft followed for Outlook consumer domains in May 2025. All three require SPF, DKIM, and DMARC, one-click unsubscribe on commercial mail, and a spam-complaint rate under 0.3%. An AI SDR running across a fleet of mailboxes hits the 5,000/day threshold fast, so treat every sending domain as a bulk sender from day one.

How do I keep an AI SDR's spam-complaint rate under 0.3%?

Pipe provider feedback loops and postmaster data into a monitor that reads complaints per thousand sent, per domain. Set a soft alert at 0.1% and a hard auto-throttle at 0.2%, well below the 0.3% ceiling. When a domain crosses the soft line, the system pauses new sends on it, drains the in-flight queue, and routes volume to healthy domains while you find the bad segment. Complaints are a lagging signal, so you throttle on the trend, not the breach.

Who owns deliverability when the AI SDR is a vendor product?

You do. The vendor owns the model that writes the email. You own the domains, the DNS records, the suppression list, the send caps, and the monitoring. Buy the agent, build the rails. The outreach copy is close to a commodity in 2026, so the durable work sits in the deliverability instrumentation around the agent, and that stays yours no matter whose model sends the mail.

Source: State of GTM Engineering Report 2026 (n=228). Salary data combines survey responses from 228 GTM Engineers across 32 countries with analysis of 3,342 job postings.