Last updated May 2, 202615 min read

How We Fixed the Email Delivery Issue: Case SendGrid

We found an email delivery issue only because the reporting numbers stopped making sense. More people were supposed to receive emails, but fewer emails were actually delivered. The fix was not one magic button. It was a chain of small checks across SendGrid, Cloudflare DNS, Microsoft responses, SPF, DMARC, DKIM, and dedicated IP usage.

Email delivery pipeline with SendGrid, DNS authentication, and Microsoft inbox checks shown as connected checkpoints.

Share this article

Share this article on LinkedIn

Summarize with AI

ChatGPTClaudePerplexity
Table of Content

In-Short

The Numbers Exposed the Problem

The story started with a simple contradiction. More users became eligible for email campaigns, but fewer emails reached inboxes.

A technical fix had added around 1K users back into triggered email lists. That should have increased delivery. Instead, delivered emails dropped. That gap was the warning light on the dashboard.

One Fix Revealed Another Problem

The first problem was inside the campaign logic. Some users had a slightly different status, so emails were not triggered for them.

After that was corrected, the list grew. But the larger send volume exposed a second issue: Microsoft inboxes started delaying or rejecting emails. The technical logic was fixed, but the delivery system was not ready for the extra pressure.

The Real Answer Was Hidden in Event History

SendGrid support did not give a useful answer. The useful answer came from the Activity Feed.

Inside one deferred email, there was a small option to review the full response history. That was the key. It showed Microsoft responses like SPF pass, DKIM fail, DMARC pass, IP throttling, and authentication-level rejection. Once those messages were visible, the problem became readable.

Email Delivery Is Like Border Control

Sending an email is not just pressing send. It is more like crossing a border.

SPF says who is allowed to send. DKIM signs the message. DMARC tells the receiver what to do if something looks fake. The IP address carries reputation. If one document is wrong, the border officer slows everything down. If the signature fails, the officer can refuse entry completely.

The First Signal Was a Reporting Contradiction

Long Read

The problem did not start as a DNS problem. It started as a reporting problem.

We noticed that more people were subscribed or eligible for emails, but fewer emails were triggered and delivered. That made no business sense. If the eligible audience grows, the sent and delivered numbers should normally move in the same direction.

This is why reporting matters. A dashboard is not only there to say whether numbers are good or bad. It is there to catch contradictions. This case connects closely with the same logic behind engagement reporting: if the system says users should be reached, but the communication layer fails, the campaign result is already damaged before marketing performance can even be judged.

About the author

Nikita Goncharenko

Nikita Goncharenko

Project Manager

Nikita Goncharenko coordinates digital projects, migrations, and delivery workflows by turning unclear needs into structured execution.