Case Study|Entity Resolution

An Army of AI Agents vs. 1.69 Million Broken Records

How AI agents that reason through data chaos turned an unsolvable problem into a competitive advantage

The Problem

Offers going to wrong people. Real owners missed entirely. Competitors reaching them first. Everyone knew the data was broken—but fixing it would take 50 analysts months of work. Not worth it.

The Solution

An army of AI agents that reason through data chaos—understanding that “Miller, John” and “John Miller” are the same person without being told. 10 agents doing in hours what 50 analysts couldn't do in months.

The Result

Problems that were previously unsolvable are now solvable. Better targeting. Faster deals. A capability their competitors don't have—and can't easily replicate.

1.69M

Records Processed

1.25M

Unique Owners

125x

Cost Savings

10x

Time to Market

The Outcome

A Capability Their Competitors Don't Have

Better targeting, faster deals, and ownership tracking that competitors can't match—because they're still using the broken data.

More Accurate Offers

Know exactly who owns what before making contact. No more sending offers to the wrong person or missing key owners.

Targeting ConfidenceBefore → After

62%

94%

Faster to Deals

Skip manual research. Data is ready on day one. Start outreach immediately instead of spending weeks cleaning data.

Time to First Contact
Before:
6 weeks
After:
Hours

Track Ownership Changes

See when properties change hands before competitors. First-mover advantage on new deals.

Change DetectionLive
UNIFIED-1234 ownership transferred
12 new properties linked
Address update detected

The Competitive Advantage

Everyone working with public tax data has these problems. Fixing them was never financially feasible—until AI agents made it possible. This client did. Their competitors haven't.

“Before this, we were sending four mailers to the same person and missing actual property owners entirely. Now we know exactly who owns what before we reach out.”

The Challenge

Public Data, Private Headaches

The underlying data comes from public tax rolls and county records. Anyone who has worked with government data knows the quality varies wildly - one county assigns the same ID to hundreds of different people, another scatters one person across dozens of IDs. Every company using this data inherits these problems.

670 Different People. One ID.

Owner ID 817132 was assigned to 670 completely unrelated people. Different names, different states, nothing in common.

BEFORE: Owner ID 817132
├──Sims Frances A
├──Pendergast Finis H
├──Police Billy
├──Palmer Ruth C
├──Partridge Charlyn
└──... 665 more unrelated people

Exclude one person? You'd exclude all 670.

AFTER: Separated IDs
COL-10001Sims Frances A
COL-10002Pendergast Finis H
COL-10003Police Billy
...670 unique IDs total

Each person can be managed individually.

44,421

Collision Groups

670

Worst Case

51,609

New IDs Created

What Didn't Work

Not Financially Feasible Before Agentic AI

Rules-based systems can't handle the infinite variations in how names and addresses appear across county records.

The Rules-Based Problem

How many rules would you need to match these?

The Transformation

From ID Chaos to Clean Profiles

See how fragmented, conflicting records become unified owner profiles with AI-validated confidence scores.

Raw Data from Provider

Owner Records Export

1.69M records with broken IDs

44,421 collision groups
ID: 817132ID shared with 669 other people!

John Miller

123 Oak St, Denver CO

ID: 817132Different person, same ID

Paula Richardson

456 Pine Ave, Boulder CO

ID: 234567Same as record #1?

Miller, J

123 Oak Street

ID: 891234Also same person?

JOHN MILLER

Oak St 123, Denver

670
Worst collision
99,581
Duplicate groups
???
True owner count

Click the toggle above to see the transformation

How We Did It

10 AI Agents Working in Parallel

A detailed look at how the agents reasoned through 1.69 million records.

Entity Resolution Pipeline

Raw Records

1.69M owner records

Fix Collisions

Separate shared IDs

Consolidate

Merge duplicates

Cross-Reference

Find overlaps

Match Records

Connect stable records

Clean Data

1.25M unique owners

1Step 1 of 7

Inconsistent Public Records

Owner ID 817132 had 670 different people assigned to it. Miller Paula appeared under 4 separate IDs. Anyone working with government data at the county level knows this pain.

1.69M

Total Records

44,421

Collision Groups

99,581

Unstable Groups

The Results

What an Army of AI Agents Accomplished

We deployed 10 agents for this project. We could have deployed hundreds. The reasoning scales infinitely—that's what changed.

10x faster
1.69M records1.25M unique owners

44,421 collision groups fixed. 99,581 unstable IDs consolidated.

$25,000
Manual cost
$200AI processing
125x

cost savings

10x

Faster to Market

1

Record per Owner

100%

Decisions Documented

Works for Your Industry

The Same Approach, Tailored to Your Data

Whatever messy data you deal with, we can clean it up.

Oil & Gas

Mineral Rights Ownership

Track ownership changes across millions of records. Reach the right person on the first try.

First-mover advantage

Real Estate

Property & Owner Records

Same property, different records over time. Track ownership accurately.

10x faster research

Financial Services

Customer Data Unification

One customer, multiple products, multiple names. Create a single customer view.

Single source of truth

Healthcare

Patient Record Matching

Same patient, different visits, different systems. Unify records across facilities.

90%+ duplicate reduction

Manufacturing

Supplier Consolidation

Same vendor, different names from M&A. Consolidate your supplier base.

30%+ duplicate suppliers found

Your Industry

Custom Data Challenges

Any messy data that needs matching, deduplication, or consolidation.

Let's discuss your use case

Your data could be your competitive advantage

If your industry has messy data that everyone struggles with, we can help you fix it. Talk to us about your specific challenges.

From this project

1.69M

Records Processed

125x

Cost Savings

10x

Faster to Market