Part 2: From F to B: How We Achieved a 122% Improvement in Healthcare Data Quality

In Part 1, we revealed that average healthcare data quality scores just 36/100 (F grade). We quantified the costs: 10x higher AI expenses, clinical risks from missing diagnoses, and regulatory exposure. Now, here’s how we systematically fixed it.

The Solution: Systematic, Measured Improvement

After applying our 13-step data refinement process across 3 main layers, we re-measured the same patient records.

Average post-refinement score: 80/100 (B grade – Good)

Improvement: 44 points = 122% increase

What Changed – Layer by Layer

Layer 1: Raw Data Collection

Connected to ALL major networks (Patient Access API, TEFCA, CMS)
Integrated ALL source types (EMR, payer, pharmacy, lab, imaging)
Result: Multi-source completeness improved from 25/100 → 75/100

Layer 2: Derived Data Standardization

Mapped custom codes to standards (LOINC, RxNorm, CVX, SNOMED)
Applied clinical logic (active vs. completed medications)
Deduplicated across sources using medical terminologies
Result: Conformity improved from 38/100 → 82/100

Layer 3: Composition for AI Optimization

Grouped related information (brand/generic drugs)
Created timeline views optimized for AI processing
Added clinical context (prescriber info, encounter types)
Result: Availability improved from 45/100 → 85/100

Real Patient Example: Before and After

Patient: 43-year-old with chronic conditions

Before (36/100 – F grade):

The Data:

700+ medication records (dispenses, prescriptions, patient statements)
21 medications marked “active” (including medications from 2021—five years ago)
4,462 lab and vital readings
Different coding systems (LOINC, custom Epic codes, no codes at all)
Same labs with different names and units (9000mg vs 9g)
Missing RxNorm codes, using custom Cerner systems
Brand and generic versions of same drug listed separately
Labs miscategorized as vitals and vice versa

The Scores:

Availability: 45/100 (D) – Missing critical resources
Accuracy: 52/100 (F) – Invalid values and formats
Conformity: 38/100 (F) – Non-standard codes
Plausibility: 48/100 (F) – Implausible timelines
Completeness: 25/100 (F) – Only 19% longitudinal coverage, single-source

Critical Issue: Crohn’s disease and Lupus diagnoses exist in Patient Access API network but completely absent from TEFCA.

After (80/100 – B grade):

The Data:

29 clinically relevant medications
3 actually active medications (correctly identified using clinical logic)
Brand and generic drugs properly grouped (Jakafi + Ruxolitinib)
All codes standardized to RxNorm
Prescriber information filled in from national provider directory
24 key lab findings + 6 vitals (older data accessible if needed)
All standardized to LOINC codes
Results and reference ranges properly extracted
Organized in timeline view optimized for AI processing

The Scores:

Availability: 85/100 (B) – All critical resources present
Accuracy: 88/100 (B) – Clinical validation applied
Conformity: 82/100 (B) – Standardized terminologies
Plausibility: 78/100 (C) – Temporal and clinical validation
Completeness: 72/100 (C) – Connected all networks, multi-source validation

Critical Fix: Crohn’s disease diagnosis is now visible (retrieved from Patient Access API and connected across networks).

The Clinical Impact: What Each Dimension Prevents

Scenario 1: The Missing Diagnosis (Completeness Issue)

Without Refinement:

AI sees: Abdominal pain complaint
AI recommends: Ibuprofen (Advil) for pain relief
Hidden risk: Patient has Crohn’s disease (only in disconnected network)
Outcome: NSAIDs can cause serious complications for Crohn’s patients

With b.well Refinement:

AI sees: Abdominal pain + Crohn’s disease diagnosis
AI recommends: Acetaminophen (Tylenol) instead, avoids NSAIDs
Outcome: Safe, appropriate recommendation

PIQI Dimension Improved: Multi-Source Completeness (25/100 → 75/100)

Scenario 2: The Medication Time Bomb (Availability + Accuracy Issue)

Without Refinement:

AI sees: 21 “active” medications (including 5-year-old prescriptions)
AI checks: Drug interactions across all 21
Risk: False positive interactions, alert fatigue
Worse risk: Recommends both brand (Jakafi) and generic (Ruxolitinib) versions = double dosing

With b.well Refinement:

AI sees: 3 actually active medications (validated with clinical logic)
AI checks: Interactions only for current medications
Outcome: Accurate interaction checking, brand/generic properly grouped

PIQI Dimension Improved: Availability + Accuracy (45/100 → 85/100)

Scenario 3: The Token Cost Explosion (Temporal Density + Currency Issue)

Without Refinement:

AI processes: 4,462 lab results in context window
Token cost: ~$50 per query (at GPT-4 pricing)
Response time: 30+ seconds
Risk: Context window exhaustion, important findings buried in noise

With b.well Refinement:

AI processes: 24 key lab findings (older data accessible if needed)
Token cost: ~$5 per query
Response time: 3 seconds
Outcome: 10x cost reduction, 10x speed improvement, better focus

PIQI Dimension Improved: Temporal Density + Currency (25/100 → 72/100)

The Differentiator: De-Duplication Across Sources

Here’s what most companies miss: The same clinical event appears differently across every data source.

Example: COVID-19 Vaccination

From Walgreens (Pharmacy):
Code: CVX 208 (COVID-19 vaccine)
Date: 2024-01-15
Lot: ABC123

From Cigna (Payer):
Code: CPT 91300 (COVID immunization admin)
Date: 2024-01-15
Claim: Walgreens Pharmacy

From One Medical (EMR):
Code: Custom Epic code “COVID_VAX_2024”
Date: 2024-01-15
Note: “Patient received Pfizer booster”

Without Proper De-Duplication:

AI sees: 3 different COVID vaccinations on the same day
AI recommends: “You may be due for your COVID booster.”
Patient confusion: “But I just got one!”

With b.well’s Medical Terminology-Based De-Duplication:

AI sees: 1 COVID vaccination (cross-validated across 3 sources)
AI recommends: “Your COVID vaccination is current.”
Added benefit: Lot number from pharmacy + clinical note from EMR = complete record

PIQI Dimension Improved: Multi-Source Completeness + Conformity

This is why conformity to medical terminologies (LOINC, RxNorm, CVX, SNOMED) is critical. It’s not just about standards compliance—it’s the foundation for intelligent de-duplication across sources.

The Solution: Systematic, Measured Improvement

What Changed – Layer by Layer

Layer 1: Raw Data Collection

Layer 2: Derived Data Standardization

Layer 3: Composition for AI Optimization

Real Patient Example: Before and After

Before (36/100 – F grade):

After (80/100 – B grade):

The Clinical Impact: What Each Dimension Prevents

Scenario 1: The Missing Diagnosis (Completeness Issue)

Scenario 2: The Medication Time Bomb (Availability + Accuracy Issue)

Scenario 3: The Token Cost Explosion (Temporal Density + Currency Issue)

The Differentiator: De-Duplication Across Sources

Without Proper De-Duplication:

With b.well’s Medical Terminology-Based De-Duplication:

You May Also Like

Join us on our mission to simplify healthcare, one person at a time.