Entity Patterns

Discover patterns in entity resolution data โ€” name matching heuristics, geographic distribution, confidence scoring, and data quality signals. 13 patterns detected.

13Patterns
8High Significance
88%Avg Confidence
4Strengthening
Aa

Name Matching

3
94%

Abbreviation expansion improves match rate by 23% โ€” "Mgmt" โ†’ "Management", "Intl" โ†’ "International" account for 41% of fuzzy corrections.

high8,420 data points
91%

Legal suffix stripping (LLC, Ltd, Corp, GmbH) reduces false negatives by 18% without impacting precision.

high12,847 data points
87%

Token-order-invariant matching catches 12% more fund names where words appear in different order (e.g. "Total Return PIMCO" vs "PIMCO Total Return").

medium3,210 data pointsstrengthening
Gl

Geographic Distribution

3
96%

US-domiciled entities represent 58% of resolution volume. Auto-resolve rate is 12% higher for US entities due to richer GLEIF coverage.

high7,450 data points
82%

Cayman Islands and Luxembourg fund registrations cluster around quarter-end dates, causing 3x spike in resolution requests.

medium1,890 data pointsstrengthening
74%

APAC entity names with transliteration variants (Nomura/้‡Žๆ‘) require multi-script matching โ€” current Jaro-Winkler underperforms by 15%.

high920 data points
Sc

Confidence & Scoring

3
93%

Bimodal confidence distribution: 68% of resolutions score >90% (auto-resolve) or <40% (no match). The 40-90% review band contains only 32% of volume.

high12,847 data points
89%

Adding LEI as input boosts composite score by avg 28 points. Only 23% of incoming records include LEI โ€” opportunity to enrich upstream.

high2,950 data points
86%

Country code match contributes disproportionate signal โ€” wrong country drops confidence by avg 31 points even when name is 95%+ similar.

medium4,100 data points
DQ

Data Quality

2
88%

GLEIF cache staleness: 8.3% of cached entities have registration status changes within 90 days. Recommend 30-day refresh cycle for active counterparties.

high2,500 data points
81%

Duplicate entity detection: 3.1% of resolved entities map to multiple LEIs (merged, transferred, or lapsed registrations). Ownership chain analysis resolves 78% of ambiguity.

medium1,640 data pointsstrengthening
Lk

Relationships & Linkage

2
92%

Fund-to-manager clustering: top 20 asset managers control 74% of resolved fund entities. Manager LEI lookup resolves 89% of ambiguous fund names.

high5,680 data points
85%

S&P Global PMID linkage available for 62% of resolved entities โ€” enables downstream portfolio analytics and risk aggregation.

medium7,960 data pointsstrengthening
v0.1.0