hcris-analyst

A Claude Code skill for U.S. nonprofit hospital financials.

Two of the most cited datasets in U.S. hospital research — CMS Medicare Cost Reports (HCRIS) and IRS Form 990 Schedule H — describe overlapping things about the same hospitals using different field definitions, different fiscal periods, and different file formats. CMS ships 100,000-row long-skinny CSVs; the IRS ships 990s as XML bulk ZIPs. Joining them at the hospital level is real work.

hcris-analyst is a Claude Code skill that queries the trove-published bundles where that work has already been done. It runs in your own Claude Code session, translates natural-language questions into DuckDB SQL, knows where the data lives, and handles the definitional caveats that trip up naive cross-form comparisons.

Install

/plugin marketplace add cbetz/trove
/plugin install trove@trove

Or copy the skill directory directly:

git clone https://github.com/cbetz/trove
cp -r trove/skills/hcris-analyst ~/.claude/skills/

Example prompts

What this skill does

What this skill doesn't do

Coverage

1,295 nonprofit U.S. hospital systems matched at the EIN level. HCRIS Hospital 2552-10, FY2023. IRS 990 Schedule H, TY2022 (across 2024/2025/2026 release years). CCN↔EIN crosswalk from Community Benefit Insight (RTI International / RWJF), December 2024 vintage. Browse the index at /hospitals/.

Source and license

MIT-licensed. Source code, parsers, raw artifacts: github.com/cbetz/trove. The data is U.S. government work, public domain — CMS HCRIS, IRS 990 e-file, and CDC SVI. The CCN↔EIN crosswalk is from Community Benefit Insight: RTI Press DOI 10.3768/rtipress.2023.op.0080.2302.