saturn·

cms medicaid

saturn notebook · generated 2026-04-22 Report Notebook

Overview

Source: /home/coolhand/datasets/accessibility-atlas/cms_medicaid_enrollment_2026.csv

Saturn profiled 10,302 rows across 44 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/datasets/accessibility-atlas/cms_medicaid_enrollment_2026.csv",
    "--findings", "cms-medicaid.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset captures monthly state-level Medicaid and CHIP performance reports (10,302 rows × 44 columns) covering enrollment counts, application volumes, eligibility determinations by processing-time bucket, and call-center metrics across all 51 state jurisdictions. The reporting structure is clean and balanced — each state contributes 202 rows, and the Final Report and Preliminary/Updated flags split exactly 50/50 — but most numeric metrics are heavily right-skewed and riddled with outliers, since large states like California dwarf smaller ones (e.g., Total Medicaid Enrollment ranges from 0 to 13.2M with skew 3.6). Two things deserve a closer look first: the very high null rates on operational metrics (Total Adult Medicaid Enrollment is 85% null; call-center fields ~70% null), which suggests many states simply don't report these, and the Medicaid-expansion split (73% Y vs 27% N) which is a natural lens for comparing enrollment and processing outcomes. The free-text 'footnotes' columns are also worth scanning — they reveal systematic data-quality caveats (e.g., 'Incorrectly reporting processing time at application level') that should temper any cross-state comparison.

citing: row_count · column_count · Total Medicaid Enrollment · Total Medicaid and CHIP Enrollment · Total Adult Medicaid Enrollment · State Expanded Medicaid · State Abbreviation · Final Report · Preliminary or Updated · Average Call Center Wait Time (Minutes) · Average Call Center Abandonment Rate · Reporting Period

Out[4]:

saturn.schema() · 44 columns

column kind n null% unique alerts
State Abbreviation categorical 10,302 0.0% 51
State Name categorical 10,302 0.0% 51
Reporting Period numeric 10,302 0.0% 102
State Expanded Medicaid categorical 10,302 0.0% 2
Preliminary or Updated categorical 10,302 0.0% 2
Final Report categorical 10,302 0.0% 2
New Applications Submitted to Medicaid and CHIP Agencies numeric 10,302 0.5% 5,378 high_skew outliers
New Applications Submitted to Medicaid and CHIP Agencies - footnotes categorical 10,302 76.6% 18 null_rate
Applications for Financial Assistance Submitted to the State Based Marketplace numeric 10,302 0.5% 1,373 high_skew outliers
Applications for Financial Assistance Submitted to the State Based Marketplace - footnotes categorical 10,302 97.4% 3 null_rate
Total Applications for Financial Assistance Submitted at State Level numeric 10,302 0.5% 5,591 high_skew outliers
Total Applications for Financial Assistance Submitted at State Level - footnotes categorical 10,302 73.4% 17 null_rate
Individuals Determined Eligible for Medicaid at Application numeric 10,302 0.5% 5,568 high_skew outliers
Individuals Determined Eligible for Medicaid at Application - footnotes categorical 10,302 72.8% 18 null_rate
Individuals Determined Eligible for CHIP at Application numeric 10,302 0.5% 3,064 high_skew outliers
Individuals Determined Eligible for CHIP at Application - footnotes categorical 10,302 91.0% 7 null_rate
Total Medicaid and CHIP Determinations numeric 10,302 0.5% 5,587 high_skew outliers
Total Medicaid and CHIP Determinations - footnotes categorical 10,302 81.6% 12 null_rate
Medicaid and CHIP Child Enrollment numeric 10,302 0.5% 8,094 high_skew outliers
Medicaid and CHIP Child Enrollment - footnotes categorical 10,302 92.2% 7 null_rate
Total Medicaid and CHIP Enrollment numeric 10,302 0.0% 8,309 high_skew outliers
Total Medicaid and CHIP Enrollment - footnotes categorical 10,302 93.1% 9 null_rate
Total Medicaid Enrollment numeric 10,302 0.5% 8,221 high_skew outliers
Total Medicaid Enrollment - footnotes categorical 10,302 93.2% 7 null_rate
Total CHIP Enrollment numeric 10,302 0.5% 7,918 high_skew outliers
Total CHIP Enrollment - footnotes categorical 10,302 95.9% 4 null_rate
Total Adult Medicaid Enrollment numeric 10,302 84.7% 1,415 high_skew null_rate
Total Adult Medicaid Enrollment - footnotes categorical 10,302 98.8% 4 null_rate
Total Medicaid and CHIP Determinations Processed in Less than 24 Hours numeric 10,302 43.1% 2,701 high_skew outliers null_rate
Total Medicaid and CHIP Determinations Processed in Less than 24 Hours - footnotes categorical 10,302 89.2% 20 null_rate
Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days numeric 10,302 43.1% 2,559 high_skew outliers null_rate
Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days - footnotes categorical 10,302 89.2% 21 null_rate
Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days numeric 10,302 43.1% 2,608 high_skew outliers null_rate
Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days - footnotes categorical 10,302 89.3% 16 null_rate
Total Medicaid and CHIP Determinations Processed between 31 days and 45 days numeric 10,302 43.1% 1,923 high_skew outliers null_rate
Total Medicaid and CHIP Determinations Processed between 31 days and 45 days - footnotes categorical 10,302 89.3% 15 null_rate
Total Medicaid and CHIP Determinations Processed in More than 45 Days numeric 10,302 43.1% 1,691 high_skew outliers null_rate
Total Medicaid and CHIP Determinations Processed in More than 45 Days - footnotes categorical 10,302 89.3% 15 null_rate
Total Call Center Volume (Number of Calls) numeric 10,302 69.5% 1,592 high_skew outliers null_rate
Total Call Center Volume (Number of Calls) - footnotes categorical 10,302 71.0% 27 null_rate
Average Call Center Wait Time (Minutes) numeric 10,302 69.5% 63 null_rate
Average Call Center Wait Time (Minutes) - footnotes categorical 10,302 69.7% 44 null_rate
Average Call Center Abandonment Rate numeric 10,302 69.5% 408 null_rate
Average Call Center Abandonment Rate - footnotes categorical 10,302 69.7% 36 null_rate
Fig 1.
Total Medicaid and CHIP Enrollment · Shows the heavy right-skew of state enrollment totals, ranging from near-zero up to ~14.5M.
Show data table
Histogram bins for Total Medicaid and CHIP Enrollment (median: 1031822.0).
bincount
0 – 3.616e+052640
3.616e+05 – 7.231e+051192
7.231e+05 – 1.085e+061581
1.085e+06 – 1.446e+061149
1.446e+06 – 1.808e+061171
1.808e+06 – 2.169e+06726
2.169e+06 – 2.531e+06291
2.531e+06 – 2.893e+06207
2.893e+06 – 3.254e+06331
3.254e+06 – 3.616e+06143
3.616e+06 – 3.977e+06175
3.977e+06 – 4.339e+06106
4.339e+06 – 4.7e+0684
4.7e+06 – 5.062e+0642
5.062e+06 – 5.423e+0626
5.423e+06 – 5.785e+0619
5.785e+06 – 6.147e+0675
6.147e+06 – 6.508e+0620
6.508e+06 – 6.87e+0651
6.87e+06 – 7.231e+0635
7.231e+06 – 7.593e+0632
7.593e+06 – 7.954e+063
7.954e+06 – 8.316e+060
8.316e+06 – 8.678e+060
8.678e+06 – 9.039e+060
9.039e+06 – 9.401e+060
9.401e+06 – 9.762e+060
9.762e+06 – 1.012e+070
1.012e+07 – 1.049e+070
1.049e+07 – 1.085e+070
1.085e+07 – 1.121e+070
1.121e+07 – 1.157e+076
1.157e+07 – 1.193e+0735
1.193e+07 – 1.229e+0738
1.229e+07 – 1.265e+0711
1.265e+07 – 1.302e+0710
1.302e+07 – 1.338e+0723
1.338e+07 – 1.374e+0739
1.374e+07 – 1.41e+0721
1.41e+07 – 1.446e+0718
Fig 2.
State Expanded Medicaid · Roughly 73% of rows come from Medicaid-expansion states, a key segmentation variable.
Show data table
Top values for State Expanded Medicaid (2 unique shown, of 2 total).
valuecountshare
Y747572.6%
N282727.4%
Fig 3.
Average Call Center Wait Time (Minutes) · Median wait is 5 minutes but the tail extends to 72 — useful for spotting service-level outliers.
Show data table
Histogram bins for Average Call Center Wait Time (Minutes) (median: 5.0).
bincount
0 – 1.8942
1.8 – 3.6417
3.6 – 5.4329
5.4 – 7.2188
7.2 – 980
9 – 10.8116
10.8 – 12.6168
12.6 – 14.4103
14.4 – 16.2106
16.2 – 1843
18 – 19.886
19.8 – 21.675
21.6 – 23.473
23.4 – 25.250
25.2 – 2720
27 – 28.844
28.8 – 30.648
30.6 – 32.447
32.4 – 34.238
34.2 – 3615
36 – 37.828
37.8 – 39.628
39.6 – 41.46
41.4 – 43.215
43.2 – 4512
45 – 46.814
46.8 – 48.610
48.6 – 50.42
50.4 – 52.24
52.2 – 542
54 – 55.80
55.8 – 57.610
57.6 – 59.412
59.4 – 61.23
61.2 – 632
63 – 64.82
64.8 – 66.64
66.6 – 68.40
68.4 – 70.20
70.2 – 723
Fig 4.
Reporting Period · Confirms time coverage from Sept 2013 through 2025, roughly uniform across 102 monthly periods.
Show data table
Histogram bins for Reporting Period (median: 202107.5).
bincount
2.013e+05 – 2.013e+0551
2.013e+05 – 2.014e+050
2.014e+05 – 2.014e+050
2.014e+05 – 2.014e+050
2.014e+05 – 2.015e+050
2.015e+05 – 2.015e+050
2.015e+05 – 2.015e+050
2.015e+05 – 2.015e+050
2.015e+05 – 2.016e+050
2.016e+05 – 2.016e+050
2.016e+05 – 2.016e+050
2.016e+05 – 2.017e+050
2.017e+05 – 2.017e+050
2.017e+05 – 2.017e+05714
2.017e+05 – 2.018e+050
2.018e+05 – 2.018e+050
2.018e+05 – 2.018e+051224
2.018e+05 – 2.018e+050
2.018e+05 – 2.019e+050
2.019e+05 – 2.019e+05918
2.019e+05 – 2.019e+05306
2.019e+05 – 2.02e+050
2.02e+05 – 2.02e+050
2.02e+05 – 2.02e+051224
2.02e+05 – 2.021e+050
2.021e+05 – 2.021e+050
2.021e+05 – 2.021e+051224
2.021e+05 – 2.021e+050
2.021e+05 – 2.022e+050
2.022e+05 – 2.022e+05918
2.022e+05 – 2.022e+05306
2.022e+05 – 2.023e+050
2.023e+05 – 2.023e+050
2.023e+05 – 2.023e+051224
2.023e+05 – 2.024e+050
2.024e+05 – 2.024e+050
2.024e+05 – 2.024e+051224
2.024e+05 – 2.024e+050
2.024e+05 – 2.025e+050
2.025e+05 – 2.025e+05969
Fig 5.
State Abbreviation · Verifies balanced coverage — every one of the 51 jurisdictions contributes exactly 202 rows.
Show data table
Top values for State Abbreviation (20 unique shown, of 51 total).
valuecountshare
AK2022.0%
AL2022.0%
AR2022.0%
AZ2022.0%
CA2022.0%
CO2022.0%
CT2022.0%
DC2022.0%
DE2022.0%
FL2022.0%
GA2022.0%
HI2022.0%
IA2022.0%
ID2022.0%
IL2022.0%
IN2022.0%
KS2022.0%
KY2022.0%
LA2022.0%
MA2022.0%
Fig 6.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
State Abbreviationcategorical0.0%
State Namecategorical0.0%
Reporting Periodnumeric0.0%
State Expanded Medicaidcategorical0.0%
Preliminary or Updatedcategorical0.0%
Final Reportcategorical0.0%
New Applications Submitted to Medicaid and CHIP Agenciesnumeric0.5%
New Applications Submitted to Medicaid and CHIP Agencies - footnotescategorical76.6%
Applications for Financial Assistance Submitted to the State Based Marketplacenumeric0.5%
Applications for Financial Assistance Submitted to the State Based Marketplace - footnotescategorical97.4%
Total Applications for Financial Assistance Submitted at State Levelnumeric0.5%
Total Applications for Financial Assistance Submitted at State Level - footnotescategorical73.4%
Individuals Determined Eligible for Medicaid at Applicationnumeric0.5%
Individuals Determined Eligible for Medicaid at Application - footnotescategorical72.8%
Individuals Determined Eligible for CHIP at Applicationnumeric0.5%
Individuals Determined Eligible for CHIP at Application - footnotescategorical91.0%
Total Medicaid and CHIP Determinationsnumeric0.5%
Total Medicaid and CHIP Determinations - footnotescategorical81.6%
Medicaid and CHIP Child Enrollmentnumeric0.5%
Medicaid and CHIP Child Enrollment - footnotescategorical92.2%
Total Medicaid and CHIP Enrollmentnumeric0.0%
Total Medicaid and CHIP Enrollment - footnotescategorical93.1%
Total Medicaid Enrollmentnumeric0.5%
Total Medicaid Enrollment - footnotescategorical93.2%
Total CHIP Enrollmentnumeric0.5%
Total CHIP Enrollment - footnotescategorical95.9%
Total Adult Medicaid Enrollmentnumeric84.7%
Total Adult Medicaid Enrollment - footnotescategorical98.8%
Total Medicaid and CHIP Determinations Processed in Less than 24 Hoursnumeric43.1%
Total Medicaid and CHIP Determinations Processed in Less than 24 Hours - footnotescategorical89.2%
Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Daysnumeric43.1%
Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days - footnotescategorical89.2%
Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Daysnumeric43.1%
Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days - footnotescategorical89.3%
Total Medicaid and CHIP Determinations Processed between 31 days and 45 daysnumeric43.1%
Total Medicaid and CHIP Determinations Processed between 31 days and 45 days - footnotescategorical89.3%
Total Medicaid and CHIP Determinations Processed in More than 45 Daysnumeric43.1%
Total Medicaid and CHIP Determinations Processed in More than 45 Days - footnotescategorical89.3%
Total Call Center Volume (Number of Calls)numeric69.5%
Total Call Center Volume (Number of Calls) - footnotescategorical71.0%
Average Call Center Wait Time (Minutes)numeric69.5%
Average Call Center Wait Time (Minutes) - footnotescategorical69.7%
Average Call Center Abandonment Ratenumeric69.5%
Average Call Center Abandonment Rate - footnotescategorical69.7%
Fig 7.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 12 numeric columns (values clipped to 2 decimals).
Reporting PeriodNew Applications Submitted to Medicaid and CHIP AgenciesApplications for Financial Assistance Submitted to the State Based MarketplaceTotal Applications for Financial Assistance Submitted at State LevelIndividuals Determined Eligible for Medicaid at ApplicationIndividuals Determined Eligible for CHIP at ApplicationTotal Medicaid and CHIP DeterminationsMedicaid and CHIP Child EnrollmentTotal Medicaid and CHIP EnrollmentTotal Medicaid EnrollmentTotal CHIP EnrollmentTotal Adult Medicaid Enrollment
Reporting Period+1.00-0.03+0.09+0.06+0.07+0.05+0.07+0.11+0.11+0.11+0.07+0.06
New Applications Submitted to Medicaid and CHIP Agencies-0.03+1.00-0.07+0.49+0.60+0.62+0.61+0.70+0.59+0.62+0.52+0.37
Applications for Financial Assistance Submitted to the State Based Marketplace+0.09-0.07+1.00+0.83+0.25+0.11+0.24+0.24+0.27+0.30+0.29+0.08
Total Applications for Financial Assistance Submitted at State Level+0.06+0.49+0.83+1.00+0.55+0.44+0.54+0.60+0.56+0.61+0.54+0.27
Individuals Determined Eligible for Medicaid at Application+0.07+0.60+0.25+0.55+1.00+0.89+1.00+0.65+0.61+0.64+0.63+0.38
Individuals Determined Eligible for CHIP at Application+0.05+0.62+0.11+0.44+0.89+1.00+0.91+0.55+0.46+0.48+0.50+0.27
Total Medicaid and CHIP Determinations+0.07+0.61+0.24+0.54+1.00+0.91+1.00+0.65+0.60+0.63+0.62+0.38
Medicaid and CHIP Child Enrollment+0.11+0.70+0.24+0.60+0.65+0.55+0.65+1.00+0.94+0.96+0.92+0.61
Total Medicaid and CHIP Enrollment+0.11+0.59+0.27+0.56+0.61+0.46+0.60+0.94+1.00+0.98+0.95+0.68
Total Medicaid Enrollment+0.11+0.62+0.30+0.61+0.64+0.48+0.63+0.96+0.98+1.00+0.96+0.69
Total CHIP Enrollment+0.07+0.52+0.29+0.54+0.63+0.50+0.62+0.92+0.95+0.96+1.00+0.68
Total Adult Medicaid Enrollment+0.06+0.37+0.08+0.27+0.38+0.27+0.38+0.61+0.68+0.69+0.68+1.00

State Abbreviation categorical feature

Two-letter US state abbreviation, with 51 distinct values (the 50 states plus DC) and zero nulls across 10,302 rows. The distribution is perfectly uniform: every state appears exactly 202 times, and entropy_ratio is 1.0, indicating this column was constructed as a balanced grid rather than sampled from real-world frequencies.

Treatment: One-hot or target-encode for modelling; useful as a join key on state-level lookups.

anthropic:claude-opus-4-7 · confidence high
Out[13]:

saturn.columns["State Abbreviation"].stats

statvalue
n10,302
nulls0 (0.0%)
unique51
top_value AK
top_rate 0.01961
cardinality 51
entropy 5.672
entropy_ratio 1
Fig 8.
Top values for State Abbreviation.
Show data table
Top values for State Abbreviation (20 unique shown, of 51 total).
valuecountshare
AK2022.0%
AL2022.0%
AR2022.0%
AZ2022.0%
CA2022.0%
CO2022.0%
CT2022.0%
DC2022.0%
DE2022.0%
FL2022.0%
GA2022.0%
HI2022.0%
IA2022.0%
ID2022.0%
IL2022.0%
IN2022.0%
KS2022.0%
KY2022.0%
LA2022.0%
MA2022.0%

State Name categorical feature

This column holds US state names, with 51 distinct values (the 50 states plus District of Columbia) and zero nulls across 10,302 rows. The distribution is perfectly uniform: every state appears exactly 202 times, yielding a top_rate of 0.0196 and an entropy_ratio of 1.0. That balance suggests the dataset was constructed as a state-by-something grid rather than sampled from real-world frequencies.

Treatment: use as a categorical grouping key; one-hot or target-encode for modelling.

anthropic:claude-opus-4-7 · confidence high
Out[16]:

saturn.columns["State Name"].stats

statvalue
n10,302
nulls0 (0.0%)
unique51
top_value Alaska
top_rate 0.01961
cardinality 51
entropy 5.672
entropy_ratio 1
Fig 9.
Top values for State Name.
Show data table
Top values for State Name (20 unique shown, of 51 total).
valuecountshare
Alaska2022.0%
Alabama2022.0%
Arkansas2022.0%
Arizona2022.0%
California2022.0%
Colorado2022.0%
Connecticut2022.0%
District of Columbia2022.0%
Delaware2022.0%
Florida2022.0%
Georgia2022.0%
Hawaii2022.0%
Iowa2022.0%
Idaho2022.0%
Illinois2022.0%
Indiana2022.0%
Kansas2022.0%
Kentucky2022.0%
Louisiana2022.0%
Massachusetts2022.0%

Reporting Period numeric timestamp

This column encodes a year-month reporting period as a packed integer (YYYYMM), spanning 201309 to 202510 across 102 distinct values with no nulls. Values cluster between Q1 201906 and Q3 202309, consistent with monthly reporting cadence rather than a true continuous numeric. The 'std' of 249.6 and IQR of 403 are artifacts of the YYYYMM encoding (gaps between ..12 and ..01) and should not be read as a real numeric spread.

Treatment: Parse YYYYMM into a proper month/date type before any time-series analysis.

anthropic:claude-opus-4-7 · confidence high
Out[19]:

saturn.columns["Reporting Period"].stats

statvalue
n10,302
nulls0 (0.0%)
unique102
min 201,309
max 202,510
mean 2.021e+05
median 2.021e+05
std 249.6
q1 201,906
q3 202,309
iqr 403
skew -0.1301
kurtosis -0.8155
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 10.
Distribution of Reporting Period. Vertical dash marks the median.
Show data table
Histogram bins for Reporting Period (median: 202107.5).
bincount
2.013e+05 – 2.013e+0551
2.013e+05 – 2.014e+050
2.014e+05 – 2.014e+050
2.014e+05 – 2.014e+050
2.014e+05 – 2.015e+050
2.015e+05 – 2.015e+050
2.015e+05 – 2.015e+050
2.015e+05 – 2.015e+050
2.015e+05 – 2.016e+050
2.016e+05 – 2.016e+050
2.016e+05 – 2.016e+050
2.016e+05 – 2.017e+050
2.017e+05 – 2.017e+050
2.017e+05 – 2.017e+05714
2.017e+05 – 2.018e+050
2.018e+05 – 2.018e+050
2.018e+05 – 2.018e+051224
2.018e+05 – 2.018e+050
2.018e+05 – 2.019e+050
2.019e+05 – 2.019e+05918
2.019e+05 – 2.019e+05306
2.019e+05 – 2.02e+050
2.02e+05 – 2.02e+050
2.02e+05 – 2.02e+051224
2.02e+05 – 2.021e+050
2.021e+05 – 2.021e+050
2.021e+05 – 2.021e+051224
2.021e+05 – 2.021e+050
2.021e+05 – 2.022e+050
2.022e+05 – 2.022e+05918
2.022e+05 – 2.022e+05306
2.022e+05 – 2.023e+050
2.023e+05 – 2.023e+050
2.023e+05 – 2.023e+051224
2.023e+05 – 2.024e+050
2.024e+05 – 2.024e+050
2.024e+05 – 2.024e+051224
2.024e+05 – 2.024e+050
2.024e+05 – 2.025e+050
2.025e+05 – 2.025e+05969

State Expanded Medicaid categorical feature

Binary Y/N flag indicating whether the state expanded Medicaid, fully populated across 10302 rows. The split is imbalanced: 72.6% 'Y' versus 2827 'N' records, but entropy ratio of 0.85 shows both classes are well represented. No nulls or unexpected categories.

Treatment: Encode as a 0/1 indicator for modelling.

anthropic:claude-opus-4-7 · confidence high
Out[22]:

saturn.columns["State Expanded Medicaid"].stats

statvalue
n10,302
nulls0 (0.0%)
unique2
top_value Y
top_rate 0.7256
cardinality 2
entropy 0.8477
entropy_ratio 0.8477
Fig 11.
Top values for State Expanded Medicaid.
Show data table
Top values for State Expanded Medicaid (2 unique shown, of 2 total).
valuecountshare
Y747572.6%
N282727.4%

Preliminary or Updated categorical feature

Binary flag distinguishing 'Preliminary' (P) from 'Updated' (U) records, with exactly 5151 of each across 10302 rows. The perfectly even 50/50 split (entropy_ratio 1.0) is unusual for a real-world status field and suggests the dataset was constructed by pairing each preliminary record with its updated counterpart. No nulls.

Treatment: Encode as a binary indicator; consider whether to filter to only 'U' rows to avoid double-counting paired records.

anthropic:claude-opus-4-7 · confidence high
Out[25]:

saturn.columns["Preliminary or Updated"].stats

statvalue
n10,302
nulls0 (0.0%)
unique2
top_value U
top_rate 0.5
cardinality 2
entropy 1
entropy_ratio 1
Fig 12.
Top values for Preliminary or Updated.
Show data table
Top values for Preliminary or Updated (2 unique shown, of 2 total).
valuecountshare
U515150.0%
P515150.0%

Final Report categorical label

Binary Y/N flag indicating whether a final report exists, with no nulls across 10302 rows. The split is exactly balanced at 5151 each, yielding maximum entropy (1.0) — this perfect 50/50 is unusual in real-world reporting data and suggests deliberate stratified sampling or class balancing.

Treatment: Encode as 0/1 and use directly as a binary target or stratification key.

anthropic:claude-opus-4-7 · confidence high
Out[28]:

saturn.columns["Final Report"].stats

statvalue
n10,302
nulls0 (0.0%)
unique2
top_value Y
top_rate 0.5
cardinality 2
entropy 1
entropy_ratio 1
Fig 13.
Top values for Final Report.
Show data table
Top values for Final Report (2 unique shown, of 2 total).
valuecountshare
Y515150.0%
N515150.0%

New Applications Submitted to Medicaid and CHIP Agencies numeric feature

This column counts new Medicaid/CHIP applications submitted to state agencies, with 10,302 rows and 5,378 unique values. The distribution is heavily right-skewed (skew 4.08, kurtosis 23.5): the median is 14,644 but the mean is 29,897 and the max reaches 733,651, with 11.5% of values flagged as outliers and 4% exact zeros. The 0.5% null rate is negligible, but the spread between Q1 (4,511) and Q3 (29,973) confirms wide cross-state or cross-period variation.

Treatment: Apply a log1p transform before modelling to tame the skew and outliers.

anthropic:claude-opus-4-7 · confidence high
Out[31]:

saturn.columns["New Applications Submitted to Medicaid and CHIP Agencies"].stats

statvalue
n10,302
nulls51 (0.5%)
unique5,378
min 0
max 733,651
mean 2.99e+04
median 14,644
std 4.911e+04
q1 4,511
q3 29,973
iqr 25,462
skew 4.077
kurtosis 23.55
n_outliers 1,175
outlier_rate 0.1146
zero_rate 0.04019
alert: high_skewskew=+4.08
alert: outliers11.5% rows beyond 1.5 IQR
Fig 14.
Distribution of New Applications Submitted to Medicaid and CHIP Agencies. Vertical dash marks the median.
Show data table
Histogram bins for New Applications Submitted to Medicaid and CHIP Agencies (median: 14644.0).
bincount
0 – 1.834e+045923
1.834e+04 – 3.668e+042165
3.668e+04 – 5.502e+04640
5.502e+04 – 7.337e+04475
7.337e+04 – 9.171e+04348
9.171e+04 – 1.1e+05193
1.1e+05 – 1.284e+05106
1.284e+05 – 1.467e+0596
1.467e+05 – 1.651e+0550
1.651e+05 – 1.834e+0534
1.834e+05 – 2.018e+054
2.018e+05 – 2.201e+056
2.201e+05 – 2.384e+0520
2.384e+05 – 2.568e+0529
2.568e+05 – 2.751e+0550
2.751e+05 – 2.935e+0534
2.935e+05 – 3.118e+0518
3.118e+05 – 3.301e+0530
3.301e+05 – 3.485e+0516
3.485e+05 – 3.668e+056
3.668e+05 – 3.852e+052
3.852e+05 – 4.035e+054
4.035e+05 – 4.218e+050
4.218e+05 – 4.402e+050
4.402e+05 – 4.585e+050
4.585e+05 – 4.769e+050
4.769e+05 – 4.952e+050
4.952e+05 – 5.136e+050
5.136e+05 – 5.319e+050
5.319e+05 – 5.502e+050
5.502e+05 – 5.686e+050
5.686e+05 – 5.869e+050
5.869e+05 – 6.053e+050
6.053e+05 – 6.236e+050
6.236e+05 – 6.419e+050
6.419e+05 – 6.603e+050
6.603e+05 – 6.786e+050
6.786e+05 – 6.97e+050
6.97e+05 – 7.153e+050
7.153e+05 – 7.337e+052

New Applications Submitted to Medicaid and CHIP Agencies - footnotes categorical metadata

This is a free-form footnotes/caveats field annotating Medicaid/CHIP application counts, explaining what each state's submission does or does not include (e.g., renewals, administrative transfers, other programs). It is sparsely populated with a 0.766 null rate, and across 10302 rows only 18 distinct notes appear; the most common, 'Includes Renewals and/or Redeterminations', covers 27% of non-null entries. High entropy ratio (0.84) indicates the remaining notes are spread fairly evenly, suggesting heterogeneous reporting practices across states.

Treatment: Treat as data-quality caveat flags; split on ';' and one-hot encode the component clauses rather than modelling the raw string.

anthropic:claude-opus-4-7 · confidence high
Out[34]:

saturn.columns["New Applications Submitted to Medicaid and CHIP Agencies - footnotes"].stats

statvalue
n10,302
nulls7,891 (76.6%)
unique18
top_value Includes Renewals and/or Redeterminations
top_rate 0.27
cardinality 18
entropy 3.502
entropy_ratio 0.8399
alert: null_rate76.6% null
Fig 15.
Top values for New Applications Submitted to Medicaid and CHIP Agencies - footnotes.
Show data table
Top values for New Applications Submitted to Medicaid and CHIP Agencies - footnotes (18 unique shown, of 18 total).
valuecountshare
Includes Renewals and/or Redeterminations6516.3%
Does Not Include All Non-MAGI Applications Submitted to Medicaid and CHIP Agencies2452.4%
Includes Renewals and/or Redeterminations; Includes Administrative Data Transfers1971.9%
Includes Applications for Other Programs (e.g, QHPs, SNAP, etc.)1931.9%
Includes Applications for Other Programs (e.g, QHPs, SNAP, etc.); Includes Accounts Transferred from FFM1921.9%
Does Not Include All Applications for Limited-Benefit Programs Submitted to Medicaid and CHIP Agencies1901.8%
Includes Administrative Data Transfers; Does Not Include All Medicaid Applications Submitted to Medicaid and CHIP Agencies1611.6%
Does Not Include All Medicaid Applications Submitted to Medicaid and CHIP Agencies1121.1%
Includes SBM data991.0%
Includes Accounts Transferred from FFM; Does Not Include All Applications for Limited-Benefit Programs Submitted to Medicaid and CHIP Agencies970.9%
Includes Accounts Transferred from FFM; Does Not Include All Medicaid Applications Submitted to Medicaid and CHIP Agencies; Does Not Include All CHIP Applications Submitted to Medicaid and CHIP Agencies; Includes Administrative Data Transfers910.9%
Includes Administrative Data Transfers; Includes Accounts Transferred from FFM; Does Not Include All Medicaid Applications Submitted to Medicaid and CHIP Agencies; Does Not Include All CHIP Applications Submitted to Medicaid and CHIP Agencies670.7%
Does Not Include All Medicaid Applications Submitted to Medicaid and CHIP Agencies; Includes Administrative Data Transfers340.3%
Includes Duplicates260.3%
Count is of individuals as opposed to applications210.2%
Does Not Include All Medicaid Applications Submitted to Medicaid and CHIP Agencies; Includes Duplicates190.2%
Includes Accounts Transferred from FFM80.1%
Includes Administrative Data Transfers; Includes Accounts Transferred from FFM80.1%

Applications for Financial Assistance Submitted to the State Based Marketplace numeric feature

This column appears to be a count of applications for financial assistance submitted to a state-based health insurance marketplace, recorded across 10,302 rows. The distribution is extremely sparse and skewed: 77% of values are zero, the median and IQR are both 0, yet the maximum reaches 762,069 with a mean of 11,228 and std of 55,394. Skew (8.41) and kurtosis (82.6) are severe, and 23% of rows are flagged as outliers, indicating a small set of very large submission counts dominate the signal.

Treatment: Consider a zero-vs-nonzero indicator plus log1p transform before modelling, given 77% zeros and heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[37]:

saturn.columns["Applications for Financial Assistance Submitted to the State Based Marketplace"].stats

statvalue
n10,302
nulls51 (0.5%)
unique1,373
min 0
max 762,069
mean 1.123e+04
median 0
std 5.539e+04
q1 0
q3 0
iqr 0
skew 8.415
kurtosis 82.64
n_outliers 2,357
outlier_rate 0.2299
zero_rate 0.7701
alert: high_skewskew=+8.41
alert: outliers23.0% rows beyond 1.5 IQR
Fig 16.
Distribution of Applications for Financial Assistance Submitted to the State Based Marketplace. Vertical dash marks the median.
Show data table
Histogram bins for Applications for Financial Assistance Submitted to the State Based Marketplace (median: 0.0).
bincount
0 – 1.905e+049479
1.905e+04 – 3.81e+04115
3.81e+04 – 5.716e+0457
5.716e+04 – 7.621e+04139
7.621e+04 – 9.526e+04141
9.526e+04 – 1.143e+0573
1.143e+05 – 1.334e+0577
1.334e+05 – 1.524e+0525
1.524e+05 – 1.715e+0514
1.715e+05 – 1.905e+0512
1.905e+05 – 2.096e+054
2.096e+05 – 2.286e+052
2.286e+05 – 2.477e+052
2.477e+05 – 2.667e+050
2.667e+05 – 2.858e+056
2.858e+05 – 3.048e+056
3.048e+05 – 3.239e+058
3.239e+05 – 3.429e+054
3.429e+05 – 3.62e+058
3.62e+05 – 3.81e+0510
3.81e+05 – 4.001e+054
4.001e+05 – 4.191e+054
4.191e+05 – 4.382e+054
4.382e+05 – 4.572e+050
4.572e+05 – 4.763e+050
4.763e+05 – 4.953e+050
4.953e+05 – 5.144e+0510
5.144e+05 – 5.334e+056
5.334e+05 – 5.525e+057
5.525e+05 – 5.716e+054
5.716e+05 – 5.906e+052
5.906e+05 – 6.097e+056
6.097e+05 – 6.287e+054
6.287e+05 – 6.478e+054
6.478e+05 – 6.668e+054
6.668e+05 – 6.859e+050
6.859e+05 – 7.049e+052
7.049e+05 – 7.24e+052
7.24e+05 – 7.43e+054
7.43e+05 – 7.621e+052

Applications for Financial Assistance Submitted to the State Based Marketplace - footnotes categorical metadata

This is a footnotes/qualifier column attached to SBM financial-assistance application counts, carrying free-form caveats about what the underlying figures include. It's almost entirely empty (97.43% null) with only 3 distinct annotations across 10,302 rows; when present, 83% read 'Includes Renewals and/or Redeterminations'. The notes warn that some counts mix in renewals, duplicates, or exclude certain Medicaid applications — material caveats for anyone aggregating the parent metric.

Treatment: Keep as a qualifier flag joined to the parent metric; do not model directly.

anthropic:claude-opus-4-7 · confidence high
Out[40]:

saturn.columns["Applications for Financial Assistance Submitted to the State Based Marketplace - footnotes"].stats

statvalue
n10,302
nulls10,037 (97.4%)
unique3
top_value Includes Renewals and/or Redeterminations
top_rate 0.8302
cardinality 3
entropy 0.8241
entropy_ratio 0.52
alert: null_rate97.4% null
Fig 17.
Top values for Applications for Financial Assistance Submitted to the State Based Marketplace - footnotes.
Show data table
Top values for Applications for Financial Assistance Submitted to the State Based Marketplace - footnotes (3 unique shown, of 3 total).
valuecountshare
Includes Renewals and/or Redeterminations2202.1%
Includes Duplicates; Includes Renewals and/or Redeterminations260.3%
Does Not Include All Medicaid Applications Received By the SBM; Includes Duplicates; Includes Renewals and/or Redeterminations190.2%

Total Applications for Financial Assistance Submitted at State Level numeric feature

Numeric count of financial-assistance applications aggregated to the state level, with 10,302 records and 5,591 unique values. The distribution is severely right-skewed (skew 4.40, kurtosis 26.4): the median is 18,257 but the mean is 41,125 and the max reaches 762,069, with 12.2% of rows flagged as outliers. About 2% of values are zero and 0.5% are null, so a long tail of high-volume states dominates the spread (std 72,081 vs IQR 33,613).

Treatment: Apply a log1p transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[43]:

saturn.columns["Total Applications for Financial Assistance Submitted at State Level"].stats

statvalue
n10,302
nulls51 (0.5%)
unique5,591
min 0
max 762,069
mean 4.113e+04
median 18,257
std 7.208e+04
q1 6,494
q3 40,107
iqr 33,613
skew 4.396
kurtosis 26.4
n_outliers 1,249
outlier_rate 0.1218
zero_rate 0.02039
alert: high_skewskew=+4.40
alert: outliers12.2% rows beyond 1.5 IQR
Fig 18.
Distribution of Total Applications for Financial Assistance Submitted at State Level. Vertical dash marks the median.
Show data table
Histogram bins for Total Applications for Financial Assistance Submitted at State Level (median: 18257.0).
bincount
0 – 1.905e+045336
1.905e+04 – 3.81e+042304
3.81e+04 – 5.716e+04717
5.716e+04 – 7.621e+04403
7.621e+04 – 9.526e+04332
9.526e+04 – 1.143e+05257
1.143e+05 – 1.334e+05204
1.334e+05 – 1.524e+05139
1.524e+05 – 1.715e+05112
1.715e+05 – 1.905e+0554
1.905e+05 – 2.096e+0537
2.096e+05 – 2.286e+0528
2.286e+05 – 2.477e+0520
2.477e+05 – 2.667e+0551
2.667e+05 – 2.858e+0560
2.858e+05 – 3.048e+0530
3.048e+05 – 3.239e+0536
3.239e+05 – 3.429e+0528
3.429e+05 – 3.62e+0512
3.62e+05 – 3.81e+0516
3.81e+05 – 4.001e+058
4.001e+05 – 4.191e+054
4.191e+05 – 4.382e+054
4.382e+05 – 4.572e+050
4.572e+05 – 4.763e+050
4.763e+05 – 4.953e+050
4.953e+05 – 5.144e+0510
5.144e+05 – 5.334e+056
5.334e+05 – 5.525e+057
5.525e+05 – 5.716e+054
5.716e+05 – 5.906e+052
5.906e+05 – 6.097e+056
6.097e+05 – 6.287e+054
6.287e+05 – 6.478e+054
6.478e+05 – 6.668e+054
6.668e+05 – 6.859e+050
6.859e+05 – 7.049e+052
7.049e+05 – 7.24e+052
7.24e+05 – 7.43e+056
7.43e+05 – 7.621e+052

Total Applications for Financial Assistance Submitted at State Level - footnotes categorical metadata

Free-form footnote annotations qualifying state-level application counts, drawn from a small controlled vocabulary of 17 caveat strings (often semicolon-concatenated combinations). 73.44% of rows are null, and among the 2,737 populated rows the single value "Includes Renewals and/or Redeterminations" accounts for 42.5%, signalling that most states append the same scope caveat. Entropy ratio of 0.73 confirms the long tail is thin — these are methodology flags, not data.

Treatment: Keep as a qualifier when interpreting the parent metric; split on ';' into boolean flags rather than modelling as a feature.

anthropic:claude-opus-4-7 · confidence high
Out[46]:

saturn.columns["Total Applications for Financial Assistance Submitted at State Level - footnotes"].stats

statvalue
n10,302
nulls7,566 (73.4%)
unique17
top_value Includes Renewals and/or Redeterminations
top_rate 0.4251
cardinality 17
entropy 2.977
entropy_ratio 0.7283
alert: null_rate73.4% null
Fig 19.
Top values for Total Applications for Financial Assistance Submitted at State Level - footnotes.
Show data table
Top values for Total Applications for Financial Assistance Submitted at State Level - footnotes (17 unique shown, of 17 total).
valuecountshare
Includes Renewals and/or Redeterminations116311.3%
Does Not Include All Non-MAGI Applications2452.4%
Includes Renewals and/or Redeterminations; Includes Administrative Data Transfers1971.9%
Includes Applications for Other Programs (e.g, QHPs, SNAP, etc.)1931.9%
Includes Applications for Other Programs (e.g, QHPs, SNAP, etc.); Includes Accounts Transferred from FFM1921.9%
Does Not Include All Applications for Limited-Benefit Programs1901.8%
Includes Administrative Data Transfers; Does Not Include All Medicaid Applications; Does Not Include All CHIP Applications1611.6%
Includes Accounts Transferred from FFM; Does Not Include All Applications for Limited-Benefit Programs970.9%
Includes Accounts Transferred from FFM; Does Not Include All Medicaid Applications; Does Not Include All CHIP Applications; Includes Administrative Data Transfers910.9%
Includes Administrative Data Transfers; Includes Accounts Transferred from FFM; Does Not Include All Medicaid Applications; Does Not Include All CHIP Applications670.7%
Does Not Include All Medicaid Applications; Does Not Include All CHIP Applications; Includes Administrative Data Transfers340.3%
Includes Duplicates260.3%
Does Not Include All Medicaid Applications240.2%
Count is of individuals as opposed to applications210.2%
Does Not Include All Medicaid Applications; Includes Duplicates190.2%
Includes Accounts Transferred from FFM80.1%
Includes Administrative Data Transfers; Includes Accounts Transferred from FFM80.1%

Individuals Determined Eligible for Medicaid at Application numeric feature

This column counts individuals determined eligible for Medicaid at application, likely aggregated per reporting unit (state/month). The distribution is heavily right-skewed (skew 2.93, kurtosis 11.06) with a median of 11,008 but a max of 435,560, and roughly 10.3% of rows flagged as outliers. About 5.9% of values are zero and 0.5% are null, so a small but non-trivial share of records report no eligibility activity.

Treatment: Log1p-transform before modelling to tame skew and outliers.

anthropic:claude-opus-4-7 · confidence high
Out[49]:

saturn.columns["Individuals Determined Eligible for Medicaid at Application"].stats

statvalue
n10,302
nulls51 (0.5%)
unique5,568
min 0
max 435,560
mean 2.744e+04
median 11,008
std 4.159e+04
q1 4,344
q3 32,631
iqr 28,287
skew 2.932
kurtosis 11.06
n_outliers 1,051
outlier_rate 0.1025
zero_rate 0.05863
alert: high_skewskew=+2.93
alert: outliers10.3% rows beyond 1.5 IQR
Fig 20.
Distribution of Individuals Determined Eligible for Medicaid at Application. Vertical dash marks the median.
Show data table
Histogram bins for Individuals Determined Eligible for Medicaid at Application (median: 11008.0).
bincount
0 – 1.089e+045085
1.089e+04 – 2.178e+041748
2.178e+04 – 3.267e+04862
3.267e+04 – 4.356e+04850
4.356e+04 – 5.444e+04382
5.444e+04 – 6.533e+04138
6.533e+04 – 7.622e+04144
7.622e+04 – 8.711e+04146
8.711e+04 – 9.8e+04136
9.8e+04 – 1.089e+05135
1.089e+05 – 1.198e+05107
1.198e+05 – 1.307e+0572
1.307e+05 – 1.416e+0554
1.416e+05 – 1.524e+0577
1.524e+05 – 1.633e+0562
1.633e+05 – 1.742e+0574
1.742e+05 – 1.851e+0549
1.851e+05 – 1.96e+0540
1.96e+05 – 2.069e+0516
2.069e+05 – 2.178e+0527
2.178e+05 – 2.287e+057
2.287e+05 – 2.396e+0511
2.396e+05 – 2.504e+056
2.504e+05 – 2.613e+056
2.613e+05 – 2.722e+053
2.722e+05 – 2.831e+054
2.831e+05 – 2.94e+050
2.94e+05 – 3.049e+052
3.049e+05 – 3.158e+050
3.158e+05 – 3.267e+050
3.267e+05 – 3.376e+051
3.376e+05 – 3.484e+053
3.484e+05 – 3.593e+050
3.593e+05 – 3.702e+050
3.702e+05 – 3.811e+050
3.811e+05 – 3.92e+050
3.92e+05 – 4.029e+050
4.029e+05 – 4.138e+050
4.138e+05 – 4.247e+052
4.247e+05 – 4.356e+052

Individuals Determined Eligible for Medicaid at Application - footnotes categorical metadata

This is a footnotes/caveats column annotating Medicaid eligibility determination counts, with 18 distinct qualifier strings (often semicolon-concatenated combinations like 'Includes Renewals and/or Redeterminations; Includes CHIP'). 72.8% of rows are null, meaning most records carry no caveat, but where present the top note 'Includes Renewals and/or Redeterminations' covers 28.1% of non-null entries. High entropy ratio (0.82) across only 18 values indicates the caveats are spread fairly evenly, signalling that the underlying count column is not measured consistently across reporters.

Treatment: Split on '; ' into boolean caveat flags and use them to qualify or filter the associated count column.

anthropic:claude-opus-4-7 · confidence high
Out[52]:

saturn.columns["Individuals Determined Eligible for Medicaid at Application - footnotes"].stats

statvalue
n10,302
nulls7,500 (72.8%)
unique18
top_value Includes Renewals and/or Redeterminations
top_rate 0.2812
cardinality 18
entropy 3.41
entropy_ratio 0.8178
alert: null_rate72.8% null
Fig 21.
Top values for Individuals Determined Eligible for Medicaid at Application - footnotes.
Show data table
Top values for Individuals Determined Eligible for Medicaid at Application - footnotes (18 unique shown, of 18 total).
valuecountshare
Includes Renewals and/or Redeterminations7887.6%
Does Not Include All Medicaid Determinations Made At Application3863.7%
Includes Renewals and/or Redeterminations; Includes CHIP2602.5%
Includes Renewals and/or Redeterminations; Does Not Include All Medicaid Determinations Made At Application2162.1%
Count is of Households, Not Individuals2032.0%
Includes CHIP1921.9%
Includes Renewals and/or Redeterminations; Does Not Include All MAGI Determinations Made At Application1801.7%
Includes CHIP; Includes Renewals and/or Redeterminations1021.0%
Does Not Include All Determinations for Limited-Benefit Programs Made At Application970.9%
Count is of Households, Not Individuals; Includes Renewals and/or Redeterminations830.8%
Includes CHIP; Count is of Households, Not Individuals; Includes Renewals and/or Redeterminations770.7%
Count is of Households, Not Individuals; Does Not Include All Non-MAGI Determinations Made At Application740.7%
Count is of Households, Not Individuals; Includes Renewals and/or Redeterminations; Includes CHIP670.7%
Includes Renewals and/or Redeterminations; Count is of Households, Not Individuals250.2%
Includes Conditional Eligibility Determinations; Includes Presumptive Eligibility Determinations; Does Not Include All CHIP Determinations Made At Application210.2%
Includes Final Eligibility Determinations Made by FFM; Includes Conditional Eligibility Determinations; Includes Presumptive Eligibility Determinations150.1%
Does Not Include All MAGI Determinations Made At Application; Includes Renewals and/or Redeterminations90.1%
Does Not Include All Non-MAGI Determinations Made At Application70.1%

Individuals Determined Eligible for CHIP at Application numeric feature

Monthly or periodic count of individuals determined eligible for CHIP (Children's Health Insurance Program) at the application stage, reported per submitting entity. The distribution is heavily right-skewed (skew 3.31, kurtosis 13.83) with a median of 679 but mean 2374.6 and max 44,881, and 11.4% of rows are exact zeros. Roughly 12.6% of values (1,295 rows) flag as outliers, suggesting a long tail of very large reporting units alongside many small or inactive ones.

Treatment: Apply a log1p transform before modelling and consider modelling the zero-mass separately.

anthropic:claude-opus-4-7 · confidence high
Out[55]:

saturn.columns["Individuals Determined Eligible for CHIP at Application"].stats

statvalue
n10,302
nulls51 (0.5%)
unique3,064
min 0
max 44,881
mean 2375
median 679
std 4296
q1 142
q3 2,421
iqr 2,279
skew 3.31
kurtosis 13.83
n_outliers 1,295
outlier_rate 0.1263
zero_rate 0.1137
alert: high_skewskew=+3.31
alert: outliers12.6% rows beyond 1.5 IQR
Fig 22.
Distribution of Individuals Determined Eligible for CHIP at Application. Vertical dash marks the median.
Show data table
Histogram bins for Individuals Determined Eligible for CHIP at Application (median: 679.0).
bincount
0 – 11226282
1122 – 22441289
2244 – 3366774
3366 – 4488376
4488 – 5610207
5610 – 6732198
6732 – 7854187
7854 – 8976159
8976 – 1.01e+04171
1.01e+04 – 1.122e+0483
1.122e+04 – 1.234e+0494
1.234e+04 – 1.346e+0480
1.346e+04 – 1.459e+0463
1.459e+04 – 1.571e+0442
1.571e+04 – 1.683e+0425
1.683e+04 – 1.795e+0436
1.795e+04 – 1.907e+0422
1.907e+04 – 2.02e+0420
2.02e+04 – 2.132e+0426
2.132e+04 – 2.244e+0433
2.244e+04 – 2.356e+0419
2.356e+04 – 2.468e+0419
2.468e+04 – 2.581e+048
2.581e+04 – 2.693e+048
2.693e+04 – 2.805e+042
2.805e+04 – 2.917e+043
2.917e+04 – 3.029e+043
3.029e+04 – 3.142e+042
3.142e+04 – 3.254e+048
3.254e+04 – 3.366e+046
3.366e+04 – 3.478e+044
3.478e+04 – 3.59e+040
3.59e+04 – 3.703e+040
3.703e+04 – 3.815e+040
3.815e+04 – 3.927e+040
3.927e+04 – 4.039e+040
4.039e+04 – 4.151e+040
4.151e+04 – 4.264e+040
4.264e+04 – 4.376e+040
4.376e+04 – 4.488e+042

Individuals Determined Eligible for CHIP at Application - footnotes categorical metadata

Free-text footnote annotations qualifying CHIP eligibility counts at application, populated only when a caveat applies. The column is 90.99% null with just 7 distinct notes across 10,302 rows; among the 928 populated cells, 'Includes Renewals and/or Redeterminations' dominates at 46.55%. The notes flag methodology caveats (renewals mixed in, household vs individual counts, incomplete determinations) that materially affect comparability of the numeric column they annotate.

Treatment: Keep as a caveat flag; binarize or one-hot the handful of footnote categories before comparing the associated count column across rows.

anthropic:claude-opus-4-7 · confidence high
Out[58]:

saturn.columns["Individuals Determined Eligible for CHIP at Application - footnotes"].stats

statvalue
n10,302
nulls9,374 (91.0%)
unique7
top_value Includes Renewals and/or Redeterminations
top_rate 0.4655
cardinality 7
entropy 1.939
entropy_ratio 0.6905
alert: null_rate91.0% null
Fig 23.
Top values for Individuals Determined Eligible for CHIP at Application - footnotes.
Show data table
Top values for Individuals Determined Eligible for CHIP at Application - footnotes (7 unique shown, of 7 total).
valuecountshare
Includes Renewals and/or Redeterminations4324.2%
Includes Renewals and/or Redeterminations; Does Not Include All CHIP Determinations Made At Application2162.1%
Does Not Include All CHIP Determinations Made At Application2001.9%
Count is of Households, Not Individuals370.4%
Includes Conditional Eligibility Determinations; Includes Presumptive Eligibility Determinations; Does Not Include All CHIP Determinations Made At Application210.2%
Includes Final Eligibility Determinations Made by FFM; Includes Conditional Eligibility Determinations; Includes Presumptive Eligibility Determinations150.1%
Count is of Households, Not Individuals; Includes Renewals and/or Redeterminations70.1%

Total Medicaid and CHIP Determinations numeric feature

This is a numeric feature recording total Medicaid and CHIP eligibility determinations, likely aggregated per state-month or similar reporting unit. The distribution is heavily right-skewed (skew 2.92, kurtosis 10.77) with a median of 11,977 against a mean of 29,811 and a max of 467,780, and roughly 10.5% of rows flagged as outliers. About 5.5% of values are zero and 0.5% are null, suggesting some non-reporting or inactive periods.

Treatment: Log-transform (log1p) before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[61]:

saturn.columns["Total Medicaid and CHIP Determinations"].stats

statvalue
n10,302
nulls51 (0.5%)
unique5,587
min 0
max 467,780
mean 2.981e+04
median 11,977
std 4.535e+04
q1 4,739
q3 35,059
iqr 30,320
skew 2.922
kurtosis 10.77
n_outliers 1,076
outlier_rate 0.105
zero_rate 0.05492
alert: high_skewskew=+2.92
alert: outliers10.5% rows beyond 1.5 IQR
Fig 24.
Distribution of Total Medicaid and CHIP Determinations. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Determinations (median: 11977.0).
bincount
0 – 1.169e+045036
1.169e+04 – 2.339e+041757
2.339e+04 – 3.508e+04896
3.508e+04 – 4.678e+04844
4.678e+04 – 5.847e+04386
5.847e+04 – 7.017e+04137
7.017e+04 – 8.186e+04137
8.186e+04 – 9.356e+04131
9.356e+04 – 1.053e+05147
1.053e+05 – 1.169e+05149
1.169e+05 – 1.286e+05120
1.286e+05 – 1.403e+0560
1.403e+05 – 1.52e+0552
1.52e+05 – 1.637e+0565
1.637e+05 – 1.754e+0564
1.754e+05 – 1.871e+0564
1.871e+05 – 1.988e+0551
1.988e+05 – 2.105e+0543
2.105e+05 – 2.222e+0526
2.222e+05 – 2.339e+0533
2.339e+05 – 2.456e+0511
2.456e+05 – 2.573e+059
2.573e+05 – 2.69e+0511
2.69e+05 – 2.807e+054
2.807e+05 – 2.924e+052
2.924e+05 – 3.041e+056
3.041e+05 – 3.158e+050
3.158e+05 – 3.274e+052
3.274e+05 – 3.391e+050
3.391e+05 – 3.508e+050
3.508e+05 – 3.625e+054
3.625e+05 – 3.742e+050
3.742e+05 – 3.859e+050
3.859e+05 – 3.976e+050
3.976e+05 – 4.093e+050
4.093e+05 – 4.21e+050
4.21e+05 – 4.327e+050
4.327e+05 – 4.444e+051
4.444e+05 – 4.561e+051
4.561e+05 – 4.678e+052

Total Medicaid and CHIP Determinations - footnotes categorical metadata

Free-form footnote annotations attached to state Medicaid/CHIP determination counts, describing caveats like inclusion of renewals or exclusions of certain determination types. The column is 81.58% null, leaving only ~1,900 populated rows across 12 distinct caveat strings, with 'Includes Renewals and/or Redeterminations' alone covering 47.1% of non-null entries. Several values are semicolon-concatenated combinations of base caveats, so the 12-way cardinality understates the underlying flag set.

Treatment: Split on '; ' into boolean caveat flags rather than treating as a single categorical.

anthropic:claude-opus-4-7 · confidence high
Out[64]:

saturn.columns["Total Medicaid and CHIP Determinations - footnotes"].stats

statvalue
n10,302
nulls8,404 (81.6%)
unique12
top_value Includes Renewals and/or Redeterminations
top_rate 0.471
cardinality 12
entropy 2.42
entropy_ratio 0.675
alert: null_rate81.6% null
Fig 25.
Top values for Total Medicaid and CHIP Determinations - footnotes.
Show data table
Top values for Total Medicaid and CHIP Determinations - footnotes (12 unique shown, of 12 total).
valuecountshare
Includes Renewals and/or Redeterminations8948.7%
Does Not Include All Medicaid Determinations Made At Application; Does Not Include All CHIP Determinations Made At Application3343.2%
Includes Renewals and/or Redeterminations; Does Not Include All Medicaid Determinations Made At Application; Does Not Include All CHIP Determinations Made At Application2162.1%
Count is of Households, Not Individuals; Includes Renewals and/or Redeterminations1511.5%
Does Not Include All Determinations for Limited-Benefit Programs Made At Application970.9%
Does Not Include All Non-MAGI Determinations Made At Application810.8%
Does Not Include All Medicaid Determinations Made At Application490.5%
Includes Renewals and/or Redeterminations; Count is of Households, Not Individuals250.2%
Includes Conditional Eligibility Determinations; Includes Presumptive Eligibility Determinations; Does Not Include All CHIP Determinations Made At Application210.2%
Includes Final Eligibility Determinations Made by FFM; Includes Conditional Eligibility Determinations; Includes Presumptive Eligibility Determinations150.1%
Count is of Households, Not Individuals120.1%
Does Not Include All Non-MAGI Determinations Made At Application; Does Not Include All Medicaid Determinations Made At Application; Does Not Include All CHIP Determinations Made At Application30.0%

Medicaid and CHIP Child Enrollment numeric feature

This column reports counts of Medicaid and CHIP child enrollees, with 10,302 rows and 8,094 unique values spanning 0 to 5,339,904. The distribution is heavily right-skewed (skew 2.80, kurtosis 8.85) with a mean of 740,683 well above the median of 511,370, and roughly 7.8% of values flagged as outliers. About 2.1% of rows are exactly zero and 0.5% are null, suggesting a mix of small and very large reporting units.

Treatment: Log-transform (log1p) before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[67]:

saturn.columns["Medicaid and CHIP Child Enrollment"].stats

statvalue
n10,302
nulls51 (0.5%)
unique8,094
min 0
max 5.34e+06
mean 7.407e+05
median 511,370
std 9.294e+05
q1 156,330
q3 836,539
iqr 680,209
skew 2.799
kurtosis 8.851
n_outliers 804
outlier_rate 0.07843
zero_rate 0.02127
alert: high_skewskew=+2.80
alert: outliers7.8% rows beyond 1.5 IQR
Fig 26.
Distribution of Medicaid and CHIP Child Enrollment. Vertical dash marks the median.
Show data table
Histogram bins for Medicaid and CHIP Child Enrollment (median: 511370.0).
bincount
0 – 1.335e+052347
1.335e+05 – 2.67e+051120
2.67e+05 – 4.005e+051049
4.005e+05 – 5.34e+05815
5.34e+05 – 6.675e+051121
6.675e+05 – 8.01e+05986
8.01e+05 – 9.345e+05783
9.345e+05 – 1.068e+06174
1.068e+06 – 1.201e+0694
1.201e+06 – 1.335e+06316
1.335e+06 – 1.468e+06404
1.468e+06 – 1.602e+06196
1.602e+06 – 1.735e+0631
1.735e+06 – 1.869e+0611
1.869e+06 – 2.002e+060
2.002e+06 – 2.136e+060
2.136e+06 – 2.269e+060
2.269e+06 – 2.403e+0622
2.403e+06 – 2.536e+06217
2.536e+06 – 2.67e+0695
2.67e+06 – 2.803e+0620
2.803e+06 – 2.937e+0624
2.937e+06 – 3.07e+0620
3.07e+06 – 3.204e+0616
3.204e+06 – 3.337e+0655
3.337e+06 – 3.471e+0628
3.471e+06 – 3.604e+0630
3.604e+06 – 3.738e+066
3.738e+06 – 3.871e+0612
3.871e+06 – 4.005e+0620
4.005e+06 – 4.138e+0616
4.138e+06 – 4.272e+066
4.272e+06 – 4.405e+0616
4.405e+06 – 4.539e+060
4.539e+06 – 4.672e+060
4.672e+06 – 4.806e+068
4.806e+06 – 4.939e+0640
4.939e+06 – 5.073e+0662
5.073e+06 – 5.206e+0658
5.206e+06 – 5.34e+0633

Medicaid and CHIP Child Enrollment - footnotes categorical metadata

This column holds free-form footnotes annotating the Medicaid and CHIP child enrollment counts, explaining caveats like enrollment-period basis or data completeness. It is null 92.19% of the time, and among the 805 populated rows there are only 7 distinct messages, with 'Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)' covering 49.57% of them. The presence of 'Unable to provide data due to system limitations' (218 rows) is a meaningful data-quality flag tied to specific reporters.

Treatment: Keep as a qualifier flag joined to the enrollment metric; do not model directly.

anthropic:claude-opus-4-7 · confidence high
Out[70]:

saturn.columns["Medicaid and CHIP Child Enrollment - footnotes"].stats

statvalue
n10,302
nulls9,497 (92.2%)
unique7
top_value Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)
top_rate 0.4957
cardinality 7
entropy 1.854
entropy_ratio 0.6605
alert: null_rate92.2% null
Fig 27.
Top values for Medicaid and CHIP Child Enrollment - footnotes.
Show data table
Top values for Medicaid and CHIP Child Enrollment - footnotes (7 unique shown, of 7 total).
valuecountshare
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)3993.9%
Unable to provide data due to system limitations2182.1%
Includes Retroactive Enrollments1201.2%
Does Not Include All Full-Benefit Child Medicaid enrollees420.4%
Does Not Include All Full-Benefit MAGI Child enrollees110.1%
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count); Includes Retroactive Enrollments80.1%
Does Not Include All Full-Benefit Child Medicaid enrollees; Does Not Include All CHIP enrollees70.1%

Total Medicaid and CHIP Enrollment numeric feature

State-level (or similar geographic) totals of Medicaid and CHIP enrollees, ranging from 0 to about 14.46M with a median of roughly 1.03M and mean of 1.57M. The distribution is heavily right-skewed (skew 3.67, kurtosis 16.6) with 692 outliers (6.7%), consistent with a few very large states dominating the tail. Nulls and zeros are negligible (0.02% and 0.03%).

Treatment: Log-transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[73]:

saturn.columns["Total Medicaid and CHIP Enrollment"].stats

statvalue
n10,302
nulls2 (0.0%)
unique8,309
min 0
max 1.446e+07
mean 1.567e+06
median 1.032e+06
std 2.054e+06
q1 349,361
q3 1.805e+06
iqr 1.455e+06
skew 3.67
kurtosis 16.57
n_outliers 692
outlier_rate 0.06718
zero_rate 0.0002913
alert: high_skewskew=+3.67
alert: outliers6.7% rows beyond 1.5 IQR
Fig 28.
Distribution of Total Medicaid and CHIP Enrollment. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Enrollment (median: 1031822.0).
bincount
0 – 3.616e+052640
3.616e+05 – 7.231e+051192
7.231e+05 – 1.085e+061581
1.085e+06 – 1.446e+061149
1.446e+06 – 1.808e+061171
1.808e+06 – 2.169e+06726
2.169e+06 – 2.531e+06291
2.531e+06 – 2.893e+06207
2.893e+06 – 3.254e+06331
3.254e+06 – 3.616e+06143
3.616e+06 – 3.977e+06175
3.977e+06 – 4.339e+06106
4.339e+06 – 4.7e+0684
4.7e+06 – 5.062e+0642
5.062e+06 – 5.423e+0626
5.423e+06 – 5.785e+0619
5.785e+06 – 6.147e+0675
6.147e+06 – 6.508e+0620
6.508e+06 – 6.87e+0651
6.87e+06 – 7.231e+0635
7.231e+06 – 7.593e+0632
7.593e+06 – 7.954e+063
7.954e+06 – 8.316e+060
8.316e+06 – 8.678e+060
8.678e+06 – 9.039e+060
9.039e+06 – 9.401e+060
9.401e+06 – 9.762e+060
9.762e+06 – 1.012e+070
1.012e+07 – 1.049e+070
1.049e+07 – 1.085e+070
1.085e+07 – 1.121e+070
1.121e+07 – 1.157e+076
1.157e+07 – 1.193e+0735
1.193e+07 – 1.229e+0738
1.229e+07 – 1.265e+0711
1.265e+07 – 1.302e+0710
1.302e+07 – 1.338e+0723
1.338e+07 – 1.374e+0739
1.374e+07 – 1.41e+0721
1.41e+07 – 1.446e+0718

Total Medicaid and CHIP Enrollment - footnotes categorical metadata

Footnote annotations qualifying the Total Medicaid and CHIP Enrollment figure for each row, drawn from a controlled vocabulary of just 9 phrases. The column is sparse — 93.11% null — and when present is dominated by a single caveat, "Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)", which accounts for 56.19% of non-null values (399 of ~710). The remaining notes flag scope exclusions or retroactive counts that materially affect comparability across states or months.

Treatment: Keep as a qualifier flag joined to the enrollment figure; do not model directly, but filter or stratify analyses by these caveats.

anthropic:claude-opus-4-7 · confidence high
Out[76]:

saturn.columns["Total Medicaid and CHIP Enrollment - footnotes"].stats

statvalue
n10,302
nulls9,592 (93.1%)
unique9
top_value Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)
top_rate 0.562
cardinality 9
entropy 1.832
entropy_ratio 0.578
alert: null_rate93.1% null
Fig 29.
Top values for Total Medicaid and CHIP Enrollment - footnotes.
Show data table
Top values for Total Medicaid and CHIP Enrollment - footnotes (9 unique shown, of 9 total).
valuecountshare
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)3993.9%
Includes Enrollees in Other Financial Assistance Programs Not Enrolled in Medicaid or CHIP1351.3%
Includes Retroactive Enrollments1201.2%
Does Not Include All Full-Benefit Medicaid enrollees; Does Not Include All CHIP enrollees190.2%
Does Not Include All Full-Benefit Non-MAGI enrollees110.1%
Does Not Include All Full-Benefit Medicaid enrollees100.1%
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count); Includes Limited-Benefit Enrollees; Includes Retroactive Enrollments80.1%
Does Not Include All Individuals Conditionally Eligible50.0%
Unable to Provide Data due to System Limitations30.0%

Total Medicaid Enrollment numeric feature

Counts of Medicaid enrollees, almost certainly aggregated by state-month or similar reporting unit (10,302 rows, 8,221 unique values, near-zero nulls). The distribution is heavily right-skewed (skew 3.61, kurtosis 16.1) with a median of 949,244 but a max of 13,160,563, and 691 rows (6.7%) flagged as outliers — consistent with a few very large states dominating the tail. Minimum is 0 but only 0.03% of rows are zero, so true empties are rare.

Treatment: log-transform before regression to tame the right skew.

anthropic:claude-opus-4-7 · confidence high
Out[79]:

saturn.columns["Total Medicaid Enrollment"].stats

statvalue
n10,302
nulls51 (0.5%)
unique8,221
min 0
max 1.316e+07
mean 1.433e+06
median 949,244
std 1.865e+06
q1 319,127
q3 1.647e+06
iqr 1.328e+06
skew 3.613
kurtosis 16.11
n_outliers 691
outlier_rate 0.06741
zero_rate 0.0002927
alert: high_skewskew=+3.61
alert: outliers6.7% rows beyond 1.5 IQR
Fig 30.
Distribution of Total Medicaid Enrollment. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid Enrollment (median: 949244.0).
bincount
0 – 3.29e+052655
3.29e+05 – 6.58e+051219
6.58e+05 – 9.87e+051452
9.87e+05 – 1.316e+061130
1.316e+06 – 1.645e+061219
1.645e+06 – 1.974e+06738
1.974e+06 – 2.303e+06276
2.303e+06 – 2.632e+06180
2.632e+06 – 2.961e+06355
2.961e+06 – 3.29e+06147
3.29e+06 – 3.619e+06171
3.619e+06 – 3.948e+06137
3.948e+06 – 4.277e+0654
4.277e+06 – 4.606e+0642
4.606e+06 – 4.935e+0638
4.935e+06 – 5.264e+0614
5.264e+06 – 5.593e+0678
5.593e+06 – 5.922e+0639
5.922e+06 – 6.251e+0637
6.251e+06 – 6.58e+0626
6.58e+06 – 6.909e+0630
6.909e+06 – 7.238e+0613
7.238e+06 – 7.567e+060
7.567e+06 – 7.896e+060
7.896e+06 – 8.225e+060
8.225e+06 – 8.554e+060
8.554e+06 – 8.883e+060
8.883e+06 – 9.212e+060
9.212e+06 – 9.541e+060
9.541e+06 – 9.87e+060
9.87e+06 – 1.02e+073
1.02e+07 – 1.053e+0727
1.053e+07 – 1.086e+0735
1.086e+07 – 1.119e+0720
1.119e+07 – 1.152e+0710
1.152e+07 – 1.184e+0711
1.184e+07 – 1.217e+0724
1.217e+07 – 1.25e+0734
1.25e+07 – 1.283e+0721
1.283e+07 – 1.316e+0716

Total Medicaid Enrollment - footnotes categorical metadata

Footnote annotations qualifying the Total Medicaid Enrollment figures, drawn from a small controlled vocabulary of 7 distinct caveats. The column is 93.21% null, so only 700 of 10302 rows carry any note, and 57% of those flag that the count includes individuals enrolled at any time in the month rather than a point-in-time count. A handful of rows even disclose 'Unable to Provide Data due to System Limitations', which materially affects how the paired enrollment number should be read.

Treatment: Keep as a qualifier flag joined to the enrollment value; do not impute, and exclude or footnote rows with the system-limitation note when aggregating.

anthropic:claude-opus-4-7 · confidence high
Out[82]:

saturn.columns["Total Medicaid Enrollment - footnotes"].stats

statvalue
n10,302
nulls9,602 (93.2%)
unique7
top_value Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)
top_rate 0.57
cardinality 7
entropy 1.725
entropy_ratio 0.6144
alert: null_rate93.2% null
Fig 31.
Top values for Total Medicaid Enrollment - footnotes.
Show data table
Top values for Total Medicaid Enrollment - footnotes (7 unique shown, of 7 total).
valuecountshare
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)3993.9%
Includes Enrollees in Other Financial Assistance Programs Not Enrolled in Medicaid or CHIP1351.3%
Includes Retroactive Enrollments1201.2%
Does Not Include All Full-Benefit Medicaid enrollees240.2%
Does Not Include All Full-Benefit Non-MAGI enrollees110.1%
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count); Includes Limited-Benefit Enrollees; Includes Retroactive Enrollments80.1%
Unable to Provide Data due to System Limitations30.0%

Total CHIP Enrollment numeric feature

Likely a count of children enrolled in CHIP (Children's Health Insurance Program), probably aggregated by state and time period across 10,302 rows. The distribution is heavily right-skewed (skew 3.90, kurtosis 18.21) with the mean (136,663) far above the median (78,933) and a max of 1,317,347, plus 542 outliers (5.3%). Near-zero zero_rate (0.03%) and low null_rate (0.5%) indicate the field is consistently populated.

Treatment: log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[85]:

saturn.columns["Total CHIP Enrollment"].stats

statvalue
n10,302
nulls51 (0.5%)
unique7,918
min 0
max 1.317e+06
mean 1.367e+05
median 78,933
std 2.026e+05
q1 26,056
q3 172,021
iqr 145,965
skew 3.897
kurtosis 18.21
n_outliers 542
outlier_rate 0.05287
zero_rate 0.0002927
alert: high_skewskew=+3.90
alert: outliers5.3% rows beyond 1.5 IQR
Fig 32.
Distribution of Total CHIP Enrollment. Vertical dash marks the median.
Show data table
Histogram bins for Total CHIP Enrollment (median: 78933.0).
bincount
0 – 3.293e+042984
3.293e+04 – 6.587e+041521
6.587e+04 – 9.88e+041323
9.88e+04 – 1.317e+05963
1.317e+05 – 1.647e+05780
1.647e+05 – 1.976e+05717
1.976e+05 – 2.305e+05428
2.305e+05 – 2.635e+05359
2.635e+05 – 2.964e+05303
2.964e+05 – 3.293e+05186
3.293e+05 – 3.623e+05137
3.623e+05 – 3.952e+0510
3.952e+05 – 4.281e+0512
4.281e+05 – 4.611e+058
4.611e+05 – 4.94e+058
4.94e+05 – 5.269e+0521
5.269e+05 – 5.599e+0565
5.599e+05 – 5.928e+05112
5.928e+05 – 6.257e+0567
6.257e+05 – 6.587e+0518
6.587e+05 – 6.916e+0528
6.916e+05 – 7.245e+050
7.245e+05 – 7.575e+050
7.575e+05 – 7.904e+050
7.904e+05 – 8.233e+050
8.233e+05 – 8.563e+050
8.563e+05 – 8.892e+050
8.892e+05 – 9.221e+050
9.221e+05 – 9.551e+050
9.551e+05 – 9.88e+050
9.88e+05 – 1.021e+060
1.021e+06 – 1.054e+060
1.054e+06 – 1.087e+060
1.087e+06 – 1.12e+060
1.12e+06 – 1.153e+060
1.153e+06 – 1.186e+060
1.186e+06 – 1.219e+060
1.219e+06 – 1.251e+0636
1.251e+06 – 1.284e+0615
1.284e+06 – 1.317e+06150

Total CHIP Enrollment - footnotes categorical metadata

This is a sparse footnote/qualifier column annotating CHIP enrollment counts, present on only ~4% of rows (null_rate 0.9592). Among the 420 non-null entries, four distinct notes appear, dominated by 'Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)' at 75.95%, with smaller mentions of retroactive enrollments, incomplete coverage, and system-limitation gaps. These notes signal that the accompanying enrollment numbers are not methodologically uniform across rows.

Treatment: Keep as a caveat flag joined to the enrollment column; do not use as a modelling feature.

anthropic:claude-opus-4-7 · confidence high
Out[88]:

saturn.columns["Total CHIP Enrollment - footnotes"].stats

statvalue
n10,302
nulls9,882 (95.9%)
unique4
top_value Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)
top_rate 0.7595
cardinality 4
entropy 1.033
entropy_ratio 0.5167
alert: null_rate95.9% null
Fig 33.
Top values for Total CHIP Enrollment - footnotes.
Show data table
Top values for Total CHIP Enrollment - footnotes (4 unique shown, of 4 total).
valuecountshare
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)3193.1%
Includes Retroactive Enrollments730.7%
Does Not Include All CHIP enrollees250.2%
Unable to Provide Data due to System Limitations30.0%

Total Adult Medicaid Enrollment numeric feature

Counts of adult Medicaid enrollees, almost certainly aggregated by state and reporting period. Coverage is sparse — 84.65% of rows are null — and the populated values are heavily right-skewed (skew 4.56, kurtosis 23.38) with a mean of 810,066 sitting well above the median of 551,447 and a max of 8,497,290. Only 1,415 distinct values across 10,302 rows suggests repeated state-level totals, and 62 outliers (3.9%) likely correspond to the largest states.

Treatment: Impute or filter the 84.65% nulls and log-transform before any regression.

anthropic:claude-opus-4-7 · confidence high
Out[91]:

saturn.columns["Total Adult Medicaid Enrollment"].stats

statvalue
n10,302
nulls8,721 (84.7%)
unique1,415
min 0
max 8.497e+06
mean 8.101e+05
median 551,447
std 1.269e+06
q1 168,141
q3 951,232
iqr 783,091
skew 4.556
kurtosis 23.38
n_outliers 62
outlier_rate 0.03922
zero_rate 0.001898
alert: high_skewskew=+4.56
alert: null_rate84.7% null
Fig 34.
Distribution of Total Adult Medicaid Enrollment. Vertical dash marks the median.
Show data table
Histogram bins for Total Adult Medicaid Enrollment (median: 551447.0).
bincount
0 – 2.179e+05490
2.179e+05 – 4.358e+05238
4.358e+05 – 6.536e+05209
6.536e+05 – 8.715e+05183
8.715e+05 – 1.089e+06203
1.089e+06 – 1.307e+0623
1.307e+06 – 1.525e+0680
1.525e+06 – 1.743e+0674
1.743e+06 – 1.961e+0619
1.961e+06 – 2.179e+060
2.179e+06 – 2.397e+060
2.397e+06 – 2.615e+060
2.615e+06 – 2.832e+060
2.832e+06 – 3.05e+060
3.05e+06 – 3.268e+060
3.268e+06 – 3.486e+060
3.486e+06 – 3.704e+060
3.704e+06 – 3.922e+060
3.922e+06 – 4.14e+0615
4.14e+06 – 4.358e+0616
4.358e+06 – 4.575e+060
4.575e+06 – 4.793e+060
4.793e+06 – 5.011e+060
5.011e+06 – 5.229e+060
5.229e+06 – 5.447e+060
5.447e+06 – 5.665e+060
5.665e+06 – 5.883e+060
5.883e+06 – 6.101e+060
6.101e+06 – 6.318e+060
6.318e+06 – 6.536e+060
6.536e+06 – 6.754e+060
6.754e+06 – 6.972e+060
6.972e+06 – 7.19e+060
7.19e+06 – 7.408e+060
7.408e+06 – 7.626e+060
7.626e+06 – 7.844e+060
7.844e+06 – 8.062e+060
8.062e+06 – 8.279e+062
8.279e+06 – 8.497e+0629

Total Adult Medicaid Enrollment - footnotes categorical metadata

This is a footnotes/caveats column accompanying Total Adult Medicaid Enrollment, holding methodological annotations rather than data values. It is 98.79% null with only 4 distinct notes across 10,302 rows; the dominant note (56% of non-nulls, 70 occurrences) flags that counts include anyone enrolled at any time in the month rather than a point-in-time count, while 3 rows admit data could not be provided due to system limitations.

Treatment: Keep as a qualitative caveat lookup; do not use as a model feature given the 98.79% null rate.

anthropic:claude-opus-4-7 · confidence high
Out[94]:

saturn.columns["Total Adult Medicaid Enrollment - footnotes"].stats

statvalue
n10,302
nulls10,177 (98.8%)
unique4
top_value Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)
top_rate 0.56
cardinality 4
entropy 1.382
entropy_ratio 0.6908
alert: null_rate98.8% null
Fig 35.
Top values for Total Adult Medicaid Enrollment - footnotes.
Show data table
Top values for Total Adult Medicaid Enrollment - footnotes (4 unique shown, of 4 total).
valuecountshare
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count)700.7%
Includes Retroactive Enrollments440.4%
Includes Individuals Enrolled At Any Time in Month (Not a Point-in-Time Count); Includes Limited-Benefit Enrollees; Includes Retroactive Enrollments80.1%
Unable to Provide Data due to System Limitations30.0%

Total Medicaid and CHIP Determinations Processed in Less than 24 Hours numeric feature

This column counts Medicaid/CHIP determinations processed in under 24 hours per reporting unit, with values ranging from 0 to 791,175 and a median of just 3,470. It is severely right-skewed (skew 6.44, kurtosis 47.28) with mean 22,240 dwarfing the median, 699 outliers (11.9%), and a 43.07% null rate that an analyst should not ignore. About 2.1% of records are zero, suggesting some agencies report no fast-track determinations.

Treatment: Impute or flag the 43% missing, then log-transform before any modelling to tame the skew.

anthropic:claude-opus-4-7 · confidence high
Out[97]:

saturn.columns["Total Medicaid and CHIP Determinations Processed in Less than 24 Hours"].stats

statvalue
n10,302
nulls4,437 (43.1%)
unique2,701
min 0
max 791,175
mean 2.224e+04
median 3,470
std 7.333e+04
q1 932
q3 12,172
iqr 11,240
skew 6.441
kurtosis 47.28
n_outliers 699
outlier_rate 0.1192
zero_rate 0.02148
alert: high_skewskew=+6.44
alert: outliers11.9% rows beyond 1.5 IQR
alert: null_rate43.1% null
Fig 36.
Distribution of Total Medicaid and CHIP Determinations Processed in Less than 24 Hours. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Determinations Processed in Less than 24 Hours (median: 3470.0).
bincount
0 – 1.978e+044914
1.978e+04 – 3.956e+04349
3.956e+04 – 5.934e+04102
5.934e+04 – 7.912e+04136
7.912e+04 – 9.89e+0443
9.89e+04 – 1.187e+0569
1.187e+05 – 1.385e+0568
1.385e+05 – 1.582e+0535
1.582e+05 – 1.78e+0512
1.78e+05 – 1.978e+0514
1.978e+05 – 2.176e+054
2.176e+05 – 2.374e+058
2.374e+05 – 2.571e+052
2.571e+05 – 2.769e+058
2.769e+05 – 2.967e+056
2.967e+05 – 3.165e+054
3.165e+05 – 3.362e+054
3.362e+05 – 3.56e+052
3.56e+05 – 3.758e+058
3.758e+05 – 3.956e+052
3.956e+05 – 4.154e+054
4.154e+05 – 4.351e+054
4.351e+05 – 4.549e+054
4.549e+05 – 4.747e+052
4.747e+05 – 4.945e+050
4.945e+05 – 5.143e+0510
5.143e+05 – 5.34e+056
5.34e+05 – 5.538e+059
5.538e+05 – 5.736e+054
5.736e+05 – 5.934e+054
5.934e+05 – 6.132e+056
6.132e+05 – 6.329e+054
6.329e+05 – 6.527e+054
6.527e+05 – 6.725e+052
6.725e+05 – 6.923e+050
6.923e+05 – 7.121e+052
7.121e+05 – 7.318e+052
7.318e+05 – 7.516e+054
7.516e+05 – 7.714e+052
7.714e+05 – 7.912e+052

Total Medicaid and CHIP Determinations Processed in Less than 24 Hours - footnotes categorical metadata

Footnote/caveat field annotating data-quality issues with the 'Total Medicaid and CHIP Determinations Processed in Less than 24 Hours' metric. It is null 89.21% of the time, with only 20 distinct values across 10,302 rows; the most common note, appearing 420 times (37.77% of non-nulls), flags reporting at application rather than individual level. Several entries are semicolon-concatenated combinations of caveats, suggesting a multi-label field flattened into strings.

Treatment: Keep as a caveat flag; split on ';' into boolean indicators if you need to filter unreliable rows from the paired metric.

anthropic:claude-opus-4-7 · confidence high
Out[100]:

saturn.columns["Total Medicaid and CHIP Determinations Processed in Less than 24 Hours - footnotes"].stats

statvalue
n10,302
nulls9,190 (89.2%)
unique20
top_value Incorrectly reporting processing time at application level, as opposed to the individual level
top_rate 0.3777
cardinality 20
entropy 2.968
entropy_ratio 0.6867
alert: null_rate89.2% null
Fig 37.
Top values for Total Medicaid and CHIP Determinations Processed in Less than 24 Hours - footnotes.
Show data table
Top values for Total Medicaid and CHIP Determinations Processed in Less than 24 Hours - footnotes (20 unique shown, of 20 total).
valuecountshare
Incorrectly reporting processing time at application level, as opposed to the individual level4204.1%
Incorrectly includes redeterminations1771.7%
Does not include all MAGI determinations on applications1431.4%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level900.9%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations700.7%
Incorrectly includes redeterminations; Does not include all MAGI determinations on applications570.6%
Unable to provide data due to system limitations390.4%
Includes some non-MAGI determinations on applications240.2%
Includes some non-MAGI determinations on applications; Incorrectly includes redeterminations180.2%
Reflects incorrect start dates for processing time on some reopened applications100.1%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications100.1%
Includes some non-Medicaid or CHIP determinations on applications; Reflects incorrect start dates for processing time on some reopened applications80.1%
Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications80.1%
Includes some non-MAGI determinations on applications; Does not include all MAGI determinations on applications60.1%
Includes some non-MAGI determinations on applications; Incorrectly reporting processing time at application level, as opposed to the individual level60.1%
Determinations conducted in less than 24 hours are reported under the 1-7 days category60.1%
Incorrectly includes reinstatements60.1%
Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations60.1%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly includes redeterminations60.1%
Includes some non-Medicaid or CHIP determinations on applications20.0%

Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days numeric feature

This column counts state-level Medicaid and CHIP eligibility determinations processed in the 24-hour to 7-day window. Values are heavily right-skewed (skew 4.39, kurtosis 29.5) with a median of 2,312 but a max of 133,996, and 10.3% of values flag as outliers — consistent with a few large states dominating volumes. Note that 43.1% of rows are null and 2.7% are exact zeros, so coverage is partial.

Treatment: Log-transform and impute or flag missingness before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[103]:

saturn.columns["Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days"].stats

statvalue
n10,302
nulls4,437 (43.1%)
unique2,559
min 0
max 133,996
mean 5465
median 2,312
std 9481
q1 723
q3 5,802
iqr 5,079
skew 4.392
kurtosis 29.55
n_outliers 602
outlier_rate 0.1026
zero_rate 0.02677
alert: high_skewskew=+4.39
alert: outliers10.3% rows beyond 1.5 IQR
alert: null_rate43.1% null
Fig 38.
Distribution of Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days (median: 2312.0).
bincount
0 – 33503582
3350 – 67001036
6700 – 1.005e+04405
1.005e+04 – 1.34e+04240
1.34e+04 – 1.675e+04119
1.675e+04 – 2.01e+0474
2.01e+04 – 2.345e+04109
2.345e+04 – 2.68e+0493
2.68e+04 – 3.015e+0453
3.015e+04 – 3.35e+0431
3.35e+04 – 3.685e+0418
3.685e+04 – 4.02e+0419
4.02e+04 – 4.355e+0411
4.355e+04 – 4.69e+0410
4.69e+04 – 5.025e+047
5.025e+04 – 5.36e+049
5.36e+04 – 5.695e+0411
5.695e+04 – 6.03e+0412
6.03e+04 – 6.365e+042
6.365e+04 – 6.7e+048
6.7e+04 – 7.035e+042
7.035e+04 – 7.37e+044
7.37e+04 – 7.705e+040
7.705e+04 – 8.04e+040
8.04e+04 – 8.375e+044
8.375e+04 – 8.71e+042
8.71e+04 – 9.045e+040
9.045e+04 – 9.38e+040
9.38e+04 – 9.715e+040
9.715e+04 – 1.005e+050
1.005e+05 – 1.038e+052
1.038e+05 – 1.072e+050
1.072e+05 – 1.105e+050
1.105e+05 – 1.139e+050
1.139e+05 – 1.172e+050
1.172e+05 – 1.206e+050
1.206e+05 – 1.239e+050
1.239e+05 – 1.273e+050
1.273e+05 – 1.306e+050
1.306e+05 – 1.34e+052

Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days - footnotes categorical metadata

Footnote annotations describing data quality caveats for the Medicaid/CHIP 24h-7d determinations metric. The column is 89.21% null, leaving only ~1,111 populated rows across 21 distinct caveat strings, with the top note ('Incorrectly reporting processing time at application level...') covering 37.77% of the non-null entries. Many values are semicolon-concatenated combinations of caveats, indicating multiple data quality issues stacked per row.

Treatment: Keep as a qualitative caveat flag; split on ';' to derive boolean data-quality indicators rather than modelling directly.

anthropic:claude-opus-4-7 · confidence high
Out[106]:

saturn.columns["Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days - footnotes"].stats

statvalue
n10,302
nulls9,190 (89.2%)
unique21
top_value Incorrectly reporting processing time at application level, as opposed to the individual level
top_rate 0.3777
cardinality 21
entropy 3.026
entropy_ratio 0.689
alert: null_rate89.2% null
Fig 39.
Top values for Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days - footnotes.
Show data table
Top values for Total Medicaid and CHIP Determinations Processed Between 24 Hours and 7 Days - footnotes (20 unique shown, of 21 total).
valuecountshare
Incorrectly reporting processing time at application level, as opposed to the individual level4204.1%
Incorrectly includes redeterminations1771.7%
Does not include all MAGI determinations on applications1431.4%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level; MAGI determinations conducted in 6-7 days are reported under the 8-30 days category720.7%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations700.7%
Incorrectly includes redeterminations; Does not include all MAGI determinations on applications570.6%
Unable to provide data due to system limitations390.4%
Includes some non-MAGI determinations on applications240.2%
Includes some non-MAGI determinations on applications; Incorrectly includes redeterminations180.2%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level180.2%
Reflects incorrect start dates for processing time on some reopened applications100.1%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications100.1%
Includes some non-Medicaid or CHIP determinations on applications; Reflects incorrect start dates for processing time on some reopened applications80.1%
Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications80.1%
Includes some non-MAGI determinations on applications; Does not include all MAGI determinations on applications60.1%
Includes some non-MAGI determinations on applications; Incorrectly reporting processing time at application level, as opposed to the individual level60.1%
Determinations conducted in less than 24 hours are reported under the 1-7 days category60.1%
Incorrectly includes reinstatements60.1%
Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations60.1%
Determinations conducted in less than 24 hours are reported under the 1-7 days category; Incorrectly includes redeterminations60.1%

Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days numeric feature

Numeric count of Medicaid/CHIP eligibility determinations processed in the 8-30 day window, likely reported per state-month. Distribution is heavily right-skewed (skew 3.98, kurtosis 19.86) with median 2528 but max 155529, and 10.2% of values flagged as outliers. Notably, 43.1% of rows are null, suggesting many reporting periods or jurisdictions did not submit this metric.

Treatment: Impute or filter the 43% nulls and log-transform before any modelling.

anthropic:claude-opus-4-7 · confidence high
Out[109]:

saturn.columns["Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days"].stats

statvalue
n10,302
nulls4,437 (43.1%)
unique2,608
min 0
max 155,529
mean 7967
median 2,528
std 1.528e+04
q1 624
q3 7,866
iqr 7,242
skew 3.975
kurtosis 19.86
n_outliers 601
outlier_rate 0.1025
zero_rate 0.02643
alert: high_skewskew=+3.98
alert: outliers10.2% rows beyond 1.5 IQR
alert: null_rate43.1% null
Fig 40.
Distribution of Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days (median: 2528.0).
bincount
0 – 38883532
3888 – 7776853
7776 – 1.166e+04492
1.166e+04 – 1.555e+04228
1.555e+04 – 1.944e+04177
1.944e+04 – 2.333e+0462
2.333e+04 – 2.722e+0474
2.722e+04 – 3.111e+0464
3.111e+04 – 3.499e+0439
3.499e+04 – 3.888e+0448
3.888e+04 – 4.277e+0444
4.277e+04 – 4.666e+0443
4.666e+04 – 5.055e+0451
5.055e+04 – 5.444e+0431
5.444e+04 – 5.832e+0417
5.832e+04 – 6.221e+0411
6.221e+04 – 6.61e+048
6.61e+04 – 6.999e+046
6.999e+04 – 7.388e+046
7.388e+04 – 7.776e+0410
7.776e+04 – 8.165e+046
8.165e+04 – 8.554e+048
8.554e+04 – 8.943e+044
8.943e+04 – 9.332e+042
9.332e+04 – 9.721e+046
9.721e+04 – 1.011e+0510
1.011e+05 – 1.05e+0513
1.05e+05 – 1.089e+056
1.089e+05 – 1.128e+052
1.128e+05 – 1.166e+052
1.166e+05 – 1.205e+052
1.205e+05 – 1.244e+052
1.244e+05 – 1.283e+052
1.283e+05 – 1.322e+052
1.322e+05 – 1.361e+050
1.361e+05 – 1.4e+050
1.4e+05 – 1.439e+050
1.439e+05 – 1.478e+050
1.478e+05 – 1.516e+050
1.516e+05 – 1.555e+052

Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days - footnotes categorical metadata

This is a footnotes/caveats column attached to a Medicaid & CHIP determination-processing metric, recording data-quality disclaimers per row. It is overwhelmingly empty (null_rate 0.8926), and among the 1,106 populated rows the 16 distinct notes are dominated by 'Incorrectly reporting processing time at application level, as opposed to the individual level' at 39.6%, with several entries being semicolon-concatenated combinations of base notes. The notes describe known reporting errors (redeterminations included, missing MAGI determinations, system limitations), so any downstream use of the associated metric should treat flagged rows as suspect.

Treatment: Keep as a data-quality flag; split on ';' into boolean caveat indicators rather than modelling as a single category.

anthropic:claude-opus-4-7 · confidence high
Out[112]:

saturn.columns["Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days - footnotes"].stats

statvalue
n10,302
nulls9,196 (89.3%)
unique16
top_value Incorrectly reporting processing time at application level, as opposed to the individual level
top_rate 0.396
cardinality 16
entropy 2.818
entropy_ratio 0.7045
alert: null_rate89.3% null
Fig 41.
Top values for Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days - footnotes.
Show data table
Top values for Total Medicaid and CHIP Determinations Processed Between 8 Days and 30 Days - footnotes (16 unique shown, of 16 total).
valuecountshare
Incorrectly reporting processing time at application level, as opposed to the individual level4384.3%
Incorrectly includes redeterminations1831.8%
Does not include all MAGI determinations on applications1431.4%
Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations760.7%
Incorrectly reporting processing time at application level, as opposed to the individual level; MAGI determinations conducted in 6-7 days are reported under the 8-30 days category720.7%
Incorrectly includes redeterminations; Does not include all MAGI determinations on applications570.6%
Unable to provide data due to system limitations390.4%
Includes some non-MAGI determinations on applications240.2%
Includes some non-MAGI determinations on applications; Incorrectly includes redeterminations180.2%
Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications180.2%
Reflects incorrect start dates for processing time on some reopened applications100.1%
Includes some non-Medicaid or CHIP determinations on applications; Reflects incorrect start dates for processing time on some reopened applications80.1%
Includes some non-MAGI determinations on applications; Does not include all MAGI determinations on applications60.1%
Includes some non-MAGI determinations on applications; Incorrectly reporting processing time at application level, as opposed to the individual level60.1%
Incorrectly includes reinstatements60.1%
Includes some non-Medicaid or CHIP determinations on applications20.0%

Total Medicaid and CHIP Determinations Processed between 31 days and 45 days numeric feature

This is a numeric operational metric counting Medicaid and CHIP determinations processed in a 31-45 day window, almost certainly reported per state and reporting period. The distribution is severely right-skewed (skew 4.90, kurtosis 31.95) with a median of 693 but a mean of 2917 and a max of 81475, and 12.75% of rows flagged as outliers. Notably, 43.07% of values are null and 6.63% are zero, so under half of rows carry a usable signal.

Treatment: Log-transform after imputing or flagging the 43% nulls before any modelling.

anthropic:claude-opus-4-7 · confidence high
Out[115]:

saturn.columns["Total Medicaid and CHIP Determinations Processed between 31 days and 45 days"].stats

statvalue
n10,302
nulls4,437 (43.1%)
unique1,923
min 0
max 81,475
mean 2917
median 693
std 6725
q1 106
q3 2,322
iqr 2,216
skew 4.899
kurtosis 31.95
n_outliers 748
outlier_rate 0.1275
zero_rate 0.06633
alert: high_skewskew=+4.90
alert: outliers12.8% rows beyond 1.5 IQR
alert: null_rate43.1% null
Fig 42.
Distribution of Total Medicaid and CHIP Determinations Processed between 31 days and 45 days. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Determinations Processed between 31 days and 45 days (median: 693.0).
bincount
0 – 20374225
2037 – 4074690
4074 – 6111267
6111 – 8148154
8148 – 1.018e+0487
1.018e+04 – 1.222e+0468
1.222e+04 – 1.426e+0454
1.426e+04 – 1.63e+0445
1.63e+04 – 1.833e+0455
1.833e+04 – 2.037e+0435
2.037e+04 – 2.241e+0426
2.241e+04 – 2.444e+0426
2.444e+04 – 2.648e+0442
2.648e+04 – 2.852e+0414
2.852e+04 – 3.055e+0413
3.055e+04 – 3.259e+0412
3.259e+04 – 3.463e+044
3.463e+04 – 3.666e+040
3.666e+04 – 3.87e+040
3.87e+04 – 4.074e+048
4.074e+04 – 4.277e+040
4.277e+04 – 4.481e+044
4.481e+04 – 4.685e+046
4.685e+04 – 4.888e+042
4.888e+04 – 5.092e+044
5.092e+04 – 5.296e+042
5.296e+04 – 5.5e+046
5.5e+04 – 5.703e+042
5.703e+04 – 5.907e+046
5.907e+04 – 6.111e+042
6.111e+04 – 6.314e+040
6.314e+04 – 6.518e+040
6.518e+04 – 6.722e+042
6.722e+04 – 6.925e+040
6.925e+04 – 7.129e+040
7.129e+04 – 7.333e+042
7.333e+04 – 7.536e+040
7.536e+04 – 7.74e+040
7.74e+04 – 7.944e+040
7.944e+04 – 8.148e+042

Total Medicaid and CHIP Determinations Processed between 31 days and 45 days - footnotes categorical metadata

Footnote/caveat field annotating data quality issues for the 31-45 day Medicaid/CHIP determination processing metric. 89.26% of rows are null, and among the 15 distinct notes one dominates: 46.11% of non-null entries flag 'Incorrectly reporting processing time at application level, as opposed to the individual level', with semicolon-delimited combinations indicating multiple concurrent caveats per row.

Treatment: Keep as a data-quality flag; split on ';' into multi-hot indicators when auditing the paired metric.

anthropic:claude-opus-4-7 · confidence high
Out[118]:

saturn.columns["Total Medicaid and CHIP Determinations Processed between 31 days and 45 days - footnotes"].stats

statvalue
n10,302
nulls9,196 (89.3%)
unique15
top_value Incorrectly reporting processing time at application level, as opposed to the individual level
top_rate 0.4611
cardinality 15
entropy 2.547
entropy_ratio 0.652
alert: null_rate89.3% null
Fig 43.
Top values for Total Medicaid and CHIP Determinations Processed between 31 days and 45 days - footnotes.
Show data table
Top values for Total Medicaid and CHIP Determinations Processed between 31 days and 45 days - footnotes (15 unique shown, of 15 total).
valuecountshare
Incorrectly reporting processing time at application level, as opposed to the individual level5105.0%
Incorrectly includes redeterminations1831.8%
Does not include all MAGI determinations on applications1431.4%
Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations760.7%
Incorrectly includes redeterminations; Does not include all MAGI determinations on applications570.6%
Unable to provide data due to system limitations390.4%
Includes some non-MAGI determinations on applications240.2%
Includes some non-MAGI determinations on applications; Incorrectly includes redeterminations180.2%
Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications180.2%
Reflects incorrect start dates for processing time on some reopened applications100.1%
Includes some non-Medicaid or CHIP determinations on applications; Reflects incorrect start dates for processing time on some reopened applications80.1%
Includes some non-MAGI determinations on applications; Does not include all MAGI determinations on applications60.1%
Includes some non-MAGI determinations on applications; Incorrectly reporting processing time at application level, as opposed to the individual level60.1%
Incorrectly includes reinstatements60.1%
Includes some non-Medicaid or CHIP determinations on applications20.0%

Total Medicaid and CHIP Determinations Processed in More than 45 Days numeric feature

This is a state/period-level operational metric counting Medicaid and CHIP eligibility determinations that exceeded the 45-day processing target. The distribution is severely right-skewed (skew 5.26, kurtosis 37.4) with a median of 395 but a max of 106,943 and std of 8,135, and roughly 15.4% of values flagged as outliers. Notably, 43.1% of rows are null and another 10% are exact zeros, so most of the variance comes from a small tail of jurisdictions or reporting periods with large backlogs.

Treatment: Log1p-transform and add a missing-value indicator before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[121]:

saturn.columns["Total Medicaid and CHIP Determinations Processed in More than 45 Days"].stats

statvalue
n10,302
nulls4,437 (43.1%)
unique1,691
min 0
max 106,943
mean 3028
median 395
std 8135
q1 90
q3 1,545
iqr 1,455
skew 5.264
kurtosis 37.42
n_outliers 902
outlier_rate 0.1538
zero_rate 0.1001
alert: high_skewskew=+5.26
alert: outliers15.4% rows beyond 1.5 IQR
alert: null_rate43.1% null
Fig 44.
Distribution of Total Medicaid and CHIP Determinations Processed in More than 45 Days. Vertical dash marks the median.
Show data table
Histogram bins for Total Medicaid and CHIP Determinations Processed in More than 45 Days (median: 395.0).
bincount
0 – 26744746
2674 – 5347361
5347 – 8021146
8021 – 1.069e+04138
1.069e+04 – 1.337e+0480
1.337e+04 – 1.604e+0477
1.604e+04 – 1.872e+0454
1.872e+04 – 2.139e+0445
2.139e+04 – 2.406e+0434
2.406e+04 – 2.674e+0431
2.674e+04 – 2.941e+0429
2.941e+04 – 3.208e+0418
3.208e+04 – 3.476e+0420
3.476e+04 – 3.743e+0418
3.743e+04 – 4.01e+048
4.01e+04 – 4.278e+044
4.278e+04 – 4.545e+048
4.545e+04 – 4.812e+046
4.812e+04 – 5.08e+048
5.08e+04 – 5.347e+0410
5.347e+04 – 5.615e+040
5.615e+04 – 5.882e+042
5.882e+04 – 6.149e+044
6.149e+04 – 6.417e+040
6.417e+04 – 6.684e+042
6.684e+04 – 6.951e+042
6.951e+04 – 7.219e+044
7.219e+04 – 7.486e+040
7.486e+04 – 7.753e+040
7.753e+04 – 8.021e+042
8.021e+04 – 8.288e+040
8.288e+04 – 8.555e+042
8.555e+04 – 8.823e+044
8.823e+04 – 9.09e+040
9.09e+04 – 9.358e+040
9.358e+04 – 9.625e+040
9.625e+04 – 9.892e+040
9.892e+04 – 1.016e+050
1.016e+05 – 1.043e+050
1.043e+05 – 1.069e+052

Total Medicaid and CHIP Determinations Processed in More than 45 Days - footnotes categorical metadata

Footnote/caveat field annotating data-quality issues on the '45+ day determinations' metric. 89.26% null suggests footnotes are only attached when a state reports a known issue; among the 1,104 populated rows, 46.11% flag the same problem ('Incorrectly reporting processing time at application level, as opposed to the individual level'), and several values are semicolon-concatenated combinations rather than atomic codes.

Treatment: Split on '; ' into a multi-label flag set and use as data-quality filters, not as a model feature.

anthropic:claude-opus-4-7 · confidence high
Out[124]:

saturn.columns["Total Medicaid and CHIP Determinations Processed in More than 45 Days - footnotes"].stats

statvalue
n10,302
nulls9,196 (89.3%)
unique15
top_value Incorrectly reporting processing time at application level, as opposed to the individual level
top_rate 0.4611
cardinality 15
entropy 2.547
entropy_ratio 0.652
alert: null_rate89.3% null
Fig 45.
Top values for Total Medicaid and CHIP Determinations Processed in More than 45 Days - footnotes.
Show data table
Top values for Total Medicaid and CHIP Determinations Processed in More than 45 Days - footnotes (15 unique shown, of 15 total).
valuecountshare
Incorrectly reporting processing time at application level, as opposed to the individual level5105.0%
Incorrectly includes redeterminations1831.8%
Does not include all MAGI determinations on applications1431.4%
Incorrectly reporting processing time at application level, as opposed to the individual level; Incorrectly includes redeterminations760.7%
Incorrectly includes redeterminations; Does not include all MAGI determinations on applications570.6%
Unable to provide data due to system limitations390.4%
Includes some non-MAGI determinations on applications240.2%
Includes some non-MAGI determinations on applications; Incorrectly includes redeterminations180.2%
Incorrectly reporting processing time at application level, as opposed to the individual level; Does not include all MAGI determinations on applications180.2%
Reflects incorrect start dates for processing time on some reopened applications100.1%
Includes some non-Medicaid or CHIP determinations on applications; Reflects incorrect start dates for processing time on some reopened applications80.1%
Includes some non-MAGI determinations on applications; Does not include all MAGI determinations on applications60.1%
Includes some non-MAGI determinations on applications; Incorrectly reporting processing time at application level, as opposed to the individual level60.1%
Incorrectly includes reinstatements60.1%
Includes some non-Medicaid or CHIP determinations on applications20.0%

Total Call Center Volume (Number of Calls) numeric feature

Numeric volume of calls handled by a call center, almost certainly an aggregate per row (e.g., monthly or per-center totals). Nearly 70% of rows are null (null_rate 0.6949), so this column is sparsely populated. The distribution is heavily right-skewed (skew 4.66, kurtosis 25.1) with a median of 73,754 but a max of 2,615,575 and 223 outliers (7.1%), indicating a small number of very high-volume reporters dominate the tail.

Treatment: Log-transform and impute or flag the 69% missing before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[127]:

saturn.columns["Total Call Center Volume (Number of Calls)"].stats

statvalue
n10,302
nulls7,159 (69.5%)
unique1,592
min 5,750
max 2.616e+06
mean 1.723e+05
median 73,754
std 3.191e+05
q1 31,323
q3 180,553
iqr 149,230
skew 4.656
kurtosis 25.11
n_outliers 223
outlier_rate 0.07095
zero_rate 0
alert: high_skewskew=+4.66
alert: outliers7.1% rows beyond 1.5 IQR
alert: null_rate69.5% null
Fig 46.
Distribution of Total Call Center Volume (Number of Calls). Vertical dash marks the median.
Show data table
Histogram bins for Total Call Center Volume (Number of Calls) (median: 73754.0).
bincount
5750 – 7.1e+041556
7.1e+04 – 1.362e+05576
1.362e+05 – 2.015e+05313
2.015e+05 – 2.667e+05235
2.667e+05 – 3.32e+05167
3.32e+05 – 3.972e+0569
3.972e+05 – 4.625e+0526
4.625e+05 – 5.277e+056
5.277e+05 – 5.93e+0510
5.93e+05 – 6.582e+0523
6.582e+05 – 7.235e+0524
7.235e+05 – 7.887e+0525
7.887e+05 – 8.539e+056
8.539e+05 – 9.192e+0510
9.192e+05 – 9.844e+058
9.844e+05 – 1.05e+066
1.05e+06 – 1.115e+064
1.115e+06 – 1.18e+0610
1.18e+06 – 1.245e+066
1.245e+06 – 1.311e+064
1.311e+06 – 1.376e+060
1.376e+06 – 1.441e+060
1.441e+06 – 1.506e+062
1.506e+06 – 1.572e+062
1.572e+06 – 1.637e+060
1.637e+06 – 1.702e+064
1.702e+06 – 1.767e+060
1.767e+06 – 1.833e+066
1.833e+06 – 1.898e+063
1.898e+06 – 1.963e+068
1.963e+06 – 2.028e+066
2.028e+06 – 2.094e+064
2.094e+06 – 2.159e+062
2.159e+06 – 2.224e+062
2.224e+06 – 2.289e+066
2.289e+06 – 2.355e+062
2.355e+06 – 2.42e+062
2.42e+06 – 2.485e+062
2.485e+06 – 2.55e+064
2.55e+06 – 2.616e+064

Total Call Center Volume (Number of Calls) - footnotes categorical metadata

This column holds free-text footnotes qualifying the 'Total Call Center Volume' metric, explaining caveats like excluded after-hours calls or inclusion of other benefit programs. 71% of rows are null, and only 27 distinct footnote strings cover the remaining 2,993 records, with the top note appearing 601 times (20% of non-nulls). The values are semicolon-concatenated combinations of a small set of standard caveats, so this is metadata about measurement methodology rather than a feature.

Treatment: Keep as methodological annotation; if used in modelling, parse into binary flags for each underlying caveat rather than treating as a categorical.

anthropic:claude-opus-4-7 · confidence high
Out[130]:

saturn.columns["Total Call Center Volume (Number of Calls) - footnotes"].stats

statvalue
n10,302
nulls7,310 (71.0%)
unique27
top_value Does not include all calls received after business hours; Includes calls for other benefit programs
top_rate 0.2009
cardinality 27
entropy 3.742
entropy_ratio 0.787
alert: null_rate71.0% null
Fig 47.
Top values for Total Call Center Volume (Number of Calls) - footnotes.
Show data table
Top values for Total Call Center Volume (Number of Calls) - footnotes (20 unique shown, of 27 total).
valuecountshare
Does not include all calls received after business hours; Includes calls for other benefit programs6015.8%
Includes calls for other benefit programs4354.2%
Does not include all calls received after business hours4184.1%
Does not include all calls received by call centers; Does not include all calls received after business hours3032.9%
Includes calls for other benefit programs; Includes only calls transferred to a live agent1871.8%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent1321.3%
Does not include all calls received by call centers1101.1%
Does not include all calls received by call centers; Includes calls for other benefit programs830.8%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes calls for other benefit programs740.7%
Does not include all calls received after business hours; Includes only calls transferred to a live agent630.6%
Includes state-based marketplace (SBM) data630.6%
Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent630.6%
Does not include all calls received by call centers; Includes calls for other benefit programs; Includes state-based marketplace (SBM) data630.6%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes calls for other benefit programs; Includes state-based marketplace (SBM) data630.6%
Does not include all calls received by call centers; Includes only calls transferred to a live agent630.6%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes state-based marketplace (SBM) data630.6%
Does not operate a call center630.6%
Does not include all calls received after business hours; Includes calls for other benefit programs; Includes state-based marketplace (SBM) data630.6%
Does not include all calls received after business hours; Includes state-based marketplace (SBM) data600.6%
Did not report data because of technical reasons50.0%

Average Call Center Wait Time (Minutes) numeric feature

Average call-centre wait time in minutes, recorded for only about 30% of rows (null_rate 0.6947). Distribution is right-skewed (skew 1.78, kurtosis 3.29) with median 5 minutes versus mean 9.95 and a max of 72; 11.6% of observed values are exactly zero and 145 outliers (4.6%) sit above the q3+IQR fence. Heavy missingness suggests the metric is only captured for cases that involved a call.

Treatment: Impute or add a missingness indicator and log-transform before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[133]:

saturn.columns["Average Call Center Wait Time (Minutes)"].stats

statvalue
n10,302
nulls7,157 (69.5%)
unique63
min 0
max 72
mean 9.945
median 5
std 12.2
q1 1
q3 15
iqr 14
skew 1.78
kurtosis 3.29
n_outliers 145
outlier_rate 0.0461
zero_rate 0.1161
alert: null_rate69.5% null
Fig 48.
Distribution of Average Call Center Wait Time (Minutes). Vertical dash marks the median.
Show data table
Histogram bins for Average Call Center Wait Time (Minutes) (median: 5.0).
bincount
0 – 1.8942
1.8 – 3.6417
3.6 – 5.4329
5.4 – 7.2188
7.2 – 980
9 – 10.8116
10.8 – 12.6168
12.6 – 14.4103
14.4 – 16.2106
16.2 – 1843
18 – 19.886
19.8 – 21.675
21.6 – 23.473
23.4 – 25.250
25.2 – 2720
27 – 28.844
28.8 – 30.648
30.6 – 32.447
32.4 – 34.238
34.2 – 3615
36 – 37.828
37.8 – 39.628
39.6 – 41.46
41.4 – 43.215
43.2 – 4512
45 – 46.814
46.8 – 48.610
48.6 – 50.42
50.4 – 52.24
52.2 – 542
54 – 55.80
55.8 – 57.610
57.6 – 59.412
59.4 – 61.23
61.2 – 632
63 – 64.82
64.8 – 66.64
66.6 – 68.40
68.4 – 70.20
70.2 – 723

Average Call Center Wait Time (Minutes) - footnotes categorical metadata

Free-form footnote annotations qualifying the 'Average Call Center Wait Time' metric, describing measurement caveats like callback handling, after-hours exclusions, and inclusion of other benefit programs. 69.73% of rows are null and the remaining values spread across 44 semicolon-concatenated combinations with very high entropy ratio (0.918), so even the most common footnote covers only 7.89% of non-nulls. The values are clearly composed from a small set of reusable phrases joined with semicolons rather than free prose.

Treatment: Split on '; ' and one-hot encode the underlying caveat phrases rather than treating each combination as a distinct category.

anthropic:claude-opus-4-7 · confidence high
Out[136]:

saturn.columns["Average Call Center Wait Time (Minutes) - footnotes"].stats

statvalue
n10,302
nulls7,184 (69.7%)
unique44
top_value Call centers offer callbacks; Includes calls for other benefit programs; Includes only calls transferred to a live agent
top_rate 0.0789
cardinality 44
entropy 5.013
entropy_ratio 0.9183
alert: null_rate69.7% null
Fig 49.
Top values for Average Call Center Wait Time (Minutes) - footnotes.
Show data table
Top values for Average Call Center Wait Time (Minutes) - footnotes (20 unique shown, of 44 total).
valuecountshare
Call centers offer callbacks; Includes calls for other benefit programs; Includes only calls transferred to a live agent2462.4%
Does not include all calls received after business hours; Includes only calls transferred to a live agent1841.8%
Includes calls for other benefit programs; Includes only calls transferred to a live agent1711.7%
Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent1401.4%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent1321.3%
Call centers offer callbacks; Does not include all calls received after business hours1281.2%
Does not include all calls received after business hours; Includes calls for other benefit programs1261.2%
Call centers offer callbacks; Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent1261.2%
Callbacks are included; Call centers offer callbacks; Does not include all calls received after business hours; Includes only calls transferred to a live agent1261.2%
Callbacks are included; Call centers offer callbacks; Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent1091.1%
Callbacks are included; Call centers offer callbacks; Does not include all calls received after business hours; Includes calls for other benefit programs991.0%
Call centers offer callbacks; Does not include all calls received after business hours; Includes only calls transferred to a live agent981.0%
Callbacks are included; Call centers offer callbacks; Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent981.0%
Call centers offer callbacks; Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent870.8%
Callbacks are included; Call centers offer callbacks; Does not include all calls received by call centers; Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent700.7%
Call centers offer callbacks; Does not include all calls received by call centers690.7%
Call centers offer callbacks; Does not include all calls received after business hours; Includes only calls transferred to a live agent; Includes state-based marketplace (SBM) data630.6%
Callbacks are included; Call centers offer callbacks; Includes calls for other benefit programs; Includes only calls transferred to a live agent630.6%
Callbacks are included; Call centers offer callbacks; Includes calls for other benefit programs630.6%
Callbacks are included; Call centers offer callbacks; Does not include all calls received by call centers; Includes calls for other benefit programs; Includes only calls transferred to a live agent; Includes state-based marketplace (SBM) data630.6%

Average Call Center Abandonment Rate numeric feature

Numeric metric capturing the share of inbound calls abandoned at a contact center, expressed as a proportion between 0 and 0.652. Distribution is right-skewed (skew 1.22) with median 0.088 well below mean 0.132, and 54 outliers (1.7%) sit in the upper tail. The dominant concern is coverage: 69.49% of rows are null, so the column is populated for less than a third of records.

Treatment: Impute or add a missingness indicator before modelling, and consider a log or sqrt transform to tame the right skew.

anthropic:claude-opus-4-7 · confidence high
Out[139]:

saturn.columns["Average Call Center Abandonment Rate"].stats

statvalue
n10,302
nulls7,159 (69.5%)
unique408
min 0
max 0.652
mean 0.1321
median 0.088
std 0.1291
q1 0.024
q3 0.212
iqr 0.188
skew 1.222
kurtosis 1.146
n_outliers 54
outlier_rate 0.01718
zero_rate 0.0009545
alert: null_rate69.5% null
Fig 50.
Distribution of Average Call Center Abandonment Rate. Vertical dash marks the median.
Show data table
Histogram bins for Average Call Center Abandonment Rate (median: 0.088).
bincount
0 – 0.0163594
0.0163 – 0.0326328
0.0326 – 0.0489200
0.0489 – 0.0652193
0.0652 – 0.0815202
0.0815 – 0.0978129
0.0978 – 0.1141123
0.1141 – 0.1304120
0.1304 – 0.146792
0.1467 – 0.163106
0.163 – 0.179387
0.1793 – 0.1956104
0.1956 – 0.211979
0.2119 – 0.2282122
0.2282 – 0.2445108
0.2445 – 0.260868
0.2608 – 0.277165
0.2771 – 0.293440
0.2934 – 0.309739
0.3097 – 0.32645
0.326 – 0.342343
0.3423 – 0.358626
0.3586 – 0.374950
0.3749 – 0.391224
0.3912 – 0.407514
0.4075 – 0.423828
0.4238 – 0.440118
0.4401 – 0.456420
0.4564 – 0.472716
0.4727 – 0.4892
0.489 – 0.505310
0.5053 – 0.521610
0.5216 – 0.53798
0.5379 – 0.55424
0.5542 – 0.57058
0.5705 – 0.58680
0.5868 – 0.60314
0.6031 – 0.61942
0.6194 – 0.63576
0.6357 – 0.6526

Average Call Center Abandonment Rate - footnotes categorical metadata

This is a free-text footnote field qualifying the 'Average Call Center Abandonment Rate' metric, with 36 distinct semicolon-joined caveat combinations describing measurement scope (e.g., excluded after-hours calls, inclusion of other benefit programs, live-agent-only counts). It is missing for 69.73% of rows, and even the most common note covers only 12.54% of present values, with entropy ratio 0.833 indicating the remaining caveats are spread fairly evenly. The recurring phrases suggest these are reusable methodology disclaimers attached per reporter rather than free prose.

Treatment: Keep as documentation; if needed for modelling, split on ';' and one-hot the individual caveat flags.

anthropic:claude-opus-4-7 · confidence high
Out[142]:

saturn.columns["Average Call Center Abandonment Rate - footnotes"].stats

statvalue
n10,302
nulls7,184 (69.7%)
unique36
top_value Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent
top_rate 0.1254
cardinality 36
entropy 4.307
entropy_ratio 0.8332
alert: null_rate69.7% null
Fig 51.
Top values for Average Call Center Abandonment Rate - footnotes.
Show data table
Top values for Average Call Center Abandonment Rate - footnotes (20 unique shown, of 36 total).
valuecountshare
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent3913.8%
Includes calls for other benefit programs; Includes only calls transferred to a live agent3543.4%
Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent3083.0%
Does not include all calls received after business hours; Includes only calls transferred to a live agent2102.0%
Includes calls for other benefit programs2052.0%
Does not include all calls received after business hours2001.9%
Does not include all calls received after business hours; Includes calls for other benefit programs1701.7%
Does not include all calls received by call centers; Includes only calls transferred to a live agent1241.2%
Does not include all calls received after business hours; Includes only calls transferred to a live agent; Includes state-based marketplace (SBM) data1231.2%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent1131.1%
Does not include all calls received by call centers; Includes calls for other benefit programs; Includes only calls transferred to a live agent830.8%
Callbacks are included; Does not include all calls received after business hours; Includes calls for other benefit programs780.8%
Callbacks are included; Includes calls for other benefit programs; Includes only calls transferred to a live agent630.6%
Does not include all calls received by call centers; Includes calls for other benefit programs; Includes only calls transferred to a live agent; Includes state-based marketplace (SBM) data630.6%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes calls for other benefit programs; Includes state-based marketplace (SBM) data630.6%
Does not include all calls received by call centers; Does not include all calls received after business hours; Includes only calls transferred to a live agent; Includes state-based marketplace (SBM) data630.6%
Callbacks are included; Does not include all calls received after business hours; Includes only calls transferred to a live agent630.6%
Does not operate a call center630.6%
Callbacks are included; Does not include all calls received after business hours630.6%
Does not include all calls received after business hours; Includes calls for other benefit programs; Includes only calls transferred to a live agent; Includes state-based marketplace (SBM) data630.6%

How to cite

click to copy

BibTeX
@misc{saturn-cms-medicaid-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: cms medicaid},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/cms-medicaid}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: cms medicaid. Source: /home/coolhand/datasets/accessibility-atlas/cms_medicaid_enrollment_2026.csv. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/cms-medicaid