saturn

/home/coolhand/html/datavis/data_trove/cache/county_health_rankings.parquet 3,222 rows sample n=3,222 seed 42 2026-05-01T16:58:05+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/cache/county_health_rankings.parquet
Total rows3,222
Profiled sample3,222
Columns5
Generated2026-05-01T16:58:05+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset contains 3,222 rows of US county-level health data, with each row identified by a unique county name and FIPS code, plus three numeric measures: total population, uninsured population, and uninsured rate. The population fields are extremely right-skewed — total_pop ranges from 47 to nearly 9.87 million with a median of 25,328, and uninsured_pop shows similar skew (median 36, max 20,915), so a few large counties dominate. The uninsured_rate is the most analytically interesting field: it has a median of 0.12 but stretches up to 3.7, with about 17% of counties reporting zero, suggesting either small/edge cases or data quality issues worth investigating. Start by examining the distribution of uninsured_rate and how it relates to total_pop.

fips high anthropic:claude-opus-4-7

This is the U.S. county FIPS code: every one of the 3222 rows is unique with no nulls, and the value range (1001 to 72153) matches the standard 5-digit state+county encoding. The distribution is near-symmetric (skew 0.16) with no statistical outliers, consistent with an identifier rather than a measured quantity.

county_name high anthropic:claude-opus-4-7

This column lists US county names paired with their state (e.g., 'County, Texas'), with all 3222 values unique and no nulls. The token 'county,' appears 2999 times, suggesting ~223 rows use a different suffix (likely 'Parish' in Louisiana or 'Borough/Census Area' in Alaska). State frequencies match expectations, with Texas (256) leading — consistent with Texas having the most counties nationally.

total_pop high anthropic:claude-opus-4-7

This is a population count, almost certainly per geographic unit (likely US counties given n=3222), with values from 47 to 9,866,623 and a median of 25,328. The distribution is severely right-skewed (skew 13.38, kurtosis 298.69) with 453 outliers (14.06%), reflecting a few massive metros dwarfing thousands of small areas. Mean (102,232) sits far above the median, confirming the heavy tail.

uninsured_pop high anthropic:claude-opus-4-7

Likely a county- or tract-level count of uninsured residents, with 3222 rows and 584 unique values. The distribution is extremely right-skewed (skew 17.8, kurtosis 462.9): median is 36 while the max hits 20915 and the mean is 159.9, and 17.2% of rows are zero. About 11.4% of values (368) flag as outliers, consistent with a few very populous areas dominating.

uninsured_rate medium anthropic:claude-opus-4-7

Likely a per-record uninsured rate, expressed as a fraction (median 0.12, q3 0.25) but with a long tail reaching 3.7, which is implausible for a true rate and suggests mixed units or data entry errors. The distribution is severely right-skewed (skew 4.10, kurtosis 27.70) with 230 outliers (7.1%) and 17.5% exact zeros. No nulls across 3222 rows and only 152 unique values, hinting at rounded or binned source data.

Numeric correlation

fips numeric

rows3,222
null0 (0.0%)
unique3,222
min1,001
max72,153
mean31,378
median30,022
std16,300
q119,030
q346,104
iqr27,075
skew0.157
kurtosis-0.631
n_outliers0
outlier_rate0.000
zero_rate0.000

county_name text

100.0% of rows are unique strings
rows3,222
null0 (0.0%)
unique3,222
len_min16
len_max59
len_mean24.324
len_median24.000
len_p9531.000
word_mean3.248
word_median3.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size1,990
readability_flesch_mean10.284
emoji_rate0.000
url_rate0.000
one_word_rate0.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. Bibb County, Alabama
  2. Cheatham County, Tennessee
  3. Piute County, Utah
  4. Lamb County, Texas
  5. Martin County, Minnesota
  6. Sheridan County, Wyoming
  7. Chickasaw County, Mississippi
  8. Rockingham County, Virginia
  9. Liberty County, Texas
  10. Clark County, Arkansas

total_pop numeric

skew=+13.38 14.1% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique3,141
min47.000
max9,866,623
mean102,232
median25,328
std326,934
q110,611
q365,190
iqr54,579
skew13.377
kurtosis298.689
n_outliers453
outlier_rate0.141
zero_rate0.000

uninsured_pop numeric

skew=+17.81 11.4% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique584
min0.000
max20,915
mean159.945
median36.000
std627.163
q17.000
q3120.000
iqr113.000
skew17.811
kurtosis462.866
n_outliers368
outlier_rate0.114
zero_rate0.172

uninsured_rate numeric

skew=+4.10 7.1% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique152
min0.000
max3.700
mean0.200
median0.120
std0.283
q10.040
q30.250
iqr0.210
skew4.095
kurtosis27.703
n_outliers230
outlier_rate0.071
zero_rate0.175