saturn

/home/coolhand/datasets/us-inequality-atlas/disability/census_disability_by_county_2022.csv 3,222 rows sample n=3,222 seed 42 2026-05-01T17:36:35+00:00

Overview

Source/home/coolhand/datasets/us-inequality-atlas/disability/census_disability_by_county_2022.csv
Total rows3,222
Profiled sample3,222
Columns16
Generated2026-05-01T17:36:35+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset contains 2022 US Census disability counts for 3,222 counties, broken out by disability type (ambulatory, cognitive, hearing, vision, self-care, independent living) along with totals, a derived disability rate, and FIPS identifiers. Nearly every count column is heavily right-skewed (skew above 10) with substantial outliers — total_population alone ranges from 47 to 9.87M with a mean of ~102K but a median of just 25,328, so a handful of large counties dominate the raw counts. The disability_rate field is the most analyst-friendly view: it's bounded, less skewed (skew 2.17), and centers around a median of 1.07 with an IQR of 0.77–1.42. Start with disability_rate to compare counties on equal footing, then look at total_population to understand the size distribution before interpreting any raw disability counts.

fips high anthropic:claude-opus-4-7

This column is the FIPS county code: every one of the 3222 rows is unique and non-null, and the value range (1001 to 72153) matches the standard US state+county FIPS encoding. The distribution is near-symmetric (skew 0.16, kurtosis -0.63) with no outliers, which is expected for an identifier rather than a measured quantity. Treat it as a categorical key, not a number.

county_name high anthropic:claude-opus-4-7

Each of the 3,222 rows holds a unique county-and-state string (e.g., '... County, Texas'), averaging 24 characters and roughly 3 words. The token 'county,' appears 2,999 times, so a small minority of entries use a different suffix (parish, borough, census area). Texas (256), Virginia (189), and Georgia (159) lead the state distribution, consistent with a full U.S. county roster.

state_fips high anthropic:claude-opus-4-7

This is almost certainly the US state FIPS code: 52 unique integer values across 3222 rows, ranging from 1 to 72 with no nulls or zeros. The count of 52 (rather than 50) and a max of 72 indicate inclusion of DC and territories like Puerto Rico. Distribution is roughly uniform (skew 0.16, kurtosis -0.63), consistent with a categorical geographic identifier rather than a measurement.

county_fips high anthropic:claude-opus-4-7

This is the county-level portion of a FIPS code, stored as an integer from 1 to 840 across 3222 rows with no nulls and only 330 distinct values. The distribution is heavily right-skewed (skew 2.87, kurtosis 11.64) with 178 outliers (5.5%), which is expected since county codes are categorical identifiers and most values cluster low (median 79, Q3 133) while a few counties carry much larger codes. Treating this as a numeric feature would be misleading despite the numeric dtype.

total_population high anthropic:claude-opus-4-7

Likely a county- or region-level total population count across 3,222 rows with no nulls and 3,141 unique values. The distribution is extremely right-skewed (skew 13.38, kurtosis 298.69): the median is 25,328 but the mean is 102,232 and the max reaches 9,866,623, with 453 outliers (14.06%) flagged above the IQR fence.

disability_total high anthropic:claude-opus-4-7

A heavily right-skewed count of disability cases or claims, ranging from 0 to 69,705 with a median of just 298 but a mean of 1,043. Skew of 10.28 and kurtosis of 166.8 indicate an extreme long tail, with 404 outliers (12.5% of rows) and only 1.7% zeros. The std (2,906) dwarfs the IQR (689), so a small number of very large records dominate the distribution.

disability_rate high anthropic:claude-opus-4-7

Numeric disability_rate spanning 0.0 to 9.17 with a median of 1.07 and IQR 0.77-1.42, almost certainly a per-row rate or percentage. The distribution is heavily right-skewed (skew 2.17, kurtosis 15.24) with 117 outliers (3.6%) stretching well beyond the typical range, and 1.7% of rows sit at exactly zero. No nulls across 3,222 rows, and only 305 distinct values suggest rounding to two decimals.

no_disability high anthropic:claude-opus-4-7

Counts of people recorded as having no disability per geographic or administrative unit, ranging from 0 to 2,091,332 with a median of 5,607. The distribution is extremely right-skewed (skew 12.67, kurtosis 259.77) and the mean of 22,872 sits well above Q3 of 14,739, with 442 outliers (13.7%) flagging a long tail of very large units. Only one zero is present and there are no nulls, so the heavy tail—not missingness—is the dominant feature.

one_disability high anthropic:claude-opus-4-7

This column appears to be a count of people with one disability per geographic or administrative unit, ranging from 0 to 44,466 with a median of 217.5. The distribution is severely right-skewed (skew 9.45, kurtosis 139.4), with the mean (755.7) more than triple the median and 408 outliers (12.7% of rows) — consistent with a few very large units dominating a long tail of small ones. About 2.8% of rows are zero and there are no nulls.

two_plus_disabilities high anthropic:claude-opus-4-7

This column appears to be a count of people (likely per geographic unit) reporting two or more disabilities, ranging from 0 to 25,239 with a median of just 76. The distribution is extremely right-skewed (skew 12.57, kurtosis 253.95), with 11.67% of rows flagged as outliers and ~9% exact zeros, suggesting a few very large jurisdictions dominate while most are small. The mean (287.7) sits well above Q3 (222), confirming the long tail.

hearing_disability medium anthropic:claude-opus-4-7

This appears to be a count or population-style measure related to hearing disability, with all 3222 rows populated and 2314 distinct values ranging from 1 to 296898. The distribution is extremely right-skewed (skew 11.54, kurtosis 226.6) with a median of 1326 well below the mean of 4003, and 391 outliers (12.1%) inflate the tail. The min of 1 and absence of zeros suggest these are aggregated counts rather than individual indicators.

vision_disability high anthropic:claude-opus-4-7

Numeric counts of people with a vision disability per geographic or demographic unit, ranging from 0 to 346,901 with a median of 1,361. The distribution is extremely right-skewed (skew 12.29, kurtosis 254.79) and the mean of 4,246 sits well above Q3 of 3,291, with 380 outliers (11.8%) inflating the upper tail. Near-zero zero_rate (0.03%) and no nulls suggest clean population-style aggregates rather than survey responses.

cognitive_disability high anthropic:claude-opus-4-7

Likely a count of people with a cognitive disability per geographic or administrative unit, ranging from 0 to 413,990 with a median of 1,623. The distribution is severely right-skewed (skew 12.09, kurtosis 254.7) with 375 outliers (11.6% of rows) and a mean (5,142) more than triple the median, indicating a few very large units dominate. Near-zero null and zero rates suggest the count is reliably populated.

ambulatory_disability high anthropic:claude-opus-4-7

Counts of people with ambulatory disability per geographic unit, ranging from 3 to 548,175 with a median of 2,197. The distribution is severely right-skewed (skew 13.0, kurtosis 288.7) and 11.4% of rows are flagged as outliers, indicating a long tail of very large jurisdictions dominating the mean (6,497) versus the median.

self_care_disability high anthropic:claude-opus-4-7

Numeric counts of people with a self-care disability, likely aggregated per geographic or demographic unit given the 3222 rows and 1961 unique values. The distribution is severely right-skewed (skew 16.8, kurtosis 478.7) with a median of 772.5 but a max of 281,611, and 355 outliers (11.0% of rows) sit far above the Q3 of 1948.5. Near-zero null and zero rates suggest the field is consistently populated.

independent_living_disability high anthropic:claude-opus-4-7

Counts of people with an independent-living disability per geographic unit, ranging from 2 to 1,417,825 with a median of 3,135. The distribution is severely right-skewed (skew 14.09, kurtosis 329.97) and 13.8% of rows (445) flag as outliers, suggesting the column mixes small areas with very large aggregates. No nulls or zeros are present.

Numeric correlation

fips numeric

rows3,222
null0 (0.0%)
unique3,222
min1,001
max72,153
mean31,378
median30,022
std16,300
q119,030
q346,104
iqr27,075
skew0.157
kurtosis-0.631
n_outliers0
outlier_rate0.000
zero_rate0.000

county_name text

100.0% of rows are unique strings
rows3,222
null0 (0.0%)
unique3,222
len_min16
len_max59
len_mean24.324
len_median24.000
len_p9531.000
word_mean3.248
word_median3.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size1,990
readability_flesch_mean10.284
emoji_rate0.000
url_rate0.000
one_word_rate0.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. Bibb County, Alabama
  2. Cheatham County, Tennessee
  3. Piute County, Utah
  4. Lamb County, Texas
  5. Martin County, Minnesota
  6. Sheridan County, Wyoming
  7. Chickasaw County, Mississippi
  8. Rockingham County, Virginia
  9. Liberty County, Texas
  10. Clark County, Arkansas

state_fips numeric

rows3,222
null0 (0.0%)
unique52
min1.000
max72.000
mean31.275
median30.000
std16.285
q119.000
q346.000
iqr27.000
skew0.157
kurtosis-0.627
n_outliers0
outlier_rate0.000
zero_rate0.000

county_fips numeric

skew=+2.87 5.5% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique330
min1.000
max840.000
mean103.216
median79.000
std106.561
q135.000
q3133.000
iqr98.000
skew2.866
kurtosis11.640
n_outliers178
outlier_rate0.055
zero_rate0.000

total_population numeric

skew=+13.38 14.1% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique3,141
min47.000
max9,866,623
mean102,232
median25,328
std326,934
q110,611
q365,190
iqr54,579
skew13.377
kurtosis298.689
n_outliers453
outlier_rate0.141
zero_rate0.000

disability_total numeric

skew=+10.28 12.5% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique1,393
min0.000
max69,705
mean1,043
median298.000
std2,906
q1107.000
q3796.250
iqr689.250
skew10.283
kurtosis166.796
n_outliers404
outlier_rate0.125
zero_rate0.017

disability_rate numeric

skew=+2.17
rows3,222
null0 (0.0%)
unique305
min0.000
max9.170
mean1.145
median1.070
std0.622
q10.770
q31.420
iqr0.650
skew2.167
kurtosis15.238
n_outliers117
outlier_rate0.036
zero_rate0.017

no_disability numeric

skew=+12.67 13.7% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique2,955
min0.000
max2,091,332
mean22,872
median5,607
std73,289
q12,384
q314,739
iqr12,355
skew12.673
kurtosis259.770
n_outliers442
outlier_rate0.137
zero_rate3.10e-04

one_disability numeric

skew=+9.45 12.7% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique1,212
min0.000
max44,466
mean755.660
median217.500
std2,032
q176.250
q3586.750
iqr510.500
skew9.449
kurtosis139.402
n_outliers408
outlier_rate0.127
zero_rate0.028

two_plus_disabilities numeric

skew=+12.57 11.7% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique786
min0.000
max25,239
mean287.704
median76.000
std890.091
q121.000
q3222.000
iqr201.000
skew12.570
kurtosis253.949
n_outliers376
outlier_rate0.117
zero_rate0.090

hearing_disability numeric

skew=+11.54 12.1% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique2,314
min1.000
max296,898
mean4,003
median1,326
std10,685
q1579.250
q33,193
iqr2,614
skew11.542
kurtosis226.566
n_outliers391
outlier_rate0.121
zero_rate0.000

vision_disability numeric

skew=+12.29 11.8% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique2,349
min0.000
max346,901
mean4,246
median1,361
std12,047
q1567.000
q33,291
iqr2,724
skew12.291
kurtosis254.794
n_outliers380
outlier_rate0.118
zero_rate3.10e-04

cognitive_disability numeric

skew=+12.09 11.6% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique2,473
min0.000
max413,990
mean5,142
median1,623
std14,208
q1634.000
q34,173
iqr3,539
skew12.086
kurtosis254.723
n_outliers375
outlier_rate0.116
zero_rate3.10e-04

ambulatory_disability numeric

skew=+13.01 11.4% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique2,614
min3.000
max548,175
mean6,497
median2,197
std18,225
q1917.250
q35,261
iqr4,344
skew13.008
kurtosis288.662
n_outliers366
outlier_rate0.114
zero_rate0.000

self_care_disability numeric

skew=+16.82 11.0% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique1,961
min0.000
max281,611
mean2,504
median772.500
std8,061
q1307.000
q31,948
iqr1,642
skew16.822
kurtosis478.724
n_outliers355
outlier_rate0.110
zero_rate1.24e-03

independent_living_disability numeric

skew=+14.09 13.8% rows beyond 1.5 IQR
rows3,222
null0 (0.0%)
unique2,773
min2.000
max1,417,825
mean13,626
median3,135
std45,603
q11,242
q38,586
iqr7,344
skew14.086
kurtosis329.968
n_outliers445
outlier_rate0.138
zero_rate0.000