saturn·

disability census disability by county 2022

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/datasets/us-inequality-atlas/disability/census_disability_by_county_2022.csv

Saturn profiled 3,222 rows across 16 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/datasets/us-inequality-atlas/disability/census_disability_by_county_2022.csv",
    "--findings", "disability-census_disability_by_county_2022.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset contains 2022 US Census disability counts for 3,222 counties, broken out by disability type (ambulatory, cognitive, hearing, vision, self-care, independent living) along with totals, a derived disability rate, and FIPS identifiers. Nearly every count column is heavily right-skewed (skew above 10) with substantial outliers — total_population alone ranges from 47 to 9.87M with a mean of ~102K but a median of just 25,328, so a handful of large counties dominate the raw counts. The disability_rate field is the most analyst-friendly view: it's bounded, less skewed (skew 2.17), and centers around a median of 1.07 with an IQR of 0.77–1.42. Start with disability_rate to compare counties on equal footing, then look at total_population to understand the size distribution before interpreting any raw disability counts.

citing: row_count · column_count · total_population · disability_rate · disability_total · ambulatory_disability · independent_living_disability · state_fips

Out[4]:

saturn.schema() · 16 columns

column kind n null% unique alerts
fips numeric 3,222 0.0% 3,222
county_name text 3,222 0.0% 3,222 near_unique
state_fips numeric 3,222 0.0% 52
county_fips numeric 3,222 0.0% 330 high_skew outliers
total_population numeric 3,222 0.0% 3,141 high_skew outliers
disability_total numeric 3,222 0.0% 1,393 high_skew outliers
disability_rate numeric 3,222 0.0% 305 high_skew
no_disability numeric 3,222 0.0% 2,955 high_skew outliers
one_disability numeric 3,222 0.0% 1,212 high_skew outliers
two_plus_disabilities numeric 3,222 0.0% 786 high_skew outliers
hearing_disability numeric 3,222 0.0% 2,314 high_skew outliers
vision_disability numeric 3,222 0.0% 2,349 high_skew outliers
cognitive_disability numeric 3,222 0.0% 2,473 high_skew outliers
ambulatory_disability numeric 3,222 0.0% 2,614 high_skew outliers
self_care_disability numeric 3,222 0.0% 1,961 high_skew outliers
independent_living_disability numeric 3,222 0.0% 2,773 high_skew outliers
Fig 1.
disability_rate · Most counties cluster near a 1.07 median rate; check the long tail out to 9.17 for unusual counties.
Show data table
Histogram bins for disability_rate (median: 1.07).
bincount
0 – 0.2293114
0.2293 – 0.4585143
0.4585 – 0.6878352
0.6878 – 0.917590
0.917 – 1.146634
1.146 – 1.376496
1.376 – 1.605362
1.605 – 1.834200
1.834 – 2.063118
2.063 – 2.29277
2.292 – 2.52245
2.522 – 2.75131
2.751 – 2.9822
2.98 – 3.2110
3.21 – 3.4397
3.439 – 3.6684
3.668 – 3.8973
3.897 – 4.1273
4.127 – 4.3563
4.356 – 4.5852
4.585 – 4.8140
4.814 – 5.0432
5.043 – 5.2732
5.273 – 5.5020
5.502 – 5.7310
5.731 – 5.9610
5.961 – 6.190
6.19 – 6.4190
6.419 – 6.6480
6.648 – 6.8780
6.878 – 7.1070
7.107 – 7.3360
7.336 – 7.5651
7.565 – 7.7950
7.795 – 8.0240
8.024 – 8.2530
8.253 – 8.4820
8.482 – 8.7120
8.712 – 8.9410
8.941 – 9.171
Fig 2.
total_population · Extreme right skew — a few mega-counties dwarf the median of ~25K residents and will distort any raw count comparisons.
Show data table
Histogram bins for total_population (median: 25328.0).
bincount
47 – 2.467e+052942
2.467e+05 – 4.934e+05137
4.934e+05 – 7.4e+0556
7.4e+05 – 9.867e+0539
9.867e+05 – 1.233e+0613
1.233e+06 – 1.48e+069
1.48e+06 – 1.727e+067
1.727e+06 – 1.973e+063
1.973e+06 – 2.22e+063
2.22e+06 – 2.467e+064
2.467e+06 – 2.713e+063
2.713e+06 – 2.96e+060
2.96e+06 – 3.207e+062
3.207e+06 – 3.453e+060
3.453e+06 – 3.7e+060
3.7e+06 – 3.947e+060
3.947e+06 – 4.193e+060
4.193e+06 – 4.44e+061
4.44e+06 – 4.687e+060
4.687e+06 – 4.933e+061
4.933e+06 – 5.18e+060
5.18e+06 – 5.427e+061
5.427e+06 – 5.673e+060
5.673e+06 – 5.92e+060
5.92e+06 – 6.167e+060
6.167e+06 – 6.413e+060
6.413e+06 – 6.66e+060
6.66e+06 – 6.907e+060
6.907e+06 – 7.153e+060
7.153e+06 – 7.4e+060
7.4e+06 – 7.647e+060
7.647e+06 – 7.893e+060
7.893e+06 – 8.14e+060
8.14e+06 – 8.387e+060
8.387e+06 – 8.633e+060
8.633e+06 – 8.88e+060
8.88e+06 – 9.127e+060
9.127e+06 – 9.373e+060
9.373e+06 – 9.62e+060
9.62e+06 – 9.867e+061
Fig 3.
disability_total · Same skewed shape as population; useful to confirm disability counts track population size.
Show data table
Histogram bins for disability_total (median: 298.0).
bincount
0 – 17432797
1743 – 3485223
3485 – 522871
5228 – 697042
6970 – 871330
8713 – 1.046e+0420
1.046e+04 – 1.22e+048
1.22e+04 – 1.394e+047
1.394e+04 – 1.568e+047
1.568e+04 – 1.743e+040
1.743e+04 – 1.917e+042
1.917e+04 – 2.091e+042
2.091e+04 – 2.265e+041
2.265e+04 – 2.44e+044
2.44e+04 – 2.614e+042
2.614e+04 – 2.788e+040
2.788e+04 – 2.962e+041
2.962e+04 – 3.137e+041
3.137e+04 – 3.311e+040
3.311e+04 – 3.485e+040
3.485e+04 – 3.66e+040
3.66e+04 – 3.834e+041
3.834e+04 – 4.008e+040
4.008e+04 – 4.182e+040
4.182e+04 – 4.357e+040
4.357e+04 – 4.531e+041
4.531e+04 – 4.705e+040
4.705e+04 – 4.879e+040
4.879e+04 – 5.054e+040
5.054e+04 – 5.228e+040
5.228e+04 – 5.402e+041
5.402e+04 – 5.576e+040
5.576e+04 – 5.751e+040
5.751e+04 – 5.925e+040
5.925e+04 – 6.099e+040
6.099e+04 – 6.273e+040
6.273e+04 – 6.448e+040
6.448e+04 – 6.622e+040
6.622e+04 – 6.796e+040
6.796e+04 – 6.97e+041
Fig 4.
ambulatory_disability · The largest single disability category by mean; inspect the outlier tail (366 outliers) before aggregating.
Show data table
Histogram bins for ambulatory_disability (median: 2197.0).
bincount
3 – 1.371e+042903
1.371e+04 – 2.741e+04167
2.741e+04 – 4.112e+0464
4.112e+04 – 5.482e+0440
5.482e+04 – 6.852e+0414
6.852e+04 – 8.223e+048
8.223e+04 – 9.593e+045
9.593e+04 – 1.096e+053
1.096e+05 – 1.233e+053
1.233e+05 – 1.37e+052
1.37e+05 – 1.508e+057
1.508e+05 – 1.645e+051
1.645e+05 – 1.782e+051
1.782e+05 – 1.919e+050
1.919e+05 – 2.056e+050
2.056e+05 – 2.193e+050
2.193e+05 – 2.33e+051
2.33e+05 – 2.467e+051
2.467e+05 – 2.604e+050
2.604e+05 – 2.741e+050
2.741e+05 – 2.878e+050
2.878e+05 – 3.015e+051
3.015e+05 – 3.152e+050
3.152e+05 – 3.289e+050
3.289e+05 – 3.426e+050
3.426e+05 – 3.563e+050
3.563e+05 – 3.7e+050
3.7e+05 – 3.837e+050
3.837e+05 – 3.974e+050
3.974e+05 – 4.111e+050
4.111e+05 – 4.248e+050
4.248e+05 – 4.385e+050
4.385e+05 – 4.522e+050
4.522e+05 – 4.659e+050
4.659e+05 – 4.797e+050
4.797e+05 – 4.934e+050
4.934e+05 – 5.071e+050
5.071e+05 – 5.208e+050
5.208e+05 – 5.345e+050
5.345e+05 – 5.482e+051
Fig 5.
state_fips · Shows how the 3,222 county rows distribute across the 52 state codes — useful for spotting state-level coverage imbalances.
Show data table
Histogram bins for state_fips (median: 30.0).
bincount
1 – 2.77597
2.775 – 4.5515
4.55 – 6.325133
6.325 – 8.164
8.1 – 9.8759
9.875 – 11.654
11.65 – 13.42226
13.42 – 15.25
15.2 – 16.9844
16.98 – 18.75194
18.75 – 20.52204
20.52 – 22.3184
22.3 – 24.0740
24.07 – 25.8514
25.85 – 27.62170
27.62 – 29.4197
29.4 – 31.17149
31.17 – 32.9517
32.95 – 34.7331
34.73 – 36.595
36.5 – 38.27153
38.27 – 40.05165
40.05 – 41.8236
41.82 – 43.667
43.6 – 45.3851
45.38 – 47.15161
47.15 – 48.92254
48.92 – 50.743
50.7 – 52.47133
52.47 – 54.2594
54.25 – 56.0295
56.02 – 57.80
57.8 – 59.570
59.57 – 61.350
61.35 – 63.120
63.12 – 64.90
64.9 – 66.670
66.67 – 68.450
68.45 – 70.220
70.22 – 7278
Fig 6.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
fipsnumeric0.0%
county_nametext0.0%
state_fipsnumeric0.0%
county_fipsnumeric0.0%
total_populationnumeric0.0%
disability_totalnumeric0.0%
disability_ratenumeric0.0%
no_disabilitynumeric0.0%
one_disabilitynumeric0.0%
two_plus_disabilitiesnumeric0.0%
hearing_disabilitynumeric0.0%
vision_disabilitynumeric0.0%
cognitive_disabilitynumeric0.0%
ambulatory_disabilitynumeric0.0%
self_care_disabilitynumeric0.0%
independent_living_disabilitynumeric0.0%
Fig 7.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 12 numeric columns (values clipped to 2 decimals).
fipsstate_fipscounty_fipstotal_populationdisability_totaldisability_rateno_disabilityone_disabilitytwo_plus_disabilitieshearing_disabilityvision_disabilitycognitive_disability
fips+1.00+1.00+0.15-0.07-0.08-0.06-0.07-0.08-0.08-0.08-0.08-0.08
state_fips+1.00+1.00+0.14-0.07-0.08-0.06-0.07-0.08-0.08-0.08-0.08-0.08
county_fips+0.15+0.14+1.00-0.02-0.01+0.12-0.01-0.01-0.02-0.04-0.04-0.03
total_population-0.07-0.07-0.02+1.00+0.97-0.07+1.00+0.97+0.97+0.99+0.99+0.98
disability_total-0.08-0.08-0.01+0.97+1.00-0.02+0.98+1.00+0.99+0.98+0.98+0.99
disability_rate-0.06-0.06+0.12-0.07-0.02+1.00-0.07-0.01-0.03-0.07-0.07-0.05
no_disability-0.07-0.07-0.01+1.00+0.98-0.07+1.00+0.98+0.97+0.98+0.98+0.98
one_disability-0.08-0.08-0.01+0.97+1.00-0.01+0.98+1.00+0.98+0.97+0.97+0.98
two_plus_disabilities-0.08-0.08-0.02+0.97+0.99-0.03+0.97+0.98+1.00+0.98+0.98+0.99
hearing_disability-0.08-0.08-0.04+0.99+0.98-0.07+0.98+0.97+0.98+1.00+1.00+0.99
vision_disability-0.08-0.08-0.04+0.99+0.98-0.07+0.98+0.97+0.98+1.00+1.00+0.99
cognitive_disability-0.08-0.08-0.03+0.98+0.99-0.05+0.98+0.98+0.99+0.99+0.99+1.00

fips numeric identifier

This column is the FIPS county code: every one of the 3222 rows is unique and non-null, and the value range (1001 to 72153) matches the standard US state+county FIPS encoding. The distribution is near-symmetric (skew 0.16, kurtosis -0.63) with no outliers, which is expected for an identifier rather than a measured quantity. Treat it as a categorical key, not a number.

Treatment: Cast to zero-padded string and use as a join key to county-level data; do not feed as a numeric feature.

anthropic:claude-opus-4-7 · confidence high
Out[13]:

saturn.columns["fips"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,222
min 1,001
max 72,153
mean 3.138e+04
median 30,022
std 1.63e+04
q1 1.903e+04
q3 4.61e+04
iqr 27,075
skew 0.1574
kurtosis -0.6314
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 8.
Distribution of fips. Vertical dash marks the median.
Show data table
Histogram bins for fips (median: 30022.0).
bincount
1001 – 278097
2780 – 455915
4559 – 6337133
6337 – 811659
8116 – 989514
9895 – 1.167e+044
1.167e+04 – 1.345e+04226
1.345e+04 – 1.523e+045
1.523e+04 – 1.701e+0449
1.701e+04 – 1.879e+04189
1.879e+04 – 2.057e+04204
2.057e+04 – 2.235e+04184
2.235e+04 – 2.413e+0439
2.413e+04 – 2.59e+0415
2.59e+04 – 2.768e+04170
2.768e+04 – 2.946e+04196
2.946e+04 – 3.124e+04150
3.124e+04 – 3.302e+0427
3.302e+04 – 3.48e+0421
3.48e+04 – 3.658e+0495
3.658e+04 – 3.836e+04153
3.836e+04 – 4.013e+04155
4.013e+04 – 4.191e+0446
4.191e+04 – 4.369e+0467
4.369e+04 – 4.547e+0451
4.547e+04 – 4.725e+04161
4.725e+04 – 4.903e+04268
4.903e+04 – 5.081e+0429
5.081e+04 – 5.259e+04133
5.259e+04 – 5.436e+0494
5.436e+04 – 5.614e+0495
5.614e+04 – 5.792e+040
5.792e+04 – 5.97e+040
5.97e+04 – 6.148e+040
6.148e+04 – 6.326e+040
6.326e+04 – 6.504e+040
6.504e+04 – 6.682e+040
6.682e+04 – 6.86e+040
6.86e+04 – 7.037e+040
7.037e+04 – 7.215e+0478

county_name text identifier

Each of the 3,222 rows holds a unique county-and-state string (e.g., '... County, Texas'), averaging 24 characters and roughly 3 words. The token 'county,' appears 2,999 times, so a small minority of entries use a different suffix (parish, borough, census area). Texas (256), Virginia (189), and Georgia (159) lead the state distribution, consistent with a full U.S. county roster.

Treatment: Split into county and state fields, then left-join on a FIPS lookup.

anthropic:claude-opus-4-7 · confidence high
Out[16]:

saturn.columns["county_name"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,222
len_min 16
len_max 59
len_mean 24.32
len_median 24
len_p95 31
word_mean 3.248
word_median 3
n_empty 0
n_duplicates 0
duplicate_rate 0
vocab_size 1,990
readability_flesch_mean 10.28
emoji_rate 0
url_rate 0
one_word_rate 0
allcaps_rate 0
boilerplate_rate 0
alert: near_unique100.0% of rows are unique strings
Fig 9.
Character-length distribution for county_name.
Show data table
Character-length distribution for county_name (mean: 24.324022346368714).
charscount
16 – 1726
17 – 1872
18 – 19121
19 – 20190
20 – 21264
21 – 22407
22 – 24420
24 – 25363
25 – 26320
26 – 27240
27 – 28231
28 – 29152
29 – 30139
30 – 31165
31 – 3241
32 – 3328
33 – 3416
34 – 3510
35 – 365
36 – 380
38 – 391
39 – 401
40 – 410
41 – 421
42 – 431
43 – 440
44 – 452
45 – 460
46 – 471
47 – 481
48 – 490
49 – 500
50 – 510
51 – 530
53 – 542
54 – 551
55 – 560
56 – 570
57 – 580
58 – 591

state_fips numeric foreign_key

This is almost certainly the US state FIPS code: 52 unique integer values across 3222 rows, ranging from 1 to 72 with no nulls or zeros. The count of 52 (rather than 50) and a max of 72 indicate inclusion of DC and territories like Puerto Rico. Distribution is roughly uniform (skew 0.16, kurtosis -0.63), consistent with a categorical geographic identifier rather than a measurement.

Treatment: Cast to categorical/string and join to a state lookup table; do not treat as a numeric feature.

anthropic:claude-opus-4-7 · confidence high
Out[19]:

saturn.columns["state_fips"].stats

statvalue
n3,222
nulls0 (0.0%)
unique52
min 1
max 72
mean 31.27
median 30
std 16.29
q1 19
q3 46
iqr 27
skew 0.1574
kurtosis -0.6267
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 10.
Distribution of state_fips. Vertical dash marks the median.
Show data table
Histogram bins for state_fips (median: 30.0).
bincount
1 – 2.77597
2.775 – 4.5515
4.55 – 6.325133
6.325 – 8.164
8.1 – 9.8759
9.875 – 11.654
11.65 – 13.42226
13.42 – 15.25
15.2 – 16.9844
16.98 – 18.75194
18.75 – 20.52204
20.52 – 22.3184
22.3 – 24.0740
24.07 – 25.8514
25.85 – 27.62170
27.62 – 29.4197
29.4 – 31.17149
31.17 – 32.9517
32.95 – 34.7331
34.73 – 36.595
36.5 – 38.27153
38.27 – 40.05165
40.05 – 41.8236
41.82 – 43.667
43.6 – 45.3851
45.38 – 47.15161
47.15 – 48.92254
48.92 – 50.743
50.7 – 52.47133
52.47 – 54.2594
54.25 – 56.0295
56.02 – 57.80
57.8 – 59.570
59.57 – 61.350
61.35 – 63.120
63.12 – 64.90
64.9 – 66.670
66.67 – 68.450
68.45 – 70.220
70.22 – 7278

county_fips numeric foreign_key

This is the county-level portion of a FIPS code, stored as an integer from 1 to 840 across 3222 rows with no nulls and only 330 distinct values. The distribution is heavily right-skewed (skew 2.87, kurtosis 11.64) with 178 outliers (5.5%), which is expected since county codes are categorical identifiers and most values cluster low (median 79, Q3 133) while a few counties carry much larger codes. Treating this as a numeric feature would be misleading despite the numeric dtype.

Treatment: Cast to categorical and combine with state FIPS to join on full county identifier.

anthropic:claude-opus-4-7 · confidence high
Out[22]:

saturn.columns["county_fips"].stats

statvalue
n3,222
nulls0 (0.0%)
unique330
min 1
max 840
mean 103.2
median 79
std 106.6
q1 35
q3 133
iqr 98
skew 2.866
kurtosis 11.64
n_outliers 178
outlier_rate 0.05525
zero_rate 0
alert: high_skewskew=+2.87
alert: outliers5.5% rows beyond 1.5 IQR
Fig 11.
Distribution of county_fips. Vertical dash marks the median.
Show data table
Histogram bins for county_fips (median: 79.0).
bincount
1 – 21.98523
21.98 – 42.95418
42.95 – 63.93411
63.93 – 84.9345
84.9 – 105.9352
105.9 – 126.9281
126.9 – 147.8236
147.8 – 168.8168
168.8 – 189.8140
189.8 – 210.871
210.8 – 231.745
231.7 – 252.725
252.7 – 273.722
273.7 – 294.723
294.7 – 315.622
315.6 – 336.613
336.6 – 357.611
357.6 – 378.610
378.6 – 399.511
399.5 – 420.510
420.5 – 441.511
441.5 – 462.510
462.5 – 483.411
483.4 – 504.410
504.4 – 525.47
525.4 – 546.42
546.4 – 567.31
567.3 – 588.32
588.3 – 609.33
609.3 – 630.23
630.2 – 651.22
651.2 – 672.22
672.2 – 693.25
693.2 – 714.22
714.2 – 735.13
735.1 – 756.12
756.1 – 777.13
777.1 – 798.11
798.1 – 8192
819 – 8403

total_population numeric feature

Likely a county- or region-level total population count across 3,222 rows with no nulls and 3,141 unique values. The distribution is extremely right-skewed (skew 13.38, kurtosis 298.69): the median is 25,328 but the mean is 102,232 and the max reaches 9,866,623, with 453 outliers (14.06%) flagged above the IQR fence.

Treatment: log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[25]:

saturn.columns["total_population"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,141
min 47
max 9.867e+06
mean 1.022e+05
median 25,328
std 3.269e+05
q1 1.061e+04
q3 65,190
iqr 5.458e+04
skew 13.38
kurtosis 298.7
n_outliers 453
outlier_rate 0.1406
zero_rate 0
alert: high_skewskew=+13.38
alert: outliers14.1% rows beyond 1.5 IQR
Fig 12.
Distribution of total_population. Vertical dash marks the median.
Show data table
Histogram bins for total_population (median: 25328.0).
bincount
47 – 2.467e+052942
2.467e+05 – 4.934e+05137
4.934e+05 – 7.4e+0556
7.4e+05 – 9.867e+0539
9.867e+05 – 1.233e+0613
1.233e+06 – 1.48e+069
1.48e+06 – 1.727e+067
1.727e+06 – 1.973e+063
1.973e+06 – 2.22e+063
2.22e+06 – 2.467e+064
2.467e+06 – 2.713e+063
2.713e+06 – 2.96e+060
2.96e+06 – 3.207e+062
3.207e+06 – 3.453e+060
3.453e+06 – 3.7e+060
3.7e+06 – 3.947e+060
3.947e+06 – 4.193e+060
4.193e+06 – 4.44e+061
4.44e+06 – 4.687e+060
4.687e+06 – 4.933e+061
4.933e+06 – 5.18e+060
5.18e+06 – 5.427e+061
5.427e+06 – 5.673e+060
5.673e+06 – 5.92e+060
5.92e+06 – 6.167e+060
6.167e+06 – 6.413e+060
6.413e+06 – 6.66e+060
6.66e+06 – 6.907e+060
6.907e+06 – 7.153e+060
7.153e+06 – 7.4e+060
7.4e+06 – 7.647e+060
7.647e+06 – 7.893e+060
7.893e+06 – 8.14e+060
8.14e+06 – 8.387e+060
8.387e+06 – 8.633e+060
8.633e+06 – 8.88e+060
8.88e+06 – 9.127e+060
9.127e+06 – 9.373e+060
9.373e+06 – 9.62e+060
9.62e+06 – 9.867e+061

disability_total numeric feature

A heavily right-skewed count of disability cases or claims, ranging from 0 to 69,705 with a median of just 298 but a mean of 1,043. Skew of 10.28 and kurtosis of 166.8 indicate an extreme long tail, with 404 outliers (12.5% of rows) and only 1.7% zeros. The std (2,906) dwarfs the IQR (689), so a small number of very large records dominate the distribution.

Treatment: Apply a log1p transform before modelling to tame the extreme skew and outliers.

anthropic:claude-opus-4-7 · confidence high
Out[28]:

saturn.columns["disability_total"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,393
min 0
max 69,705
mean 1043
median 298
std 2906
q1 107
q3 796.2
iqr 689.2
skew 10.28
kurtosis 166.8
n_outliers 404
outlier_rate 0.1254
zero_rate 0.01676
alert: high_skewskew=+10.28
alert: outliers12.5% rows beyond 1.5 IQR
Fig 13.
Distribution of disability_total. Vertical dash marks the median.
Show data table
Histogram bins for disability_total (median: 298.0).
bincount
0 – 17432797
1743 – 3485223
3485 – 522871
5228 – 697042
6970 – 871330
8713 – 1.046e+0420
1.046e+04 – 1.22e+048
1.22e+04 – 1.394e+047
1.394e+04 – 1.568e+047
1.568e+04 – 1.743e+040
1.743e+04 – 1.917e+042
1.917e+04 – 2.091e+042
2.091e+04 – 2.265e+041
2.265e+04 – 2.44e+044
2.44e+04 – 2.614e+042
2.614e+04 – 2.788e+040
2.788e+04 – 2.962e+041
2.962e+04 – 3.137e+041
3.137e+04 – 3.311e+040
3.311e+04 – 3.485e+040
3.485e+04 – 3.66e+040
3.66e+04 – 3.834e+041
3.834e+04 – 4.008e+040
4.008e+04 – 4.182e+040
4.182e+04 – 4.357e+040
4.357e+04 – 4.531e+041
4.531e+04 – 4.705e+040
4.705e+04 – 4.879e+040
4.879e+04 – 5.054e+040
5.054e+04 – 5.228e+040
5.228e+04 – 5.402e+041
5.402e+04 – 5.576e+040
5.576e+04 – 5.751e+040
5.751e+04 – 5.925e+040
5.925e+04 – 6.099e+040
6.099e+04 – 6.273e+040
6.273e+04 – 6.448e+040
6.448e+04 – 6.622e+040
6.622e+04 – 6.796e+040
6.796e+04 – 6.97e+041

disability_rate numeric feature

Numeric disability_rate spanning 0.0 to 9.17 with a median of 1.07 and IQR 0.77-1.42, almost certainly a per-row rate or percentage. The distribution is heavily right-skewed (skew 2.17, kurtosis 15.24) with 117 outliers (3.6%) stretching well beyond the typical range, and 1.7% of rows sit at exactly zero. No nulls across 3,222 rows, and only 305 distinct values suggest rounding to two decimals.

Treatment: Apply a log or winsorising transform before modelling to tame the right tail.

anthropic:claude-opus-4-7 · confidence high
Out[31]:

saturn.columns["disability_rate"].stats

statvalue
n3,222
nulls0 (0.0%)
unique305
min 0
max 9.17
mean 1.145
median 1.07
std 0.6215
q1 0.77
q3 1.42
iqr 0.65
skew 2.167
kurtosis 15.24
n_outliers 117
outlier_rate 0.03631
zero_rate 0.01676
alert: high_skewskew=+2.17
Fig 14.
Distribution of disability_rate. Vertical dash marks the median.
Show data table
Histogram bins for disability_rate (median: 1.07).
bincount
0 – 0.2293114
0.2293 – 0.4585143
0.4585 – 0.6878352
0.6878 – 0.917590
0.917 – 1.146634
1.146 – 1.376496
1.376 – 1.605362
1.605 – 1.834200
1.834 – 2.063118
2.063 – 2.29277
2.292 – 2.52245
2.522 – 2.75131
2.751 – 2.9822
2.98 – 3.2110
3.21 – 3.4397
3.439 – 3.6684
3.668 – 3.8973
3.897 – 4.1273
4.127 – 4.3563
4.356 – 4.5852
4.585 – 4.8140
4.814 – 5.0432
5.043 – 5.2732
5.273 – 5.5020
5.502 – 5.7310
5.731 – 5.9610
5.961 – 6.190
6.19 – 6.4190
6.419 – 6.6480
6.648 – 6.8780
6.878 – 7.1070
7.107 – 7.3360
7.336 – 7.5651
7.565 – 7.7950
7.795 – 8.0240
8.024 – 8.2530
8.253 – 8.4820
8.482 – 8.7120
8.712 – 8.9410
8.941 – 9.171

no_disability numeric feature

Counts of people recorded as having no disability per geographic or administrative unit, ranging from 0 to 2,091,332 with a median of 5,607. The distribution is extremely right-skewed (skew 12.67, kurtosis 259.77) and the mean of 22,872 sits well above Q3 of 14,739, with 442 outliers (13.7%) flagging a long tail of very large units. Only one zero is present and there are no nulls, so the heavy tail—not missingness—is the dominant feature.

Treatment: Log-transform (log1p) before modelling to tame the skew and outliers.

anthropic:claude-opus-4-7 · confidence high
Out[34]:

saturn.columns["no_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,955
min 0
max 2.091e+06
mean 2.287e+04
median 5,607
std 7.329e+04
q1 2384
q3 1.474e+04
iqr 12,355
skew 12.67
kurtosis 259.8
n_outliers 442
outlier_rate 0.1372
zero_rate 0.0003104
alert: high_skewskew=+12.67
alert: outliers13.7% rows beyond 1.5 IQR
Fig 15.
Distribution of no_disability. Vertical dash marks the median.
Show data table
Histogram bins for no_disability (median: 5607.0).
bincount
0 – 5.228e+042925
5.228e+04 – 1.046e+05145
1.046e+05 – 1.568e+0564
1.568e+05 – 2.091e+0526
2.091e+05 – 2.614e+0524
2.614e+05 – 3.137e+0512
3.137e+05 – 3.66e+057
3.66e+05 – 4.183e+052
4.183e+05 – 4.705e+053
4.705e+05 – 5.228e+052
5.228e+05 – 5.751e+053
5.751e+05 – 6.274e+052
6.274e+05 – 6.797e+052
6.797e+05 – 7.32e+051
7.32e+05 – 7.842e+050
7.842e+05 – 8.365e+050
8.365e+05 – 8.888e+050
8.888e+05 – 9.411e+050
9.411e+05 – 9.934e+050
9.934e+05 – 1.046e+061
1.046e+06 – 1.098e+060
1.098e+06 – 1.15e+061
1.15e+06 – 1.203e+060
1.203e+06 – 1.255e+061
1.255e+06 – 1.307e+060
1.307e+06 – 1.359e+060
1.359e+06 – 1.412e+060
1.412e+06 – 1.464e+060
1.464e+06 – 1.516e+060
1.516e+06 – 1.568e+060
1.568e+06 – 1.621e+060
1.621e+06 – 1.673e+060
1.673e+06 – 1.725e+060
1.725e+06 – 1.778e+060
1.778e+06 – 1.83e+060
1.83e+06 – 1.882e+060
1.882e+06 – 1.934e+060
1.934e+06 – 1.987e+060
1.987e+06 – 2.039e+060
2.039e+06 – 2.091e+061

one_disability numeric feature

This column appears to be a count of people with one disability per geographic or administrative unit, ranging from 0 to 44,466 with a median of 217.5. The distribution is severely right-skewed (skew 9.45, kurtosis 139.4), with the mean (755.7) more than triple the median and 408 outliers (12.7% of rows) — consistent with a few very large units dominating a long tail of small ones. About 2.8% of rows are zero and there are no nulls.

Treatment: Apply a log1p transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[37]:

saturn.columns["one_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,212
min 0
max 44,466
mean 755.7
median 217.5
std 2032
q1 76.25
q3 586.8
iqr 510.5
skew 9.449
kurtosis 139.4
n_outliers 408
outlier_rate 0.1266
zero_rate 0.02793
alert: high_skewskew=+9.45
alert: outliers12.7% rows beyond 1.5 IQR
Fig 16.
Distribution of one_disability. Vertical dash marks the median.
Show data table
Histogram bins for one_disability (median: 217.5).
bincount
0 – 11122744
1112 – 2223235
2223 – 333586
3335 – 444748
4447 – 555834
5558 – 667020
6670 – 778217
7782 – 88938
8893 – 1e+046
1e+04 – 1.112e+045
1.112e+04 – 1.223e+042
1.223e+04 – 1.334e+040
1.334e+04 – 1.445e+043
1.445e+04 – 1.556e+044
1.556e+04 – 1.667e+043
1.667e+04 – 1.779e+040
1.779e+04 – 1.89e+041
1.89e+04 – 2.001e+040
2.001e+04 – 2.112e+041
2.112e+04 – 2.223e+041
2.223e+04 – 2.334e+040
2.334e+04 – 2.446e+040
2.446e+04 – 2.557e+040
2.557e+04 – 2.668e+041
2.668e+04 – 2.779e+040
2.779e+04 – 2.89e+040
2.89e+04 – 3.001e+040
3.001e+04 – 3.113e+041
3.113e+04 – 3.224e+040
3.224e+04 – 3.335e+040
3.335e+04 – 3.446e+040
3.446e+04 – 3.557e+040
3.557e+04 – 3.668e+040
3.668e+04 – 3.78e+041
3.78e+04 – 3.891e+040
3.891e+04 – 4.002e+040
4.002e+04 – 4.113e+040
4.113e+04 – 4.224e+040
4.224e+04 – 4.335e+040
4.335e+04 – 4.447e+041

two_plus_disabilities numeric feature

This column appears to be a count of people (likely per geographic unit) reporting two or more disabilities, ranging from 0 to 25,239 with a median of just 76. The distribution is extremely right-skewed (skew 12.57, kurtosis 253.95), with 11.67% of rows flagged as outliers and ~9% exact zeros, suggesting a few very large jurisdictions dominate while most are small. The mean (287.7) sits well above Q3 (222), confirming the long tail.

Treatment: Apply a log1p transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[40]:

saturn.columns["two_plus_disabilities"].stats

statvalue
n3,222
nulls0 (0.0%)
unique786
min 0
max 25,239
mean 287.7
median 76
std 890.1
q1 21
q3 222
iqr 201
skew 12.57
kurtosis 253.9
n_outliers 376
outlier_rate 0.1167
zero_rate 0.0897
alert: high_skewskew=+12.57
alert: outliers11.7% rows beyond 1.5 IQR
Fig 17.
Distribution of two_plus_disabilities. Vertical dash marks the median.
Show data table
Histogram bins for two_plus_disabilities (median: 76.0).
bincount
0 – 6312904
631 – 1262166
1262 – 189361
1893 – 252437
2524 – 315519
3155 – 37867
3786 – 441710
4417 – 50483
5048 – 56791
5679 – 63100
6310 – 69412
6941 – 75723
7572 – 82031
8203 – 88342
8834 – 94652
9465 – 1.01e+040
1.01e+04 – 1.073e+040
1.073e+04 – 1.136e+041
1.136e+04 – 1.199e+040
1.199e+04 – 1.262e+040
1.262e+04 – 1.325e+040
1.325e+04 – 1.388e+040
1.388e+04 – 1.451e+041
1.451e+04 – 1.514e+041
1.514e+04 – 1.577e+040
1.577e+04 – 1.641e+040
1.641e+04 – 1.704e+040
1.704e+04 – 1.767e+040
1.767e+04 – 1.83e+040
1.83e+04 – 1.893e+040
1.893e+04 – 1.956e+040
1.956e+04 – 2.019e+040
2.019e+04 – 2.082e+040
2.082e+04 – 2.145e+040
2.145e+04 – 2.208e+040
2.208e+04 – 2.272e+040
2.272e+04 – 2.335e+040
2.335e+04 – 2.398e+040
2.398e+04 – 2.461e+040
2.461e+04 – 2.524e+041

hearing_disability numeric feature

This appears to be a count or population-style measure related to hearing disability, with all 3222 rows populated and 2314 distinct values ranging from 1 to 296898. The distribution is extremely right-skewed (skew 11.54, kurtosis 226.6) with a median of 1326 well below the mean of 4003, and 391 outliers (12.1%) inflate the tail. The min of 1 and absence of zeros suggest these are aggregated counts rather than individual indicators.

Treatment: log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence medium
Out[43]:

saturn.columns["hearing_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,314
min 1
max 296,898
mean 4003
median 1,326
std 1.068e+04
q1 579.2
q3 3,193
iqr 2614
skew 11.54
kurtosis 226.6
n_outliers 391
outlier_rate 0.1214
zero_rate 0
alert: high_skewskew=+11.54
alert: outliers12.1% rows beyond 1.5 IQR
Fig 18.
Distribution of hearing_disability. Vertical dash marks the median.
Show data table
Histogram bins for hearing_disability (median: 1326.0).
bincount
1 – 74232853
7423 – 1.485e+04185
1.485e+04 – 2.227e+0478
2.227e+04 – 2.969e+0443
2.969e+04 – 3.711e+0423
3.711e+04 – 4.454e+046
4.454e+04 – 5.196e+049
5.196e+04 – 5.938e+046
5.938e+04 – 6.68e+044
6.68e+04 – 7.423e+044
7.423e+04 – 8.165e+042
8.165e+04 – 8.907e+042
8.907e+04 – 9.649e+041
9.649e+04 – 1.039e+050
1.039e+05 – 1.113e+052
1.113e+05 – 1.188e+050
1.188e+05 – 1.262e+050
1.262e+05 – 1.336e+051
1.336e+05 – 1.41e+050
1.41e+05 – 1.484e+050
1.484e+05 – 1.559e+050
1.559e+05 – 1.633e+051
1.633e+05 – 1.707e+050
1.707e+05 – 1.781e+051
1.781e+05 – 1.856e+050
1.856e+05 – 1.93e+050
1.93e+05 – 2.004e+050
2.004e+05 – 2.078e+050
2.078e+05 – 2.153e+050
2.153e+05 – 2.227e+050
2.227e+05 – 2.301e+050
2.301e+05 – 2.375e+050
2.375e+05 – 2.449e+050
2.449e+05 – 2.524e+050
2.524e+05 – 2.598e+050
2.598e+05 – 2.672e+050
2.672e+05 – 2.746e+050
2.746e+05 – 2.821e+050
2.821e+05 – 2.895e+050
2.895e+05 – 2.969e+051

vision_disability numeric feature

Numeric counts of people with a vision disability per geographic or demographic unit, ranging from 0 to 346,901 with a median of 1,361. The distribution is extremely right-skewed (skew 12.29, kurtosis 254.79) and the mean of 4,246 sits well above Q3 of 3,291, with 380 outliers (11.8%) inflating the upper tail. Near-zero zero_rate (0.03%) and no nulls suggest clean population-style aggregates rather than survey responses.

Treatment: Log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[46]:

saturn.columns["vision_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,349
min 0
max 346,901
mean 4246
median 1,361
std 1.205e+04
q1 567
q3 3,291
iqr 2,724
skew 12.29
kurtosis 254.8
n_outliers 380
outlier_rate 0.1179
zero_rate 0.0003104
alert: high_skewskew=+12.29
alert: outliers11.8% rows beyond 1.5 IQR
Fig 19.
Distribution of vision_disability. Vertical dash marks the median.
Show data table
Histogram bins for vision_disability (median: 1361.0).
bincount
0 – 86732892
8673 – 1.735e+04162
1.735e+04 – 2.602e+0474
2.602e+04 – 3.469e+0438
3.469e+04 – 4.336e+0419
4.336e+04 – 5.204e+045
5.204e+04 – 6.071e+049
6.071e+04 – 6.938e+045
6.938e+04 – 7.805e+044
7.805e+04 – 8.673e+043
8.673e+04 – 9.54e+043
9.54e+04 – 1.041e+052
1.041e+05 – 1.127e+052
1.127e+05 – 1.214e+050
1.214e+05 – 1.301e+050
1.301e+05 – 1.388e+050
1.388e+05 – 1.474e+050
1.474e+05 – 1.561e+051
1.561e+05 – 1.648e+050
1.648e+05 – 1.735e+050
1.735e+05 – 1.821e+050
1.821e+05 – 1.908e+052
1.908e+05 – 1.995e+050
1.995e+05 – 2.081e+050
2.081e+05 – 2.168e+050
2.168e+05 – 2.255e+050
2.255e+05 – 2.342e+050
2.342e+05 – 2.428e+050
2.428e+05 – 2.515e+050
2.515e+05 – 2.602e+050
2.602e+05 – 2.688e+050
2.688e+05 – 2.775e+050
2.775e+05 – 2.862e+050
2.862e+05 – 2.949e+050
2.949e+05 – 3.035e+050
3.035e+05 – 3.122e+050
3.122e+05 – 3.209e+050
3.209e+05 – 3.296e+050
3.296e+05 – 3.382e+050
3.382e+05 – 3.469e+051

cognitive_disability numeric feature

Likely a count of people with a cognitive disability per geographic or administrative unit, ranging from 0 to 413,990 with a median of 1,623. The distribution is severely right-skewed (skew 12.09, kurtosis 254.7) with 375 outliers (11.6% of rows) and a mean (5,142) more than triple the median, indicating a few very large units dominate. Near-zero null and zero rates suggest the count is reliably populated.

Treatment: Apply a log or log1p transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[49]:

saturn.columns["cognitive_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,473
min 0
max 413,990
mean 5142
median 1,623
std 1.421e+04
q1 634
q3 4173
iqr 3539
skew 12.09
kurtosis 254.7
n_outliers 375
outlier_rate 0.1164
zero_rate 0.0003104
alert: high_skewskew=+12.09
alert: outliers11.6% rows beyond 1.5 IQR
Fig 20.
Distribution of cognitive_disability. Vertical dash marks the median.
Show data table
Histogram bins for cognitive_disability (median: 1623.0).
bincount
0 – 1.035e+042877
1.035e+04 – 2.07e+04179
2.07e+04 – 3.105e+0469
3.105e+04 – 4.14e+0439
4.14e+04 – 5.175e+0417
5.175e+04 – 6.21e+0410
6.21e+04 – 7.245e+049
7.245e+04 – 8.28e+045
8.28e+04 – 9.315e+040
9.315e+04 – 1.035e+056
1.035e+05 – 1.138e+053
1.138e+05 – 1.242e+053
1.242e+05 – 1.345e+051
1.345e+05 – 1.449e+050
1.449e+05 – 1.552e+050
1.552e+05 – 1.656e+050
1.656e+05 – 1.759e+050
1.759e+05 – 1.863e+050
1.863e+05 – 1.966e+052
1.966e+05 – 2.07e+051
2.07e+05 – 2.173e+050
2.173e+05 – 2.277e+050
2.277e+05 – 2.38e+050
2.38e+05 – 2.484e+050
2.484e+05 – 2.587e+050
2.587e+05 – 2.691e+050
2.691e+05 – 2.794e+050
2.794e+05 – 2.898e+050
2.898e+05 – 3.001e+050
3.001e+05 – 3.105e+050
3.105e+05 – 3.208e+050
3.208e+05 – 3.312e+050
3.312e+05 – 3.415e+050
3.415e+05 – 3.519e+050
3.519e+05 – 3.622e+050
3.622e+05 – 3.726e+050
3.726e+05 – 3.829e+050
3.829e+05 – 3.933e+050
3.933e+05 – 4.036e+050
4.036e+05 – 4.14e+051

ambulatory_disability numeric feature

Counts of people with ambulatory disability per geographic unit, ranging from 3 to 548,175 with a median of 2,197. The distribution is severely right-skewed (skew 13.0, kurtosis 288.7) and 11.4% of rows are flagged as outliers, indicating a long tail of very large jurisdictions dominating the mean (6,497) versus the median.

Treatment: Log-transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[52]:

saturn.columns["ambulatory_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,614
min 3
max 548,175
mean 6497
median 2,197
std 1.822e+04
q1 917.2
q3 5261
iqr 4,344
skew 13.01
kurtosis 288.7
n_outliers 366
outlier_rate 0.1136
zero_rate 0
alert: high_skewskew=+13.01
alert: outliers11.4% rows beyond 1.5 IQR
Fig 21.
Distribution of ambulatory_disability. Vertical dash marks the median.
Show data table
Histogram bins for ambulatory_disability (median: 2197.0).
bincount
3 – 1.371e+042903
1.371e+04 – 2.741e+04167
2.741e+04 – 4.112e+0464
4.112e+04 – 5.482e+0440
5.482e+04 – 6.852e+0414
6.852e+04 – 8.223e+048
8.223e+04 – 9.593e+045
9.593e+04 – 1.096e+053
1.096e+05 – 1.233e+053
1.233e+05 – 1.37e+052
1.37e+05 – 1.508e+057
1.508e+05 – 1.645e+051
1.645e+05 – 1.782e+051
1.782e+05 – 1.919e+050
1.919e+05 – 2.056e+050
2.056e+05 – 2.193e+050
2.193e+05 – 2.33e+051
2.33e+05 – 2.467e+051
2.467e+05 – 2.604e+050
2.604e+05 – 2.741e+050
2.741e+05 – 2.878e+050
2.878e+05 – 3.015e+051
3.015e+05 – 3.152e+050
3.152e+05 – 3.289e+050
3.289e+05 – 3.426e+050
3.426e+05 – 3.563e+050
3.563e+05 – 3.7e+050
3.7e+05 – 3.837e+050
3.837e+05 – 3.974e+050
3.974e+05 – 4.111e+050
4.111e+05 – 4.248e+050
4.248e+05 – 4.385e+050
4.385e+05 – 4.522e+050
4.522e+05 – 4.659e+050
4.659e+05 – 4.797e+050
4.797e+05 – 4.934e+050
4.934e+05 – 5.071e+050
5.071e+05 – 5.208e+050
5.208e+05 – 5.345e+050
5.345e+05 – 5.482e+051

self_care_disability numeric feature

Numeric counts of people with a self-care disability, likely aggregated per geographic or demographic unit given the 3222 rows and 1961 unique values. The distribution is severely right-skewed (skew 16.8, kurtosis 478.7) with a median of 772.5 but a max of 281,611, and 355 outliers (11.0% of rows) sit far above the Q3 of 1948.5. Near-zero null and zero rates suggest the field is consistently populated.

Treatment: Log-transform before modelling and consider winsorising the long upper tail.

anthropic:claude-opus-4-7 · confidence high
Out[55]:

saturn.columns["self_care_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,961
min 0
max 281,611
mean 2504
median 772.5
std 8061
q1 307
q3 1948
iqr 1642
skew 16.82
kurtosis 478.7
n_outliers 355
outlier_rate 0.1102
zero_rate 0.001241
alert: high_skewskew=+16.82
alert: outliers11.0% rows beyond 1.5 IQR
Fig 22.
Distribution of self_care_disability. Vertical dash marks the median.
Show data table
Histogram bins for self_care_disability (median: 772.5).
bincount
0 – 70402987
7040 – 1.408e+04132
1.408e+04 – 2.112e+0447
2.112e+04 – 2.816e+0422
2.816e+04 – 3.52e+0410
3.52e+04 – 4.224e+044
4.224e+04 – 4.928e+043
4.928e+04 – 5.632e+045
5.632e+04 – 6.336e+044
6.336e+04 – 7.04e+042
7.04e+04 – 7.744e+042
7.744e+04 – 8.448e+040
8.448e+04 – 9.152e+041
9.152e+04 – 9.856e+041
9.856e+04 – 1.056e+050
1.056e+05 – 1.126e+050
1.126e+05 – 1.197e+051
1.197e+05 – 1.267e+050
1.267e+05 – 1.338e+050
1.338e+05 – 1.408e+050
1.408e+05 – 1.478e+050
1.478e+05 – 1.549e+050
1.549e+05 – 1.619e+050
1.619e+05 – 1.69e+050
1.69e+05 – 1.76e+050
1.76e+05 – 1.83e+050
1.83e+05 – 1.901e+050
1.901e+05 – 1.971e+050
1.971e+05 – 2.042e+050
2.042e+05 – 2.112e+050
2.112e+05 – 2.182e+050
2.182e+05 – 2.253e+050
2.253e+05 – 2.323e+050
2.323e+05 – 2.394e+050
2.394e+05 – 2.464e+050
2.464e+05 – 2.534e+050
2.534e+05 – 2.605e+050
2.605e+05 – 2.675e+050
2.675e+05 – 2.746e+050
2.746e+05 – 2.816e+051

independent_living_disability numeric feature

Counts of people with an independent-living disability per geographic unit, ranging from 2 to 1,417,825 with a median of 3,135. The distribution is severely right-skewed (skew 14.09, kurtosis 329.97) and 13.8% of rows (445) flag as outliers, suggesting the column mixes small areas with very large aggregates. No nulls or zeros are present.

Treatment: Log-transform before modelling and consider normalising by area population to control the heavy skew.

anthropic:claude-opus-4-7 · confidence high
Out[58]:

saturn.columns["independent_living_disability"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,773
min 2
max 1.418e+06
mean 1.363e+04
median 3,135
std 4.56e+04
q1 1242
q3 8586
iqr 7344
skew 14.09
kurtosis 330
n_outliers 445
outlier_rate 0.1381
zero_rate 0
alert: high_skewskew=+14.09
alert: outliers13.8% rows beyond 1.5 IQR
Fig 23.
Distribution of independent_living_disability. Vertical dash marks the median.
Show data table
Histogram bins for independent_living_disability (median: 3135.0).
bincount
2 – 3.545e+042956
3.545e+04 – 7.089e+04144
7.089e+04 – 1.063e+0540
1.063e+05 – 1.418e+0538
1.418e+05 – 1.772e+0514
1.772e+05 – 2.127e+054
2.127e+05 – 2.481e+056
2.481e+05 – 2.836e+053
2.836e+05 – 3.19e+057
3.19e+05 – 3.545e+052
3.545e+05 – 3.899e+051
3.899e+05 – 4.253e+052
4.253e+05 – 4.608e+051
4.608e+05 – 4.962e+050
4.962e+05 – 5.317e+050
5.317e+05 – 5.671e+050
5.671e+05 – 6.026e+051
6.026e+05 – 6.38e+050
6.38e+05 – 6.735e+051
6.735e+05 – 7.089e+050
7.089e+05 – 7.444e+051
7.444e+05 – 7.798e+050
7.798e+05 – 8.153e+050
8.153e+05 – 8.507e+050
8.507e+05 – 8.861e+050
8.861e+05 – 9.216e+050
9.216e+05 – 9.57e+050
9.57e+05 – 9.925e+050
9.925e+05 – 1.028e+060
1.028e+06 – 1.063e+060
1.063e+06 – 1.099e+060
1.099e+06 – 1.134e+060
1.134e+06 – 1.17e+060
1.17e+06 – 1.205e+060
1.205e+06 – 1.241e+060
1.241e+06 – 1.276e+060
1.276e+06 – 1.311e+060
1.311e+06 – 1.347e+060
1.347e+06 – 1.382e+060
1.382e+06 – 1.418e+061

How to cite

click to copy

BibTeX
@misc{saturn-disability-census-disability-by-county-2022-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: disability census disability by county 2022},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/disability-census_disability_by_county_2022}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: disability census disability by county 2022. Source: /home/coolhand/datasets/us-inequality-atlas/disability/census_disability_by_county_2022.csv. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/disability-census_disability_by_county_2022