healthcare data county health rankings 20260121
Reading
This dataset covers 3,222 U.S. counties (one row per FIPS code) with population totals and uninsured counts and rates. Both total_pop and uninsured_pop are extremely right-skewed (skew 13.4 and 17.8) with hundreds of outliers, indicating a handful of very large counties dominate the raw counts — analysts should work in per-capita or log space. The uninsured_rate is the more comparable metric: median 0.12 with about 17.5% of counties reporting zero, and a long tail reaching 3.7 that warrants a data-quality check. The county_name field shows Texas, Virginia, and Georgia contributing the most counties, useful context for any state-level rollups.
citing: row_count · column_count · columns.total_pop.stats.skew · columns.total_pop.stats.median · columns.total_pop.stats.max · columns.total_pop.stats.n_outliers · columns.uninsured_pop.stats.skew · columns.uninsured_pop.stats.zero_rate · columns.uninsured_rate.stats.median · columns.uninsured_rate.stats.max · columns.uninsured_rate.stats.zero_rate · columns.county_name.top_words
Charts the summary said to look at first
Show data table
| bin | count |
|---|---|
| 47 – 2.467e+05 | 2942 |
| 2.467e+05 – 4.934e+05 | 137 |
| 4.934e+05 – 7.4e+05 | 56 |
| 7.4e+05 – 9.867e+05 | 39 |
| 9.867e+05 – 1.233e+06 | 13 |
| 1.233e+06 – 1.48e+06 | 9 |
| 1.48e+06 – 1.727e+06 | 7 |
| 1.727e+06 – 1.973e+06 | 3 |
| 1.973e+06 – 2.22e+06 | 3 |
| 2.22e+06 – 2.467e+06 | 4 |
| 2.467e+06 – 2.713e+06 | 3 |
| 2.713e+06 – 2.96e+06 | 0 |
| 2.96e+06 – 3.207e+06 | 2 |
| 3.207e+06 – 3.453e+06 | 0 |
| 3.453e+06 – 3.7e+06 | 0 |
| 3.7e+06 – 3.947e+06 | 0 |
| 3.947e+06 – 4.193e+06 | 0 |
| 4.193e+06 – 4.44e+06 | 1 |
| 4.44e+06 – 4.687e+06 | 0 |
| 4.687e+06 – 4.933e+06 | 1 |
| 4.933e+06 – 5.18e+06 | 0 |
| 5.18e+06 – 5.427e+06 | 1 |
| 5.427e+06 – 5.673e+06 | 0 |
| 5.673e+06 – 5.92e+06 | 0 |
| 5.92e+06 – 6.167e+06 | 0 |
| 6.167e+06 – 6.413e+06 | 0 |
| 6.413e+06 – 6.66e+06 | 0 |
| 6.66e+06 – 6.907e+06 | 0 |
| 6.907e+06 – 7.153e+06 | 0 |
| 7.153e+06 – 7.4e+06 | 0 |
| 7.4e+06 – 7.647e+06 | 0 |
| 7.647e+06 – 7.893e+06 | 0 |
| 7.893e+06 – 8.14e+06 | 0 |
| 8.14e+06 – 8.387e+06 | 0 |
| 8.387e+06 – 8.633e+06 | 0 |
| 8.633e+06 – 8.88e+06 | 0 |
| 8.88e+06 – 9.127e+06 | 0 |
| 9.127e+06 – 9.373e+06 | 0 |
| 9.373e+06 – 9.62e+06 | 0 |
| 9.62e+06 – 9.867e+06 | 1 |
Show data table
| bin | count |
|---|---|
| 0 – 0.0925 | 1403 |
| 0.0925 – 0.185 | 704 |
| 0.185 – 0.2775 | 403 |
| 0.2775 – 0.37 | 213 |
| 0.37 – 0.4625 | 158 |
| 0.4625 – 0.555 | 101 |
| 0.555 – 0.6475 | 65 |
| 0.6475 – 0.74 | 43 |
| 0.74 – 0.8325 | 27 |
| 0.8325 – 0.925 | 23 |
| 0.925 – 1.018 | 9 |
| 1.018 – 1.11 | 15 |
| 1.11 – 1.202 | 14 |
| 1.202 – 1.295 | 5 |
| 1.295 – 1.387 | 7 |
| 1.387 – 1.48 | 7 |
| 1.48 – 1.573 | 5 |
| 1.573 – 1.665 | 2 |
| 1.665 – 1.758 | 4 |
| 1.758 – 1.85 | 1 |
| 1.85 – 1.942 | 1 |
| 1.942 – 2.035 | 1 |
| 2.035 – 2.127 | 2 |
| 2.127 – 2.22 | 2 |
| 2.22 – 2.312 | 1 |
| 2.312 – 2.405 | 0 |
| 2.405 – 2.498 | 0 |
| 2.498 – 2.59 | 1 |
| 2.59 – 2.683 | 0 |
| 2.683 – 2.775 | 1 |
| 2.775 – 2.868 | 0 |
| 2.868 – 2.96 | 1 |
| 2.96 – 3.052 | 1 |
| 3.052 – 3.145 | 0 |
| 3.145 – 3.237 | 1 |
| 3.237 – 3.33 | 0 |
| 3.33 – 3.422 | 0 |
| 3.422 – 3.515 | 0 |
| 3.515 – 3.607 | 0 |
| 3.607 – 3.7 | 1 |
Show data table
| bin | count |
|---|---|
| 0 – 522.9 | 3022 |
| 522.9 – 1046 | 124 |
| 1046 – 1569 | 32 |
| 1569 – 2092 | 16 |
| 2092 – 2614 | 7 |
| 2614 – 3137 | 5 |
| 3137 – 3660 | 5 |
| 3660 – 4183 | 2 |
| 4183 – 4706 | 0 |
| 4706 – 5229 | 1 |
| 5229 – 5752 | 2 |
| 5752 – 6274 | 1 |
| 6274 – 6797 | 0 |
| 6797 – 7320 | 0 |
| 7320 – 7843 | 0 |
| 7843 – 8366 | 1 |
| 8366 – 8889 | 1 |
| 8889 – 9412 | 0 |
| 9412 – 9935 | 0 |
| 9935 – 1.046e+04 | 0 |
| 1.046e+04 – 1.098e+04 | 0 |
| 1.098e+04 – 1.15e+04 | 2 |
| 1.15e+04 – 1.203e+04 | 0 |
| 1.203e+04 – 1.255e+04 | 0 |
| 1.255e+04 – 1.307e+04 | 0 |
| 1.307e+04 – 1.359e+04 | 0 |
| 1.359e+04 – 1.412e+04 | 0 |
| 1.412e+04 – 1.464e+04 | 0 |
| 1.464e+04 – 1.516e+04 | 0 |
| 1.516e+04 – 1.569e+04 | 0 |
| 1.569e+04 – 1.621e+04 | 0 |
| 1.621e+04 – 1.673e+04 | 0 |
| 1.673e+04 – 1.725e+04 | 0 |
| 1.725e+04 – 1.778e+04 | 0 |
| 1.778e+04 – 1.83e+04 | 0 |
| 1.83e+04 – 1.882e+04 | 0 |
| 1.882e+04 – 1.935e+04 | 0 |
| 1.935e+04 – 1.987e+04 | 0 |
| 1.987e+04 – 2.039e+04 | 0 |
| 2.039e+04 – 2.092e+04 | 1 |
Show data table
| chars | count |
|---|---|
| 16 – 17 | 26 |
| 17 – 18 | 72 |
| 18 – 19 | 121 |
| 19 – 20 | 190 |
| 20 – 21 | 264 |
| 21 – 22 | 407 |
| 22 – 24 | 420 |
| 24 – 25 | 363 |
| 25 – 26 | 320 |
| 26 – 27 | 240 |
| 27 – 28 | 231 |
| 28 – 29 | 152 |
| 29 – 30 | 139 |
| 30 – 31 | 165 |
| 31 – 32 | 41 |
| 32 – 33 | 28 |
| 33 – 34 | 16 |
| 34 – 35 | 10 |
| 35 – 36 | 5 |
| 36 – 38 | 0 |
| 38 – 39 | 1 |
| 39 – 40 | 1 |
| 40 – 41 | 0 |
| 41 – 42 | 1 |
| 42 – 43 | 1 |
| 43 – 44 | 0 |
| 44 – 45 | 2 |
| 45 – 46 | 0 |
| 46 – 47 | 1 |
| 47 – 48 | 1 |
| 48 – 49 | 0 |
| 49 – 50 | 0 |
| 50 – 51 | 0 |
| 51 – 53 | 0 |
| 53 – 54 | 2 |
| 54 – 55 | 1 |
| 55 – 56 | 0 |
| 56 – 57 | 0 |
| 57 – 58 | 0 |
| 58 – 59 | 1 |
Schema
5 columns| Alerts | ||||
|---|---|---|---|---|
| fips | text | 0.0% | 3,222 |
near_unique
one_word
allcaps
short_text
|
| county_name | text | 0.0% | 3,222 |
near_unique
|
| total_pop | numeric | 0.0% | 3,141 |
high_skew
outliers
|
| uninsured_pop | numeric | 0.0% | 584 |
high_skew
outliers
|
| uninsured_rate | numeric | 0.0% | 152 |
high_skew
outliers
|
fips
text identifier near_unique one_word allcaps short_textThis column is a 5-character FIPS code identifying U.S. counties, with every one of the 3222 rows holding a unique value (n_unique equals n) and zero nulls. Lengths are uniformly 5 (min, median, max all 5), values are single tokens (one_word_rate 1.0), and the leading samples like 01001, 01003, 01005 match Alabama county FIPS prefixes. It functions as a primary key rather than a feature. Treatment: Use as a join key to county-level reference tables; do not feed into a model as a feature.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 3,222
- len_min
- 5
- len_max
- 5
- len_mean
- 5
- len_median
- 5
- len_p95
- 5
- word_mean
- 1
- word_median
- 1
- n_empty
- 0
- n_duplicates
- 0
- duplicate_rate
- 0
- vocab_size
- 3,222
- readability_flesch_mean
- 121.2
- emoji_rate
- 0
- url_rate
- 0
- one_word_rate
- 1
- allcaps_rate
- 1
- boilerplate_rate
- 0
county_name
text identifier near_uniqueThis column holds U.S. county identifiers, likely formatted as 'County Name County, State' given that 'county,' appears in 2,999 of 3,222 rows and state names like Texas (256), Virginia (189), and Georgia (159) dominate the top tokens. Every one of the 3,222 rows is unique with zero nulls and zero duplicates, consistent with a canonical roster of U.S. counties. String lengths cluster tightly (min 16, median 24, max 59) and average 3.25 words, so formatting is highly regular. Treatment: Use as a join key to state/county references; do not feed raw into a model.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 3,222
- len_min
- 16
- len_max
- 59
- len_mean
- 24.32
- len_median
- 24
- len_p95
- 31
- word_mean
- 3.248
- word_median
- 3
- n_empty
- 0
- n_duplicates
- 0
- duplicate_rate
- 0
- vocab_size
- 1,990
- readability_flesch_mean
- 10.28
- emoji_rate
- 0
- url_rate
- 0
- one_word_rate
- 0
- allcaps_rate
- 0
- boilerplate_rate
- 0
total_pop
numeric feature high_skew outliersLikely a county- or region-level total population count: 3222 rows with 3141 unique values, no nulls, integer-scale magnitudes from 47 up to 9,866,623. The distribution is extremely right-skewed (skew 13.38, kurtosis 298.69) with median 25,328 far below mean 102,232, and 14.06% of rows flagged as outliers. The std of 326,933 dwarfs the IQR of 54,579, consistent with a few massive metros pulling the tail. Treatment: log-transform before regression to tame the heavy right tail.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 3,141
- min
- 47
- max
- 9.867e+06
- mean
- 1.022e+05
- median
- 25,328
- std
- 3.269e+05
- q1
- 1.061e+04
- q3
- 65,190
- iqr
- 5.458e+04
- skew
- 13.38
- kurtosis
- 298.7
- n_outliers
- 453
- outlier_rate
- 0.1406
- zero_rate
- 0
uninsured_pop
numeric feature high_skew outliersCounts of uninsured people per record, likely aggregated to a geographic unit (3222 rows hints at US counties). The distribution is brutally right-skewed: median is 36 but the mean is 159.9 and the max hits 20915, with skew 17.8 and kurtosis 462.9. Roughly 17.2% of rows are zero and 11.4% flag as outliers, so a handful of large jurisdictions dominate the totals. Treatment: Apply a log1p transform and consider normalising by population before modelling.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 584
- min
- 0
- max
- 20,915
- mean
- 159.9
- median
- 36
- std
- 627.2
- q1
- 7
- q3
- 120
- iqr
- 113
- skew
- 17.81
- kurtosis
- 462.9
- n_outliers
- 368
- outlier_rate
- 0.1142
- zero_rate
- 0.1723
uninsured_rate
numeric feature high_skew outliersThis looks like a per-record uninsured rate, ranging from 0.0 to 3.7 with a median of 0.12 and IQR of 0.21. The distribution is severely right-skewed (skew 4.10, kurtosis 27.70) with 230 outliers (7.14%) and 17.54% exact zeros, and the max of 3.7 is implausible if this is meant to be a proportion bounded at 1. Treatment: Validate the >1 values against the expected [0,1] range, then log- or logit-transform after winsorising before modelling.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 152
- min
- 0
- max
- 3.7
- mean
- 0.2002
- median
- 0.12
- std
- 0.2829
- q1
- 0.04
- q3
- 0.25
- iqr
- 0.21
- skew
- 4.095
- kurtosis
- 27.7
- n_outliers
- 230
- outlier_rate
- 0.07138
- zero_rate
- 0.1754