rent burden
Reading
This dataset contains 3,222 rows of U.S. county-level rent burden statistics, with each row identified by a county name and FIPS code and described by total renters and the share of renters paying 30%+ or 50%+ of income on rent. Total renters is extremely skewed (skew 15.8, max 1,810,929 vs. median 2,579.5), so a handful of large urban counties dominate the distribution and warrant separate treatment. Rent-burden percentages are more well-behaved: about 36.4% of renters per county are cost-burdened at the 30%+ threshold and 17.4% at the 50%+ threshold on average, both fairly symmetric. The most useful first look is comparing the two rent-burden distributions and isolating the outlier counties on total_renters.
citing: row_count · column_count · columns.total_renters.stats · columns.pct_rent_burdened_30plus.stats · columns.pct_rent_burdened_50plus.stats · columns.county_name.stats
Charts the summary said to look at first
Show data table
| bin | count |
|---|---|
| 28 – 4.53e+04 | 3019 |
| 4.53e+04 – 9.057e+04 | 109 |
| 9.057e+04 – 1.358e+05 | 38 |
| 1.358e+05 – 1.811e+05 | 17 |
| 1.811e+05 – 2.264e+05 | 11 |
| 2.264e+05 – 2.717e+05 | 9 |
| 2.717e+05 – 3.169e+05 | 5 |
| 3.169e+05 – 3.622e+05 | 0 |
| 3.622e+05 – 4.075e+05 | 2 |
| 4.075e+05 – 4.528e+05 | 2 |
| 4.528e+05 – 4.98e+05 | 3 |
| 4.98e+05 – 5.433e+05 | 1 |
| 5.433e+05 – 5.886e+05 | 1 |
| 5.886e+05 – 6.338e+05 | 1 |
| 6.338e+05 – 6.791e+05 | 0 |
| 6.791e+05 – 7.244e+05 | 1 |
| 7.244e+05 – 7.697e+05 | 1 |
| 7.697e+05 – 8.149e+05 | 0 |
| 8.149e+05 – 8.602e+05 | 0 |
| 8.602e+05 – 9.055e+05 | 1 |
| 9.055e+05 – 9.508e+05 | 0 |
| 9.508e+05 – 9.96e+05 | 0 |
| 9.96e+05 – 1.041e+06 | 0 |
| 1.041e+06 – 1.087e+06 | 0 |
| 1.087e+06 – 1.132e+06 | 0 |
| 1.132e+06 – 1.177e+06 | 0 |
| 1.177e+06 – 1.222e+06 | 0 |
| 1.222e+06 – 1.268e+06 | 0 |
| 1.268e+06 – 1.313e+06 | 0 |
| 1.313e+06 – 1.358e+06 | 0 |
| 1.358e+06 – 1.403e+06 | 0 |
| 1.403e+06 – 1.449e+06 | 0 |
| 1.449e+06 – 1.494e+06 | 0 |
| 1.494e+06 – 1.539e+06 | 0 |
| 1.539e+06 – 1.585e+06 | 0 |
| 1.585e+06 – 1.63e+06 | 0 |
| 1.63e+06 – 1.675e+06 | 0 |
| 1.675e+06 – 1.72e+06 | 0 |
| 1.72e+06 – 1.766e+06 | 0 |
| 1.766e+06 – 1.811e+06 | 1 |
Show data table
| bin | count |
|---|---|
| 0 – 1.624 | 9 |
| 1.624 – 3.248 | 5 |
| 3.248 – 4.872 | 3 |
| 4.872 – 6.496 | 5 |
| 6.496 – 8.12 | 9 |
| 8.12 – 9.744 | 13 |
| 9.744 – 11.37 | 11 |
| 11.37 – 12.99 | 16 |
| 12.99 – 14.62 | 26 |
| 14.62 – 16.24 | 19 |
| 16.24 – 17.86 | 35 |
| 17.86 – 19.49 | 43 |
| 19.49 – 21.11 | 52 |
| 21.11 – 22.74 | 52 |
| 22.74 – 24.36 | 73 |
| 24.36 – 25.98 | 99 |
| 25.98 – 27.61 | 109 |
| 27.61 – 29.23 | 116 |
| 29.23 – 30.86 | 132 |
| 30.86 – 32.48 | 159 |
| 32.48 – 34.1 | 189 |
| 34.1 – 35.73 | 209 |
| 35.73 – 37.35 | 227 |
| 37.35 – 38.98 | 239 |
| 38.98 – 40.6 | 205 |
| 40.6 – 42.22 | 209 |
| 42.22 – 43.85 | 210 |
| 43.85 – 45.47 | 190 |
| 45.47 – 47.1 | 131 |
| 47.1 – 48.72 | 114 |
| 48.72 – 50.34 | 118 |
| 50.34 – 51.97 | 69 |
| 51.97 – 53.59 | 51 |
| 53.59 – 55.22 | 34 |
| 55.22 – 56.84 | 24 |
| 56.84 – 58.46 | 6 |
| 58.46 – 60.09 | 3 |
| 60.09 – 61.71 | 2 |
| 61.71 – 63.34 | 3 |
| 63.34 – 64.96 | 3 |
Show data table
| bin | count |
|---|---|
| 0 – 1.624 | 42 |
| 1.624 – 3.248 | 27 |
| 3.248 – 4.872 | 34 |
| 4.872 – 6.496 | 63 |
| 6.496 – 8.12 | 102 |
| 8.12 – 9.744 | 148 |
| 9.744 – 11.37 | 163 |
| 11.37 – 12.99 | 214 |
| 12.99 – 14.62 | 242 |
| 14.62 – 16.24 | 310 |
| 16.24 – 17.86 | 315 |
| 17.86 – 19.49 | 332 |
| 19.49 – 21.11 | 335 |
| 21.11 – 22.74 | 264 |
| 22.74 – 24.36 | 219 |
| 24.36 – 25.98 | 150 |
| 25.98 – 27.61 | 99 |
| 27.61 – 29.23 | 64 |
| 29.23 – 30.86 | 39 |
| 30.86 – 32.48 | 20 |
| 32.48 – 34.1 | 21 |
| 34.1 – 35.73 | 9 |
| 35.73 – 37.35 | 2 |
| 37.35 – 38.98 | 3 |
| 38.98 – 40.6 | 1 |
| 40.6 – 42.22 | 1 |
| 42.22 – 43.85 | 1 |
| 43.85 – 45.47 | 0 |
| 45.47 – 47.1 | 1 |
| 47.1 – 48.72 | 0 |
| 48.72 – 50.34 | 0 |
| 50.34 – 51.97 | 0 |
| 51.97 – 53.59 | 0 |
| 53.59 – 55.22 | 0 |
| 55.22 – 56.84 | 0 |
| 56.84 – 58.46 | 0 |
| 58.46 – 60.09 | 0 |
| 60.09 – 61.71 | 0 |
| 61.71 – 63.34 | 0 |
| 63.34 – 64.96 | 1 |
Show data table
| bin | count |
|---|---|
| 1001 – 2780 | 97 |
| 2780 – 4559 | 15 |
| 4559 – 6337 | 133 |
| 6337 – 8116 | 59 |
| 8116 – 9895 | 14 |
| 9895 – 1.167e+04 | 4 |
| 1.167e+04 – 1.345e+04 | 226 |
| 1.345e+04 – 1.523e+04 | 5 |
| 1.523e+04 – 1.701e+04 | 49 |
| 1.701e+04 – 1.879e+04 | 189 |
| 1.879e+04 – 2.057e+04 | 204 |
| 2.057e+04 – 2.235e+04 | 184 |
| 2.235e+04 – 2.413e+04 | 39 |
| 2.413e+04 – 2.59e+04 | 15 |
| 2.59e+04 – 2.768e+04 | 170 |
| 2.768e+04 – 2.946e+04 | 196 |
| 2.946e+04 – 3.124e+04 | 150 |
| 3.124e+04 – 3.302e+04 | 27 |
| 3.302e+04 – 3.48e+04 | 21 |
| 3.48e+04 – 3.658e+04 | 95 |
| 3.658e+04 – 3.836e+04 | 153 |
| 3.836e+04 – 4.013e+04 | 155 |
| 4.013e+04 – 4.191e+04 | 46 |
| 4.191e+04 – 4.369e+04 | 67 |
| 4.369e+04 – 4.547e+04 | 51 |
| 4.547e+04 – 4.725e+04 | 161 |
| 4.725e+04 – 4.903e+04 | 268 |
| 4.903e+04 – 5.081e+04 | 29 |
| 5.081e+04 – 5.259e+04 | 133 |
| 5.259e+04 – 5.436e+04 | 94 |
| 5.436e+04 – 5.614e+04 | 95 |
| 5.614e+04 – 5.792e+04 | 0 |
| 5.792e+04 – 5.97e+04 | 0 |
| 5.97e+04 – 6.148e+04 | 0 |
| 6.148e+04 – 6.326e+04 | 0 |
| 6.326e+04 – 6.504e+04 | 0 |
| 6.504e+04 – 6.682e+04 | 0 |
| 6.682e+04 – 6.86e+04 | 0 |
| 6.86e+04 – 7.037e+04 | 0 |
| 7.037e+04 – 7.215e+04 | 78 |
Schema
5 columns| Alerts | ||||
|---|---|---|---|---|
| fips | numeric | 0.0% | 3,222 |
|
| county_name | text | 0.0% | 3,222 |
near_unique
|
| total_renters | numeric | 0.0% | 2,709 |
high_skew
outliers
|
| pct_rent_burdened_30plus | numeric | 0.0% | 2,146 |
|
| pct_rent_burdened_50plus | numeric | 0.0% | 1,769 |
|
fips
numeric identifierThis is the FIPS county/state code, used as a unique geographic identifier — every one of the 3,222 rows has a distinct value with no nulls. The range from 1001 to 72153 and the low skew (0.157) reflect the standard FIPS numbering across U.S. states and territories rather than a meaningful numeric distribution. Treating these as numbers (mean 31,377, std 16,299) is misleading; they are categorical codes. Treatment: Cast to string and use as a join key on geographic reference tables; do not model as numeric.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 3,222
- min
- 1,001
- max
- 72,153
- mean
- 3.138e+04
- median
- 30,022
- std
- 1.63e+04
- q1
- 1.903e+04
- q3
- 4.61e+04
- iqr
- 27,075
- skew
- 0.1574
- kurtosis
- -0.6314
- n_outliers
- 0
- outlier_rate
- 0
- zero_rate
- 0
county_name
text identifier near_uniqueThis column holds fully-qualified US county names (e.g., 'X County, State'), with 3222 rows all unique and zero nulls. The token 'county,' appears 2999 times, suggesting ~223 entries don't follow that exact pattern — likely Louisiana parishes, Alaska boroughs, or independent cities worth checking. State frequencies match expectations, with Texas (256) leading. Treatment: Split into county and state fields and left-join on FIPS rather than this string.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 3,222
- len_min
- 16
- len_max
- 59
- len_mean
- 24.32
- len_median
- 24
- len_p95
- 31
- word_mean
- 3.248
- word_median
- 3
- n_empty
- 0
- n_duplicates
- 0
- duplicate_rate
- 0
- vocab_size
- 1,990
- readability_flesch_mean
- 10.28
- emoji_rate
- 0
- url_rate
- 0
- one_word_rate
- 0
- allcaps_rate
- 0
- boilerplate_rate
- 0
total_renters
numeric feature high_skew outliersThis is a numeric count of renters per record, ranging from 28 to 1,810,929 with a median of 2,579.5 — likely an aggregate count at some geographic or entity level. The distribution is extremely right-skewed (skew 15.82, kurtosis 398.15) with the mean (13,851) over five times the median, and 449 outliers (13.9%) inflate the std to 55,351. No nulls or zeros, and 2,709 unique values across 3,222 rows suggest minor repetition but largely distinct totals. Treatment: log-transform before modelling to tame the heavy right tail.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 2,709
- min
- 28
- max
- 1.811e+06
- mean
- 1.385e+04
- median
- 2580
- std
- 5.535e+04
- q1
- 1004
- q3
- 7396
- iqr
- 6,392
- skew
- 15.82
- kurtosis
- 398.2
- n_outliers
- 449
- outlier_rate
- 0.1394
- zero_rate
- 0
pct_rent_burdened_30plus
numeric featureThis column captures the percentage of households spending 30%+ of income on rent, reported per row across 3,222 records with no nulls. Values span 0 to 64.96 with a median of 37.36 and IQR of 30.67-43.48, mildly left-skewed (-0.57) and tightly clustered (std 10.0). About 0.25% are exact zeros and 58 rows (1.8%) flag as outliers, but the distribution is otherwise well-behaved and ready to use as-is. Treatment: Use directly as a continuous feature; no transform needed given the near-symmetric, bounded distribution.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 2,146
- min
- 0
- max
- 64.96
- mean
- 36.44
- median
- 37.36
- std
- 10.01
- q1
- 30.67
- q3
- 43.48
- iqr
- 12.81
- skew
- -0.5673
- kurtosis
- 0.5032
- n_outliers
- 58
- outlier_rate
- 0.018
- zero_rate
- 0.002483
pct_rent_burdened_50plus
numeric featureLikely the percentage of households spending 50%+ of income on rent at some geographic unit (e.g., county). Values span 0 to 64.96 with mean 17.35 and median 17.62, a near-symmetric distribution (skew 0.05) and modest tails (kurtosis 0.98). Only 0.93% are zero and 1.46% flagged as outliers, so the signal is clean and ready to use. Treatment: Use as-is as a continuous feature; no transform needed given symmetry.
- n
- 3,222
- nulls
- 0 (0.0%)
- unique
- 1,769
- min
- 0
- max
- 64.96
- mean
- 17.35
- median
- 17.62
- std
- 6.577
- q1
- 13.07
- q3
- 21.63
- iqr
- 8.557
- skew
- 0.05436
- kurtosis
- 0.9823
- n_outliers
- 47
- outlier_rate
- 0.01459
- zero_rate
- 0.009311