saturn·

rent burden

source /home/coolhand/html/datavis/data_trove/cache/rent_burden.parquet 3,222 rows 5 columns profiled 2026-05-01 raw JSON static .html .ipynb Report Notebook

Reading

dataset summary · high confidence anthropic:claude-opus-4-7

This dataset contains 3,222 rows of U.S. county-level rent burden statistics, with each row identified by a county name and FIPS code and described by total renters and the share of renters paying 30%+ or 50%+ of income on rent. Total renters is extremely skewed (skew 15.8, max 1,810,929 vs. median 2,579.5), so a handful of large urban counties dominate the distribution and warrant separate treatment. Rent-burden percentages are more well-behaved: about 36.4% of renters per county are cost-burdened at the 30%+ threshold and 17.4% at the 50%+ threshold on average, both fairly symmetric. The most useful first look is comparing the two rent-burden distributions and isolating the outlier counties on total_renters.

citing: row_count · column_count · columns.total_renters.stats · columns.pct_rent_burdened_30plus.stats · columns.pct_rent_burdened_50plus.stats · columns.county_name.stats

Schema

5 columns
Per-column summary. Click column name to jump to its detail.
Alerts
fips numeric 0.0% 3,222
county_name text 0.0% 3,222
near_unique
total_renters numeric 0.0% 2,709
high_skew outliers
pct_rent_burdened_30plus numeric 0.0% 2,146
pct_rent_burdened_50plus numeric 0.0% 1,769

fips

numeric identifier
This is the FIPS county/state code, used as a unique geographic identifier — every one of the 3,222 rows has a distinct value with no nulls. The range from 1001 to 72153 and the low skew (0.157) reflect the standard FIPS numbering across U.S. states and territories rather than a meaningful numeric distribution. Treating these as numbers (mean 31,377, std 16,299) is misleading; they are categorical codes. Treatment: Cast to string and use as a join key on geographic reference tables; do not model as numeric. high · anthropic:claude-opus-4-7
n
3,222
nulls
0 (0.0%)
unique
3,222
min
1,001
max
72,153
mean
3.138e+04
median
30,022
std
1.63e+04
q1
1.903e+04
q3
4.61e+04
iqr
27,075
skew
0.1574
kurtosis
-0.6314
n_outliers
0
outlier_rate
0
zero_rate
0

county_name

text identifier near_unique
This column holds fully-qualified US county names (e.g., 'X County, State'), with 3222 rows all unique and zero nulls. The token 'county,' appears 2999 times, suggesting ~223 entries don't follow that exact pattern — likely Louisiana parishes, Alaska boroughs, or independent cities worth checking. State frequencies match expectations, with Texas (256) leading. Treatment: Split into county and state fields and left-join on FIPS rather than this string. high · anthropic:claude-opus-4-7
n
3,222
nulls
0 (0.0%)
unique
3,222
len_min
16
len_max
59
len_mean
24.32
len_median
24
len_p95
31
word_mean
3.248
word_median
3
n_empty
0
n_duplicates
0
duplicate_rate
0
vocab_size
1,990
readability_flesch_mean
10.28
emoji_rate
0
url_rate
0
one_word_rate
0
allcaps_rate
0
boilerplate_rate
0

total_renters

numeric feature high_skew outliers
This is a numeric count of renters per record, ranging from 28 to 1,810,929 with a median of 2,579.5 — likely an aggregate count at some geographic or entity level. The distribution is extremely right-skewed (skew 15.82, kurtosis 398.15) with the mean (13,851) over five times the median, and 449 outliers (13.9%) inflate the std to 55,351. No nulls or zeros, and 2,709 unique values across 3,222 rows suggest minor repetition but largely distinct totals. Treatment: log-transform before modelling to tame the heavy right tail. high · anthropic:claude-opus-4-7
n
3,222
nulls
0 (0.0%)
unique
2,709
min
28
max
1.811e+06
mean
1.385e+04
median
2580
std
5.535e+04
q1
1004
q3
7396
iqr
6,392
skew
15.82
kurtosis
398.2
n_outliers
449
outlier_rate
0.1394
zero_rate
0

pct_rent_burdened_30plus

numeric feature
This column captures the percentage of households spending 30%+ of income on rent, reported per row across 3,222 records with no nulls. Values span 0 to 64.96 with a median of 37.36 and IQR of 30.67-43.48, mildly left-skewed (-0.57) and tightly clustered (std 10.0). About 0.25% are exact zeros and 58 rows (1.8%) flag as outliers, but the distribution is otherwise well-behaved and ready to use as-is. Treatment: Use directly as a continuous feature; no transform needed given the near-symmetric, bounded distribution. high · anthropic:claude-opus-4-7
n
3,222
nulls
0 (0.0%)
unique
2,146
min
0
max
64.96
mean
36.44
median
37.36
std
10.01
q1
30.67
q3
43.48
iqr
12.81
skew
-0.5673
kurtosis
0.5032
n_outliers
58
outlier_rate
0.018
zero_rate
0.002483

pct_rent_burdened_50plus

numeric feature
Likely the percentage of households spending 50%+ of income on rent at some geographic unit (e.g., county). Values span 0 to 64.96 with mean 17.35 and median 17.62, a near-symmetric distribution (skew 0.05) and modest tails (kurtosis 0.98). Only 0.93% are zero and 1.46% flagged as outliers, so the signal is clean and ready to use. Treatment: Use as-is as a continuous feature; no transform needed given symmetry. high · anthropic:claude-opus-4-7
n
3,222
nulls
0 (0.0%)
unique
1,769
min
0
max
64.96
mean
17.35
median
17.62
std
6.577
q1
13.07
q3
21.63
iqr
8.557
skew
0.05436
kurtosis
0.9823
n_outliers
47
outlier_rate
0.01459
zero_rate
0.009311