saturn·

us housing affordability crisis housing crisis merged

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/datasets/us-housing-affordability-crisis/housing_crisis_merged.csv

Saturn profiled 3,222 rows across 16 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/datasets/us-housing-affordability-crisis/housing_crisis_merged.csv",
    "--findings", "us-housing-affordability-crisis-housing_crisis_merged.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset covers 3,222 U.S. counties with 16 columns describing rental affordability — rents, incomes, renter shares, and burden percentages — keyed by FIPS and county name. Several numeric fields (annual_rent, median_gross_rent, median_household_income, rent_to_income_ratio) carry impossible negative sentinel values like -666666666 and -7999999992, which drag means deeply negative and produce skew around -17 to -57; these need cleaning before any analysis. The affordability_category field is also extremely imbalanced — 3,192 of 3,222 counties are labeled 'Affordable' (top_rate 0.99), so it offers little discriminatory signal as-is. The cleaner fields to start with are pct_rent_burdened_30plus (median 37.36%), pct_rent_burdened_50plus (median 17.62%), and pct_renter (median 26.07%), which look well-behaved and tell the real affordability story.

citing: annual_rent.stats.min · annual_rent.stats.skew · median_household_income.stats.min · median_household_income.stats.skew · rent_to_income_ratio.stats.skew · affordability_category.stats.top_rate · affordability_category.top_values · pct_rent_burdened_30plus.stats.median · pct_rent_burdened_50plus.stats.median · pct_renter.stats.median · row_count · column_count

Out[4]:

saturn.schema() · 16 columns

column kind n null% unique alerts
fips numeric 3,222 0.0% 3,222
county_name text 3,222 0.0% 3,222 near_unique
total_renters numeric 3,222 0.0% 2,709 high_skew outliers
pct_rent_burdened_30plus numeric 3,222 0.0% 2,146
pct_rent_burdened_50plus numeric 3,222 0.0% 1,769
median_gross_rent numeric 3,222 0.0% 984 high_skew outliers
median_household_income numeric 3,222 0.0% 3,099 high_skew outliers
total_housing_units numeric 3,222 0.0% 3,074 high_skew outliers
owner_occupied numeric 3,222 0.0% 3,001 high_skew outliers
renter_occupied numeric 3,222 0.0% 2,709 high_skew outliers
pct_renter numeric 3,222 0.0% 1,925
annual_rent numeric 3,222 0.0% 984 high_skew outliers
rent_to_income_ratio numeric 3,222 0.0% 1,278 high_skew
affordability_category categorical 3,222 0.0% 3 imbalance
hours_at_min_wage_for_rent numeric 3,222 0.0% 230 high_skew outliers
weeks_at_min_wage_for_rent numeric 3,222 0.0% 72 high_skew outliers
Fig 1.
pct_rent_burdened_30plus · Distribution of counties by share of renters paying 30%+ of income on rent — a clean view of affordability stress.
Show data table
Histogram bins for pct_rent_burdened_30plus (median: 37.36).
bincount
0 – 1.6249
1.624 – 3.2485
3.248 – 4.8723
4.872 – 6.4965
6.496 – 8.129
8.12 – 9.74413
9.744 – 11.3711
11.37 – 12.9916
12.99 – 14.6226
14.62 – 16.2419
16.24 – 17.8635
17.86 – 19.4943
19.49 – 21.1152
21.11 – 22.7452
22.74 – 24.3673
24.36 – 25.9899
25.98 – 27.61109
27.61 – 29.23116
29.23 – 30.86132
30.86 – 32.48159
32.48 – 34.1189
34.1 – 35.73209
35.73 – 37.35227
37.35 – 38.98239
38.98 – 40.6205
40.6 – 42.22209
42.22 – 43.85210
43.85 – 45.47190
45.47 – 47.1131
47.1 – 48.72114
48.72 – 50.34118
50.34 – 51.9769
51.97 – 53.5951
53.59 – 55.2234
55.22 – 56.8424
56.84 – 58.466
58.46 – 60.093
60.09 – 61.712
61.71 – 63.343
63.34 – 64.963
Fig 2.
pct_rent_burdened_50plus · Severe rent burden across counties; watch the right tail for the worst-affected places.
Show data table
Histogram bins for pct_rent_burdened_50plus (median: 17.62).
bincount
0 – 1.62442
1.624 – 3.24827
3.248 – 4.87234
4.872 – 6.49663
6.496 – 8.12102
8.12 – 9.744148
9.744 – 11.37163
11.37 – 12.99214
12.99 – 14.62242
14.62 – 16.24310
16.24 – 17.86315
17.86 – 19.49332
19.49 – 21.11335
21.11 – 22.74264
22.74 – 24.36219
24.36 – 25.98150
25.98 – 27.6199
27.61 – 29.2364
29.23 – 30.8639
30.86 – 32.4820
32.48 – 34.121
34.1 – 35.739
35.73 – 37.352
37.35 – 38.983
38.98 – 40.61
40.6 – 42.221
42.22 – 43.851
43.85 – 45.470
45.47 – 47.11
47.1 – 48.720
48.72 – 50.340
50.34 – 51.970
51.97 – 53.590
53.59 – 55.220
55.22 – 56.840
56.84 – 58.460
58.46 – 60.090
60.09 – 61.710
61.71 – 63.340
63.34 – 64.961
Fig 3.
pct_renter · How renter-heavy each county is; most cluster near 26% but a long right tail reaches 100%.
Show data table
Histogram bins for pct_renter (median: 26.07).
bincount
3.01 – 5.4351
5.435 – 7.8593
7.859 – 10.289
10.28 – 12.7126
12.71 – 15.1363
15.13 – 17.56156
17.56 – 19.98316
19.98 – 22.41371
22.41 – 24.83450
24.83 – 27.26419
27.26 – 29.68357
29.68 – 32.11301
32.11 – 34.53203
34.53 – 36.96169
36.96 – 39.38115
39.38 – 41.8175
41.81 – 44.2356
44.23 – 46.6645
46.66 – 49.0825
49.08 – 51.515
51.5 – 53.9311
53.93 – 56.3510
56.35 – 58.788
58.78 – 61.24
61.2 – 63.634
63.63 – 66.051
66.05 – 68.481
68.48 – 70.93
70.9 – 73.331
73.33 – 75.751
75.75 – 78.180
78.18 – 80.61
80.6 – 83.030
83.03 – 85.451
85.45 – 87.880
87.88 – 90.30
90.3 – 92.730
92.73 – 95.150
95.15 – 97.580
97.58 – 1001
Fig 4.
affordability_category · Shows the extreme imbalance — nearly all counties fall into 'Affordable', flagging a likely labeling issue.
Show data table
Top values for affordability_category (3 unique shown, of 3 total).
valuecountshare
Affordable319299.1%
Moderately Burdened290.9%
Extremely Burdened10.0%
Fig 5.
median_gross_rent · Median rent per county; expect to filter out negative sentinel values before plotting.
Show data table
Histogram bins for median_gross_rent (median: 817.5).
bincount
-6.667e+08 – -6.5e+0810
-6.5e+08 – -6.333e+080
-6.333e+08 – -6.167e+080
-6.167e+08 – -6e+080
-6e+08 – -5.833e+080
-5.833e+08 – -5.667e+080
-5.667e+08 – -5.5e+080
-5.5e+08 – -5.333e+080
-5.333e+08 – -5.167e+080
-5.167e+08 – -5e+080
-5e+08 – -4.833e+080
-4.833e+08 – -4.667e+080
-4.667e+08 – -4.5e+080
-4.5e+08 – -4.333e+080
-4.333e+08 – -4.167e+080
-4.167e+08 – -4e+080
-4e+08 – -3.833e+080
-3.833e+08 – -3.667e+080
-3.667e+08 – -3.5e+080
-3.5e+08 – -3.333e+080
-3.333e+08 – -3.167e+080
-3.167e+08 – -3e+080
-3e+08 – -2.833e+080
-2.833e+08 – -2.667e+080
-2.667e+08 – -2.5e+080
-2.5e+08 – -2.333e+080
-2.333e+08 – -2.167e+080
-2.167e+08 – -2e+080
-2e+08 – -1.833e+080
-1.833e+08 – -1.667e+080
-1.667e+08 – -1.5e+080
-1.5e+08 – -1.333e+080
-1.333e+08 – -1.167e+080
-1.167e+08 – -1e+080
-1e+08 – -8.333e+070
-8.333e+07 – -6.666e+070
-6.666e+07 – -5e+070
-5e+07 – -3.333e+070
-3.333e+07 – -1.666e+070
-1.666e+07 – 28053212
Fig 6.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
fipsnumeric0.0%
county_nametext0.0%
total_rentersnumeric0.0%
pct_rent_burdened_30plusnumeric0.0%
pct_rent_burdened_50plusnumeric0.0%
median_gross_rentnumeric0.0%
median_household_incomenumeric0.0%
total_housing_unitsnumeric0.0%
owner_occupiednumeric0.0%
renter_occupiednumeric0.0%
pct_renternumeric0.0%
annual_rentnumeric0.0%
rent_to_income_rationumeric0.0%
affordability_categorycategorical0.0%
hours_at_min_wage_for_rentnumeric0.0%
weeks_at_min_wage_for_rentnumeric0.0%
Fig 7.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 12 numeric columns (values clipped to 2 decimals).
fipstotal_renterspct_rent_burdened_30pluspct_rent_burdened_50plusmedian_gross_rentmedian_household_incometotal_housing_unitsowner_occupiedrenter_occupiedpct_renterannual_rentrent_to_income_ratio
fips+1.00-0.06-0.16-0.10-0.06-0.07-0.06-0.06-0.06-0.10-0.06-0.06
total_renters-0.06+1.00+0.23+0.20+0.02+0.23+0.99+0.96+1.00+0.22+0.02+0.02
pct_rent_burdened_30plus-0.16+0.23+1.00+0.82+0.19+0.21+0.26+0.28+0.23+0.19+0.19+0.19
pct_rent_burdened_50plus-0.10+0.20+0.82+1.00+0.16+0.12+0.23+0.25+0.20+0.22+0.16+0.16
median_gross_rent-0.06+0.02+0.19+0.16+1.00+0.05+0.02+0.02+0.02-0.13+1.00+1.00
median_household_income-0.07+0.23+0.21+0.12+0.05+1.00+0.28+0.31+0.23-0.09+0.05+0.05
total_housing_units-0.06+0.99+0.26+0.23+0.02+0.28+1.00+0.99+0.99+0.19+0.02+0.02
owner_occupied-0.06+0.96+0.28+0.25+0.02+0.31+0.99+1.00+0.96+0.16+0.02+0.02
renter_occupied-0.06+1.00+0.23+0.20+0.02+0.23+0.99+0.96+1.00+0.22+0.02+0.02
pct_renter-0.10+0.22+0.19+0.22-0.13-0.09+0.19+0.16+0.22+1.00-0.13-0.14
annual_rent-0.06+0.02+0.19+0.16+1.00+0.05+0.02+0.02+0.02-0.13+1.00+1.00
rent_to_income_ratio-0.06+0.02+0.19+0.16+1.00+0.05+0.02+0.02+0.02-0.14+1.00+1.00

fips numeric identifier

This is the U.S. county FIPS code, a 5-digit geographic identifier where the leading digits encode the state. Every one of the 3222 rows is unique with no nulls, and the range 1001 to 72153 spans Alabama through Puerto Rico, consistent with a full national county roster. Distribution stats (mean 31377.89, skew 0.157) are not meaningful here since the values are categorical codes, not quantities.

Treatment: Treat as a categorical key; left-join on this code to bring in geographic attributes rather than using it as a numeric feature.

anthropic:claude-opus-4-7 · confidence high
Out[13]:

saturn.columns["fips"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,222
min 1,001
max 72,153
mean 3.138e+04
median 30,022
std 1.63e+04
q1 1.903e+04
q3 4.61e+04
iqr 27,075
skew 0.1574
kurtosis -0.6314
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 8.
Distribution of fips. Vertical dash marks the median.
Show data table
Histogram bins for fips (median: 30022.0).
bincount
1001 – 278097
2780 – 455915
4559 – 6337133
6337 – 811659
8116 – 989514
9895 – 1.167e+044
1.167e+04 – 1.345e+04226
1.345e+04 – 1.523e+045
1.523e+04 – 1.701e+0449
1.701e+04 – 1.879e+04189
1.879e+04 – 2.057e+04204
2.057e+04 – 2.235e+04184
2.235e+04 – 2.413e+0439
2.413e+04 – 2.59e+0415
2.59e+04 – 2.768e+04170
2.768e+04 – 2.946e+04196
2.946e+04 – 3.124e+04150
3.124e+04 – 3.302e+0427
3.302e+04 – 3.48e+0421
3.48e+04 – 3.658e+0495
3.658e+04 – 3.836e+04153
3.836e+04 – 4.013e+04155
4.013e+04 – 4.191e+0446
4.191e+04 – 4.369e+0467
4.369e+04 – 4.547e+0451
4.547e+04 – 4.725e+04161
4.725e+04 – 4.903e+04268
4.903e+04 – 5.081e+0429
5.081e+04 – 5.259e+04133
5.259e+04 – 5.436e+0494
5.436e+04 – 5.614e+0495
5.614e+04 – 5.792e+040
5.792e+04 – 5.97e+040
5.97e+04 – 6.148e+040
6.148e+04 – 6.326e+040
6.326e+04 – 6.504e+040
6.504e+04 – 6.682e+040
6.682e+04 – 6.86e+040
6.86e+04 – 7.037e+040
7.037e+04 – 7.215e+0478

county_name text identifier

This column holds U.S. county identifiers, formatted as 'County, State' strings — 'county,' appears in 2999 of 3222 rows and the top state tokens (texas 256, virginia 189, georgia 159) match the U.S. counties-by-state distribution. Every one of the 3222 values is unique with zero nulls or duplicates, and lengths cluster tightly (min 16, median 24, max 59), consistent with a clean canonical name field. The 223 rows not containing 'county,' likely reflect Louisiana parishes, Alaska boroughs, or independent cities rather than data quality issues.

Treatment: Use as a join key against county-level reference tables; do not feed raw into models.

anthropic:claude-opus-4-7 · confidence high
Out[16]:

saturn.columns["county_name"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,222
len_min 16
len_max 59
len_mean 24.32
len_median 24
len_p95 31
word_mean 3.248
word_median 3
n_empty 0
n_duplicates 0
duplicate_rate 0
vocab_size 1,990
readability_flesch_mean 10.28
emoji_rate 0
url_rate 0
one_word_rate 0
allcaps_rate 0
boilerplate_rate 0
alert: near_unique100.0% of rows are unique strings
Fig 9.
Character-length distribution for county_name.
Show data table
Character-length distribution for county_name (mean: 24.324022346368714).
charscount
16 – 1726
17 – 1872
18 – 19121
19 – 20190
20 – 21264
21 – 22407
22 – 24420
24 – 25363
25 – 26320
26 – 27240
27 – 28231
28 – 29152
29 – 30139
30 – 31165
31 – 3241
32 – 3328
33 – 3416
34 – 3510
35 – 365
36 – 380
38 – 391
39 – 401
40 – 410
41 – 421
42 – 431
43 – 440
44 – 452
45 – 460
46 – 471
47 – 481
48 – 490
49 – 500
50 – 510
51 – 530
53 – 542
54 – 551
55 – 560
56 – 570
57 – 580
58 – 591

total_renters numeric feature

This is a numeric count of renters per record, ranging from 28 to 1,810,929 with a median of 2,579.5 — likely an aggregate at some geography or entity level rather than a per-unit measure. The distribution is severely right-skewed (skew 15.82, kurtosis 398.15) with 449 outliers (14.0% outlier rate) and a mean (13,851) more than five times the median, indicating a few very large populations dominate. No nulls or zeros, and 2,709 unique values out of 3,222 rows suggest most records carry distinct counts.

Treatment: log-transform before regression to tame the extreme right skew.

anthropic:claude-opus-4-7 · confidence high
Out[19]:

saturn.columns["total_renters"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,709
min 28
max 1.811e+06
mean 1.385e+04
median 2580
std 5.535e+04
q1 1004
q3 7396
iqr 6,392
skew 15.82
kurtosis 398.2
n_outliers 449
outlier_rate 0.1394
zero_rate 0
alert: high_skewskew=+15.82
alert: outliers13.9% rows beyond 1.5 IQR
Fig 10.
Distribution of total_renters. Vertical dash marks the median.
Show data table
Histogram bins for total_renters (median: 2579.5).
bincount
28 – 4.53e+043019
4.53e+04 – 9.057e+04109
9.057e+04 – 1.358e+0538
1.358e+05 – 1.811e+0517
1.811e+05 – 2.264e+0511
2.264e+05 – 2.717e+059
2.717e+05 – 3.169e+055
3.169e+05 – 3.622e+050
3.622e+05 – 4.075e+052
4.075e+05 – 4.528e+052
4.528e+05 – 4.98e+053
4.98e+05 – 5.433e+051
5.433e+05 – 5.886e+051
5.886e+05 – 6.338e+051
6.338e+05 – 6.791e+050
6.791e+05 – 7.244e+051
7.244e+05 – 7.697e+051
7.697e+05 – 8.149e+050
8.149e+05 – 8.602e+050
8.602e+05 – 9.055e+051
9.055e+05 – 9.508e+050
9.508e+05 – 9.96e+050
9.96e+05 – 1.041e+060
1.041e+06 – 1.087e+060
1.087e+06 – 1.132e+060
1.132e+06 – 1.177e+060
1.177e+06 – 1.222e+060
1.222e+06 – 1.268e+060
1.268e+06 – 1.313e+060
1.313e+06 – 1.358e+060
1.358e+06 – 1.403e+060
1.403e+06 – 1.449e+060
1.449e+06 – 1.494e+060
1.494e+06 – 1.539e+060
1.539e+06 – 1.585e+060
1.585e+06 – 1.63e+060
1.63e+06 – 1.675e+060
1.675e+06 – 1.72e+060
1.72e+06 – 1.766e+060
1.766e+06 – 1.811e+061

pct_rent_burdened_30plus numeric feature

This appears to be the percentage of renter households spending 30%+ of income on rent, reported per geographic unit (likely county-level given n=3222). Values span 0 to 64.96 with a median of 37.36 and IQR of roughly 30.67–43.48, indicating rent burden is widespread rather than rare. The mild left skew (-0.57) and 58 outliers (1.8%) suggest a few areas with unusually low burden pull the tail down, while 0.25% report exactly zero.

Treatment: Use as-is as a continuous feature; no transformation needed given near-symmetric distribution.

anthropic:claude-opus-4-7 · confidence high
Out[22]:

saturn.columns["pct_rent_burdened_30plus"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,146
min 0
max 64.96
mean 36.44
median 37.36
std 10.01
q1 30.67
q3 43.48
iqr 12.81
skew -0.5673
kurtosis 0.5032
n_outliers 58
outlier_rate 0.018
zero_rate 0.002483
Fig 11.
Distribution of pct_rent_burdened_30plus. Vertical dash marks the median.
Show data table
Histogram bins for pct_rent_burdened_30plus (median: 37.36).
bincount
0 – 1.6249
1.624 – 3.2485
3.248 – 4.8723
4.872 – 6.4965
6.496 – 8.129
8.12 – 9.74413
9.744 – 11.3711
11.37 – 12.9916
12.99 – 14.6226
14.62 – 16.2419
16.24 – 17.8635
17.86 – 19.4943
19.49 – 21.1152
21.11 – 22.7452
22.74 – 24.3673
24.36 – 25.9899
25.98 – 27.61109
27.61 – 29.23116
29.23 – 30.86132
30.86 – 32.48159
32.48 – 34.1189
34.1 – 35.73209
35.73 – 37.35227
37.35 – 38.98239
38.98 – 40.6205
40.6 – 42.22209
42.22 – 43.85210
43.85 – 45.47190
45.47 – 47.1131
47.1 – 48.72114
48.72 – 50.34118
50.34 – 51.9769
51.97 – 53.5951
53.59 – 55.2234
55.22 – 56.8424
56.84 – 58.466
58.46 – 60.093
60.09 – 61.712
61.71 – 63.343
63.34 – 64.963

pct_rent_burdened_50plus numeric feature

This is a county-level (or similar geographic) numeric feature giving the percentage of households spending 50%+ of income on rent — severely rent-burdened. Values span 0 to 64.96 with a near-symmetric distribution (skew 0.05) centered at a median of 17.62%, and the IQR of 8.56 around a mean of 17.35 indicates a tight, well-behaved spread. Only 47 outliers (1.46%) and a 0.93% zero rate, with no nulls across 3,222 rows, suggest clean and complete coverage.

Treatment: Use as-is in modelling; standardize if combining with other scaled features.

anthropic:claude-opus-4-7 · confidence high
Out[25]:

saturn.columns["pct_rent_burdened_50plus"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,769
min 0
max 64.96
mean 17.35
median 17.62
std 6.577
q1 13.07
q3 21.63
iqr 8.557
skew 0.05436
kurtosis 0.9823
n_outliers 47
outlier_rate 0.01459
zero_rate 0.009311
Fig 12.
Distribution of pct_rent_burdened_50plus. Vertical dash marks the median.
Show data table
Histogram bins for pct_rent_burdened_50plus (median: 17.62).
bincount
0 – 1.62442
1.624 – 3.24827
3.248 – 4.87234
4.872 – 6.49663
6.496 – 8.12102
8.12 – 9.744148
9.744 – 11.37163
11.37 – 12.99214
12.99 – 14.62242
14.62 – 16.24310
16.24 – 17.86315
17.86 – 19.49332
19.49 – 21.11335
21.11 – 22.74264
22.74 – 24.36219
24.36 – 25.98150
25.98 – 27.6199
27.61 – 29.2364
29.23 – 30.8639
30.86 – 32.4820
32.48 – 34.121
34.1 – 35.739
35.73 – 37.352
37.35 – 38.983
38.98 – 40.61
40.6 – 42.221
42.22 – 43.851
43.85 – 45.470
45.47 – 47.11
47.1 – 48.720
48.72 – 50.340
50.34 – 51.970
51.97 – 53.590
53.59 – 55.220
55.22 – 56.840
56.84 – 58.460
58.46 – 60.090
60.09 – 61.710
61.71 – 63.340
63.34 – 64.961

median_gross_rent numeric feature

Likely the median gross rent (in dollars) for each row's geography, with a typical value near $817.5 and an interquartile range of $718–$978. The data is contaminated by sentinel values: the minimum is -666666666 and the mean of -2,068,220 with std ~3.7e7 is impossible for rent, producing extreme negative skew (-17.87) and kurtosis (317.2). 235 outliers (7.3%) are flagged, but the central distribution looks plausible once sentinels are removed.

Treatment: Replace sentinel negatives (e.g., -666666666) with NaN before any modelling or aggregation.

anthropic:claude-opus-4-7 · confidence high
Out[28]:

saturn.columns["median_gross_rent"].stats

statvalue
n3,222
nulls0 (0.0%)
unique984
min -6.667e+08
max 2,805
mean -2.068e+06
median 817.5
std 3.709e+07
q1 718
q3 978
iqr 260
skew -17.87
kurtosis 317.2
n_outliers 235
outlier_rate 0.07294
zero_rate 0
alert: high_skewskew=-17.87
alert: outliers7.3% rows beyond 1.5 IQR
Fig 13.
Distribution of median_gross_rent. Vertical dash marks the median.
Show data table
Histogram bins for median_gross_rent (median: 817.5).
bincount
-6.667e+08 – -6.5e+0810
-6.5e+08 – -6.333e+080
-6.333e+08 – -6.167e+080
-6.167e+08 – -6e+080
-6e+08 – -5.833e+080
-5.833e+08 – -5.667e+080
-5.667e+08 – -5.5e+080
-5.5e+08 – -5.333e+080
-5.333e+08 – -5.167e+080
-5.167e+08 – -5e+080
-5e+08 – -4.833e+080
-4.833e+08 – -4.667e+080
-4.667e+08 – -4.5e+080
-4.5e+08 – -4.333e+080
-4.333e+08 – -4.167e+080
-4.167e+08 – -4e+080
-4e+08 – -3.833e+080
-3.833e+08 – -3.667e+080
-3.667e+08 – -3.5e+080
-3.5e+08 – -3.333e+080
-3.333e+08 – -3.167e+080
-3.167e+08 – -3e+080
-3e+08 – -2.833e+080
-2.833e+08 – -2.667e+080
-2.667e+08 – -2.5e+080
-2.5e+08 – -2.333e+080
-2.333e+08 – -2.167e+080
-2.167e+08 – -2e+080
-2e+08 – -1.833e+080
-1.833e+08 – -1.667e+080
-1.667e+08 – -1.5e+080
-1.5e+08 – -1.333e+080
-1.333e+08 – -1.167e+080
-1.167e+08 – -1e+080
-1e+08 – -8.333e+070
-8.333e+07 – -6.666e+070
-6.666e+07 – -5e+070
-5e+07 – -3.333e+070
-3.333e+07 – -1.666e+070
-1.666e+07 – 28053212

median_household_income numeric feature

County-level median household income in dollars, with 3099 unique values across 3222 rows and no nulls. The distribution is contaminated by sentinel values: the minimum is -666666666 and the mean of -144603 is implausible against a median of 60458.5, driving extreme skew (-56.74) and kurtosis (3216.99). Once sentinels are removed, the IQR (51814.75 to 70376.25) looks like a typical US income distribution, but 188 outliers (5.83%) remain flagged.

Treatment: Replace -666666666 sentinels with nulls, then consider log-transform or winsorization before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[31]:

saturn.columns["median_household_income"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,099
min -6.667e+08
max 170,463
mean -1.446e+05
median 6.046e+04
std 1.175e+07
q1 5.181e+04
q3 7.038e+04
iqr 1.856e+04
skew -56.74
kurtosis 3217
n_outliers 188
outlier_rate 0.05835
zero_rate 0
alert: high_skewskew=-56.74
alert: outliers5.8% rows beyond 1.5 IQR
Fig 14.
Distribution of median_household_income. Vertical dash marks the median.
Show data table
Histogram bins for median_household_income (median: 60458.5).
bincount
-6.667e+08 – -6.5e+081
-6.5e+08 – -6.333e+080
-6.333e+08 – -6.167e+080
-6.167e+08 – -6e+080
-6e+08 – -5.833e+080
-5.833e+08 – -5.666e+080
-5.666e+08 – -5.5e+080
-5.5e+08 – -5.333e+080
-5.333e+08 – -5.166e+080
-5.166e+08 – -5e+080
-5e+08 – -4.833e+080
-4.833e+08 – -4.666e+080
-4.666e+08 – -4.499e+080
-4.499e+08 – -4.333e+080
-4.333e+08 – -4.166e+080
-4.166e+08 – -3.999e+080
-3.999e+08 – -3.833e+080
-3.833e+08 – -3.666e+080
-3.666e+08 – -3.499e+080
-3.499e+08 – -3.332e+080
-3.332e+08 – -3.166e+080
-3.166e+08 – -2.999e+080
-2.999e+08 – -2.832e+080
-2.832e+08 – -2.666e+080
-2.666e+08 – -2.499e+080
-2.499e+08 – -2.332e+080
-2.332e+08 – -2.166e+080
-2.166e+08 – -1.999e+080
-1.999e+08 – -1.832e+080
-1.832e+08 – -1.665e+080
-1.665e+08 – -1.499e+080
-1.499e+08 – -1.332e+080
-1.332e+08 – -1.165e+080
-1.165e+08 – -9.986e+070
-9.986e+07 – -8.318e+070
-8.318e+07 – -6.651e+070
-6.651e+07 – -4.984e+070
-4.984e+07 – -3.317e+070
-3.317e+07 – -1.65e+070
-1.65e+07 – 1.705e+053221

total_housing_units numeric feature

Counts of total housing units across 3,222 rows with no nulls and 3,074 distinct values, consistent with one record per geographic area (e.g., county). The distribution is severely right-skewed (skew 12.05, kurtosis 240.5): the median is 10,021 but the mean is 39,403 and the max reaches 3,363,093, with 443 outliers (13.7%) above the IQR fence. Range spans 32 to 3.36M, indicating a mix of very small and very large jurisdictions.

Treatment: log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[34]:

saturn.columns["total_housing_units"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,074
min 32
max 3.363e+06
mean 3.94e+04
median 10,021
std 1.201e+05
q1 4211
q3 25,939
iqr 2.173e+04
skew 12.05
kurtosis 240.5
n_outliers 443
outlier_rate 0.1375
zero_rate 0
alert: high_skewskew=+12.05
alert: outliers13.7% rows beyond 1.5 IQR
Fig 15.
Distribution of total_housing_units. Vertical dash marks the median.
Show data table
Histogram bins for total_housing_units (median: 10021.0).
bincount
32 – 8.411e+042907
8.411e+04 – 1.682e+05153
1.682e+05 – 2.523e+0562
2.523e+05 – 3.363e+0538
3.363e+05 – 4.204e+0522
4.204e+05 – 5.045e+056
5.045e+05 – 5.886e+0511
5.886e+05 – 6.726e+055
6.726e+05 – 7.567e+055
7.567e+05 – 8.408e+053
8.408e+05 – 9.249e+051
9.249e+05 – 1.009e+063
1.009e+06 – 1.093e+061
1.093e+06 – 1.177e+061
1.177e+06 – 1.261e+060
1.261e+06 – 1.345e+060
1.345e+06 – 1.429e+060
1.429e+06 – 1.513e+060
1.513e+06 – 1.597e+060
1.597e+06 – 1.682e+061
1.682e+06 – 1.766e+061
1.766e+06 – 1.85e+060
1.85e+06 – 1.934e+060
1.934e+06 – 2.018e+060
2.018e+06 – 2.102e+061
2.102e+06 – 2.186e+060
2.186e+06 – 2.27e+060
2.27e+06 – 2.354e+060
2.354e+06 – 2.438e+060
2.438e+06 – 2.522e+060
2.522e+06 – 2.606e+060
2.606e+06 – 2.69e+060
2.69e+06 – 2.775e+060
2.775e+06 – 2.859e+060
2.859e+06 – 2.943e+060
2.943e+06 – 3.027e+060
3.027e+06 – 3.111e+060
3.111e+06 – 3.195e+060
3.195e+06 – 3.279e+060
3.279e+06 – 3.363e+061

owner_occupied numeric feature

Likely a count of owner-occupied housing units per geographic area, given the integer-like range from 0 to 1,552,164 and median of 7,325.5. The distribution is severely right-skewed (skew 9.52, kurtosis 146.9) with 429 outliers (13.3%) and a mean (25,551.7) far above the median, indicating a few very large areas dominate. Near-unique values (3,001 of 3,222) and effectively no zeros (0.03%) suggest one row per area rather than a categorical flag.

Treatment: log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[37]:

saturn.columns["owner_occupied"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,001
min 0
max 1.552e+06
mean 2.555e+04
median 7326
std 6.755e+04
q1 3148
q3 1.886e+04
iqr 1.572e+04
skew 9.516
kurtosis 146.9
n_outliers 429
outlier_rate 0.1331
zero_rate 0.0003104
alert: high_skewskew=+9.52
alert: outliers13.3% rows beyond 1.5 IQR
Fig 16.
Distribution of owner_occupied. Vertical dash marks the median.
Show data table
Histogram bins for owner_occupied (median: 7325.5).
bincount
0 – 3.88e+042761
3.88e+04 – 7.761e+04225
7.761e+04 – 1.164e+0578
1.164e+05 – 1.552e+0552
1.552e+05 – 1.94e+0536
1.94e+05 – 2.328e+0520
2.328e+05 – 2.716e+0510
2.716e+05 – 3.104e+0510
3.104e+05 – 3.492e+056
3.492e+05 – 3.88e+056
3.88e+05 – 4.268e+053
4.268e+05 – 4.656e+053
4.656e+05 – 5.045e+054
5.045e+05 – 5.433e+052
5.433e+05 – 5.821e+050
5.821e+05 – 6.209e+051
6.209e+05 – 6.597e+051
6.597e+05 – 6.985e+050
6.985e+05 – 7.373e+050
7.373e+05 – 7.761e+050
7.761e+05 – 8.149e+050
8.149e+05 – 8.537e+050
8.537e+05 – 8.925e+050
8.925e+05 – 9.313e+051
9.313e+05 – 9.701e+050
9.701e+05 – 1.009e+060
1.009e+06 – 1.048e+060
1.048e+06 – 1.087e+061
1.087e+06 – 1.125e+060
1.125e+06 – 1.164e+060
1.164e+06 – 1.203e+061
1.203e+06 – 1.242e+060
1.242e+06 – 1.281e+060
1.281e+06 – 1.319e+060
1.319e+06 – 1.358e+060
1.358e+06 – 1.397e+060
1.397e+06 – 1.436e+060
1.436e+06 – 1.475e+060
1.475e+06 – 1.513e+060
1.513e+06 – 1.552e+061

renter_occupied numeric feature

Counts of renter-occupied housing units per record, almost certainly aggregated by geography given the spread from 28 to 1,810,929. The distribution is extremely right-skewed (skew 15.82, kurtosis 398.15) with a median of 2,579.5 sitting far below the mean of 13,851, and 449 outliers (14% of rows) likely representing dense urban areas. Near-unique values (2,709 of 3,222) and zero null/zero rate suggest a clean per-area tally rather than a categorical feature.

Treatment: log-transform before regression to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[40]:

saturn.columns["renter_occupied"].stats

statvalue
n3,222
nulls0 (0.0%)
unique2,709
min 28
max 1.811e+06
mean 1.385e+04
median 2580
std 5.535e+04
q1 1004
q3 7396
iqr 6,392
skew 15.82
kurtosis 398.2
n_outliers 449
outlier_rate 0.1394
zero_rate 0
alert: high_skewskew=+15.82
alert: outliers13.9% rows beyond 1.5 IQR
Fig 17.
Distribution of renter_occupied. Vertical dash marks the median.
Show data table
Histogram bins for renter_occupied (median: 2579.5).
bincount
28 – 4.53e+043019
4.53e+04 – 9.057e+04109
9.057e+04 – 1.358e+0538
1.358e+05 – 1.811e+0517
1.811e+05 – 2.264e+0511
2.264e+05 – 2.717e+059
2.717e+05 – 3.169e+055
3.169e+05 – 3.622e+050
3.622e+05 – 4.075e+052
4.075e+05 – 4.528e+052
4.528e+05 – 4.98e+053
4.98e+05 – 5.433e+051
5.433e+05 – 5.886e+051
5.886e+05 – 6.338e+051
6.338e+05 – 6.791e+050
6.791e+05 – 7.244e+051
7.244e+05 – 7.697e+051
7.697e+05 – 8.149e+050
8.149e+05 – 8.602e+050
8.602e+05 – 9.055e+051
9.055e+05 – 9.508e+050
9.508e+05 – 9.96e+050
9.96e+05 – 1.041e+060
1.041e+06 – 1.087e+060
1.087e+06 – 1.132e+060
1.132e+06 – 1.177e+060
1.177e+06 – 1.222e+060
1.222e+06 – 1.268e+060
1.268e+06 – 1.313e+060
1.313e+06 – 1.358e+060
1.358e+06 – 1.403e+060
1.403e+06 – 1.449e+060
1.449e+06 – 1.494e+060
1.494e+06 – 1.539e+060
1.539e+06 – 1.585e+060
1.585e+06 – 1.63e+060
1.63e+06 – 1.675e+060
1.675e+06 – 1.72e+060
1.72e+06 – 1.766e+060
1.766e+06 – 1.811e+061

pct_renter numeric feature

Numeric share-of-renters feature spanning 3.01 to 100.0 with mean 27.35 and median 26.07, suggesting county- or tract-level renter percentages. The distribution is right-skewed (skew 1.32, kurtosis 4.41) with 88 high-side outliers (2.7%) pulling toward 100, while the IQR is a tight 10.02 around the mid-20s. No nulls or zeros, and 1925 unique values across 3222 rows indicate granular but not unique measurements.

Treatment: Consider a log or Yeo-Johnson transform before regression to tame the right tail.

anthropic:claude-opus-4-7 · confidence high
Out[43]:

saturn.columns["pct_renter"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,925
min 3.01
max 100
mean 27.35
median 26.07
std 8.564
q1 21.64
q3 31.66
iqr 10.02
skew 1.317
kurtosis 4.412
n_outliers 88
outlier_rate 0.02731
zero_rate 0
Fig 18.
Distribution of pct_renter. Vertical dash marks the median.
Show data table
Histogram bins for pct_renter (median: 26.07).
bincount
3.01 – 5.4351
5.435 – 7.8593
7.859 – 10.289
10.28 – 12.7126
12.71 – 15.1363
15.13 – 17.56156
17.56 – 19.98316
19.98 – 22.41371
22.41 – 24.83450
24.83 – 27.26419
27.26 – 29.68357
29.68 – 32.11301
32.11 – 34.53203
34.53 – 36.96169
36.96 – 39.38115
39.38 – 41.8175
41.81 – 44.2356
44.23 – 46.6645
46.66 – 49.0825
49.08 – 51.515
51.5 – 53.9311
53.93 – 56.3510
56.35 – 58.788
58.78 – 61.24
61.2 – 63.634
63.63 – 66.051
66.05 – 68.481
68.48 – 70.93
70.9 – 73.331
73.33 – 75.751
75.75 – 78.180
78.18 – 80.61
80.6 – 83.030
83.03 – 85.451
85.45 – 87.880
87.88 – 90.30
90.3 – 92.730
92.73 – 95.150
95.15 – 97.580
97.58 – 1001

annual_rent numeric feature

This is an annual rent figure in currency units, with a typical tenant paying between 8616 and 11736 (median 9810). However, the minimum of -7999999992 drags the mean to -24818640 and produces extreme skew (-17.87) and kurtosis (317.20), indicating sentinel values or sign errors masquerading as rents. 235 outliers (7.3%) sit outside the IQR fence, so the tail is not a single rogue record.

Treatment: Clip or null negative sentinels, then log-transform before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[46]:

saturn.columns["annual_rent"].stats

statvalue
n3,222
nulls0 (0.0%)
unique984
min -8e+09
max 33,660
mean -2.482e+07
median 9,810
std 4.451e+08
q1 8,616
q3 11,736
iqr 3,120
skew -17.87
kurtosis 317.2
n_outliers 235
outlier_rate 0.07294
zero_rate 0
alert: high_skewskew=-17.87
alert: outliers7.3% rows beyond 1.5 IQR
Fig 19.
Distribution of annual_rent. Vertical dash marks the median.
Show data table
Histogram bins for annual_rent (median: 9810.0).
bincount
-8e+09 – -7.8e+0910
-7.8e+09 – -7.6e+090
-7.6e+09 – -7.4e+090
-7.4e+09 – -7.2e+090
-7.2e+09 – -7e+090
-7e+09 – -6.8e+090
-6.8e+09 – -6.6e+090
-6.6e+09 – -6.4e+090
-6.4e+09 – -6.2e+090
-6.2e+09 – -6e+090
-6e+09 – -5.8e+090
-5.8e+09 – -5.6e+090
-5.6e+09 – -5.4e+090
-5.4e+09 – -5.2e+090
-5.2e+09 – -5e+090
-5e+09 – -4.8e+090
-4.8e+09 – -4.6e+090
-4.6e+09 – -4.4e+090
-4.4e+09 – -4.2e+090
-4.2e+09 – -4e+090
-4e+09 – -3.8e+090
-3.8e+09 – -3.6e+090
-3.6e+09 – -3.4e+090
-3.4e+09 – -3.2e+090
-3.2e+09 – -3e+090
-3e+09 – -2.8e+090
-2.8e+09 – -2.6e+090
-2.6e+09 – -2.4e+090
-2.4e+09 – -2.2e+090
-2.2e+09 – -2e+090
-2e+09 – -1.8e+090
-1.8e+09 – -1.6e+090
-1.6e+09 – -1.4e+090
-1.4e+09 – -1.2e+090
-1.2e+09 – -1e+090
-1e+09 – -8e+080
-8e+08 – -6e+080
-6e+08 – -4e+080
-4e+08 – -2e+080
-2e+08 – 3.366e+043212

rent_to_income_ratio numeric feature

This is a numeric feature meant to capture rent as a proportion (or percentage) of income, with the bulk of values clustered tightly between q1 15.07 and q3 19.39 around a median of 17.05. However, the column is badly corrupted: the minimum is -24,357,569.09 driving a mean of -37,244.13 and a std of 752,361.70, with skew -22.74 and kurtosis 570.21. Negative ratios are nonsensical here and 114 outliers (3.54%) plus a max of 1200 suggest data entry or unit errors rather than genuine variation.

Treatment: Clip or filter to plausible non-negative ratios and investigate the extreme negatives before any modelling.

anthropic:claude-opus-4-7 · confidence high
Out[49]:

saturn.columns["rent_to_income_ratio"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,278
min -2.436e+07
max 1,200
mean -3.724e+04
median 17.05
std 7.524e+05
q1 15.07
q3 19.39
iqr 4.317
skew -22.74
kurtosis 570.2
n_outliers 114
outlier_rate 0.03538
zero_rate 0
alert: high_skewskew=-22.74
Fig 20.
Distribution of rent_to_income_ratio. Vertical dash marks the median.
Show data table
Histogram bins for rent_to_income_ratio (median: 17.05).
bincount
-2.436e+07 – -2.375e+071
-2.375e+07 – -2.314e+070
-2.314e+07 – -2.253e+070
-2.253e+07 – -2.192e+070
-2.192e+07 – -2.131e+070
-2.131e+07 – -2.07e+070
-2.07e+07 – -2.009e+070
-2.009e+07 – -1.949e+070
-1.949e+07 – -1.888e+070
-1.888e+07 – -1.827e+070
-1.827e+07 – -1.766e+070
-1.766e+07 – -1.705e+071
-1.705e+07 – -1.644e+070
-1.644e+07 – -1.583e+070
-1.583e+07 – -1.522e+070
-1.522e+07 – -1.461e+071
-1.461e+07 – -1.401e+070
-1.401e+07 – -1.34e+071
-1.34e+07 – -1.279e+071
-1.279e+07 – -1.218e+070
-1.218e+07 – -1.157e+070
-1.157e+07 – -1.096e+071
-1.096e+07 – -1.035e+070
-1.035e+07 – -9.742e+061
-9.742e+06 – -9.133e+060
-9.133e+06 – -8.524e+060
-8.524e+06 – -7.915e+060
-7.915e+06 – -7.306e+061
-7.306e+06 – -6.697e+061
-6.697e+06 – -6.088e+060
-6.088e+06 – -5.48e+060
-5.48e+06 – -4.871e+060
-4.871e+06 – -4.262e+060
-4.262e+06 – -3.653e+060
-3.653e+06 – -3.044e+060
-3.044e+06 – -2.435e+060
-2.435e+06 – -1.826e+060
-1.826e+06 – -1.217e+060
-1.217e+06 – -6.078e+050
-6.078e+05 – 12003213

affordability_category categorical label

A categorical affordability bucket with three levels: Affordable, Moderately Burdened, and Extremely Burdened. The distribution is extremely imbalanced — 'Affordable' covers 3192 of 3222 rows (top_rate 0.9907), leaving only 29 'Moderately Burdened' and a single 'Extremely Burdened' record. Entropy ratio of 0.049 confirms there is almost no information in this column as-is.

Treatment: Collapse to a binary affordable-vs-burdened flag or drop; near-constant for modelling.

anthropic:claude-opus-4-7 · confidence high
Out[52]:

saturn.columns["affordability_category"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3
top_value Affordable
top_rate 0.9907
cardinality 3
entropy 0.07815
entropy_ratio 0.04931
alert: imbalancetop value is 99.1% of rows
Fig 21.
Top values for affordability_category.
Show data table
Top values for affordability_category (3 unique shown, of 3 total).
valuecountshare
Affordable319299.1%
Moderately Burdened290.9%
Extremely Burdened10.0%

hours_at_min_wage_for_rent numeric feature

This column appears to capture how many minimum-wage hours are needed to cover rent, with a typical value around 113 hours (IQR 99–135). However, the data is corrupted by extreme negative values: the minimum is -91,954,023 and the mean is -285,271, despite a max of 387, producing severe negative skew (-17.87) and kurtosis of 317. About 7.2% of rows (232) are flagged as outliers, suggesting sentinel codes or data-entry errors masquerading as numeric values.

Treatment: Filter out negative sentinel values before use, then consider a log or robust transform.

anthropic:claude-opus-4-7 · confidence high
Out[55]:

saturn.columns["hours_at_min_wage_for_rent"].stats

statvalue
n3,222
nulls0 (0.0%)
unique230
min -9.195e+07
max 387
mean -2.853e+05
median 113
std 5.116e+06
q1 99
q3 135
iqr 36
skew -17.87
kurtosis 317.2
n_outliers 232
outlier_rate 0.072
zero_rate 0
alert: high_skewskew=-17.87
alert: outliers7.2% rows beyond 1.5 IQR
Fig 22.
Distribution of hours_at_min_wage_for_rent. Vertical dash marks the median.
Show data table
Histogram bins for hours_at_min_wage_for_rent (median: 113.0).
bincount
-9.195e+07 – -8.966e+0710
-8.966e+07 – -8.736e+070
-8.736e+07 – -8.506e+070
-8.506e+07 – -8.276e+070
-8.276e+07 – -8.046e+070
-8.046e+07 – -7.816e+070
-7.816e+07 – -7.586e+070
-7.586e+07 – -7.356e+070
-7.356e+07 – -7.126e+070
-7.126e+07 – -6.897e+070
-6.897e+07 – -6.667e+070
-6.667e+07 – -6.437e+070
-6.437e+07 – -6.207e+070
-6.207e+07 – -5.977e+070
-5.977e+07 – -5.747e+070
-5.747e+07 – -5.517e+070
-5.517e+07 – -5.287e+070
-5.287e+07 – -5.057e+070
-5.057e+07 – -4.828e+070
-4.828e+07 – -4.598e+070
-4.598e+07 – -4.368e+070
-4.368e+07 – -4.138e+070
-4.138e+07 – -3.908e+070
-3.908e+07 – -3.678e+070
-3.678e+07 – -3.448e+070
-3.448e+07 – -3.218e+070
-3.218e+07 – -2.988e+070
-2.988e+07 – -2.759e+070
-2.759e+07 – -2.529e+070
-2.529e+07 – -2.299e+070
-2.299e+07 – -2.069e+070
-2.069e+07 – -1.839e+070
-1.839e+07 – -1.609e+070
-1.609e+07 – -1.379e+070
-1.379e+07 – -1.149e+070
-1.149e+07 – -9.195e+060
-9.195e+06 – -6.896e+060
-6.896e+06 – -4.597e+060
-4.597e+06 – -2.298e+060
-2.298e+06 – 3873212

weeks_at_min_wage_for_rent numeric feature

This column appears to capture how many weeks of full-time minimum-wage work are needed to cover rent, with a typical value around 2.8 weeks (IQR 2.5-3.4). However, the data is severely corrupted: the minimum is -2,298,850.6 and the mean is -7,131.79, while the max is only 9.7, producing extreme skew (-17.87) and kurtosis (317.20) with 232 outliers (7.2% of rows). Negative weeks are nonsensical for this metric, suggesting sentinel values or data-entry errors masquerading as numbers.

Treatment: Filter out negative values as invalid before any modelling, then consider winsorizing the upper tail.

anthropic:claude-opus-4-7 · confidence high
Out[58]:

saturn.columns["weeks_at_min_wage_for_rent"].stats

statvalue
n3,222
nulls0 (0.0%)
unique72
min -2.299e+06
max 9.7
mean -7132
median 2.8
std 1.279e+05
q1 2.5
q3 3.4
iqr 0.9
skew -17.87
kurtosis 317.2
n_outliers 232
outlier_rate 0.072
zero_rate 0
alert: high_skewskew=-17.87
alert: outliers7.2% rows beyond 1.5 IQR
Fig 23.
Distribution of weeks_at_min_wage_for_rent. Vertical dash marks the median.
Show data table
Histogram bins for weeks_at_min_wage_for_rent (median: 2.8).
bincount
-2.299e+06 – -2.241e+0610
-2.241e+06 – -2.184e+060
-2.184e+06 – -2.126e+060
-2.126e+06 – -2.069e+060
-2.069e+06 – -2.011e+060
-2.011e+06 – -1.954e+060
-1.954e+06 – -1.897e+060
-1.897e+06 – -1.839e+060
-1.839e+06 – -1.782e+060
-1.782e+06 – -1.724e+060
-1.724e+06 – -1.667e+060
-1.667e+06 – -1.609e+060
-1.609e+06 – -1.552e+060
-1.552e+06 – -1.494e+060
-1.494e+06 – -1.437e+060
-1.437e+06 – -1.379e+060
-1.379e+06 – -1.322e+060
-1.322e+06 – -1.264e+060
-1.264e+06 – -1.207e+060
-1.207e+06 – -1.149e+060
-1.149e+06 – -1.092e+060
-1.092e+06 – -1.034e+060
-1.034e+06 – -9.77e+050
-9.77e+05 – -9.195e+050
-9.195e+05 – -8.621e+050
-8.621e+05 – -8.046e+050
-8.046e+05 – -7.471e+050
-7.471e+05 – -6.896e+050
-6.896e+05 – -6.322e+050
-6.322e+05 – -5.747e+050
-5.747e+05 – -5.172e+050
-5.172e+05 – -4.598e+050
-4.598e+05 – -4.023e+050
-4.023e+05 – -3.448e+050
-3.448e+05 – -2.873e+050
-2.873e+05 – -2.299e+050
-2.299e+05 – -1.724e+050
-1.724e+05 – -1.149e+050
-1.149e+05 – -5.746e+040
-5.746e+04 – 9.73212

How to cite

click to copy

BibTeX
@misc{saturn-us-housing-affordability-crisis-housing-crisis-merged-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: us housing affordability crisis housing crisis merged},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/us-housing-affordability-crisis-housing_crisis_merged}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: us housing affordability crisis housing crisis merged. Source: /home/coolhand/datasets/us-housing-affordability-crisis/housing_crisis_merged.csv. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/us-housing-affordability-crisis-housing_crisis_merged