saturn

/home/coolhand/html/datavis/data_trove/economic/housing/nyc/nyc_housing_metrics_merged.csv 2,327 rows sample n=2,327 seed 42 2026-06-21T23:33:50+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/economic/housing/nyc/nyc_housing_metrics_merged.csv
Total rows2,327
Profiled sample2,327
Columns23
Generated2026-06-21T23:33:50+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
total_renter_householdsnumeric0.0%
rent_30_to_34_9_pctnumeric0.0%
rent_35_to_39_9_pctnumeric0.0%
rent_40_to_49_9_pctnumeric0.0%
rent_50_pct_or_morenumeric0.0%
NAMEtext0.0%
statenumeric0.0%
countynumeric0.0%
tractnumeric0.0%
county_namecategorical0.0%
moderate_burdennumeric0.0%
severe_burdennumeric0.0%
pct_moderate_burdennumeric4.4%
pct_severe_burdennumeric4.4%
rent_burdenednumeric0.0%
pct_rent_burdenednumeric4.4%
median_gross_rentnumeric0.0%
median_household_incomenumeric0.0%
total_householdsnumeric0.0%
owner_occupiednumeric0.0%
renter_occupiednumeric0.0%
pct_owner_occupiednumeric4.1%
pct_renter_occupiednumeric4.1%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This dataset covers housing affordability metrics for 2,327 census tracts across New York City's five boroughs, with variables spanning rent burden, household income, gross rent, and tenure type. The most urgent data quality issue is that both `median_gross_rent` and `median_household_income` contain extreme negative sentinel values (min of -666,666,666), which wildly distort their means and standard deviations — these columns must be filtered or recoded before any analysis. Substantively, rent burden is the headline story: the median tract has 50% of renter households paying more than 30% of income on rent (`pct_rent_burdened` median = 50.0), and severe burden (≥50% of income) affects a median of 26.2% of renters per tract. Brooklyn leads in tract count (805 tracts, 34.6% of the dataset), followed by Queens (725) and the Bronx (361), so borough-level comparisons are feasible but uneven in sample size.

median_gross_rent high anthropic:default

This column represents median gross rent (likely in USD) across 2,327 geographic units, with a healthy central range of $1,441–$2,049 (IQR) and a median of $1,735. The mean of –$41.5M and minimum of –$666,666,666 are severe red flags: a sentinel/placeholder value (e.g., –666666666) has been used for missing or N/A records, which is driving the extreme negative skew (–3.62), kurtosis of 11.1, and the 289 outliers (12.4% of rows). These corrupted values must be treated as nulls before any analysis.

median_household_income high anthropic:default

This column represents median household income, likely drawn from census or demographic data, with a plausible central distribution (median $76,833, IQR $49,117–$102,360) that aligns with typical US household income ranges. However, the column is severely corrupted by sentinel/error values: a minimum of -666,666,666 drags the mean to -$36,017,397 and produces extreme negative skew (-3.94) and kurtosis of 13.5. With 208 outliers (8.9% of rows) and a max capped at $250,001 (suggesting a top-coded value), a meaningful subset of records contain invalid or placeholder negative values that must be treated before any analysis.

rent_30_to_34_9_pct high anthropic:default

This column represents a count (or weighted count) of households paying 30–34.9% of income toward rent — a standard housing cost-burden threshold bracket. The distribution is severely right-skewed (skew=2.76, kurtosis=13.86), with a median of 51 but a mean of 83 and a max of 1205, indicating a long tail driven by high-density geographies. About 16.2% of rows are zero (areas with no such households or suppressed data), and 124 observations (~5.3%) are flagged as outliers, suggesting a mix of very small and very large geographic units in the dataset.

owner_occupied high anthropic:default

This column represents a count of owner-occupied housing units per geographic area (e.g., census tract or block group), with values ranging from 0 to 3,052 and a mean of ~465. The distribution is notably right-skewed (skew = 1.76) with high kurtosis (4.25), and 143 outliers (~6.1% of rows) pull the tail well above the median of 371. A 7.2% zero rate is worth investigating — these may be areas with no owner-occupied units or data gaps.

NAME high anthropic:default

This column contains fully-qualified U.S. Census tract names for New York City, following the pattern 'Census Tract [number], [County] County; New York' — every one of the 2,327 rows carries this format. All 2,327 values are unique with zero nulls or duplicates, and the extremely tight string-length range (min 38, max 46, mean 41.6) confirms a highly templated naming convention. The top words show 'census', 'tract', and 'county;' each appear exactly 2,327 times, while borough-level counties (Kings 805, Queens 725, Bronx 361, Richmond 126) account for NYC's five boroughs. The vocabulary size of 1,539 relative to 2,327 rows reflects the shared template words plus varying tract numbers and county names.

rent_35_to_39_9_pct high anthropic:default

This column represents the count (or estimated count) of renter households paying 35–39.9% of their income on rent — a standard housing cost-burden bracket used in Census/ACS tabulations. The distribution is severely right-skewed (skew 2.40, kurtosis 9.27), with a median of 35 but a mean of 58 and a maximum of 633, indicating that most geographic units have low counts while a small number of populous areas drive extreme values. Nearly 20% of rows are zero, suggesting many small geographies have no households in this rent-burden band, and 110 observations (4.7%) qualify as outliers. The IQR of 73 against a median of 35 confirms substantial spread relative to the typical value.

rent_40_to_49_9_pct high anthropic:default

This column represents a count (or estimate) of renter households spending 40–49.9% of their income on rent within some geographic unit, likely a census tract or block group. The distribution is heavily right-skewed (skew=2.14, kurtosis=7.14) with a median of 49 but a mean of 74.68 and a maximum of 740, indicating a small number of high-density areas pull the mean far above the typical value. About 15.6% of rows are zero—geographic units with no households in this rent-burden band—and 111 observations (4.8%) are flagged as outliers, consistent with urban concentration effects.

state high anthropic:default

This column appears to encode a categorical 'state' as a numeric code, but every single one of its 2,327 non-null rows holds the identical value of 36.0 — making it a zero-variance constant. This is a strong signal that the dataset was filtered or exported for a single state (code 36, which corresponds to New York in U.S. FIPS encoding, though that interpretation is not confirmed by the evidence). With n_unique = 1 and std = 0.0, this column carries no predictive or discriminative information whatsoever.

tract high anthropic:default

This column contains U.S. Census tract codes stored as integers, serving as a geographic identifier rather than a true numeric measure. The values range from 100 to 990100 with 1,530 unique codes across 2,327 rows, meaning roughly 34% of tract codes appear more than once — consistent with multiple records per tract. The extreme skew (10.14) and kurtosis (189.82) reflect the non-uniform geographic distribution of census tracts and the integer encoding scheme (e.g., tract 101 vs. tract 9901), not a meaningful numeric distribution; 63 outliers at the high end likely represent tracts in high-numbered FIPS areas.

county high anthropic:default

Despite being named 'county', this column is stored as a numeric type with only 5 distinct values (5, 47, 81, and two others within the min–max range of 5–85), strongly suggesting it is a numeric county FIPS code or a coded categorical identifier rather than a free-text name. The concentration of values around Q1=47 and Q3=81 with zero outliers and zero nulls indicates a clean, bounded categorical encoding. The mild negative skew (−0.72) and flat kurtosis (−0.45) suggest the five codes are distributed somewhat unevenly, with lower-numbered counties less represented. Analysts should treat the 5 numeric values as nominal category labels, not ordinal or continuous quantities.

moderate_burden medium anthropic:default

This column appears to represent a count or dollar-amount measure of 'moderate burden' — likely a financial, time, or resource burden metric per entity (e.g., household, case, or geographic unit). The distribution is heavily right-skewed (skew=1.93) with high excess kurtosis (6.05), meaning a long upper tail pulls the mean (216.1) well above the median (159.0), and 86 outliers reach as high as 1732. The IQR of 247 spans 64 to 311, while ~6.4% of rows are zero, suggesting a meaningful minority of entities report no burden at all.

rent_50_pct_or_more high anthropic:default

This column likely represents a count of renter households spending 50% or more of their income on rent (severe rent burden), probably aggregated at a geographic unit such as a census tract or ZIP code. The distribution is right-skewed (skew 1.60) with a mean of 253 and a median of 184, indicating many lower-burden areas alongside a long tail of high-burden zones reaching up to 1,918. Kurtosis of 3.44 and 87 outliers (3.7% of rows) confirm a heavy upper tail, and ~6.3% of records are zero, representing areas with no severely rent-burdened households.

rent_burdened medium anthropic:default

This column likely represents a count of rent-burdened households (those spending >30% of income on rent) per geographic unit such as a census tract or ZIP code. The distribution is right-skewed (skew 1.49, kurtosis 3.00) with a mean of 469 well above the median of 358, indicating a long tail driven by 82 outliers reaching up to 3,153 — consistent with dense urban areas pulling the upper end. The IQR spans 164.5 to 670, suggesting high variability across units, and 4.7% of rows are zero, plausibly reflecting areas with negligible renter populations.

renter_occupied high anthropic:default

This column represents the count of renter-occupied housing units, likely aggregated at a census tract or similar geographic level. The distribution is right-skewed (skew 1.59) with a mean of 946 well above the median of 726, indicating many mid-range areas pulled up by a long upper tail reaching 8,209. Kurtosis of 4.63 and 69 outliers (3.0%) confirm a heavy tail of high-density rental areas. A 4.4% zero rate is plausible for owner-dominated or low-density geographies.

severe_burden medium anthropic:default

This column appears to be a quantitative burden score or count (e.g., disease burden, cost burden, or case load) measured per some unit, given its name 'severe_burden' and integer-like range of 0–1918. The distribution is notably right-skewed (skew = 1.60) with excess kurtosis (3.44), meaning a long upper tail pulls the mean (253.2) well above the median (184.0); 87 outliers (3.7% of rows) extend toward the maximum of 1918. About 6.3% of values are exactly zero, which may represent genuine absence of burden or a data-quality artifact worth investigating.

total_households high anthropic:default

This column represents a count of total households per geographic or administrative unit, likely at a census-tract or zip-code level. The distribution is right-skewed (skew = 1.479) with leptokurtic tails (kurtosis = 4.38), meaning a minority of units contain disproportionately large household counts — confirmed by 70 outliers reaching up to 8,209 while the median sits at 1,252. A notable 4.1% of rows carry a zero value, which may indicate uninhabited areas, data gaps, or boundary artifacts worth investigating before modelling.

total_renter_households high anthropic:default

This column counts the total number of renter-occupied households per geographic unit (likely census tract or ZIP code), with values ranging from 0 to 8,209. The distribution is notably right-skewed (skew 1.59) with excess kurtosis of 4.63, indicating a long tail of high-density rental areas pulling the mean (946) well above the median (726). About 4.4% of records are zero, suggesting some areas have no renters at all, and 69 outliers (~3%) represent unusually dense rental markets that may warrant separate treatment.

pct_owner_occupied high anthropic:default

This column represents the percentage of owner-occupied housing units for geographic areas (e.g., census tracts or zip codes), ranging from 0% to 100% with a mean of 37.5% and median of 34.4%. The distribution is surprisingly flat and broad — the IQR alone spans 39.7 percentage points (Q1=16.4, Q3=56.1) — indicating high variability across areas rather than a clustered norm. Negative kurtosis (−0.85) confirms a platykurtic, spread-out distribution with no heavy tails or outliers. A 3.2% zero rate may reflect areas that are entirely renter-occupied or institutionalized populations, which is worth investigating before modelling.

pct_renter_occupied high anthropic:default

This column represents the percentage of housing units that are renter-occupied, likely derived from census or housing survey data at some geographic unit (e.g., tract, ZIP, or neighborhood). The distribution is notably broad — IQR spans 39.7 points (43.9 to 83.6) with a std of 25.65 — indicating high variability in renter rates across observations. The mean (62.5) and median (65.6) both skew toward higher renter shares, suggesting this dataset may oversample urban or higher-density areas where renting is more common. The near-platykurtic shape (kurtosis −0.85) and mild negative skew confirm a relatively flat, left-leaning distribution with no outliers flagged.

pct_moderate_burden high anthropic:default

This column represents the percentage of households (or units) experiencing moderate housing cost burden, likely defined as spending 30–50% of income on housing. The distribution is right-skewed (skew = 1.51) with a heavy tail and high kurtosis (6.70), meaning most observations cluster around a median of 21.8% but a notable minority push toward the maximum of 100.0—59 outliers exist at the upper extreme. The null rate of 4.38% and zero rate of 2.11% are modest but worth flagging as potential data gaps or areas with genuinely zero burden. With only 461 unique values across 2,327 rows, values appear to be rounded or binned percentages rather than continuous measurements.

pct_rent_burdened high anthropic:default

This column represents the percentage of renters who are rent-burdened (typically defined as spending ≥30% of income on rent) across 2,327 geographic or housing units. Surprisingly, the distribution is almost perfectly symmetric around the median of 50.0 and mean of 49.87, with near-zero skew (-0.038) and moderate IQ range of 17.9 — this is unusually bell-shaped for a percentage metric, which normally skews in one direction. The full 0–100 range is present, 62 outliers exist at the tails, and the near-zero rate is negligible at 0.36%, suggesting very few units with no rent burden at all.

pct_severe_burden high anthropic:default

This column represents the percentage of some population or housing units experiencing severe cost burden (likely housing costs exceeding 50% of income, a standard HUD metric). Values range from 0 to 100 with a mean of 27.1% and median of 26.2%, indicating a fairly symmetric distribution with mild right skew (0.57) and modest kurtosis (1.22). Notably, 518 unique values across 2,327 rows suggests aggregated geographic or demographic groupings rather than raw microdata. The 4.38% null rate and 30 outliers (reaching 100%) warrant attention but are not extreme.

county_name high anthropic:default

This column contains the five boroughs of New York City, functioning as a geographic region label for each record. With only 5 unique values across 2,327 rows and zero nulls, it is clean and complete. Brooklyn (Kings) is the most frequent borough at 34.6% (805 records), while Staten Island (Richmond) is the least represented at 126 records — a roughly 6:1 imbalance worth noting for any borough-level modelling. Entropy ratio of 0.898 confirms the distribution is moderately spread but not uniform.

Numeric correlation

Show data table
Pearson correlation across 12 numeric columns (values clipped to 2 decimals).
total_renter_householdsrent_30_to_34_9_pctrent_35_to_39_9_pctrent_40_to_49_9_pctrent_50_pct_or_morestatecountytractmoderate_burdensevere_burdenpct_moderate_burdenpct_severe_burden
total_renter_households+1.00+0.76+0.73+0.76+0.84+nan-0.18-0.23+0.89+0.84-0.03+0.07
rent_30_to_34_9_pct+0.76+1.00+0.55+0.57+0.56+nan-0.14-0.18+0.87+0.56-0.04+0.09
rent_35_to_39_9_pct+0.73+0.55+1.00+0.60+0.61+nan-0.13-0.12+0.81+0.61-0.01+0.04
rent_40_to_49_9_pct+0.76+0.57+0.60+1.00+0.62+nan-0.15-0.15+0.85+0.62-0.02+0.03
rent_50_pct_or_more+0.84+0.56+0.61+0.62+1.00+nan-0.32-0.21+0.70+1.00+0.03+0.11
state+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan
county-0.18-0.14-0.13-0.15-0.32+nan+1.00+0.18-0.16-0.32-0.14-0.07
tract-0.23-0.18-0.12-0.15-0.21+nan+0.18+1.00-0.18-0.21-0.04-0.04
moderate_burden+0.89+0.87+0.81+0.85+0.70+nan-0.16-0.18+1.00+0.70-0.03+0.06
severe_burden+0.84+0.56+0.61+0.62+1.00+nan-0.32-0.21+0.70+1.00+0.03+0.11
pct_moderate_burden-0.03-0.04-0.01-0.02+0.03+nan-0.14-0.04-0.03+0.03+1.00-0.26
pct_severe_burden+0.07+0.09+0.04+0.03+0.11+nan-0.07-0.04+0.06+0.11-0.26+1.00

total_renter_households numeric

rows2,327
null0 (0.0%)
unique1,418
min0.000
max8,209
mean946.145
median726.000
std815.372
q1346.000
q31,357
iqr1,011
skew1.595
kurtosis4.627
n_outliers69
outlier_rate0.030
zero_rate0.044
Show data table
Histogram bins for total_renter_households (median: 726.0).
bincount
0 – 205.2349
205.2 – 410.4358
410.4 – 615.7292
615.7 – 820.9268
820.9 – 1026207
1026 – 1231175
1231 – 1437168
1437 – 1642110
1642 – 1847100
1847 – 205268
2052 – 225763
2257 – 246342
2463 – 266836
2668 – 287322
2873 – 307819
3078 – 328417
3284 – 34896
3489 – 36945
3694 – 38994
3899 – 41046
4104 – 43105
4310 – 45153
4515 – 47201
4720 – 49250
4925 – 51311
5131 – 53361
5336 – 55410
5541 – 57460
5746 – 59520
5952 – 61570
6157 – 63620
6362 – 65670
6567 – 67720
6772 – 69780
6978 – 71830
7183 – 73880
7388 – 75930
7593 – 77990
7799 – 80040
8004 – 82091

rent_30_to_34_9_pct numeric

skew=+2.76 5.3% rows beyond 1.5 IQR
rows2,327
null0 (0.0%)
unique355
min0.000
max1,205
mean83.050
median51.000
std100.320
q115.000
q3116.000
iqr101.000
skew2.755
kurtosis13.860
n_outliers124
outlier_rate0.053
zero_rate0.162
Show data table
Histogram bins for rent_30_to_34_9_pct (median: 51.0).
bincount
0 – 30.12836
30.12 – 60.25444
60.25 – 90.38275
90.38 – 120.5217
120.5 – 150.6152
150.6 – 180.8105
180.8 – 210.990
210.9 – 24148
241 – 271.141
271.1 – 301.225
301.2 – 331.417
331.4 – 361.521
361.5 – 391.615
391.6 – 421.89
421.8 – 451.911
451.9 – 4825
482 – 512.12
512.1 – 542.22
542.2 – 572.44
572.4 – 602.51
602.5 – 632.60
632.6 – 662.81
662.8 – 692.91
692.9 – 7230
723 – 753.11
753.1 – 783.22
783.2 – 813.40
813.4 – 843.50
843.5 – 873.61
873.6 – 903.80
903.8 – 933.90
933.9 – 9640
964 – 994.10
994.1 – 10240
1024 – 10540
1054 – 10840
1084 – 11150
1115 – 11450
1145 – 11750
1175 – 12051

rent_35_to_39_9_pct numeric

skew=+2.40
rows2,327
null0 (0.0%)
unique270
min0.000
max633.000
mean58.351
median35.000
std69.848
q110.000
q383.000
iqr73.000
skew2.395
kurtosis9.275
n_outliers110
outlier_rate0.047
zero_rate0.196
Show data table
Histogram bins for rent_35_to_39_9_pct (median: 35.0).
bincount
0 – 15.82719
15.82 – 31.65371
31.65 – 47.47261
47.47 – 63.3212
63.3 – 79.12163
79.12 – 94.95103
94.95 – 110.897
110.8 – 126.677
126.6 – 142.477
142.4 – 158.258
158.2 – 174.132
174.1 – 189.941
189.9 – 205.720
205.7 – 221.518
221.5 – 237.415
237.4 – 253.210
253.2 – 26914
269 – 284.85
284.8 – 300.73
300.7 – 316.58
316.5 – 332.36
332.3 – 348.12
348.1 – 3642
364 – 379.82
379.8 – 395.61
395.6 – 411.41
411.4 – 427.32
427.3 – 443.12
443.1 – 458.90
458.9 – 474.80
474.8 – 490.61
490.6 – 506.40
506.4 – 522.20
522.2 – 5380
538 – 553.90
553.9 – 569.72
569.7 – 585.50
585.5 – 601.41
601.4 – 617.20
617.2 – 6331

rent_40_to_49_9_pct numeric

skew=+2.14
rows2,327
null0 (0.0%)
unique322
min0.000
max740.000
mean74.676
median49.000
std83.794
q114.000
q3106.000
iqr92.000
skew2.137
kurtosis7.139
n_outliers111
outlier_rate0.048
zero_rate0.156
Show data table
Histogram bins for rent_40_to_49_9_pct (median: 49.0).
bincount
0 – 18.5671
18.5 – 37306
37 – 55.5270
55.5 – 74201
74 – 92.5196
92.5 – 111134
111 – 129.5102
129.5 – 14886
148 – 166.578
166.5 – 18563
185 – 203.539
203.5 – 22238
222 – 240.526
240.5 – 25927
259 – 277.516
277.5 – 29616
296 – 314.513
314.5 – 33311
333 – 351.55
351.5 – 3703
370 – 388.55
388.5 – 4073
407 – 425.54
425.5 – 4441
444 – 462.54
462.5 – 4810
481 – 499.51
499.5 – 5182
518 – 536.50
536.5 – 5550
555 – 573.52
573.5 – 5920
592 – 610.51
610.5 – 6291
629 – 647.51
647.5 – 6660
666 – 684.50
684.5 – 7030
703 – 721.50
721.5 – 7401

rent_50_pct_or_more numeric

rows2,327
null0 (0.0%)
unique706
min0.000
max1,918
mean253.181
median184.000
std236.597
q182.000
q3360.000
iqr278.000
skew1.603
kurtosis3.435
n_outliers87
outlier_rate0.037
zero_rate0.063
Show data table
Histogram bins for rent_50_pct_or_more (median: 184.0).
bincount
0 – 47.95368
47.95 – 95.9293
95.9 – 143.9290
143.9 – 191.8249
191.8 – 239.8186
239.8 – 287.7175
287.7 – 335.7122
335.7 – 383.6114
383.6 – 431.6101
431.6 – 479.583
479.5 – 527.562
527.5 – 575.445
575.4 – 623.448
623.4 – 671.331
671.3 – 719.241
719.2 – 767.228
767.2 – 815.215
815.2 – 863.112
863.1 – 911.114
911.1 – 9597
959 – 10079
1007 – 10558
1055 – 11039
1103 – 11513
1151 – 11994
1199 – 12475
1247 – 12951
1295 – 13432
1343 – 13910
1391 – 14380
1438 – 14860
1486 – 15340
1534 – 15820
1582 – 16301
1630 – 16780
1678 – 17260
1726 – 17740
1774 – 18220
1822 – 18700
1870 – 19181

NAME text

100.0% of rows are unique strings
rows2,327
null0 (0.0%)
unique2,327
len_min38
len_max46
len_mean41.649
len_median41.000
len_p9546.000
word_mean7.133
word_median7.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size1,539
readability_flesch_mean91.451
emoji_rate0.000
url_rate0.000
one_word_rate0.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for NAME (mean: 41.64890416845724).
charscount
38 – 387
38 – 380
38 – 390
39 – 390
39 – 390
39 – 39104
39 – 390
39 – 400
40 – 400
40 – 400
40 – 40785
40 – 400
40 – 410
41 – 410
41 – 410
41 – 41447
41 – 410
41 – 420
42 – 420
42 – 420
42 – 42200
42 – 420
42 – 430
43 – 430
43 – 430
43 – 43378
43 – 430
43 – 440
44 – 440
44 – 440
44 – 44190
44 – 440
44 – 450
45 – 450
45 – 450
45 – 4582
45 – 450
45 – 460
46 – 460
46 – 46134
Sample values (first 10)
  1. Census Tract 4; Bronx County; New York
  2. Census Tract 399.01; Queens County; New York
  3. Census Tract 779.08; Queens County; New York
  4. Census Tract 613.02; Queens County; New York
  5. Census Tract 780; Kings County; New York
  6. Census Tract 156.02; Richmond County; New York
  7. Census Tract 848; Kings County; New York
  8. Census Tract 1008.04; Queens County; New York
  9. Census Tract 618; Queens County; New York
  10. Census Tract 145; Bronx County; New York

state numeric

only one distinct value
rows2,327
null0 (0.0%)
unique1
min36.000
max36.000
mean36.000
median36.000
std0.000
q136.000
q336.000
iqr0.000
skew0.000
kurtosis0.000
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for state (median: 36.0).
bincount
35.5 – 35.520
35.52 – 35.550
35.55 – 35.580
35.58 – 35.60
35.6 – 35.620
35.62 – 35.650
35.65 – 35.670
35.67 – 35.70
35.7 – 35.730
35.73 – 35.750
35.75 – 35.770
35.77 – 35.80
35.8 – 35.830
35.83 – 35.850
35.85 – 35.880
35.88 – 35.90
35.9 – 35.920
35.92 – 35.950
35.95 – 35.980
35.98 – 360
36 – 36.022327
36.02 – 36.050
36.05 – 36.080
36.08 – 36.10
36.1 – 36.120
36.12 – 36.150
36.15 – 36.170
36.17 – 36.20
36.2 – 36.230
36.23 – 36.250
36.25 – 36.270
36.27 – 36.30
36.3 – 36.330
36.33 – 36.350
36.35 – 36.380
36.38 – 36.40
36.4 – 36.420
36.42 – 36.450
36.45 – 36.480
36.48 – 36.50

county numeric

rows2,327
null0 (0.0%)
unique5
min5.000
max85.000
mean55.000
median47.000
std25.969
q147.000
q381.000
iqr34.000
skew-0.720
kurtosis-0.453
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for county (median: 47.0).
bincount
5 – 7361
7 – 90
9 – 110
11 – 130
13 – 150
15 – 170
17 – 190
19 – 210
21 – 230
23 – 250
25 – 270
27 – 290
29 – 310
31 – 330
33 – 350
35 – 370
37 – 390
39 – 410
41 – 430
43 – 450
45 – 470
47 – 49805
49 – 510
51 – 530
53 – 550
55 – 570
57 – 590
59 – 610
61 – 63310
63 – 650
65 – 670
67 – 690
69 – 710
71 – 730
73 – 750
75 – 770
77 – 790
79 – 810
81 – 83725
83 – 85126

tract numeric

skew=+10.14
rows2,327
null0 (0.0%)
unique1,530
min100.000
max990,100
mean42,252
median30,100
std48,265
q115,200
q357,900
iqr42,700
skew10.143
kurtosis189.824
n_outliers63
outlier_rate0.027
zero_rate0.000
Show data table
Histogram bins for tract (median: 30100.0).
bincount
100 – 2.485e+04982
2.485e+04 – 4.96e+04617
4.96e+04 – 7.435e+04329
7.435e+04 – 9.91e+04197
9.91e+04 – 1.238e+05145
1.238e+05 – 1.486e+0537
1.486e+05 – 1.734e+0517
1.734e+05 – 1.981e+050
1.981e+05 – 2.228e+050
2.228e+05 – 2.476e+050
2.476e+05 – 2.724e+050
2.724e+05 – 2.971e+050
2.971e+05 – 3.218e+050
3.218e+05 – 3.466e+050
3.466e+05 – 3.714e+050
3.714e+05 – 3.961e+050
3.961e+05 – 4.208e+050
4.208e+05 – 4.456e+050
4.456e+05 – 4.704e+050
4.704e+05 – 4.951e+050
4.951e+05 – 5.198e+050
5.198e+05 – 5.446e+050
5.446e+05 – 5.694e+050
5.694e+05 – 5.941e+050
5.941e+05 – 6.188e+050
6.188e+05 – 6.436e+050
6.436e+05 – 6.684e+050
6.684e+05 – 6.931e+050
6.931e+05 – 7.178e+050
7.178e+05 – 7.426e+050
7.426e+05 – 7.674e+050
7.674e+05 – 7.921e+050
7.921e+05 – 8.168e+050
8.168e+05 – 8.416e+050
8.416e+05 – 8.664e+050
8.664e+05 – 8.911e+050
8.911e+05 – 9.158e+050
9.158e+05 – 9.406e+050
9.406e+05 – 9.654e+050
9.654e+05 – 9.901e+053

county_name categorical

rows2,327
null0 (0.0%)
unique5
top_valueBrooklyn (Kings)
top_rate0.346
cardinality5
entropy2.086
entropy_ratio0.898
Show data table
Top values for county_name (5 unique shown, of 5 total).
valuecountshare
Brooklyn (Kings)80534.6%
Queens72531.2%
Bronx36115.5%
Manhattan (New York)31013.3%
Staten Island (Richmond)1265.4%
Top values (rank 1–20)
  1. Brooklyn (Kings) — 805
  2. Queens — 725
  3. Bronx — 361
  4. Manhattan (New York) — 310
  5. Staten Island (Richmond) — 126

moderate_burden numeric

rows2,327
null0 (0.0%)
unique639
min0.000
max1,732
mean216.076
median159.000
std210.384
q164.000
q3311.000
iqr247.000
skew1.934
kurtosis6.052
n_outliers86
outlier_rate0.037
zero_rate0.064
Show data table
Histogram bins for moderate_burden (median: 159.0).
bincount
0 – 43.3431
43.3 – 86.6317
86.6 – 129.9256
129.9 – 173.2245
173.2 – 216.5190
216.5 – 259.8149
259.8 – 303.1137
303.1 – 346.4109
346.4 – 389.7105
389.7 – 43389
433 – 476.361
476.3 – 519.653
519.6 – 562.928
562.9 – 606.233
606.2 – 649.528
649.5 – 692.817
692.8 – 736.111
736.1 – 779.416
779.4 – 822.77
822.7 – 86610
866 – 909.36
909.3 – 952.69
952.6 – 995.92
995.9 – 10390
1039 – 10823
1082 – 11266
1126 – 11691
1169 – 12120
1212 – 12561
1256 – 12991
1299 – 13421
1342 – 13860
1386 – 14290
1429 – 14720
1472 – 15160
1516 – 15593
1559 – 16020
1602 – 16451
1645 – 16890
1689 – 17321

severe_burden numeric

rows2,327
null0 (0.0%)
unique706
min0.000
max1,918
mean253.181
median184.000
std236.597
q182.000
q3360.000
iqr278.000
skew1.603
kurtosis3.435
n_outliers87
outlier_rate0.037
zero_rate0.063
Show data table
Histogram bins for severe_burden (median: 184.0).
bincount
0 – 47.95368
47.95 – 95.9293
95.9 – 143.9290
143.9 – 191.8249
191.8 – 239.8186
239.8 – 287.7175
287.7 – 335.7122
335.7 – 383.6114
383.6 – 431.6101
431.6 – 479.583
479.5 – 527.562
527.5 – 575.445
575.4 – 623.448
623.4 – 671.331
671.3 – 719.241
719.2 – 767.228
767.2 – 815.215
815.2 – 863.112
863.1 – 911.114
911.1 – 9597
959 – 10079
1007 – 10558
1055 – 11039
1103 – 11513
1151 – 11994
1199 – 12475
1247 – 12951
1295 – 13432
1343 – 13910
1391 – 14380
1438 – 14860
1486 – 15340
1534 – 15820
1582 – 16301
1630 – 16780
1678 – 17260
1726 – 17740
1774 – 18220
1822 – 18700
1870 – 19181

pct_moderate_burden numeric

rows2,327
null102 (4.4%)
unique461
min0.000
max100.000
mean22.744
median21.800
std11.359
q115.900
q328.200
iqr12.300
skew1.509
kurtosis6.704
n_outliers59
outlier_rate0.027
zero_rate0.021
Show data table
Histogram bins for pct_moderate_burden (median: 21.8).
bincount
0 – 2.555
2.5 – 524
5 – 7.549
7.5 – 1078
10 – 12.5108
12.5 – 15160
15 – 17.5213
17.5 – 20251
20 – 22.5238
22.5 – 25240
25 – 27.5193
27.5 – 30172
30 – 32.5129
32.5 – 3579
35 – 37.571
37.5 – 4041
40 – 42.531
42.5 – 4524
45 – 47.514
47.5 – 5012
50 – 52.56
52.5 – 554
55 – 57.54
57.5 – 606
60 – 62.50
62.5 – 653
65 – 67.53
67.5 – 705
70 – 72.50
72.5 – 751
75 – 77.51
77.5 – 801
80 – 82.51
82.5 – 851
85 – 87.50
87.5 – 901
90 – 92.52
92.5 – 951
95 – 97.50
97.5 – 1003

pct_severe_burden numeric

rows2,327
null102 (4.4%)
unique518
min0.000
max100.000
mean27.124
median26.200
std12.677
q118.700
q334.600
iqr15.900
skew0.566
kurtosis1.222
n_outliers30
outlier_rate0.013
zero_rate0.020
Show data table
Histogram bins for pct_severe_burden (median: 26.2).
bincount
0 – 2.545
2.5 – 514
5 – 7.541
7.5 – 1053
10 – 12.594
12.5 – 15115
15 – 17.5131
17.5 – 20160
20 – 22.5170
22.5 – 25188
25 – 27.5188
27.5 – 30168
30 – 32.5173
32.5 – 35157
35 – 37.5115
37.5 – 4097
40 – 42.573
42.5 – 4562
45 – 47.544
47.5 – 5035
50 – 52.529
52.5 – 5519
55 – 57.518
57.5 – 6012
60 – 62.56
62.5 – 654
65 – 67.54
67.5 – 702
70 – 72.51
72.5 – 751
75 – 77.51
77.5 – 801
80 – 82.50
82.5 – 851
85 – 87.51
87.5 – 900
90 – 92.51
92.5 – 950
95 – 97.50
97.5 – 1001

rent_burdened numeric

rows2,327
null0 (0.0%)
unique1,013
min0.000
max3,153
mean469.258
median358.000
std415.279
q1164.500
q3670.000
iqr505.500
skew1.494
kurtosis3.005
n_outliers82
outlier_rate0.035
zero_rate0.047
Show data table
Histogram bins for rent_burdened (median: 358.0).
bincount
0 – 78.83310
78.83 – 157.7256
157.7 – 236.5264
236.5 – 315.3231
315.3 – 394.1190
394.1 – 473180
473 – 551.8147
551.8 – 630.6113
630.6 – 709.4108
709.4 – 788.275
788.2 – 867.191
867.1 – 945.973
945.9 – 102557
1025 – 110439
1104 – 118241
1182 – 126123
1261 – 134026
1340 – 141920
1419 – 149819
1498 – 157611
1576 – 165516
1655 – 17346
1734 – 18135
1813 – 18926
1892 – 19714
1971 – 20492
2049 – 21283
2128 – 22075
2207 – 22860
2286 – 23650
2365 – 24441
2444 – 25221
2522 – 26012
2601 – 26801
2680 – 27590
2759 – 28380
2838 – 29170
2917 – 29950
2995 – 30740
3074 – 31531

pct_rent_burdened numeric

rows2,327
null102 (4.4%)
unique596
min0.000
max100.000
mean49.867
median50.000
std14.615
q140.900
q358.800
iqr17.900
skew-0.038
kurtosis0.785
n_outliers62
outlier_rate0.028
zero_rate3.60e-03
Show data table
Histogram bins for pct_rent_burdened (median: 50.0).
bincount
0 – 2.58
2.5 – 53
5 – 7.51
7.5 – 105
10 – 12.57
12.5 – 158
15 – 17.512
17.5 – 2014
20 – 22.514
22.5 – 2524
25 – 27.535
27.5 – 3042
30 – 32.553
32.5 – 3580
35 – 37.591
37.5 – 40119
40 – 42.5129
42.5 – 45144
45 – 47.5146
47.5 – 50177
50 – 52.5139
52.5 – 55178
55 – 57.5162
57.5 – 60131
60 – 62.5117
62.5 – 6597
65 – 67.560
67.5 – 7057
70 – 72.554
72.5 – 7528
75 – 77.520
77.5 – 8025
80 – 82.58
82.5 – 854
85 – 87.512
87.5 – 905
90 – 92.55
92.5 – 953
95 – 97.50
97.5 – 1008

median_gross_rent numeric

skew=-3.62 12.4% rows beyond 1.5 IQR
rows2,327
null0 (0.0%)
unique1,232
min-666,666,666
max3,501
mean-41,539,609
median1,735
std161,182,639
q11,442
q32,049
iqr607.500
skew-3.621
kurtosis11.115
n_outliers289
outlier_rate0.124
zero_rate0.000
Show data table
Histogram bins for median_gross_rent (median: 1735.0).
bincount
-6.667e+08 – -6.5e+08145
-6.5e+08 – -6.333e+080
-6.333e+08 – -6.167e+080
-6.167e+08 – -6e+080
-6e+08 – -5.833e+080
-5.833e+08 – -5.667e+080
-5.667e+08 – -5.5e+080
-5.5e+08 – -5.333e+080
-5.333e+08 – -5.167e+080
-5.167e+08 – -5e+080
-5e+08 – -4.833e+080
-4.833e+08 – -4.667e+080
-4.667e+08 – -4.5e+080
-4.5e+08 – -4.333e+080
-4.333e+08 – -4.167e+080
-4.167e+08 – -4e+080
-4e+08 – -3.833e+080
-3.833e+08 – -3.667e+080
-3.667e+08 – -3.5e+080
-3.5e+08 – -3.333e+080
-3.333e+08 – -3.167e+080
-3.167e+08 – -3e+080
-3e+08 – -2.833e+080
-2.833e+08 – -2.667e+080
-2.667e+08 – -2.5e+080
-2.5e+08 – -2.333e+080
-2.333e+08 – -2.167e+080
-2.167e+08 – -2e+080
-2e+08 – -1.833e+080
-1.833e+08 – -1.667e+080
-1.667e+08 – -1.5e+080
-1.5e+08 – -1.333e+080
-1.333e+08 – -1.167e+080
-1.167e+08 – -1e+080
-1e+08 – -8.333e+070
-8.333e+07 – -6.666e+070
-6.666e+07 – -5e+070
-5e+07 – -3.333e+070
-3.333e+07 – -1.666e+070
-1.666e+07 – 35012182

median_household_income numeric

skew=-3.94 8.9% rows beyond 1.5 IQR
rows2,327
null0 (0.0%)
unique2,106
min-666,666,666
max250,001
mean-36,017,397
median76,833
std150,923,372
q153,242
q3102,360
iqr49,117
skew-3.940
kurtosis13.525
n_outliers208
outlier_rate0.089
zero_rate0.000
Show data table
Histogram bins for median_household_income (median: 76833.0).
bincount
-6.667e+08 – -6.5e+08126
-6.5e+08 – -6.333e+080
-6.333e+08 – -6.166e+080
-6.166e+08 – -6e+080
-6e+08 – -5.833e+080
-5.833e+08 – -5.666e+080
-5.666e+08 – -5.5e+080
-5.5e+08 – -5.333e+080
-5.333e+08 – -5.166e+080
-5.166e+08 – -4.999e+080
-4.999e+08 – -4.833e+080
-4.833e+08 – -4.666e+080
-4.666e+08 – -4.499e+080
-4.499e+08 – -4.332e+080
-4.332e+08 – -4.166e+080
-4.166e+08 – -3.999e+080
-3.999e+08 – -3.832e+080
-3.832e+08 – -3.666e+080
-3.666e+08 – -3.499e+080
-3.499e+08 – -3.332e+080
-3.332e+08 – -3.165e+080
-3.165e+08 – -2.999e+080
-2.999e+08 – -2.832e+080
-2.832e+08 – -2.665e+080
-2.665e+08 – -2.498e+080
-2.498e+08 – -2.332e+080
-2.332e+08 – -2.165e+080
-2.165e+08 – -1.998e+080
-1.998e+08 – -1.832e+080
-1.832e+08 – -1.665e+080
-1.665e+08 – -1.498e+080
-1.498e+08 – -1.331e+080
-1.331e+08 – -1.165e+080
-1.165e+08 – -9.979e+070
-9.979e+07 – -8.311e+070
-8.311e+07 – -6.644e+070
-6.644e+07 – -4.977e+070
-4.977e+07 – -3.31e+070
-3.31e+07 – -1.642e+070
-1.642e+07 – 2.5e+052201

total_households numeric

rows2,327
null0 (0.0%)
unique1,495
min0.000
max8,209
mean1,411
median1,252
std923.255
q1773.500
q31,850
iqr1,076
skew1.479
kurtosis4.377
n_outliers70
outlier_rate0.030
zero_rate0.041
Show data table
Histogram bins for total_households (median: 1252.0).
bincount
0 – 205.2123
205.2 – 410.441
410.4 – 615.7203
615.7 – 820.9272
820.9 – 1026269
1026 – 1231237
1231 – 1437215
1437 – 1642221
1642 – 1847162
1847 – 2052134
2052 – 225794
2257 – 2463101
2463 – 266866
2668 – 287339
2873 – 307835
3078 – 328424
3284 – 348922
3489 – 36948
3694 – 38997
3899 – 41049
4104 – 431013
4310 – 45159
4515 – 47205
4720 – 49255
4925 – 51313
5131 – 53362
5336 – 55412
5541 – 57460
5746 – 59521
5952 – 61571
6157 – 63620
6362 – 65670
6567 – 67721
6772 – 69782
6978 – 71830
7183 – 73880
7388 – 75930
7593 – 77990
7799 – 80040
8004 – 82091

owner_occupied numeric

6.1% rows beyond 1.5 IQR
rows2,327
null0 (0.0%)
unique1,001
min0.000
max3,052
mean464.600
median371.000
std422.558
q1177.000
q3608.000
iqr431.000
skew1.761
kurtosis4.254
n_outliers143
outlier_rate0.061
zero_rate0.072
Show data table
Histogram bins for owner_occupied (median: 371.0).
bincount
0 – 76.3343
76.3 – 152.6175
152.6 – 228.9191
228.9 – 305.2236
305.2 – 381.5258
381.5 – 457.8245
457.8 – 534.1167
534.1 – 610.4134
610.4 – 686.798
686.7 – 76369
763 – 839.361
839.3 – 915.649
915.6 – 991.953
991.9 – 106843
1068 – 114433
1144 – 122121
1221 – 129720
1297 – 137328
1373 – 145016
1450 – 152618
1526 – 16029
1602 – 167913
1679 – 17559
1755 – 18318
1831 – 19085
1908 – 19842
1984 – 20604
2060 – 21363
2136 – 22133
2213 – 22891
2289 – 23652
2365 – 24422
2442 – 25183
2518 – 25940
2594 – 26703
2670 – 27470
2747 – 28231
2823 – 28990
2899 – 29760
2976 – 30521

renter_occupied numeric

rows2,327
null0 (0.0%)
unique1,418
min0.000
max8,209
mean946.145
median726.000
std815.372
q1346.000
q31,357
iqr1,011
skew1.595
kurtosis4.627
n_outliers69
outlier_rate0.030
zero_rate0.044
Show data table
Histogram bins for renter_occupied (median: 726.0).
bincount
0 – 205.2349
205.2 – 410.4358
410.4 – 615.7292
615.7 – 820.9268
820.9 – 1026207
1026 – 1231175
1231 – 1437168
1437 – 1642110
1642 – 1847100
1847 – 205268
2052 – 225763
2257 – 246342
2463 – 266836
2668 – 287322
2873 – 307819
3078 – 328417
3284 – 34896
3489 – 36945
3694 – 38994
3899 – 41046
4104 – 43105
4310 – 45153
4515 – 47201
4720 – 49250
4925 – 51311
5131 – 53361
5336 – 55410
5541 – 57460
5746 – 59520
5952 – 61570
6157 – 63620
6362 – 65670
6567 – 67720
6772 – 69780
6978 – 71830
7183 – 73880
7388 – 75930
7593 – 77990
7799 – 80040
8004 – 82091

pct_owner_occupied numeric

rows2,327
null96 (4.1%)
unique823
min0.000
max100.000
mean37.513
median34.400
std25.651
q116.400
q356.100
iqr39.700
skew0.395
kurtosis-0.854
n_outliers0
outlier_rate0.000
zero_rate0.032
Show data table
Histogram bins for pct_owner_occupied (median: 34.4).
bincount
0 – 2.5141
2.5 – 586
5 – 7.571
7.5 – 1063
10 – 12.586
12.5 – 1565
15 – 17.572
17.5 – 2088
20 – 22.598
22.5 – 2588
25 – 27.579
27.5 – 3067
30 – 32.570
32.5 – 3556
35 – 37.576
37.5 – 4068
40 – 42.559
42.5 – 4572
45 – 47.558
47.5 – 5054
50 – 52.570
52.5 – 5559
55 – 57.555
57.5 – 6039
60 – 62.544
62.5 – 6540
65 – 67.552
67.5 – 7034
70 – 72.540
72.5 – 7547
75 – 77.532
77.5 – 8048
80 – 82.531
82.5 – 8527
85 – 87.526
87.5 – 9024
90 – 92.523
92.5 – 958
95 – 97.56
97.5 – 1009

pct_renter_occupied numeric

rows2,327
null96 (4.1%)
unique823
min0.000
max100.000
mean62.487
median65.600
std25.651
q143.900
q383.600
iqr39.700
skew-0.395
kurtosis-0.854
n_outliers0
outlier_rate0.000
zero_rate2.69e-03
Show data table
Histogram bins for pct_renter_occupied (median: 65.6).
bincount
0 – 2.59
2.5 – 56
5 – 7.57
7.5 – 1023
10 – 12.524
12.5 – 1526
15 – 17.527
17.5 – 2032
20 – 22.547
22.5 – 2531
25 – 27.548
27.5 – 3040
30 – 32.534
32.5 – 3550
35 – 37.539
37.5 – 4046
40 – 42.541
42.5 – 4553
45 – 47.557
47.5 – 5072
50 – 52.554
52.5 – 5554
55 – 57.573
57.5 – 6062
60 – 62.569
62.5 – 6572
65 – 67.560
67.5 – 7065
70 – 72.572
72.5 – 7575
75 – 77.591
77.5 – 8094
80 – 82.592
82.5 – 8573
85 – 87.564
87.5 – 9083
90 – 92.565
92.5 – 9571
95 – 97.583
97.5 – 100147