This dataset contains 2,327 New York City census tracts with housing tenure breakdowns across 10 columns, covering owner- and renter-occupied household counts and percentages by county. Brooklyn (Kings) leads with 805 tracts (34.6% of rows), followed by Queens (725) and Bronx (361), while Staten Island has just 126. Renting dominates citywide: the mean share of renter-occupied households is 62.5% versus 37.5% owner-occupied, and renter counts are right-skewed with a long tail up to 8,209 per tract. Worth a closer look: the strong skew in raw household counts (owner_occupied skew 1.76, renter_occupied skew 1.59) and the ~4% null rate in the percentage columns. Note that 'state' is constant (all 36) and can be ignored.
saturn
/home/coolhand/html/datavis/data_trove/data/urban/nyc_housing/nyc_tenure_by_tract.csv 2,327 rows sample n=2,327 seed 42 2026-05-01T17:16:48+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/data/urban/nyc_housing/nyc_tenure_by_tract.csv |
| Total rows | 2,327 |
| Profiled sample | 2,327 |
| Columns | 10 |
| Generated | 2026-05-01T17:16:48+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
Counts of households per geographic unit, ranging from 0 to 8209 with a median of 1252 and mean of 1410.7. The distribution is right-skewed (skew 1.48, kurtosis 4.38) with 70 outliers (3.0%) on the high end, and 4.1% of rows are zeros which may indicate uninhabited or unreported areas.
Despite the boolean-sounding name, owner_occupied is an integer count ranging 0–3052 with 1001 distinct values and a mean of 464.6 versus a median of 371, suggesting a per-area tally of owner-occupied units rather than a flag. The distribution is right-skewed (skew 1.76, kurtosis 4.25) with 143 outliers (6.1%) and 7.2% exact zeros. No nulls are present.
This column reports the count of renter-occupied units per record, ranging from 0 to 8209 with a mean of 946 and median of 726. The distribution is right-skewed (skew 1.59, kurtosis 4.63) with 4.4% zeros and 69 outliers (2.97%) in the upper tail. No nulls and 1418 unique values across 2327 rows suggest a per-area aggregate count rather than a per-unit flag.
This column holds fully-qualified Census tract names for New York City, with every one of the 2327 rows unique and non-null. Lengths cluster tightly between 38 and 46 characters and every record contains the tokens 'new', 'york', 'census', 'tract', and 'county;', with the borough breakdown skewed toward Kings (805) and Queens (725) over Bronx (361) and Richmond (126) — Manhattan/New York County appears absent from the top words, which is worth checking. With n_unique == n, this is effectively a row identifier rather than a feature.
The column 'state' is numeric but holds the single value 36 across all 2327 rows, with zero variance and only one unique value. It carries no information for analysis and is flagged constant.
Encoded as numeric but only 5 distinct values across 2327 rows (min 5, max 85, median 47), this is almost certainly a categorical county code stored as an integer. The distribution is left-skewed (skew -0.72) with mean 55 sitting above median 47, suggesting one or two higher-numbered codes dominate. No nulls or outliers reported.
Census tract codes stored as integers, ranging from 100 to 990100 across 1530 distinct values in 2327 rows. The skew of 10.14 and kurtosis of 189.8 are artefacts of the tract numbering scheme rather than a real distribution — these are categorical identifiers, not measurements. 63 outliers (2.7%) reflect tracts with unusually high numeric codes, not anomalous data.
This column lists New York City borough/county names across 2327 rows, with all 5 NYC boroughs represented and no nulls. Distribution is fairly even (entropy ratio 0.898), though Brooklyn (Kings) leads at 34.6% (805) and Staten Island (Richmond) trails at 126. The parenthetical county names suggest the source schema uses formal county labels rather than borough-only naming.
Numeric column on a 0-100 scale (min 0.0, max 100.0) capturing the percentage of owner-occupied housing per record. The distribution is wide and flattish (std 25.65, kurtosis -0.85) with mean 37.51 just above median 34.4, and a broad IQR from 16.4 to 56.1, indicating most areas are minority owner-occupied. About 4.13% of rows are null and 3.23% are exactly zero, which may represent fully-rental areas worth flagging.
Numeric share variable bounded between 0 and 100 (mean 62.49, median 65.6) — almost certainly the percentage of renter-occupied housing units in each row. The distribution is wide (std 25.65, IQR 39.7) and slightly left-skewed (skew -0.39, kurtosis -0.85), so values cluster toward the high end with a long tail of owner-dominated areas. About 4.13% of rows are null and only 0.27% are exact zeros; no outliers were flagged given the natural 0–100 bounds.
Numeric correlation
total_households numeric
owner_occupied numeric
renter_occupied numeric
NAME text
Sample values (first 10)
- Census Tract 4; Bronx County; New York
- Census Tract 399.01; Queens County; New York
- Census Tract 779.08; Queens County; New York
- Census Tract 613.02; Queens County; New York
- Census Tract 780; Kings County; New York
- Census Tract 156.02; Richmond County; New York
- Census Tract 848; Kings County; New York
- Census Tract 1008.04; Queens County; New York
- Census Tract 618; Queens County; New York
- Census Tract 145; Bronx County; New York
state numeric
county numeric
tract numeric
county_name categorical
Top values (rank 1–20)
- Brooklyn (Kings) — 805
- Queens — 725
- Bronx — 361
- Manhattan (New York) — 310
- Staten Island (Richmond) — 126