Summary confidence: high
This dataset covers 3,222 U.S. counties with 16 columns describing rental affordability — rents, incomes, renter shares, and burden percentages — keyed by FIPS and county name. Several numeric fields (annual_rent, median_gross_rent, median_household_income, rent_to_income_ratio) carry impossible negative sentinel values like -666666666 and -7999999992, which drag means deeply negative and produce skew around -17 to -57; these need cleaning before any analysis. The affordability_category field is also extremely imbalanced — 3,192 of 3,222 counties are labeled 'Affordable' (top_rate 0.99), so it offers little discriminatory signal as-is. The cleaner fields to start with are pct_rent_burdened_30plus (median 37.36%), pct_rent_burdened_50plus (median 17.62%), and pct_renter (median 26.07%), which look well-behaved and tell the real affordability story.
citing: annual_rent.stats.min · annual_rent.stats.skew · median_household_income.stats.min · median_household_income.stats.skew · rent_to_income_ratio.stats.skew · affordability_category.stats.top_rate · affordability_category.top_values · pct_rent_burdened_30plus.stats.median · pct_rent_burdened_50plus.stats.median · pct_renter.stats.median · row_count · column_count