saturn

/home/coolhand/html/datavis/data_trove/cache/poverty_data.parquet 3,222 rows sample n=3,222 seed 42 2026-05-01T16:54:37+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/cache/poverty_data.parquet
Total rows3,222
Profiled sample3,222
Columns3
Generated2026-05-01T16:54:37+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset contains 3,222 U.S. counties with three columns: a county name, a FIPS code identifier, and a poverty rate. Each row is unique by county_name and fips, so the analytical signal lives almost entirely in poverty_rate. Poverty rate ranges from 1.6% to 66.32% with a mean of 15.1% and median of 13.55%, and it is right-skewed (skew 2.10) with 137 high-end outliers (~4.25% of counties). That long upper tail is the first thing worth a closer look, since a small number of counties have poverty rates several times the national median.

fips high anthropic:claude-opus-4-7

This is the U.S. county FIPS code, a 5-digit geographic identifier where the leading 1-2 digits encode state. All 3222 values are unique with zero nulls, and the range 1001 to 72153 spans Alabama through Puerto Rico, consistent with a complete county roster. Treating it as numeric is misleading despite the clean distribution (skew 0.16, no outliers) since the magnitudes are categorical codes, not measurements.

county_name high anthropic:claude-opus-4-7

This column holds fully-qualified US county names (e.g., ', '), with every one of the 3222 rows unique and zero nulls. The token 'county,' appears 2999 times, suggesting ~223 entries don't follow the 'X County, State' pattern (likely parishes in Louisiana, boroughs in Alaska, or independent cities). Texas (256), Virginia (189), and Georgia (159) lead the state distribution, consistent with the actual US county counts.

poverty_rate high anthropic:claude-opus-4-7

Numeric poverty rate (likely percent of population below the poverty line) across 3222 rows with no nulls and 1719 unique values. Distribution is right-skewed (skew 2.10, kurtosis 6.89) with median 13.55 and mean 15.10, ranging from 1.6 to 66.32, and 137 outliers (4.25%) sit above the upper whisker. The long upper tail suggests a small set of high-poverty units pulling the mean above the median.

Numeric correlation

fips numeric

rows3,222
null0 (0.0%)
unique3,222
min1,001
max72,153
mean31,378
median30,022
std16,300
q119,030
q346,104
iqr27,075
skew0.157
kurtosis-0.631
n_outliers0
outlier_rate0.000
zero_rate0.000

county_name text

100.0% of rows are unique strings
rows3,222
null0 (0.0%)
unique3,222
len_min16
len_max59
len_mean24.324
len_median24.000
len_p9531.000
word_mean3.248
word_median3.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size1,990
readability_flesch_mean10.284
emoji_rate0.000
url_rate0.000
one_word_rate0.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. Bibb County, Alabama
  2. Cheatham County, Tennessee
  3. Piute County, Utah
  4. Lamb County, Texas
  5. Martin County, Minnesota
  6. Sheridan County, Wyoming
  7. Chickasaw County, Mississippi
  8. Rockingham County, Virginia
  9. Liberty County, Texas
  10. Clark County, Arkansas

poverty_rate numeric

skew=+2.10
rows3,222
null0 (0.0%)
unique1,719
min1.600
max66.320
mean15.100
median13.550
std7.706
q110.160
q317.910
iqr7.750
skew2.096
kurtosis6.891
n_outliers137
outlier_rate0.043
zero_rate0.000