saturn·

food deserts vehicle access

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/urban/food_deserts/vehicle_access.csv

Saturn profiled 3,222 rows across 9 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/urban/food_deserts/vehicle_access.csv",
    "--findings", "food_deserts-vehicle_access.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset covers vehicle access for 3,222 US counties (one row per county, identified by FIPS code and name) across 9 columns, with no missing values. The household and no-vehicle counts are extremely right-skewed — `no_vehicle_total` has a median of 580 but a max of 601,621, and `total_households` ranges from 32 up to roughly 3.36 million — so a handful of large urban counties dominate the absolute totals. The more comparable signal is `no_vehicle_pct`, which has a median of 5.41% but stretches up to 85.94%, flagging a small set of counties with extreme transit dependence worth investigating first. State coverage looks complete (52 distinct state codes), so geographic breakdowns should be straightforward.

citing: row_count · column_count · columns.no_vehicle_total.stats · columns.no_vehicle_pct.stats · columns.total_households.stats · columns.state.n_unique · columns.name.n_unique · columns.fips.n_unique

Out[4]:

saturn.schema() · 9 columns

column kind n null% unique alerts
name text 3,222 0.0% 3,222 near_unique
total_households numeric 3,222 0.0% 3,074 high_skew outliers
no_vehicle_owner numeric 3,222 0.0% 1,176 high_skew outliers
no_vehicle_renter numeric 3,222 0.0% 1,517 high_skew outliers
state numeric 3,222 0.0% 52
county numeric 3,222 0.0% 330 high_skew outliers
fips numeric 3,222 0.0% 3,222
no_vehicle_total numeric 3,222 0.0% 1,823 high_skew outliers
no_vehicle_pct numeric 3,222 0.0% 1,065 high_skew
Fig 1.
no_vehicle_pct · Distribution of the share of households without a vehicle per county — watch the long right tail above ~20%.
Show data table
Histogram bins for no_vehicle_pct (median: 5.41).
bincount
0 – 2.148161
2.148 – 4.297823
4.297 – 6.4451091
6.445 – 8.594630
8.594 – 10.74283
10.74 – 12.89111
12.89 – 15.0461
15.04 – 17.1923
17.19 – 19.348
19.34 – 21.483
21.48 – 23.634
23.63 – 25.782
25.78 – 27.933
27.93 – 30.082
30.08 – 32.232
32.23 – 34.382
34.38 – 36.522
36.52 – 38.672
38.67 – 40.820
40.82 – 42.971
42.97 – 45.120
45.12 – 47.271
47.27 – 49.420
49.42 – 51.560
51.56 – 53.710
53.71 – 55.861
55.86 – 58.011
58.01 – 60.160
60.16 – 62.312
62.31 – 64.450
64.45 – 66.61
66.6 – 68.750
68.75 – 70.90
70.9 – 73.050
73.05 – 75.20
75.2 – 77.350
77.35 – 79.491
79.49 – 81.640
81.64 – 83.790
83.79 – 85.941
Fig 2.
no_vehicle_total · Absolute count of no-vehicle households is extremely skewed; a few counties exceed 100k while the median is just 580.
Show data table
Histogram bins for no_vehicle_total (median: 580.0).
bincount
0 – 1.504e+043108
1.504e+04 – 3.008e+0462
3.008e+04 – 4.512e+0419
4.512e+04 – 6.016e+049
6.016e+04 – 7.52e+048
7.52e+04 – 9.024e+042
9.024e+04 – 1.053e+054
1.053e+05 – 1.203e+053
1.203e+05 – 1.354e+050
1.354e+05 – 1.504e+050
1.504e+05 – 1.654e+050
1.654e+05 – 1.805e+050
1.805e+05 – 1.955e+051
1.955e+05 – 2.106e+050
2.106e+05 – 2.256e+050
2.256e+05 – 2.406e+050
2.406e+05 – 2.557e+050
2.557e+05 – 2.707e+050
2.707e+05 – 2.858e+050
2.858e+05 – 3.008e+052
3.008e+05 – 3.159e+050
3.159e+05 – 3.309e+051
3.309e+05 – 3.459e+050
3.459e+05 – 3.61e+050
3.61e+05 – 3.76e+051
3.76e+05 – 3.911e+050
3.911e+05 – 4.061e+050
4.061e+05 – 4.211e+050
4.211e+05 – 4.362e+050
4.362e+05 – 4.512e+050
4.512e+05 – 4.663e+050
4.663e+05 – 4.813e+050
4.813e+05 – 4.963e+050
4.963e+05 – 5.114e+050
5.114e+05 – 5.264e+050
5.264e+05 – 5.415e+050
5.415e+05 – 5.565e+051
5.565e+05 – 5.715e+050
5.715e+05 – 5.866e+050
5.866e+05 – 6.016e+051
Fig 3.
total_households · County household sizes span five orders of magnitude — useful context before comparing raw no-vehicle totals.
Show data table
Histogram bins for total_households (median: 10021.0).
bincount
32 – 8.411e+042907
8.411e+04 – 1.682e+05153
1.682e+05 – 2.523e+0562
2.523e+05 – 3.363e+0538
3.363e+05 – 4.204e+0522
4.204e+05 – 5.045e+056
5.045e+05 – 5.886e+0511
5.886e+05 – 6.726e+055
6.726e+05 – 7.567e+055
7.567e+05 – 8.408e+053
8.408e+05 – 9.249e+051
9.249e+05 – 1.009e+063
1.009e+06 – 1.093e+061
1.093e+06 – 1.177e+061
1.177e+06 – 1.261e+060
1.261e+06 – 1.345e+060
1.345e+06 – 1.429e+060
1.429e+06 – 1.513e+060
1.513e+06 – 1.597e+060
1.597e+06 – 1.682e+061
1.682e+06 – 1.766e+061
1.766e+06 – 1.85e+060
1.85e+06 – 1.934e+060
1.934e+06 – 2.018e+060
2.018e+06 – 2.102e+061
2.102e+06 – 2.186e+060
2.186e+06 – 2.27e+060
2.27e+06 – 2.354e+060
2.354e+06 – 2.438e+060
2.438e+06 – 2.522e+060
2.522e+06 – 2.606e+060
2.606e+06 – 2.69e+060
2.69e+06 – 2.775e+060
2.775e+06 – 2.859e+060
2.859e+06 – 2.943e+060
2.943e+06 – 3.027e+060
3.027e+06 – 3.111e+060
3.111e+06 – 3.195e+060
3.195e+06 – 3.279e+060
3.279e+06 – 3.363e+061
Fig 4.
state · Counties per state code — confirms geographic coverage across all 52 state-level FIPS values.
Show data table
Histogram bins for state (median: 30.0).
bincount
1 – 2.77597
2.775 – 4.5515
4.55 – 6.325133
6.325 – 8.164
8.1 – 9.8759
9.875 – 11.654
11.65 – 13.42226
13.42 – 15.25
15.2 – 16.9844
16.98 – 18.75194
18.75 – 20.52204
20.52 – 22.3184
22.3 – 24.0740
24.07 – 25.8514
25.85 – 27.62170
27.62 – 29.4197
29.4 – 31.17149
31.17 – 32.9517
32.95 – 34.7331
34.73 – 36.595
36.5 – 38.27153
38.27 – 40.05165
40.05 – 41.8236
41.82 – 43.667
43.6 – 45.3851
45.38 – 47.15161
47.15 – 48.92254
48.92 – 50.743
50.7 – 52.47133
52.47 – 54.2594
54.25 – 56.0295
56.02 – 57.80
57.8 – 59.570
59.57 – 61.350
61.35 – 63.120
63.12 – 64.90
64.9 – 66.670
66.67 – 68.450
68.45 – 70.220
70.22 – 7278
Fig 5.
no_vehicle_renter · Renter households without vehicles skew even harder than owners; helpful for spotting urban renter-heavy counties.
Show data table
Histogram bins for no_vehicle_renter (median: 351.0).
bincount
0 – 1.22e+043112
1.22e+04 – 2.441e+0463
2.441e+04 – 3.661e+0416
3.661e+04 – 4.881e+048
4.881e+04 – 6.102e+048
6.102e+04 – 7.322e+042
7.322e+04 – 8.543e+041
8.543e+04 – 9.763e+045
9.763e+04 – 1.098e+050
1.098e+05 – 1.22e+050
1.22e+05 – 1.342e+051
1.342e+05 – 1.464e+050
1.464e+05 – 1.586e+050
1.586e+05 – 1.709e+050
1.709e+05 – 1.831e+050
1.831e+05 – 1.953e+050
1.953e+05 – 2.075e+050
2.075e+05 – 2.197e+050
2.197e+05 – 2.319e+051
2.319e+05 – 2.441e+051
2.441e+05 – 2.563e+050
2.563e+05 – 2.685e+050
2.685e+05 – 2.807e+050
2.807e+05 – 2.929e+052
2.929e+05 – 3.051e+050
3.051e+05 – 3.173e+050
3.173e+05 – 3.295e+050
3.295e+05 – 3.417e+050
3.417e+05 – 3.539e+050
3.539e+05 – 3.661e+050
3.661e+05 – 3.783e+050
3.783e+05 – 3.905e+050
3.905e+05 – 4.027e+050
4.027e+05 – 4.149e+050
4.149e+05 – 4.271e+050
4.271e+05 – 4.393e+050
4.393e+05 – 4.515e+050
4.515e+05 – 4.637e+050
4.637e+05 – 4.759e+051
4.759e+05 – 4.881e+051
Fig 6.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
nametext0.0%
total_householdsnumeric0.0%
no_vehicle_ownernumeric0.0%
no_vehicle_renternumeric0.0%
statenumeric0.0%
countynumeric0.0%
fipsnumeric0.0%
no_vehicle_totalnumeric0.0%
no_vehicle_pctnumeric0.0%
Fig 7.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 8 numeric columns (values clipped to 2 decimals).
total_householdsno_vehicle_ownerno_vehicle_renterstatecountyfipsno_vehicle_totalno_vehicle_pct
total_households+1.00+0.92+0.90-0.06-0.02-0.06+0.91+0.10
no_vehicle_owner+0.92+1.00+0.97-0.05-0.03-0.06+0.98+0.16
no_vehicle_renter+0.90+0.97+1.00-0.05-0.02-0.05+1.00+0.15
state-0.06-0.05-0.05+1.00+0.14+1.00-0.05+0.04
county-0.02-0.03-0.02+0.14+1.00+0.15-0.02+0.11
fips-0.06-0.06-0.05+1.00+0.15+1.00-0.05+0.04
no_vehicle_total+0.91+0.98+1.00-0.05-0.02-0.05+1.00+0.15
no_vehicle_pct+0.10+0.16+0.15+0.04+0.11+0.04+0.15+1.00

name text identifier

This column appears to hold US county names with state suffixes — 2,999 of 3,222 rows contain the token 'county,' and the remaining top words are state names (texas, virginia, georgia, north carolina, dakota, kentucky, missouri). Every value is unique (n_unique=3222, duplicate_rate=0.0) with no nulls, and lengths are tightly clustered (mean 24.3, min 16, max 59, p95 31), consistent with 'X County, State' formatting. The near_unique alert confirms this behaves as a row identifier rather than a categorical feature.

Treatment: Use as a row label or join key on county; do not one-hot encode.

anthropic:claude-opus-4-7 · confidence high
Out[13]:

saturn.columns["name"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,222
len_min 16
len_max 59
len_mean 24.32
len_median 24
len_p95 31
word_mean 3.248
word_median 3
n_empty 0
n_duplicates 0
duplicate_rate 0
vocab_size 1,990
readability_flesch_mean 10.28
emoji_rate 0
url_rate 0
one_word_rate 0
allcaps_rate 0
boilerplate_rate 0
alert: near_unique100.0% of rows are unique strings
Fig 8.
Character-length distribution for name.
Show data table
Character-length distribution for name (mean: 24.324022346368714).
charscount
16 – 1726
17 – 1872
18 – 19121
19 – 20190
20 – 21264
21 – 22407
22 – 24420
24 – 25363
25 – 26320
26 – 27240
27 – 28231
28 – 29152
29 – 30139
30 – 31165
31 – 3241
32 – 3328
33 – 3416
34 – 3510
35 – 365
36 – 380
38 – 391
39 – 401
40 – 410
41 – 421
42 – 431
43 – 440
44 – 452
45 – 460
46 – 471
47 – 481
48 – 490
49 – 500
50 – 510
51 – 530
53 – 542
54 – 551
55 – 560
56 – 570
57 – 580
58 – 591

total_households numeric feature

Likely a count of households per geographic unit (e.g., county or tract), with 3222 rows, 3074 unique values, and no nulls or zeros. The distribution is extremely right-skewed (skew 12.05, kurtosis 240.5): the median is 10021 while the mean is 39402.86 and the max reaches 3363093, roughly 28x the standard deviation above the mean. Saturn flags 443 outliers (13.7%), consistent with a few very large jurisdictions dominating the tail.

Treatment: Log-transform before regression to tame the skew and outliers.

anthropic:claude-opus-4-7 · confidence high
Out[16]:

saturn.columns["total_households"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,074
min 32
max 3.363e+06
mean 3.94e+04
median 10,021
std 1.201e+05
q1 4211
q3 25,939
iqr 2.173e+04
skew 12.05
kurtosis 240.5
n_outliers 443
outlier_rate 0.1375
zero_rate 0
alert: high_skewskew=+12.05
alert: outliers13.7% rows beyond 1.5 IQR
Fig 9.
Distribution of total_households. Vertical dash marks the median.
Show data table
Histogram bins for total_households (median: 10021.0).
bincount
32 – 8.411e+042907
8.411e+04 – 1.682e+05153
1.682e+05 – 2.523e+0562
2.523e+05 – 3.363e+0538
3.363e+05 – 4.204e+0522
4.204e+05 – 5.045e+056
5.045e+05 – 5.886e+0511
5.886e+05 – 6.726e+055
6.726e+05 – 7.567e+055
7.567e+05 – 8.408e+053
8.408e+05 – 9.249e+051
9.249e+05 – 1.009e+063
1.009e+06 – 1.093e+061
1.093e+06 – 1.177e+061
1.177e+06 – 1.261e+060
1.261e+06 – 1.345e+060
1.345e+06 – 1.429e+060
1.429e+06 – 1.513e+060
1.513e+06 – 1.597e+060
1.597e+06 – 1.682e+061
1.682e+06 – 1.766e+061
1.766e+06 – 1.85e+060
1.85e+06 – 1.934e+060
1.934e+06 – 2.018e+060
2.018e+06 – 2.102e+061
2.102e+06 – 2.186e+060
2.186e+06 – 2.27e+060
2.27e+06 – 2.354e+060
2.354e+06 – 2.438e+060
2.438e+06 – 2.522e+060
2.522e+06 – 2.606e+060
2.606e+06 – 2.69e+060
2.69e+06 – 2.775e+060
2.775e+06 – 2.859e+060
2.859e+06 – 2.943e+060
2.943e+06 – 3.027e+060
3.027e+06 – 3.111e+060
3.111e+06 – 3.195e+060
3.195e+06 – 3.279e+060
3.279e+06 – 3.363e+061

no_vehicle_owner numeric feature

Likely a count of non-vehicle-owners aggregated per row (e.g., per geography or unit), ranging from 0 to 113,473 with a median of just 214. The distribution is severely right-skewed (skew 18.55, kurtosis 433.5) with 360 outliers (11.2%) and a std (3777.8) nearly 5x the mean (820.8), signalling a heavy tail dominated by a few extreme rows. Only 1.2% of rows are zero and there are no nulls, so the column is densely populated but dispersed across 1,176 unique values.

Treatment: Apply a log1p transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[19]:

saturn.columns["no_vehicle_owner"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,176
min 0
max 113,473
mean 820.8
median 214
std 3778
q1 81
q3 548.8
iqr 467.8
skew 18.55
kurtosis 433.5
n_outliers 360
outlier_rate 0.1117
zero_rate 0.01179
alert: high_skewskew=+18.55
alert: outliers11.2% rows beyond 1.5 IQR
Fig 10.
Distribution of no_vehicle_owner. Vertical dash marks the median.
Show data table
Histogram bins for no_vehicle_owner (median: 214.0).
bincount
0 – 28373068
2837 – 567482
5674 – 851033
8510 – 1.135e+0410
1.135e+04 – 1.418e+0413
1.418e+04 – 1.702e+043
1.702e+04 – 1.986e+042
1.986e+04 – 2.269e+042
2.269e+04 – 2.553e+042
2.553e+04 – 2.837e+041
2.837e+04 – 3.121e+040
3.121e+04 – 3.404e+040
3.404e+04 – 3.688e+040
3.688e+04 – 3.972e+040
3.972e+04 – 4.255e+040
4.255e+04 – 4.539e+040
4.539e+04 – 4.823e+041
4.823e+04 – 5.106e+040
5.106e+04 – 5.39e+040
5.39e+04 – 5.674e+040
5.674e+04 – 5.957e+041
5.957e+04 – 6.241e+040
6.241e+04 – 6.525e+040
6.525e+04 – 6.808e+040
6.808e+04 – 7.092e+041
7.092e+04 – 7.376e+040
7.376e+04 – 7.659e+040
7.659e+04 – 7.943e+041
7.943e+04 – 8.227e+040
8.227e+04 – 8.51e+041
8.51e+04 – 8.794e+040
8.794e+04 – 9.078e+040
9.078e+04 – 9.362e+040
9.362e+04 – 9.645e+040
9.645e+04 – 9.929e+040
9.929e+04 – 1.021e+050
1.021e+05 – 1.05e+050
1.05e+05 – 1.078e+050
1.078e+05 – 1.106e+050
1.106e+05 – 1.135e+051

no_vehicle_renter numeric feature

A heavily right-skewed numeric count, plausibly the number of renters without a vehicle per record/area. The median is 351 but the mean is 2483 and the max reaches 488148, with skew 20.7 and kurtosis 517.5 driving 436 outliers (13.5%). About 1.5% of rows are zero and there are no nulls, so the long tail — not missingness — dominates the distribution.

Treatment: Apply a log1p transform before modelling to tame the extreme skew and outliers.

anthropic:claude-opus-4-7 · confidence medium
Out[22]:

saturn.columns["no_vehicle_renter"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,517
min 0
max 488,148
mean 2483
median 351
std 1.646e+04
q1 125.2
q3 987.8
iqr 862.5
skew 20.69
kurtosis 517.5
n_outliers 436
outlier_rate 0.1353
zero_rate 0.01459
alert: high_skewskew=+20.69
alert: outliers13.5% rows beyond 1.5 IQR
Fig 11.
Distribution of no_vehicle_renter. Vertical dash marks the median.
Show data table
Histogram bins for no_vehicle_renter (median: 351.0).
bincount
0 – 1.22e+043112
1.22e+04 – 2.441e+0463
2.441e+04 – 3.661e+0416
3.661e+04 – 4.881e+048
4.881e+04 – 6.102e+048
6.102e+04 – 7.322e+042
7.322e+04 – 8.543e+041
8.543e+04 – 9.763e+045
9.763e+04 – 1.098e+050
1.098e+05 – 1.22e+050
1.22e+05 – 1.342e+051
1.342e+05 – 1.464e+050
1.464e+05 – 1.586e+050
1.586e+05 – 1.709e+050
1.709e+05 – 1.831e+050
1.831e+05 – 1.953e+050
1.953e+05 – 2.075e+050
2.075e+05 – 2.197e+050
2.197e+05 – 2.319e+051
2.319e+05 – 2.441e+051
2.441e+05 – 2.563e+050
2.563e+05 – 2.685e+050
2.685e+05 – 2.807e+050
2.807e+05 – 2.929e+052
2.929e+05 – 3.051e+050
3.051e+05 – 3.173e+050
3.173e+05 – 3.295e+050
3.295e+05 – 3.417e+050
3.417e+05 – 3.539e+050
3.539e+05 – 3.661e+050
3.661e+05 – 3.783e+050
3.783e+05 – 3.905e+050
3.905e+05 – 4.027e+050
4.027e+05 – 4.149e+050
4.149e+05 – 4.271e+050
4.271e+05 – 4.393e+050
4.393e+05 – 4.515e+050
4.515e+05 – 4.637e+050
4.637e+05 – 4.759e+051
4.759e+05 – 4.881e+051

state numeric feature

Despite being typed as numeric, this column holds 52 distinct integer values between 1 and 72 with no nulls or zeros, which matches a FIPS-style state code encoding (50 states plus DC and territories) rather than a true measurement. The near-uniform spread (mean 31.27, median 30, std 16.29, skew 0.16) and absence of outliers reinforce that these are categorical identifiers, not quantities.

Treatment: Cast to categorical and one-hot or target-encode; do not use as a continuous variable.

anthropic:claude-opus-4-7 · confidence high
Out[25]:

saturn.columns["state"].stats

statvalue
n3,222
nulls0 (0.0%)
unique52
min 1
max 72
mean 31.27
median 30
std 16.29
q1 19
q3 46
iqr 27
skew 0.1574
kurtosis -0.6267
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 12.
Distribution of state. Vertical dash marks the median.
Show data table
Histogram bins for state (median: 30.0).
bincount
1 – 2.77597
2.775 – 4.5515
4.55 – 6.325133
6.325 – 8.164
8.1 – 9.8759
9.875 – 11.654
11.65 – 13.42226
13.42 – 15.25
15.2 – 16.9844
16.98 – 18.75194
18.75 – 20.52204
20.52 – 22.3184
22.3 – 24.0740
24.07 – 25.8514
25.85 – 27.62170
27.62 – 29.4197
29.4 – 31.17149
31.17 – 32.9517
32.95 – 34.7331
34.73 – 36.595
36.5 – 38.27153
38.27 – 40.05165
40.05 – 41.8236
41.82 – 43.667
43.6 – 45.3851
45.38 – 47.15161
47.15 – 48.92254
48.92 – 50.743
50.7 – 52.47133
52.47 – 54.2594
54.25 – 56.0295
56.02 – 57.80
57.8 – 59.570
59.57 – 61.350
61.35 – 63.120
63.12 – 64.90
64.9 – 66.670
66.67 – 68.450
68.45 – 70.220
70.22 – 7278

county numeric identifier

Despite the name 'county', the column is stored numerically with 330 unique values across 3222 rows and no nulls, suggesting it holds county FIPS codes or similar integer identifiers rather than a measured quantity. The distribution is heavily right-skewed (skew 2.87, kurtosis 11.6) with a median of 79 but a max of 840 and 178 outliers (5.5%), which is expected for code-like values but misleading if treated as a continuous feature.

Treatment: Treat as a categorical code (e.g., FIPS) and one-hot or target-encode rather than using as a numeric feature.

anthropic:claude-opus-4-7 · confidence high
Out[28]:

saturn.columns["county"].stats

statvalue
n3,222
nulls0 (0.0%)
unique330
min 1
max 840
mean 103.2
median 79
std 106.6
q1 35
q3 133
iqr 98
skew 2.866
kurtosis 11.64
n_outliers 178
outlier_rate 0.05525
zero_rate 0
alert: high_skewskew=+2.87
alert: outliers5.5% rows beyond 1.5 IQR
Fig 13.
Distribution of county. Vertical dash marks the median.
Show data table
Histogram bins for county (median: 79.0).
bincount
1 – 21.98523
21.98 – 42.95418
42.95 – 63.93411
63.93 – 84.9345
84.9 – 105.9352
105.9 – 126.9281
126.9 – 147.8236
147.8 – 168.8168
168.8 – 189.8140
189.8 – 210.871
210.8 – 231.745
231.7 – 252.725
252.7 – 273.722
273.7 – 294.723
294.7 – 315.622
315.6 – 336.613
336.6 – 357.611
357.6 – 378.610
378.6 – 399.511
399.5 – 420.510
420.5 – 441.511
441.5 – 462.510
462.5 – 483.411
483.4 – 504.410
504.4 – 525.47
525.4 – 546.42
546.4 – 567.31
567.3 – 588.32
588.3 – 609.33
609.3 – 630.23
630.2 – 651.22
651.2 – 672.22
672.2 – 693.25
693.2 – 714.22
714.2 – 735.13
735.1 – 756.12
756.1 – 777.13
777.1 – 798.11
798.1 – 8192
819 – 8403

fips numeric identifier

This is the FIPS county code: every one of the 3222 rows is unique with no nulls, and the value range (1001 to 72153) matches the standard 5-digit state+county FIPS encoding. Distribution stats (mean 31377, median 30022, near-zero skew) are essentially meaningless here since the codes are categorical identifiers, not quantities. No outliers flagged, consistent with a clean geographic key.

Treatment: Treat as a categorical geographic key; left-join on this code rather than using it as a numeric feature.

anthropic:claude-opus-4-7 · confidence high
Out[31]:

saturn.columns["fips"].stats

statvalue
n3,222
nulls0 (0.0%)
unique3,222
min 1,001
max 72,153
mean 3.138e+04
median 30,022
std 1.63e+04
q1 1.903e+04
q3 4.61e+04
iqr 27,075
skew 0.1574
kurtosis -0.6314
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 14.
Distribution of fips. Vertical dash marks the median.
Show data table
Histogram bins for fips (median: 30022.0).
bincount
1001 – 278097
2780 – 455915
4559 – 6337133
6337 – 811659
8116 – 989514
9895 – 1.167e+044
1.167e+04 – 1.345e+04226
1.345e+04 – 1.523e+045
1.523e+04 – 1.701e+0449
1.701e+04 – 1.879e+04189
1.879e+04 – 2.057e+04204
2.057e+04 – 2.235e+04184
2.235e+04 – 2.413e+0439
2.413e+04 – 2.59e+0415
2.59e+04 – 2.768e+04170
2.768e+04 – 2.946e+04196
2.946e+04 – 3.124e+04150
3.124e+04 – 3.302e+0427
3.302e+04 – 3.48e+0421
3.48e+04 – 3.658e+0495
3.658e+04 – 3.836e+04153
3.836e+04 – 4.013e+04155
4.013e+04 – 4.191e+0446
4.191e+04 – 4.369e+0467
4.369e+04 – 4.547e+0451
4.547e+04 – 4.725e+04161
4.725e+04 – 4.903e+04268
4.903e+04 – 5.081e+0429
5.081e+04 – 5.259e+04133
5.259e+04 – 5.436e+0494
5.436e+04 – 5.614e+0495
5.614e+04 – 5.792e+040
5.792e+04 – 5.97e+040
5.97e+04 – 6.148e+040
6.148e+04 – 6.326e+040
6.326e+04 – 6.504e+040
6.504e+04 – 6.682e+040
6.682e+04 – 6.86e+040
6.86e+04 – 7.037e+040
7.037e+04 – 7.215e+0478

no_vehicle_total numeric feature

Counts of vehicles (totals) per record, ranging from 0 to 601,621 with a median of 580 but a mean of 3,304. The distribution is extremely right-skewed (skew 20.26, kurtosis 501.27) with 407 outliers (12.6%) and a std (20,050) far exceeding the IQR (1,331.75), indicating a few enormous records dominate.

Treatment: Apply a log1p transform and consider winsorising before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[34]:

saturn.columns["no_vehicle_total"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,823
min 0
max 601,621
mean 3304
median 580
std 2.005e+04
q1 223
q3 1555
iqr 1332
skew 20.26
kurtosis 501.3
n_outliers 407
outlier_rate 0.1263
zero_rate 0.003724
alert: high_skewskew=+20.26
alert: outliers12.6% rows beyond 1.5 IQR
Fig 15.
Distribution of no_vehicle_total. Vertical dash marks the median.
Show data table
Histogram bins for no_vehicle_total (median: 580.0).
bincount
0 – 1.504e+043108
1.504e+04 – 3.008e+0462
3.008e+04 – 4.512e+0419
4.512e+04 – 6.016e+049
6.016e+04 – 7.52e+048
7.52e+04 – 9.024e+042
9.024e+04 – 1.053e+054
1.053e+05 – 1.203e+053
1.203e+05 – 1.354e+050
1.354e+05 – 1.504e+050
1.504e+05 – 1.654e+050
1.654e+05 – 1.805e+050
1.805e+05 – 1.955e+051
1.955e+05 – 2.106e+050
2.106e+05 – 2.256e+050
2.256e+05 – 2.406e+050
2.406e+05 – 2.557e+050
2.557e+05 – 2.707e+050
2.707e+05 – 2.858e+050
2.858e+05 – 3.008e+052
3.008e+05 – 3.159e+050
3.159e+05 – 3.309e+051
3.309e+05 – 3.459e+050
3.459e+05 – 3.61e+050
3.61e+05 – 3.76e+051
3.76e+05 – 3.911e+050
3.911e+05 – 4.061e+050
4.061e+05 – 4.211e+050
4.211e+05 – 4.362e+050
4.362e+05 – 4.512e+050
4.512e+05 – 4.663e+050
4.663e+05 – 4.813e+050
4.813e+05 – 4.963e+050
4.963e+05 – 5.114e+050
5.114e+05 – 5.264e+050
5.264e+05 – 5.415e+050
5.415e+05 – 5.565e+051
5.565e+05 – 5.715e+050
5.715e+05 – 5.866e+050
5.866e+05 – 6.016e+051

no_vehicle_pct numeric feature

Likely the percentage of households (or similar units) with no vehicle, recorded for 3,222 rows with no nulls and values bounded between 0.0 and 85.94. The distribution is tightly clustered (median 5.41, IQR 3.38) but extremely right-skewed (skew 6.98, kurtosis 86.23), with 140 outliers (4.35%) pulling the mean to 6.20 well above the median. Only 0.37% are exact zeros, so true absence is rare; the long tail is the surprise here.

Treatment: Apply a log1p or winsorising transform before modelling to tame the heavy right tail.

anthropic:claude-opus-4-7 · confidence high
Out[37]:

saturn.columns["no_vehicle_pct"].stats

statvalue
n3,222
nulls0 (0.0%)
unique1,065
min 0
max 85.94
mean 6.197
median 5.41
std 4.538
q1 3.98
q3 7.36
iqr 3.38
skew 6.976
kurtosis 86.23
n_outliers 140
outlier_rate 0.04345
zero_rate 0.003724
alert: high_skewskew=+6.98
Fig 16.
Distribution of no_vehicle_pct. Vertical dash marks the median.
Show data table
Histogram bins for no_vehicle_pct (median: 5.41).
bincount
0 – 2.148161
2.148 – 4.297823
4.297 – 6.4451091
6.445 – 8.594630
8.594 – 10.74283
10.74 – 12.89111
12.89 – 15.0461
15.04 – 17.1923
17.19 – 19.348
19.34 – 21.483
21.48 – 23.634
23.63 – 25.782
25.78 – 27.933
27.93 – 30.082
30.08 – 32.232
32.23 – 34.382
34.38 – 36.522
36.52 – 38.672
38.67 – 40.820
40.82 – 42.971
42.97 – 45.120
45.12 – 47.271
47.27 – 49.420
49.42 – 51.560
51.56 – 53.710
53.71 – 55.861
55.86 – 58.011
58.01 – 60.160
60.16 – 62.312
62.31 – 64.450
64.45 – 66.61
66.6 – 68.750
68.75 – 70.90
70.9 – 73.050
73.05 – 75.20
75.2 – 77.350
77.35 – 79.491
79.49 – 81.640
81.64 – 83.790
83.79 – 85.941

How to cite

click to copy

BibTeX
@misc{saturn-food-deserts-vehicle-access-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: food deserts vehicle access},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/food_deserts-vehicle_access}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: food deserts vehicle access. Source: /home/coolhand/html/datavis/data_trove/data/urban/food_deserts/vehicle_access.csv. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/food_deserts-vehicle_access