saturn·

nationwide 2020 election

source /home/coolhand/html/datavis/data_trove/data/geographic/nationwide/2020_election.csv 3,152 rows 10 columns profiled 2026-05-01 raw JSON static .html .ipynb Report Notebook

Reading

dataset summary · high confidence anthropic:claude-opus-4-7

This dataset covers 3,152 U.S. counties from a 2020 election results file, with 10 columns spanning vote totals, party percentages, and geographic identifiers. Vote-count fields (total_votes, votes_dem, votes_gop, diff) are extremely right-skewed with high kurtosis and 12-16% outlier rates, reflecting a few massive-population counties dominating the raw counts — worth inspecting on a log scale. The percentage fields tell a cleaner story: per_gop has a median of 0.68 versus per_dem's 0.30, and per_point_diff is negatively skewed, indicating Republican margins in most counties despite Democrats winning the national popular vote. State coverage is led by Texas (254 counties), Georgia (159), and Virginia (133), so any state-level aggregation should account for that imbalance.

citing: row_count · column_count · columns · kinds

Schema

10 columns
Per-column summary. Click column name to jump to its detail.
Alerts
state_name categorical 0.0% 51
county_fips numeric 0.0% 3,152
county_name text 0.0% 1,887
short_text duplicates
votes_gop numeric 0.0% 2,965
high_skew outliers
votes_dem numeric 0.0% 2,762
high_skew outliers
total_votes numeric 0.0% 3,049
high_skew outliers
diff numeric 0.0% 2,906
high_skew outliers
per_gop numeric 0.0% 3,151
per_dem numeric 0.0% 3,151
per_point_diff numeric 0.0% 3,152

state_name

categorical
n
3,152
nulls
0 (0.0%)
unique
51
top_value
Texas
top_rate
0.08058
cardinality
51
entropy
5.279
entropy_ratio
0.9306

county_fips

numeric
n
3,152
nulls
0 (0.0%)
unique
3,152
min
1,001
max
56,045
mean
3.03e+04
median
29,166
std
1.521e+04
q1
1.816e+04
q3
4.508e+04
iqr
26,913
skew
-0.07763
kurtosis
-1.104
n_outliers
0
outlier_rate
0
zero_rate
0

county_name

text short_text duplicates
n
3,152
nulls
0 (0.0%)
unique
1,887
len_min
10
len_max
27
len_mean
13.9
len_median
14
len_p95
17
word_mean
2.063
word_median
2
n_empty
0
n_duplicates
1,265
duplicate_rate
0.4013
vocab_size
1,880
readability_flesch_mean
35.12
emoji_rate
0
url_rate
0
one_word_rate
0
allcaps_rate
0
boilerplate_rate
0

votes_gop

numeric high_skew outliers
n
3,152
nulls
0 (0.0%)
unique
2,965
min
60
max
1.146e+06
mean
2.354e+04
median
8,123
std
5.404e+04
q1
3662
q3
2.051e+04
iqr
1.685e+04
skew
8.684
kurtosis
122.7
n_outliers
387
outlier_rate
0.1228
zero_rate
0

votes_dem

numeric high_skew outliers
n
3,152
nulls
0 (0.0%)
unique
2,762
min
4
max
3.029e+06
mean
2.578e+04
median
3,690
std
9.693e+04
q1
1320
q3
1.194e+04
iqr
10,625
skew
14.13
kurtosis
340.1
n_outliers
468
outlier_rate
0.1485
zero_rate
0

total_votes

numeric high_skew outliers
n
3,152
nulls
0 (0.0%)
unique
3,049
min
66
max
4.263e+06
mean
5.026e+04
median
1.234e+04
std
1.494e+05
q1
5415
q3
3.33e+04
iqr
2.789e+04
skew
11.88
kurtosis
245.2
n_outliers
437
outlier_rate
0.1386
zero_rate
0

diff

numeric high_skew outliers
n
3,152
nulls
0 (0.0%)
unique
2,906
min
-1.883e+06
max
119,005
mean
-2239
median
2,984
std
5.587e+04
q1
816.8
q3
7286
iqr
6470
skew
-17.7
kurtosis
485.4
n_outliers
499
outlier_rate
0.1583
zero_rate
0

per_gop

numeric
n
3,152
nulls
0 (0.0%)
unique
3,151
min
0.05397
max
0.9618
mean
0.6478
median
0.6817
std
0.162
q1
0.5541
q3
0.7738
iqr
0.2196
skew
-0.792
kurtosis
0.1515
n_outliers
44
outlier_rate
0.01396
zero_rate
0

per_dem

numeric
n
3,152
nulls
0 (0.0%)
unique
3,151
min
0.03091
max
0.9215
mean
0.3339
median
0.3002
std
0.1599
q1
0.21
q3
0.4258
iqr
0.2159
skew
0.8178
kurtosis
0.2146
n_outliers
46
outlier_rate
0.01459
zero_rate
0

per_point_diff

numeric
n
3,152
nulls
0 (0.0%)
unique
3,152
min
-0.8675
max
0.9309
mean
0.314
median
0.3797
std
0.3214
q1
0.1294
q3
0.5644
iqr
0.435
skew
-0.8011
kurtosis
0.178
n_outliers
46
outlier_rate
0.01459
zero_rate
0