saturn

/home/coolhand/html/datavis/data_trove/demographic/veterans/military_firearm_veterans.csv 49 rows sample n=49 seed 42 2026-06-22T00:04:14+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/demographic/veterans/military_firearm_veterans.csv
Total rows49
Profiled sample49
Columns5
Generated2026-06-22T00:04:14+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
NAMEcategorical0.0%
statecategorical0.0%
veteran_populationnumeric0.0%
total_populationnumeric0.0%
veteran_percentagenumeric0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This dataset contains U.S. state-level veteran population statistics for 49 states, including total population, veteran counts, and the percentage of the population that are veterans. The most important signal is in veteran_percentage, which is extremely right-skewed (skew: 5.79) with 8 outliers and a max of 277.05 — far above the median of 5.08 — suggesting a small number of states have dramatically elevated veteran shares worth investigating. Total population and veteran population both distribute relatively evenly across states with no outliers, meaning the percentage anomalies are not simply a function of small population size.

veteran_percentage high anthropic:default

This column represents veteran percentage figures across 49 fully distinct, non-null records — likely a rate or share (%) of veterans in some population unit such as counties, districts, or organizations. The distribution is severely right-skewed (skew = 5.79, kurtosis = 34.86): the median is only 5.08 while the mean is pulled to 14.92 by extreme outliers. Most strikingly, the maximum value is 277.05, which exceeds 100% and is physically impossible for a percentage, strongly suggesting data quality issues — either unit errors (e.g., per-thousand rates mixed with percentages) or data entry errors among the 8 flagged outliers (outlier rate = 16.3%).

NAME high anthropic:default

This column contains U.S. state names, with 49 unique values across 49 rows and zero nulls — consistent with a near-complete list of U.S. states (50 states minus one, or possibly including a territory/district instead). Every value appears exactly once (top_rate = 0.0204, i.e., 1/49), yielding a maximum entropy_ratio of 1.0, which flags as a long-tail alert but is actually a perfectly uniform identifier distribution. The missing 50th entry is worth investigating — one state may be absent or replaced by a non-standard entry.

state high anthropic:default

This column contains US state names, with exactly 49 rows and 49 unique values — meaning every row is a distinct state (notably, only 49 of the 50 US states are present, so one state is missing from the dataset). The entropy ratio of 1.0 and uniform frequency of 1 per state confirm perfect cardinality with zero repetition, making this a de facto identifier rather than a grouping variable. The 'long_tail' alert is technically a misnomer here: all values occur exactly once, so there is no skew whatsoever.

total_population high anthropic:default

This column represents total population counts for 49 distinct geographic units (likely US states or similar administrative regions), with every row populated and fully unique. Values span from 548,984 to 39,227,468 with a mean of ~18.8M and median of ~19.6M, suggesting a reasonably balanced distribution — the near-zero skew (0.10) and platykurtic kurtosis (-1.16) confirm a flat, spread-out distribution rather than a peaked one. The wide IQR of ~20.4M and std of ~12.2M reflect the natural size heterogeneity across regions, and notably there are zero detected outliers despite this spread.

veteran_population high anthropic:default

This column represents the veteran population count for each of 49 geographic units (likely U.S. states or territories, given the count). All 49 rows are unique and non-null, consistent with one record per jurisdiction. The distribution is remarkably symmetric (skew ≈ -0.04) and platykurtic (kurtosis ≈ -1.34), meaning values are spread broadly and uniformly across a wide range from 61,090 to 1,786,891 with no outliers — an unusual flatness for a population variable, which typically skews right.

Numeric correlation

Show data table
Pearson correlation across 3 numeric columns (values clipped to 2 decimals).
veteran_populationtotal_populationveteran_percentage
veteran_population+1.00+0.08+0.26
total_population+0.08+1.00-0.41
veteran_percentage+0.26-0.41+1.00

NAME categorical

49 singleton categories
rows49
null0 (0.0%)
unique49
top_valueCalifornia
top_rate0.020
cardinality49
entropy5.615
entropy_ratio1.000
Show data table
Top values for NAME (20 unique shown, of 49 total).
valuecountshare
California12.0%
Texas12.0%
Florida12.0%
New York12.0%
Pennsylvania12.0%
Ohio12.0%
Illinois12.0%
Georgia12.0%
North Carolina12.0%
Michigan12.0%
Virginia12.0%
Washington12.0%
Arizona12.0%
Massachusetts12.0%
Tennessee12.0%
Indiana12.0%
Missouri12.0%
Maryland12.0%
Wisconsin12.0%
Colorado12.0%
Top values (rank 1–20)
  1. California — 1
  2. Texas — 1
  3. Florida — 1
  4. New York — 1
  5. Pennsylvania — 1
  6. Ohio — 1
  7. Illinois — 1
  8. Georgia — 1
  9. North Carolina — 1
  10. Michigan — 1
  11. Virginia — 1
  12. Washington — 1
  13. Arizona — 1
  14. Massachusetts — 1
  15. Tennessee — 1
  16. Indiana — 1
  17. Missouri — 1
  18. Maryland — 1
  19. Wisconsin — 1
  20. Colorado — 1

state categorical

49 singleton categories
rows49
null0 (0.0%)
unique49
top_valueCalifornia
top_rate0.020
cardinality49
entropy5.615
entropy_ratio1.000
Show data table
Top values for state (20 unique shown, of 49 total).
valuecountshare
California12.0%
Texas12.0%
Florida12.0%
New York12.0%
Pennsylvania12.0%
Ohio12.0%
Illinois12.0%
Georgia12.0%
North Carolina12.0%
Michigan12.0%
Virginia12.0%
Washington12.0%
Arizona12.0%
Massachusetts12.0%
Tennessee12.0%
Indiana12.0%
Missouri12.0%
Maryland12.0%
Wisconsin12.0%
Colorado12.0%
Top values (rank 1–20)
  1. California — 1
  2. Texas — 1
  3. Florida — 1
  4. New York — 1
  5. Pennsylvania — 1
  6. Ohio — 1
  7. Illinois — 1
  8. Georgia — 1
  9. North Carolina — 1
  10. Michigan — 1
  11. Virginia — 1
  12. Washington — 1
  13. Arizona — 1
  14. Massachusetts — 1
  15. Tennessee — 1
  16. Indiana — 1
  17. Missouri — 1
  18. Maryland — 1
  19. Wisconsin — 1
  20. Colorado — 1

veteran_population numeric

rows49
null0 (0.0%)
unique49
min61,090
max1,786,891
mean844,592
median932,756
std530,217
q1298,167
q31,282,752
iqr984,585
skew-0.044
kurtosis-1.344
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for veteran_population (median: 932756.0).
bincount
6.109e+04 – 3.076e+0513
3.076e+05 – 5.542e+054
5.542e+05 – 8.007e+055
8.007e+05 – 1.047e+067
1.047e+06 – 1.294e+068
1.294e+06 – 1.54e+068
1.54e+06 – 1.787e+064

total_population numeric

rows49
null0 (0.0%)
unique49
min548,984
max39,227,468
mean18,763,551
median19,582,629
std12,207,014
q17,056,168
q327,439,239
iqr20,383,071
skew0.101
kurtosis-1.158
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for total_population (median: 19582629.0).
bincount
5.49e+05 – 6.074e+0611
6.074e+06 – 1.16e+075
1.16e+07 – 1.713e+074
1.713e+07 – 2.265e+0710
2.265e+07 – 2.818e+077
2.818e+07 – 3.37e+072
3.37e+07 – 3.923e+0710

veteran_percentage numeric

skew=+5.79 16.3% rows beyond 1.5 IQR
rows49
null0 (0.0%)
unique49
min0.220
max277.050
mean14.920
median5.080
std40.366
q11.700
q37.830
iqr6.130
skew5.789
kurtosis34.861
n_outliers8
outlier_rate0.163
zero_rate0.000
Show data table
Histogram bins for veteran_percentage (median: 5.08).
bincount
0.22 – 39.7745
39.77 – 79.313
79.31 – 118.90
118.9 – 158.40
158.4 – 1980
198 – 237.50
237.5 – 277.11