saturn·

data trove veteran employment statistics

saturn notebook · generated 2026-06-22 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/demographic/veterans/military_firearm_spouse_employment.csv

Saturn profiled 15 rows across 3 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/demographic/veterans/military_firearm_spouse_employment.csv",
    "--findings", "data-trove-veteran-employment-statistics.json",
    "--llm", "anthropic:default",
])

Summary confidence: medium

This dataset captures spouse employment indicators — unemployment rate and labor force participation — across 15 U.S. states, likely in the context of military or veteran households. The most notable signal is in spouse unemployment rate, which ranges widely from 7.35% to 16.28% with a right skew and one flagged outlier at the high end, suggesting at least one state has a notably worse outcome for spouses. By contrast, spouse labor force participation is tightly clustered between 66.8% and 73.4% with no outliers, meaning most states see similar engagement levels even when unemployment varies — worth investigating whether high-unemployment states are simply retaining more job-seekers in the labor force.

citing: spouse_unemployment_rate.stats.min · spouse_unemployment_rate.stats.max · spouse_unemployment_rate.stats.mean · spouse_unemployment_rate.n_outliers · spouse_unemployment_rate.alerts · spouse_labor_force_participation.stats.min · spouse_labor_force_participation.stats.max · spouse_labor_force_participation.n_outliers · row_count · column_count

Fig 1.
spouse_unemployment_rate · Look for the right-skewed distribution and the single high outlier pulling the mean above the median.
Show data table
Histogram bins for spouse_unemployment_rate (median: 9.53).
bincount
7.35 – 9.1366
9.136 – 10.925
10.92 – 12.711
12.71 – 14.492
14.49 – 16.281
Fig 2.
spouse_labor_force_participation · Notice how tightly packed participation rates are across states, contrasting sharply with unemployment variability.
Show data table
Histogram bins for spouse_labor_force_participation (median: 69.8).
bincount
66.8 – 68.122
68.12 – 69.444
69.44 – 70.764
70.76 – 72.083
72.08 – 73.42
Fig 3.
state · Use this as a reference axis to compare unemployment and participation rates side-by-side across the 15 states.
Show data table
Top values for state (15 unique shown, of 15 total).
valuecountshare
Arizona16.7%
Georgia16.7%
Texas16.7%
North Carolina16.7%
California16.7%
Virginia16.7%
Florida16.7%
Washington16.7%
Colorado16.7%
Kentucky16.7%
Hawaii16.7%
South Carolina16.7%
Oklahoma16.7%
Kansas16.7%
Alaska16.7%
Fig 4.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
statecategorical0.0%
spouse_unemployment_ratenumeric0.0%
spouse_labor_force_participationnumeric0.0%
Fig 5.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 2 numeric columns (values clipped to 2 decimals).
spouse_unemployment_ratespouse_labor_force_participation
spouse_unemployment_rate+1.00-0.01
spouse_labor_force_participation-0.01+1.00

state categorical label

This column contains US state names, functioning as a geographic label across 15 distinct states. With n=15 and n_unique=15, every row has a unique state value — the dataset appears to contain exactly one record per state with no repetition whatsoever. Entropy ratio of ~1.0 confirms perfectly uniform distribution across all 15 states, each appearing exactly once (top_rate=0.067 ≈ 1/15). The 'long_tail' alert is a statistical artifact of the uniform distribution rather than a true concentration issue.

Treatment: Use as a grouping or join key; with only 15 rows (one per state), this dataset is likely a summary table — verify before any aggregation or modelling.

anthropic:default · confidence high
Out[11]:

saturn.columns["state"].stats

statvalue
n15
nulls0 (0.0%)
unique15
top_value Arizona
top_rate 0.06667
cardinality 15
entropy 3.907
entropy_ratio 1
alert: long_tail15 singleton categories
Fig 6.
Top values for state.
Show data table
Top values for state (15 unique shown, of 15 total).
valuecountshare
Arizona16.7%
Georgia16.7%
Texas16.7%
North Carolina16.7%
California16.7%
Virginia16.7%
Florida16.7%
Washington16.7%
Colorado16.7%
Kentucky16.7%
Hawaii16.7%
South Carolina16.7%
Oklahoma16.7%
Kansas16.7%
Alaska16.7%

spouse_unemployment_rate numeric feature

This column records the unemployment rate associated with a respondent's spouse, expressed as a percentage. With only 15 rows and 15 unique values, every observation is distinct. The distribution is moderately right-skewed (skew = 1.04) with one flagged outlier at the maximum value of 16.28, which sits notably above the Q3 of 11.135 and drives the mean (10.19) above the median (9.53). The dataset is extremely small, limiting any statistical conclusions.

Treatment: Investigate the outlier at 16.28 for data entry error; if valid, consider robust scaling before modelling given the small sample and skew.

anthropic:default · confidence medium
Out[14]:

saturn.columns["spouse_unemployment_rate"].stats

statvalue
n15
nulls0 (0.0%)
unique15
min 7.35
max 16.28
mean 10.19
median 9.53
std 2.591
q1 8.285
q3 11.13
iqr 2.85
skew 1.045
kurtosis 0.1972
n_outliers 1
outlier_rate 0.06667
zero_rate 0
alert: outliers6.7% rows beyond 1.5 IQR
Fig 7.
Distribution of spouse_unemployment_rate. Vertical dash marks the median.
Show data table
Histogram bins for spouse_unemployment_rate (median: 9.53).
bincount
7.35 – 9.1366
9.136 – 10.925
10.92 – 12.711
12.71 – 14.492
14.49 – 16.281

spouse_labor_force_participation numeric feature

This column captures the labor force participation rate of spouses, likely expressed as a percentage, recorded across 15 distinct observations (possibly countries, regions, or time periods). The distribution is remarkably tight — the full range spans only 66.8 to 73.4 (a 6.6-point spread) with a standard deviation of just 1.76 and an IQR of 2.4, suggesting these records represent a homogeneous group or a narrow slice of a larger dataset. No outliers, no nulls, and near-zero skew (0.22) indicate an unusually clean and well-behaved variable.

Treatment: Use as-is in regression or clustering; the narrow range (66.8–73.4) means normalization may be needed if combined with broader-scale features.

anthropic:default · confidence medium
Out[17]:

saturn.columns["spouse_labor_force_participation"].stats

statvalue
n15
nulls0 (0.0%)
unique15
min 66.8
max 73.4
mean 69.89
median 69.8
std 1.762
q1 68.6
q3 71
iqr 2.4
skew 0.2249
kurtosis -0.5287
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 8.
Distribution of spouse_labor_force_participation. Vertical dash marks the median.
Show data table
Histogram bins for spouse_labor_force_participation (median: 69.8).
bincount
66.8 – 68.122
68.12 – 69.444
69.44 – 70.764
70.76 – 72.083
72.08 – 73.42

How to cite

click to copy

BibTeX
@misc{saturn-data-trove-veteran-employment-statistics-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: data trove veteran employment statistics},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/data-trove-veteran-employment-statistics}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:default},
}
APA
Steuber, L. (2026). Saturn reading: data trove veteran employment statistics. Source: /home/coolhand/html/datavis/data_trove/demographic/veterans/military_firearm_spouse_employment.csv. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:default). Retrieved from https://dr.eamer.dev/saturn/view/data-trove-veteran-employment-statistics