data trove veteran employment statistics
Reading
This dataset captures spouse employment indicators — unemployment rate and labor force participation — across 15 U.S. states, likely in the context of military or veteran households. The most notable signal is in spouse unemployment rate, which ranges widely from 7.35% to 16.28% with a right skew and one flagged outlier at the high end, suggesting at least one state has a notably worse outcome for spouses. By contrast, spouse labor force participation is tightly clustered between 66.8% and 73.4% with no outliers, meaning most states see similar engagement levels even when unemployment varies — worth investigating whether high-unemployment states are simply retaining more job-seekers in the labor force.
citing: spouse_unemployment_rate.stats.min · spouse_unemployment_rate.stats.max · spouse_unemployment_rate.stats.mean · spouse_unemployment_rate.n_outliers · spouse_unemployment_rate.alerts · spouse_labor_force_participation.stats.min · spouse_labor_force_participation.stats.max · spouse_labor_force_participation.n_outliers · row_count · column_count
Charts the summary said to look at first
Show data table
| bin | count |
|---|---|
| 7.35 – 9.136 | 6 |
| 9.136 – 10.92 | 5 |
| 10.92 – 12.71 | 1 |
| 12.71 – 14.49 | 2 |
| 14.49 – 16.28 | 1 |
Show data table
| bin | count |
|---|---|
| 66.8 – 68.12 | 2 |
| 68.12 – 69.44 | 4 |
| 69.44 – 70.76 | 4 |
| 70.76 – 72.08 | 3 |
| 72.08 – 73.4 | 2 |
Show data table
| value | count | share |
|---|---|---|
| Arizona | 1 | 6.7% |
| Georgia | 1 | 6.7% |
| Texas | 1 | 6.7% |
| North Carolina | 1 | 6.7% |
| California | 1 | 6.7% |
| Virginia | 1 | 6.7% |
| Florida | 1 | 6.7% |
| Washington | 1 | 6.7% |
| Colorado | 1 | 6.7% |
| Kentucky | 1 | 6.7% |
| Hawaii | 1 | 6.7% |
| South Carolina | 1 | 6.7% |
| Oklahoma | 1 | 6.7% |
| Kansas | 1 | 6.7% |
| Alaska | 1 | 6.7% |
Schema
3 columns| Alerts | ||||
|---|---|---|---|---|
| state | categorical | 0.0% | 15 |
long_tail
|
| spouse_unemployment_rate | numeric | 0.0% | 15 |
outliers
|
| spouse_labor_force_participation | numeric | 0.0% | 15 |
|
state
categorical label long_tailThis column contains US state names, functioning as a geographic label across 15 distinct states. With n=15 and n_unique=15, every row has a unique state value — the dataset appears to contain exactly one record per state with no repetition whatsoever. Entropy ratio of ~1.0 confirms perfectly uniform distribution across all 15 states, each appearing exactly once (top_rate=0.067 ≈ 1/15). The 'long_tail' alert is a statistical artifact of the uniform distribution rather than a true concentration issue. Treatment: Use as a grouping or join key; with only 15 rows (one per state), this dataset is likely a summary table — verify before any aggregation or modelling.
- n
- 15
- nulls
- 0 (0.0%)
- unique
- 15
- top_value
- Arizona
- top_rate
- 0.06667
- cardinality
- 15
- entropy
- 3.907
- entropy_ratio
- 1
spouse_unemployment_rate
numeric feature outliersThis column records the unemployment rate associated with a respondent's spouse, expressed as a percentage. With only 15 rows and 15 unique values, every observation is distinct. The distribution is moderately right-skewed (skew = 1.04) with one flagged outlier at the maximum value of 16.28, which sits notably above the Q3 of 11.135 and drives the mean (10.19) above the median (9.53). The dataset is extremely small, limiting any statistical conclusions. Treatment: Investigate the outlier at 16.28 for data entry error; if valid, consider robust scaling before modelling given the small sample and skew.
- n
- 15
- nulls
- 0 (0.0%)
- unique
- 15
- min
- 7.35
- max
- 16.28
- mean
- 10.19
- median
- 9.53
- std
- 2.591
- q1
- 8.285
- q3
- 11.13
- iqr
- 2.85
- skew
- 1.045
- kurtosis
- 0.1972
- n_outliers
- 1
- outlier_rate
- 0.06667
- zero_rate
- 0
spouse_labor_force_participation
numeric featureThis column captures the labor force participation rate of spouses, likely expressed as a percentage, recorded across 15 distinct observations (possibly countries, regions, or time periods). The distribution is remarkably tight — the full range spans only 66.8 to 73.4 (a 6.6-point spread) with a standard deviation of just 1.76 and an IQR of 2.4, suggesting these records represent a homogeneous group or a narrow slice of a larger dataset. No outliers, no nulls, and near-zero skew (0.22) indicate an unusually clean and well-behaved variable. Treatment: Use as-is in regression or clustering; the narrow range (66.8–73.4) means normalization may be needed if combined with broader-scale features.
- n
- 15
- nulls
- 0 (0.0%)
- unique
- 15
- min
- 66.8
- max
- 73.4
- mean
- 69.89
- median
- 69.8
- std
- 1.762
- q1
- 68.6
- q3
- 71
- iqr
- 2.4
- skew
- 0.2249
- kurtosis
- -0.5287
- n_outliers
- 0
- outlier_rate
- 0
- zero_rate
- 0