saturn

/home/coolhand/datasets/accessibility-atlas/who_hale_long.csv 4,070 rows sample n=4,070 seed 42 2026-05-01T17:01:32+00:00

Overview

Source/home/coolhand/datasets/accessibility-atlas/who_hale_long.csv
Total rows4,070
Profiled sample4,070
Columns4
Generated2026-05-01T17:01:32+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset contains 4,070 rows of WHO Healthy Life Expectancy (HALE) data spanning 185 countries, 6 regions, and 22 years from 2000 to 2021. The panel is balanced — each country contributes 22 yearly observations — so the country_code distribution is essentially uniform and not informative on its own. The most interesting variable is hale_years, which ranges from 35.3 to 73.8 with a mean of 61.0 and a left-skewed distribution (skew = -0.82), indicating a long tail of countries with notably lower healthy life expectancy. Regional coverage is uneven, with Europe (1,100 rows) and Africa (1,034) dominating while South-East Asia contributes only 220 rows. Start by examining the hale_years distribution and how it breaks down by region.

country_code high anthropic:claude-opus-4-7

This column holds ISO 3166-1 alpha-3 country codes, with 185 unique values across 4,070 rows and zero nulls. The distribution is perfectly uniform — every visible top value appears exactly 22 times and entropy_ratio is 1.0 — which strongly suggests a panel structure (185 countries × 22 periods).

region high anthropic:claude-opus-4-7

This is a categorical region field with 6 distinct values matching WHO regional groupings (Europe, Africa, Americas, Eastern Mediterranean, Western Pacific, South-East Asia) and no nulls across 4070 rows. Distribution is fairly balanced — entropy ratio of 0.936 — with Europe leading at 27% (1100 rows) and South-East Asia trailing at 220. The mix of WHO-style labels suggests this dataset is sourced from or aligned with WHO global health data.

year high anthropic:claude-opus-4-7

This column captures the calendar year, spanning 2000 to 2021 with 22 distinct integer values across 4070 rows and no nulls. The distribution is perfectly symmetric (skew 0.0, mean equals median at 2010.5) with negative kurtosis (-1.20), indicating a near-uniform spread across years rather than a concentration in any period. No outliers are flagged.

hale_years high anthropic:claude-opus-4-7

Healthy life expectancy in years (HALE), spanning 35.3 to 73.8 with a mean of 61.03 and median 63.1 across 4070 rows. The distribution is left-skewed (skew -0.82), reflecting a long tail of low-HALE observations pulling below the bulk concentrated between Q1=56.3 and Q3=66.4. Only 1.1% outliers and zero nulls, so the column is clean and ready to use.

Numeric correlation

country_code categorical

rows4,070
null0 (0.0%)
unique185
top_valueAFG
top_rate5.41e-03
cardinality185
entropy7.531
entropy_ratio1.000
Top values (rank 1–20)
  1. AFG — 22
  2. AGO — 22
  3. ALB — 22
  4. ARE — 22
  5. ARG — 22
  6. ARM — 22
  7. ATG — 22
  8. AUS — 22
  9. AUT — 22
  10. AZE — 22
  11. BDI — 22
  12. BEL — 22
  13. BEN — 22
  14. BFA — 22
  15. BGD — 22
  16. BGR — 22
  17. BHR — 22
  18. BHS — 22
  19. BIH — 22
  20. BLR — 22

region categorical

rows4,070
null0 (0.0%)
unique6
top_valueEurope
top_rate0.270
cardinality6
entropy2.420
entropy_ratio0.936
Top values (rank 1–20)
  1. Europe — 1,100
  2. Africa — 1,034
  3. Americas — 748
  4. Eastern Mediterranean — 484
  5. Western Pacific — 484
  6. South-East Asia — 220

year numeric

rows4,070
null0 (0.0%)
unique22
min2,000
max2,021
mean2,010
median2,010
std6.345
q12,005
q32,016
iqr11.000
skew0.000
kurtosis-1.205
n_outliers0
outlier_rate0.000
zero_rate0.000

hale_years numeric

rows4,070
null0 (0.0%)
unique345
min35.300
max73.800
mean61.030
median63.100
std7.344
q156.300
q366.400
iqr10.100
skew-0.824
kurtosis0.027
n_outliers45
outlier_rate0.011
zero_rate0.000