saturn·

.cache who yld global

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/accessibility/.cache_who/yld_global.xlsx#Notes

Saturn profiled 196 rows across 2 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/accessibility/.cache_who/yld_global.xlsx#Notes",
    "--findings", ".cache_who-yld_global.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This is a small 'Notes' sheet (196 rows, 2 columns) extracted from a WHO Global Health Estimates 2021 workbook on years lost due to disability (YLDs). It is essentially metadata and a country list rather than a tabular dataset: the unnamed first column is 96.94% null with only 6 distinct header/note strings, while the second column holds 190 nearly-unique values dominated by country names. The most useful thing to look at is the second column's values to confirm it is the WHO Member State list. Treat this sheet as documentation; the real burden-of-disease numbers live on other sheets of the source workbook.

citing: row_count · column_count · columns[0].null_rate · columns[0].n_unique · columns[1].n_unique · columns[1].top_value · columns[1].null_rate

Fig 1.
GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES: · Distribution of text length reveals short country names versus longer header/citation strings.
Show data table
Top values for GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES: (20 unique shown, of 190 total).
valuecountshare
GLOBAL YLDs BY CAUSE, AGE AND SEX, 2000-202110.5%
July 202410.5%
World Health Organization10.5%
Geneva, Switzerland10.5%
https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates10.5%
Afghanistan10.5%
Albania10.5%
Algeria10.5%
Angola10.5%
Antigua and Barbuda10.5%
Argentina10.5%
Armenia10.5%
Australia10.5%
Austria10.5%
Azerbaijan10.5%
Bahamas10.5%
Bahrain10.5%
Bangladesh10.5%
Barbados10.5%
Belarus10.5%
Fig 2.
__UNNAMED__0 · Shows how overwhelmingly null this column is — only 6 non-null note entries out of 196 rows.
Show data table
Top values for __UNNAMED__0 (6 unique shown, of 6 total).
valuecountshare
Global Health Estimates 2021 Summary Tables This workbook contains summary burden of disease estimates from the WHO Global Health Estimates (GHE). The estimates are based on analysis of latest available national information on levels of mortality and cause distributions as of the end of 2023 together with latest available information from WHO programs for causes of public health importance. Data, methods and cause categories are described in a Technical Paper (1) available on the WHO website. Population estimates are from the 2024 revision of the UN World Population Prospects (2). This spreadsheet includes point estimates for years lost due to disability (YLDs), globally, by cause, age and sex, for the years 2000, 2010, 2015, 2019, 2020 and 2021. Documentation, country-level and regional-level summary tables are available on the WHO website ( https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates ). Depending on the available data sources, the cause-specific estimates will have quite substantial uncertainty ranges. Due to changes in data and some methods, these estimates are not comparable to previously-released WHO estimates. The preparation of these statistics was undertaken by the WHO Department of Data and Analytics, in collaboration with WHO technical programs. For further queries, please send an email to healthstat@who.int . References: (1) WHO methods and data sources for global burden of disease 2000-2021. Global Health Estimates Technical Paper WHO/DDI/DNA/GHE/2020.3.Geneva: World Health Organization; 2024 (https://www.who.int/docs/default-source/gho-documents/global-health-estimates/GlobalBurden_method_2000_2021.pdf). (2) World Population Prospects: The 2024 revision. New York: United Nations, Department of Economic and Social Affairs, Population Division; 2024 (https://esa.un.org/unpd/wpp/). 10.5%
Recommended citation:10.5%
Global Health Estimates 2021: Disease burden by Cause, Age, Sex, by Country and by Region, 2000-2021. Geneva, World Health Organization; 2024.10.5%
List of Countries10.5%
Note: WHO Member States with a population of less than 90,000 in 2021 were not included in the analysis.10.5%
Countries, areas or territories included10.5%
Fig 3.
GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES: · Top values are all frequency 1, confirming this column is essentially a unique list (mostly countries).
Show data table
Top values for GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES: (20 unique shown, of 190 total).
valuecountshare
GLOBAL YLDs BY CAUSE, AGE AND SEX, 2000-202110.5%
July 202410.5%
World Health Organization10.5%
Geneva, Switzerland10.5%
https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates10.5%
Afghanistan10.5%
Albania10.5%
Algeria10.5%
Angola10.5%
Antigua and Barbuda10.5%
Argentina10.5%
Armenia10.5%
Australia10.5%
Austria10.5%
Azerbaijan10.5%
Bahamas10.5%
Bahrain10.5%
Bangladesh10.5%
Barbados10.5%
Belarus10.5%
Fig 4.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
__UNNAMED__0categorical96.9%
GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES:categorical3.1%

__UNNAMED__0 categorical metadata

This unnamed column is almost certainly the leading text/header block from a WHO Global Health Estimates 2021 spreadsheet rather than a true data field — values include the workbook description, a 'Recommended citation:' label, and 'List of Countries'. Of 196 rows, 96.94% are null and only 6 distinct strings appear, each occurring once, so it carries no analytic signal. The presence of a multi-paragraph documentation blob as the top value confirms this is spreadsheet preamble that leaked in during ingest.

Treatment: Drop; this is spreadsheet header/preamble text, not a column.

anthropic:claude-opus-4-7 · confidence high
Out[10]:

saturn.columns["__UNNAMED__0"].stats

statvalue
n196
nulls190 (96.9%)
unique6
top_value Global Health Estimates 2021 Summary Tables This workbook contains summary burden of disease estimates from the WHO Global Health Estimates (GHE). The estimates are based on analysis of latest available national information on levels of mortality and cause distributions as of the end of 2023 together with latest available information from WHO programs for causes of public health importance. Data, methods and cause categories are described in a Technical Paper (1) available on the WHO website. Population estimates are from the 2024 revision of the UN World Population Prospects (2). This spreadsheet includes point estimates for years lost due to disability (YLDs), globally, by cause, age and sex, for the years 2000, 2010, 2015, 2019, 2020 and 2021. Documentation, country-level and regional-level summary tables are available on the WHO website ( https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates ). Depending on the available data sources, the cause-specific estimates will have quite substantial uncertainty ranges. Due to changes in data and some methods, these estimates are not comparable to previously-released WHO estimates. The preparation of these statistics was undertaken by the WHO Department of Data and Analytics, in collaboration with WHO technical programs. For further queries, please send an email to healthstat@who.int . References: (1) WHO methods and data sources for global burden of disease 2000-2021. Global Health Estimates Technical Paper WHO/DDI/DNA/GHE/2020.3.Geneva: World Health Organization; 2024 (https://www.who.int/docs/default-source/gho-documents/global-health-estimates/GlobalBurden_method_2000_2021.pdf). (2) World Population Prospects: The 2024 revision. New York: United Nations, Department of Economic and Social Affairs, Population Division; 2024 (https://esa.un.org/unpd/wpp/).
top_rate 0.1667
cardinality 6
entropy 2.585
entropy_ratio 1
alert: long_tail6 singleton categories
alert: null_rate96.9% null
Fig 5.
Top values for __UNNAMED__0.
Show data table
Top values for __UNNAMED__0 (6 unique shown, of 6 total).
valuecountshare
Global Health Estimates 2021 Summary Tables This workbook contains summary burden of disease estimates from the WHO Global Health Estimates (GHE). The estimates are based on analysis of latest available national information on levels of mortality and cause distributions as of the end of 2023 together with latest available information from WHO programs for causes of public health importance. Data, methods and cause categories are described in a Technical Paper (1) available on the WHO website. Population estimates are from the 2024 revision of the UN World Population Prospects (2). This spreadsheet includes point estimates for years lost due to disability (YLDs), globally, by cause, age and sex, for the years 2000, 2010, 2015, 2019, 2020 and 2021. Documentation, country-level and regional-level summary tables are available on the WHO website ( https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates ). Depending on the available data sources, the cause-specific estimates will have quite substantial uncertainty ranges. Due to changes in data and some methods, these estimates are not comparable to previously-released WHO estimates. The preparation of these statistics was undertaken by the WHO Department of Data and Analytics, in collaboration with WHO technical programs. For further queries, please send an email to healthstat@who.int . References: (1) WHO methods and data sources for global burden of disease 2000-2021. Global Health Estimates Technical Paper WHO/DDI/DNA/GHE/2020.3.Geneva: World Health Organization; 2024 (https://www.who.int/docs/default-source/gho-documents/global-health-estimates/GlobalBurden_method_2000_2021.pdf). (2) World Population Prospects: The 2024 revision. New York: United Nations, Department of Economic and Social Affairs, Population Division; 2024 (https://esa.un.org/unpd/wpp/). 10.5%
Recommended citation:10.5%
Global Health Estimates 2021: Disease burden by Cause, Age, Sex, by Country and by Region, 2000-2021. Geneva, World Health Organization; 2024.10.5%
List of Countries10.5%
Note: WHO Member States with a population of less than 90,000 in 2021 were not included in the analysis.10.5%
Countries, areas or territories included10.5%

GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES: categorical identifier

This appears to be a malformed header column from a WHO Global Health Estimates 2021 summary table, where document metadata (title, date, publisher, URL) has been concatenated with country names into a single field. With 190 unique values across 196 rows and a maximum frequency of 1 (top_rate 0.005), it is effectively a free-text identifier rather than a categorical feature. Entropy ratio of 1.0 confirms every populated value is distinct, and a 3.06% null rate suggests a few stray blank rows.

Treatment: Drop or re-parse: this column conflates report metadata with country labels and is near-unique.

anthropic:claude-opus-4-7 · confidence high
Out[13]:

saturn.columns["GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES:"].stats

statvalue
n196
nulls6 (3.1%)
unique190
top_value GLOBAL YLDs BY CAUSE, AGE AND SEX, 2000-2021
top_rate 0.005263
cardinality 190
entropy 7.57
entropy_ratio 1
alert: long_tail190 singleton categories
Fig 6.
Top values for GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES:.
Show data table
Top values for GLOBAL HEALTH ESTIMATES 2021 SUMMARY TABLES: (20 unique shown, of 190 total).
valuecountshare
GLOBAL YLDs BY CAUSE, AGE AND SEX, 2000-202110.5%
July 202410.5%
World Health Organization10.5%
Geneva, Switzerland10.5%
https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates10.5%
Afghanistan10.5%
Albania10.5%
Algeria10.5%
Angola10.5%
Antigua and Barbuda10.5%
Argentina10.5%
Armenia10.5%
Australia10.5%
Austria10.5%
Azerbaijan10.5%
Bahamas10.5%
Bahrain10.5%
Bangladesh10.5%
Barbados10.5%
Belarus10.5%

How to cite

click to copy

BibTeX
@misc{saturn-.cache-who-yld-global-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: .cache who yld global},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/.cache_who-yld_global}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: .cache who yld global. Source: /home/coolhand/html/datavis/data_trove/data/accessibility/.cache_who/yld_global.xlsx#Notes. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/.cache_who-yld_global