saturn
/home/coolhand/servers/diachronica/data_raw/wals_language.csv 3,573 rows sample n=3,573 seed 42 2026-05-01T17:52:07+00:00
Overview
| Source | /home/coolhand/servers/diachronica/data_raw/wals_language.csv |
| Total rows | 3,573 |
| Profiled sample | 3,573 |
| Columns | 17 |
| Generated | 2026-05-01T17:52:07+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
Errors during insight pass (18)
dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGsv59tCMcNmTCSFGmJ'}column:ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGsvaun1aSDxETPBuJj'}column:Name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGsw99QVV9hP7u9U3Bq'}column:Macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGswfemh6twvutVRr24'}column:Latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGsxDtCotu8uTyoJwCU'}column:Longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGsxjeKqvDrF4EVt7Lb'}column:Glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGsyDR2GyWfviyxMT7U'}column:ISO639P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGszfyPtr3rge8nax8d'}column:Family:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt1DU2a9tTWcULDzTn'}column:Subfamily:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt1nCge2xRm4p7LPYD'}column:Genus:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt2JhYeEaQYaSTt6R2'}column:GenusIcon:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt2oU91AZQpbZjSqcE'}column:ISO_codes:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt3U9uA8ChTBEWe8fG'}column:Samples_100:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt438sKtDkSTPE3ct7'}column:Samples_200:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt4UBCVcVVukzj5DRE'}column:Country_ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt51g2qiTWEApSLcZH'}column:Source:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt5e7yLWHsqCD3y361'}column:Parent_ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGt6MHgbcSqLFBdB24q'}
Numeric correlation
ID text
100.0% of rows are unique strings
100.0% rows are a single word
95th-percentile length under 20 chars
rows3,573
null0 (0.0%)
unique3,573
len_min2
len_max36
len_mean5.982
len_median3.000
len_p9517.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size3,573
readability_flesch_mean61.577
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- abd
- genus-araucanian
- genus-misumalpan
- subfamily-palaihnihan
- mmp
- family-tunica
- mrj
- genus-northwestcaucasian
- genus-huavean
- arg
Name text
80.0% rows are a single word
95th-percentile length under 20 chars
rows3,573
null0 (0.0%)
unique3,198
len_min2
len_max46
len_mean8.705
len_median7.000
len_p9519.000
word_mean1.258
word_median1.000
n_empty0
n_duplicates375
duplicate_rate0.105
vocab_size3,383
readability_flesch_mean48.158
emoji_rate0.000
url_rate0.000
one_word_rate0.800
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- Abidji
- Araucanian
- Misumalpan
- Palaihnihan
- Mampruli
- Tunica
- Mirniny
- Northwest Caucasian
- Huavean
- Arabic (Gulf)
Macroarea categorical
25.5% null
rows3,573
null911 (25.5%)
unique6
top_valueEurasia
top_rate0.248
cardinality6
entropy2.459
entropy_ratio0.951
Top values (rank 1–20)
- Eurasia — 659
- Africa — 606
- Papunesia — 560
- North America — 396
- South America — 258
- Australia — 183
Latitude numeric
25.5% null
rows3,573
null911 (25.5%)
unique887
min-55.000
max71.250
mean11.880
median8.292
std22.722
q1-5.000
q328.000
iqr33.000
skew0.356
kurtosis-0.502
n_outliers1
outlier_rate3.76e-04
zero_rate2.25e-03
Longitude numeric
25.5% null
rows3,573
null911 (25.5%)
unique1,360
min-178.167
max179.167
mean35.172
median34.792
std89.352
q1-45.750
q3121.000
iqr166.750
skew-0.326
kurtosis-1.047
n_outliers0
outlier_rate0.000
zero_rate1.50e-03
Glottocode text
100.0% rows are a single word
26.0% null
95th-percentile length under 20 chars
rows3,573
null928 (26.0%)
unique2,502
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates143
duplicate_rate0.054
vocab_size2,502
readability_flesch_mean92.879
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- chad1249
- nyor1246
- musl1236
- taga1270
- kuni1267
- yukp1241
- libe1247
- tuva1244
- tali1258
- yane1238
ISO639P3code text
100.0% rows are a single word
26.8% null
95th-percentile length under 20 chars
rows3,573
null959 (26.8%)
unique2,442
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates172
duplicate_rate0.066
vocab_size2,442
readability_flesch_mean119.528
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- shu
- nyo
- ttt
- tgl
- kup
- yup
- kpk
- tvl
- tlj
- adx
Family categorical
25.5% null
rows3,573
null911 (25.5%)
unique254
top_valueNiger-Congo
top_rate0.122
cardinality254
entropy5.631
entropy_ratio0.705
Top values (rank 1–20)
- Niger-Congo — 324
- Austronesian — 324
- Indo-European — 176
- Sino-Tibetan — 146
- Afro-Asiatic — 145
- Pama-Nyungan — 121
- Trans-New Guinea — 98
- other — 72
- Altaic — 65
- Oto-Manguean — 56
- Austro-Asiatic — 48
- Eastern Sudanic — 47
- Uto-Aztecan — 44
- Algic — 31
- Mayan — 30
- Arawakan — 29
- Nakh-Daghestanian — 28
- Mande — 28
- Uralic — 27
- Hokan — 26
Subfamily categorical
74.5% null
rows3,573
null2,662 (74.5%)
unique32
top_valueBenue-Congo
top_rate0.220
cardinality32
entropy3.856
entropy_ratio0.771
Top values (rank 1–20)
- Benue-Congo — 200
- Eastern Malayo-Polynesian — 159
- Tibeto-Burman — 139
- Chadic — 47
- Mon-Khmer — 38
- Adamawa-Ubangi — 30
- Gur — 27
- Daghestanian — 25
- Cushitic — 24
- Finno-Ugric — 21
- Kwa — 20
- North-Central Atlantic — 20
- Nilotic — 19
- Mixtecan — 18
- Omotic — 15
- Kainantu-Goroka — 14
- Madang — 13
- Awyu-Ok — 10
- Surmic — 10
- Je — 9
Genus categorical
25.5% null
rows3,573
null911 (25.5%)
unique625
top_valueOceanic
top_rate0.056
cardinality625
entropy7.950
entropy_ratio0.856
Top values (rank 1–20)
- Oceanic — 149
- Bantu — 141
- Indic — 50
- Western Pama-Nyungan — 49
- Semitic — 43
- Turkic — 41
- Sign Languages — 40
- Bodic — 40
- Germanic — 39
- Northern Pama-Nyungan — 33
- Creoles and Pidgins — 32
- Mayan — 30
- Algonquian — 29
- Central Malayo-Polynesian — 29
- Iranian — 26
- Romance — 24
- Biu-Mandara — 24
- Southeastern Pama-Nyungan — 23
- Dravidian — 23
- Malayo-Sumbawan — 22
GenusIcon categorical
601 singleton categories
82.5% null
rows3,573
null2,948 (82.5%)
unique613
top_valuec688033
top_rate3.20e-03
cardinality613
entropy9.249
entropy_ratio0.999
Top values (rank 1–20)
- c688033 — 2
- c803E33 — 2
- c804733 — 2
- c807D33 — 2
- c806233 — 2
- c805033 — 2
- c7A8033 — 2
- c805933 — 2
- c807433 — 2
- c806B33 — 2
- c718033 — 2
- c803533 — 2
- cCC8C51 — 1
- cCC6851 — 1
- cCC7E51 — 1
- c8FCC51 — 1
- cCC8051 — 1
- c528033 — 1
- cCC9F51 — 1
- cCCB551 — 1
ISO_codes text
99.0% rows are a single word
26.1% null
95th-percentile length under 20 chars
rows3,573
null933 (26.1%)
unique2,468
len_min3
len_max7
len_mean3.039
len_median3.000
len_p953.000
word_mean1.010
word_median1.000
n_empty0
n_duplicates172
duplicate_rate0.065
vocab_size2,486
readability_flesch_mean117.413
emoji_rate0.000
url_rate0.000
one_word_rate0.990
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- shu
- nyo
- dto
- sps
- kpx
- yux
- kff
- tue
- tlj
- ame
Samples_100 categorical
25.5% null
top value is 96.2% of rows
rows3,573
null911 (25.5%)
unique2
top_valueFalse
top_rate0.962
cardinality2
entropy0.231
entropy_ratio0.231
Top values (rank 1–20)
- False — 2,562
- True — 100
Samples_200 categorical
25.5% null
rows3,573
null911 (25.5%)
unique2
top_valueFalse
top_rate0.925
cardinality2
entropy0.385
entropy_ratio0.385
Top values (rank 1–20)
- False — 2,462
- True — 200
Country_ID categorical
25.7% null
rows3,573
null918 (25.7%)
unique337
top_valuePG
top_rate0.081
cardinality337
entropy6.314
entropy_ratio0.752
Top values (rank 1–20)
- PG — 214
- AU — 185
- US — 177
- ID — 177
- IN — 120
- MX — 120
- RU — 89
- NG — 66
- BR — 66
- CN — 54
- CD — 49
- CM — 46
- CA — 45
- CO — 39
- ET — 36
- PH — 36
- PE — 35
- NP — 32
- TZ — 28
- VU — 28
Source text
45.5% rows are a single word
30.1% null
rows3,573
null1,074 (30.1%)
unique2,373
len_min7
len_max452
len_mean42.071
len_median25.000
len_p95135.000
word_mean2.854
word_median2.000
n_empty0
n_duplicates126
duplicate_rate0.050
vocab_size5,899
readability_flesch_mean21.332
emoji_rate0.000
url_rate0.000
one_word_rate0.455
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- Abu-Absi-1995
- Grenoble-1992
- Goldstein-1991
- Ross-2002g
- Bacelar-2004
- Hanes-1952 de-Vegamian-1978
- Laanest-1982 Leskinen-1984 Raun-1964b Rjagoev-1993
- Haas-1940 Haas-1953 Nichols-1992 Swanton-1919 Swanton-1921
- Ross-2002h
- Duff-Tripp-1997 Fast-1953 Wise-1958 Wise-1978 Wise-1986 Wise-1990
Parent_ID categorical
501 singleton categories
rows3,573
null254 (7.1%)
unique911
top_valuegenus-oceanic
top_rate0.045
cardinality911
entropy8.554
entropy_ratio0.870
Top values (rank 1–20)
- genus-oceanic — 149
- genus-bantu — 141
- genus-indic — 50
- genus-westernpamanyungan — 49
- genus-semitic — 43
- genus-turkic — 41
- genus-signlanguages — 40
- genus-bodic — 40
- genus-germanic — 39
- genus-northernpamanyungan — 33
- genus-creolesandpidgins — 32
- genus-mayan — 30
- family-austronesian — 30
- genus-algonquian — 29
- genus-centralmalayopolynesian — 29
- genus-iranian — 26
- family-transnewguinea — 25
- genus-romance — 24
- genus-biumandara — 24
- genus-southeasternpamanyungan — 23