saturn
/home/coolhand/html/datavis/data_trove/cache/glottolog_languages.parquet 27,037 rows sample n=27,037 seed 42 2026-05-01T18:05:46+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/cache/glottolog_languages.parquet |
| Total rows | 27,037 |
| Profiled sample | 27,037 |
| Columns | 15 |
| Generated | 2026-05-01T18:05:46+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
Errors during insight pass (16)
dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvHK1ycaZQKNjFCrzk'}column:ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvHmoGJDHy3YYa1oeh'}column:Name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvJL2rM61Wq1rt9dGy'}column:Macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvJyyVeBwd6VDKPRzn'}column:Latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvKXibpCb8w19DrBfA'}column:Longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvL1z5LPi7Ug8LTqJV'}column:Glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvLXzJEGW4zPXt6cYP'}column:ISO639P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvM4VcgRdnUKLjAdBx'}column:Level:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvMajyoCpBx8MC85xp'}column:Countries:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvN7FHLEnxGPXDFGvY'}column:Family_ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvNc1bFTsWjhBtP5y8'}column:Language_ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvPFxKekZu3Qr62pzX'}column:Closest_ISO369P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvPjix2xpJ9npVdmFk'}column:First_Year_Of_Documentation:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvQKT758xC1cYSdyg4'}column:Last_Year_Of_Documentation:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvQswFtY2thexE486S'}column:Is_Isolate:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvRJUpw1YHGqcqtNYE'}
Numeric correlation
ID text
100.0% of rows are unique strings
100.0% rows are a single word
95th-percentile length under 20 chars
rows27,037
null0 (0.0%)
unique27,037
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size20,000
readability_flesch_mean92.033
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- east1459
- tarp1240
- kech1244
- kona1243
- jehm1239
- east2441
- apal1256
- mala1473
- bauk1238
- land1262
Name text
100.0% of rows are unique strings
66.7% rows are a single word
rows27,037
null0 (0.0%)
unique27,037
len_min1
len_max109
len_mean10.439
len_median8.000
len_p9523.000
word_mean1.444
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size18,126
readability_flesch_mean29.907
emoji_rate0.000
url_rate0.000
one_word_rate0.667
allcaps_rate1.48e-04
boilerplate_rate0.000
Sample values (first 10)
- East Bird's Head
- Tarpia
- Kechi
- Konawe
- Jeh Mang Ram
- East Lagoon
- Apali
- Mala
- Baukan
- Land Dayak (Retired)
Macroarea categorical
rows27,037
null224 (0.8%)
unique30
top_valueEurasia
top_rate0.301
cardinality30
entropy2.271
entropy_ratio0.463
Top values (rank 1–20)
- Eurasia — 8,060
- Africa — 8,020
- Papunesia — 6,326
- North America — 1,782
- South America — 1,524
- Australia — 919
- Africa;Eurasia — 29
- Eurasia;Papunesia — 22
- Africa;Eurasia;North America;Papunesia;South America — 18
- Africa;Australia;Eurasia;North America;Papunesia;South America — 17
- North America;South America — 15
- Eurasia;North America — 12
- Africa;North America — 12
- Eurasia;South America — 11
- Eurasia;Papunesia;South America — 8
- Africa;Eurasia;Papunesia;South America — 7
- Eurasia;North America;South America — 5
- Eurasia;North America;Papunesia;South America — 4
- Africa;Australia;Eurasia;North America;Papunesia — 3
- Papunesia;South America — 3
Latitude numeric
rows27,037
null479 (1.8%)
unique13,231
min-55.275
max73.135
mean11.590
median8.527
std20.570
q1-3.747
q326.000
iqr29.747
skew0.421
kurtosis-0.191
n_outliers48
outlier_rate1.81e-03
zero_rate0.000
Longitude numeric
rows27,037
null479 (1.8%)
unique13,203
min-178.785
max179.431
mean51.824
median44.065
std74.046
q19.225
q3119.394
iqr110.168
skew-0.468
kurtosis-0.452
n_outliers51
outlier_rate1.92e-03
zero_rate0.000
Glottocode text
100.0% of rows are unique strings
100.0% rows are a single word
95th-percentile length under 20 chars
rows27,037
null0 (0.0%)
unique27,037
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size20,000
readability_flesch_mean92.033
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- east1459
- tarp1240
- kech1244
- kona1243
- jehm1239
- east2441
- apal1256
- mala1473
- bauk1238
- land1262
ISO639P3code text
100.0% of rows are unique strings
100.0% rows are a single word
69.7% null
95th-percentile length under 20 chars
rows27,037
null18,857 (69.7%)
unique8,180
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size8,180
readability_flesch_mean119.105
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- aqp
- lak
- avd
- kak
- fam
- kri
- gwt
- nnz
- zac
- wlg
Level categorical
rows27,037
null0 (0.0%)
unique3
top_valuedialect
top_rate0.503
cardinality3
entropy1.468
entropy_ratio0.927
Top values (rank 1–20)
- dialect — 13,593
- language — 8,612
- family — 4,832
Countries categorical
66.4% null
rows27,037
null17,956 (66.4%)
unique737
top_valuePG
top_rate0.100
cardinality737
entropy6.562
entropy_ratio0.689
Top values (rank 1–20)
- PG — 905
- ID — 708
- NG — 512
- AU — 476
- IN — 402
- MX — 316
- CN — 315
- BR — 277
- US — 255
- CM — 205
- PH — 188
- CD — 162
- VU — 129
- RU — 104
- TZ — 103
- PE — 102
- MY — 88
- TD — 88
- NP — 82
- CO — 80
Family_ID categorical
rows27,037
null429 (1.6%)
unique297
top_valueatla1278
top_rate0.183
cardinality297
entropy4.938
entropy_ratio0.601
Top values (rank 1–20)
- atla1278 — 4,861
- aust1307 — 4,108
- indo1319 — 3,173
- sino1245 — 1,926
- afro1255 — 1,458
- nucl1709 — 834
- pama1250 — 642
- aust1305 — 526
- otom1299 — 385
- book1242 — 382
- sign1238 — 343
- mand1469 — 322
- drav1251 — 281
- turk1311 — 273
- cent2225 — 267
- taik1256 — 261
- ural1272 — 236
- nilo1247 — 235
- nakh1245 — 190
- araw1281 — 188
Language_ID text
100.0% rows are a single word
49.7% null
95th-percentile length under 20 chars
77.1% duplicate strings
rows27,037
null13,444 (49.7%)
unique3,110
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates10,483
duplicate_rate0.771
vocab_size3,110
readability_flesch_mean86.534
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- kuna1268
- mixt1426
- mewa1249
- meke1243
- gban1260
- stan1288
- ande1247
- amah1245
- kuan1248
- foii1241
Closest_ISO369P3code text
100.0% rows are a single word
21.3% null
95th-percentile length under 20 chars
61.6% duplicate strings
rows27,037
null5,754 (21.3%)
unique8,180
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates13,103
duplicate_rate0.616
vocab_size7,877
readability_flesch_mean117.413
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- xbn
- ebu
- ksl
- beb
- vam
- nso
- kzp
- bns
- arg
- yag
First_Year_Of_Documentation numeric
99.2% null
rows27,037
null26,822 (99.2%)
unique114
min-2,100
max1,932
mean673.730
median711.000
std1,055
q1-300.000
q31,710
iqr2,010
skew-0.458
kurtosis-0.921
n_outliers0
outlier_rate0.000
zero_rate0.000
Last_Year_Of_Documentation numeric
96.0% null
skew=-3.35
15.9% rows beyond 1.5 IQR
rows27,037
null25,969 (96.0%)
unique269
min-3,100
max2,024
mean1,700
median1,960
std699.336
q11,858
q31,987
iqr129.500
skew-3.345
kurtosis12.315
n_outliers170
outlier_rate0.159
zero_rate0.000
Is_Isolate categorical
68.1% null
top value is 97.9% of rows
rows27,037
null18,425 (68.1%)
unique2
top_valueFalse
top_rate0.979
cardinality2
entropy0.148
entropy_ratio0.148
Top values (rank 1–20)
- False — 8,430
- True — 182