saturn

/home/coolhand/html/datavis/data_trove/cache/glottolog_languages.parquet 27,037 rows sample n=27,037 seed 42 2026-05-01T18:05:46+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/cache/glottolog_languages.parquet
Total rows27,037
Profiled sample27,037
Columns15
Generated2026-05-01T18:05:46+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Errors during insight pass (16)
  • dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvHK1ycaZQKNjFCrzk'}
  • column:ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvHmoGJDHy3YYa1oeh'}
  • column:Name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvJL2rM61Wq1rt9dGy'}
  • column:Macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvJyyVeBwd6VDKPRzn'}
  • column:Latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvKXibpCb8w19DrBfA'}
  • column:Longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvL1z5LPi7Ug8LTqJV'}
  • column:Glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvLXzJEGW4zPXt6cYP'}
  • column:ISO639P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvM4VcgRdnUKLjAdBx'}
  • column:Level:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvMajyoCpBx8MC85xp'}
  • column:Countries:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvN7FHLEnxGPXDFGvY'}
  • column:Family_ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvNc1bFTsWjhBtP5y8'}
  • column:Language_ID:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvPFxKekZu3Qr62pzX'}
  • column:Closest_ISO369P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvPjix2xpJ9npVdmFk'}
  • column:First_Year_Of_Documentation:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvQKT758xC1cYSdyg4'}
  • column:Last_Year_Of_Documentation:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvQswFtY2thexE486S'}
  • column:Is_Isolate:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHvRJUpw1YHGqcqtNYE'}

Numeric correlation

ID text

100.0% of rows are unique strings 100.0% rows are a single word 95th-percentile length under 20 chars
rows27,037
null0 (0.0%)
unique27,037
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size20,000
readability_flesch_mean92.033
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. east1459
  2. tarp1240
  3. kech1244
  4. kona1243
  5. jehm1239
  6. east2441
  7. apal1256
  8. mala1473
  9. bauk1238
  10. land1262

Name text

100.0% of rows are unique strings 66.7% rows are a single word
rows27,037
null0 (0.0%)
unique27,037
len_min1
len_max109
len_mean10.439
len_median8.000
len_p9523.000
word_mean1.444
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size18,126
readability_flesch_mean29.907
emoji_rate0.000
url_rate0.000
one_word_rate0.667
allcaps_rate1.48e-04
boilerplate_rate0.000
Sample values (first 10)
  1. East Bird's Head
  2. Tarpia
  3. Kechi
  4. Konawe
  5. Jeh Mang Ram
  6. East Lagoon
  7. Apali
  8. Mala
  9. Baukan
  10. Land Dayak (Retired)

Macroarea categorical

rows27,037
null224 (0.8%)
unique30
top_valueEurasia
top_rate0.301
cardinality30
entropy2.271
entropy_ratio0.463
Top values (rank 1–20)
  1. Eurasia — 8,060
  2. Africa — 8,020
  3. Papunesia — 6,326
  4. North America — 1,782
  5. South America — 1,524
  6. Australia — 919
  7. Africa;Eurasia — 29
  8. Eurasia;Papunesia — 22
  9. Africa;Eurasia;North America;Papunesia;South America — 18
  10. Africa;Australia;Eurasia;North America;Papunesia;South America — 17
  11. North America;South America — 15
  12. Eurasia;North America — 12
  13. Africa;North America — 12
  14. Eurasia;South America — 11
  15. Eurasia;Papunesia;South America — 8
  16. Africa;Eurasia;Papunesia;South America — 7
  17. Eurasia;North America;South America — 5
  18. Eurasia;North America;Papunesia;South America — 4
  19. Africa;Australia;Eurasia;North America;Papunesia — 3
  20. Papunesia;South America — 3

Latitude numeric

rows27,037
null479 (1.8%)
unique13,231
min-55.275
max73.135
mean11.590
median8.527
std20.570
q1-3.747
q326.000
iqr29.747
skew0.421
kurtosis-0.191
n_outliers48
outlier_rate1.81e-03
zero_rate0.000

Longitude numeric

rows27,037
null479 (1.8%)
unique13,203
min-178.785
max179.431
mean51.824
median44.065
std74.046
q19.225
q3119.394
iqr110.168
skew-0.468
kurtosis-0.452
n_outliers51
outlier_rate1.92e-03
zero_rate0.000

Glottocode text

100.0% of rows are unique strings 100.0% rows are a single word 95th-percentile length under 20 chars
rows27,037
null0 (0.0%)
unique27,037
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size20,000
readability_flesch_mean92.033
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. east1459
  2. tarp1240
  3. kech1244
  4. kona1243
  5. jehm1239
  6. east2441
  7. apal1256
  8. mala1473
  9. bauk1238
  10. land1262

ISO639P3code text

100.0% of rows are unique strings 100.0% rows are a single word 69.7% null 95th-percentile length under 20 chars
rows27,037
null18,857 (69.7%)
unique8,180
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size8,180
readability_flesch_mean119.105
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. aqp
  2. lak
  3. avd
  4. kak
  5. fam
  6. kri
  7. gwt
  8. nnz
  9. zac
  10. wlg

Level categorical

rows27,037
null0 (0.0%)
unique3
top_valuedialect
top_rate0.503
cardinality3
entropy1.468
entropy_ratio0.927
Top values (rank 1–20)
  1. dialect — 13,593
  2. language — 8,612
  3. family — 4,832

Countries categorical

66.4% null
rows27,037
null17,956 (66.4%)
unique737
top_valuePG
top_rate0.100
cardinality737
entropy6.562
entropy_ratio0.689
Top values (rank 1–20)
  1. PG — 905
  2. ID — 708
  3. NG — 512
  4. AU — 476
  5. IN — 402
  6. MX — 316
  7. CN — 315
  8. BR — 277
  9. US — 255
  10. CM — 205
  11. PH — 188
  12. CD — 162
  13. VU — 129
  14. RU — 104
  15. TZ — 103
  16. PE — 102
  17. MY — 88
  18. TD — 88
  19. NP — 82
  20. CO — 80

Family_ID categorical

rows27,037
null429 (1.6%)
unique297
top_valueatla1278
top_rate0.183
cardinality297
entropy4.938
entropy_ratio0.601
Top values (rank 1–20)
  1. atla1278 — 4,861
  2. aust1307 — 4,108
  3. indo1319 — 3,173
  4. sino1245 — 1,926
  5. afro1255 — 1,458
  6. nucl1709 — 834
  7. pama1250 — 642
  8. aust1305 — 526
  9. otom1299 — 385
  10. book1242 — 382
  11. sign1238 — 343
  12. mand1469 — 322
  13. drav1251 — 281
  14. turk1311 — 273
  15. cent2225 — 267
  16. taik1256 — 261
  17. ural1272 — 236
  18. nilo1247 — 235
  19. nakh1245 — 190
  20. araw1281 — 188

Language_ID text

100.0% rows are a single word 49.7% null 95th-percentile length under 20 chars 77.1% duplicate strings
rows27,037
null13,444 (49.7%)
unique3,110
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates10,483
duplicate_rate0.771
vocab_size3,110
readability_flesch_mean86.534
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. kuna1268
  2. mixt1426
  3. mewa1249
  4. meke1243
  5. gban1260
  6. stan1288
  7. ande1247
  8. amah1245
  9. kuan1248
  10. foii1241

Closest_ISO369P3code text

100.0% rows are a single word 21.3% null 95th-percentile length under 20 chars 61.6% duplicate strings
rows27,037
null5,754 (21.3%)
unique8,180
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates13,103
duplicate_rate0.616
vocab_size7,877
readability_flesch_mean117.413
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. xbn
  2. ebu
  3. ksl
  4. beb
  5. vam
  6. nso
  7. kzp
  8. bns
  9. arg
  10. yag

First_Year_Of_Documentation numeric

99.2% null
rows27,037
null26,822 (99.2%)
unique114
min-2,100
max1,932
mean673.730
median711.000
std1,055
q1-300.000
q31,710
iqr2,010
skew-0.458
kurtosis-0.921
n_outliers0
outlier_rate0.000
zero_rate0.000

Last_Year_Of_Documentation numeric

96.0% null skew=-3.35 15.9% rows beyond 1.5 IQR
rows27,037
null25,969 (96.0%)
unique269
min-3,100
max2,024
mean1,700
median1,960
std699.336
q11,858
q31,987
iqr129.500
skew-3.345
kurtosis12.315
n_outliers170
outlier_rate0.159
zero_rate0.000

Is_Isolate categorical

68.1% null top value is 97.9% of rows
rows27,037
null18,425 (68.1%)
unique2
top_valueFalse
top_rate0.979
cardinality2
entropy0.148
entropy_ratio0.148
Top values (rank 1–20)
  1. False — 8,430
  2. True — 182