saturn

/home/coolhand/servers/diachronica/data_raw/glottolog_languoid.csv 19,401 rows sample n=19,401 seed 42 2026-05-01T18:05:30+00:00

Overview

Source/home/coolhand/servers/diachronica/data_raw/glottolog_languoid.csv
Total rows19,401
Profiled sample19,401
Columns7
Generated2026-05-01T18:05:30+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Errors during insight pass (8)
  • dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHu7kGoUGkpq5kLDW6c'}
  • column:glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHu8CZ8T33h1TvPMNjP'}
  • column:name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuCwZMbLh8g3q36Pmi'}
  • column:isocodes:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuDPbWjSdC3wgAYzqr'}
  • column:level:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuDwLmkTRBobPn1wH2'}
  • column:macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuEeFtkjYPttMRosBH'}
  • column:latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuF9Wq9yhJ3ugXL6nD'}
  • column:longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuFaKGGRPvQCV8zJpf'}

Numeric correlation

glottocode text

100.0% of rows are unique strings 100.0% rows are a single word 95th-percentile length under 20 chars
rows19,401
null0 (0.0%)
unique19,401
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size19,401
readability_flesch_mean93.302
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. abaz1241
  2. sand1275
  3. texm1235
  4. stan1298
  5. kuik1246
  6. yait1239
  7. kyon1245
  8. tuyu1244
  9. subi1246
  10. apah1238

name text

100.0% of rows are unique strings 71.7% rows are a single word
rows19,401
null0 (0.0%)
unique19,401
len_min1
len_max58
len_mean9.211
len_median7.000
len_p9520.000
word_mean1.369
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size17,861
readability_flesch_mean60.531
emoji_rate0.000
url_rate0.000
one_word_rate0.717
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. Abaza
  2. Sandiwar
  3. Texmelucan Zapotec
  4. Standard Braj of Mathura
  5. Kuikúro-Kalapálo
  6. Yaitepec Chatino
  7. Kyon
  8. Tuyuca
  9. Subiya
  10. Apahapsili

isocodes text

100.0% of rows are unique strings 100.0% rows are a single word 59.2% null 95th-percentile length under 20 chars
rows19,401
null11,479 (59.2%)
unique7,922
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size7,922
readability_flesch_mean118.682
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. aba
  2. sgr
  3. thd
  4. nbl
  5. gvc
  6. yad
  7. krh
  8. tuk
  9. sgz
  10. aoa

level categorical

rows19,401
null0 (0.0%)
unique2
top_valuedialect
top_rate0.563
cardinality2
entropy0.989
entropy_ratio0.989
Top values (rank 1–20)
  1. dialect — 10,920
  2. language — 8,481

macroarea categorical

rows19,401
null839 (4.3%)
unique6
top_valueAfrica
top_rate0.321
cardinality6
entropy2.176
entropy_ratio0.842
Top values (rank 1–20)
  1. Africa — 5,955
  2. Eurasia — 5,028
  3. Papunesia — 4,847
  4. South America — 1,095
  5. North America — 1,035
  6. Australia — 602

latitude numeric

59.1% null
rows19,401
null11,472 (59.1%)
unique7,786
min-55.275
max73.135
mean8.164
median6.292
std18.955
q1-5.139
q319.273
iqr24.412
skew0.543
kurtosis0.305
n_outliers135
outlier_rate0.017
zero_rate0.000

longitude numeric

59.1% null
rows19,401
null11,472 (59.1%)
unique7,745
min-178.785
max179.306
mean51.217
median47.565
std81.149
q17.180
q3124.144
iqr116.964
skew-0.481
kurtosis-0.776
n_outliers13
outlier_rate1.64e-03
zero_rate0.000