saturn
/home/coolhand/servers/diachronica/data_raw/glottolog_languoid.csv 19,401 rows sample n=19,401 seed 42 2026-05-01T18:05:30+00:00
Overview
| Source | /home/coolhand/servers/diachronica/data_raw/glottolog_languoid.csv |
| Total rows | 19,401 |
| Profiled sample | 19,401 |
| Columns | 7 |
| Generated | 2026-05-01T18:05:30+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
Errors during insight pass (8)
dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHu7kGoUGkpq5kLDW6c'}column:glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHu8CZ8T33h1TvPMNjP'}column:name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuCwZMbLh8g3q36Pmi'}column:isocodes:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuDPbWjSdC3wgAYzqr'}column:level:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuDwLmkTRBobPn1wH2'}column:macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuEeFtkjYPttMRosBH'}column:latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuF9Wq9yhJ3ugXL6nD'}column:longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacHuFaKGGRPvQCV8zJpf'}
Numeric correlation
glottocode text
100.0% of rows are unique strings
100.0% rows are a single word
95th-percentile length under 20 chars
rows19,401
null0 (0.0%)
unique19,401
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size19,401
readability_flesch_mean93.302
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- abaz1241
- sand1275
- texm1235
- stan1298
- kuik1246
- yait1239
- kyon1245
- tuyu1244
- subi1246
- apah1238
name text
100.0% of rows are unique strings
71.7% rows are a single word
rows19,401
null0 (0.0%)
unique19,401
len_min1
len_max58
len_mean9.211
len_median7.000
len_p9520.000
word_mean1.369
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size17,861
readability_flesch_mean60.531
emoji_rate0.000
url_rate0.000
one_word_rate0.717
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- Abaza
- Sandiwar
- Texmelucan Zapotec
- Standard Braj of Mathura
- Kuikúro-Kalapálo
- Yaitepec Chatino
- Kyon
- Tuyuca
- Subiya
- Apahapsili
isocodes text
100.0% of rows are unique strings
100.0% rows are a single word
59.2% null
95th-percentile length under 20 chars
rows19,401
null11,479 (59.2%)
unique7,922
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size7,922
readability_flesch_mean118.682
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- aba
- sgr
- thd
- nbl
- gvc
- yad
- krh
- tuk
- sgz
- aoa
level categorical
rows19,401
null0 (0.0%)
unique2
top_valuedialect
top_rate0.563
cardinality2
entropy0.989
entropy_ratio0.989
Top values (rank 1–20)
- dialect — 10,920
- language — 8,481
macroarea categorical
rows19,401
null839 (4.3%)
unique6
top_valueAfrica
top_rate0.321
cardinality6
entropy2.176
entropy_ratio0.842
Top values (rank 1–20)
- Africa — 5,955
- Eurasia — 5,028
- Papunesia — 4,847
- South America — 1,095
- North America — 1,035
- Australia — 602
latitude numeric
59.1% null
rows19,401
null11,472 (59.1%)
unique7,786
min-55.275
max73.135
mean8.164
median6.292
std18.955
q1-5.139
q319.273
iqr24.412
skew0.543
kurtosis0.305
n_outliers135
outlier_rate0.017
zero_rate0.000
longitude numeric
59.1% null
rows19,401
null11,472 (59.1%)
unique7,745
min-178.785
max179.306
mean51.217
median47.565
std81.149
q17.180
q3124.144
iqr116.964
skew-0.481
kurtosis-0.776
n_outliers13
outlier_rate1.64e-03
zero_rate0.000