saturn
/home/coolhand/servers/diachronica/etymology_atlas/parquet/languages.parquet 19,401 rows sample n=19,401 seed 42 2026-05-01T17:52:19+00:00
Overview
| Source | /home/coolhand/servers/diachronica/etymology_atlas/parquet/languages.parquet |
| Total rows | 19,401 |
| Profiled sample | 19,401 |
| Columns | 11 |
| Generated | 2026-05-01T17:52:19+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
Errors during insight pass (12)
dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtoTKFaRpXbwfdgyLm'}column:glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtoxaemshRZWUqz6EJ'}column:iso_639_3:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtpZJ8XpNgBKzybnYc'}column:name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtq9mHyESTKBMmX7U6'}column:family_name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtqmif11KcLA8FXbLW'}column:family_glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtrJU7fLd2qGf4P7q9'}column:macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtrnzBAtepcta1FDuV'}column:latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtsHkVwW62oFyp6UhP'}column:longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtsnGf91pNn6V3ck4R'}column:status:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGttCpQFWVMHaVz3aiY'}column:speakers_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGttfba2CoEiUDHYwMA'}column:phoneme_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtu88fW2B5BFhexWUg'}
Numeric correlation
glottocode text
100.0% of rows are unique strings
100.0% rows are a single word
95th-percentile length under 20 chars
rows19,401
null0 (0.0%)
unique19,401
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size19,401
readability_flesch_mean93.302
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- abaz1241
- sand1275
- texm1235
- stan1298
- kuik1246
- yait1239
- kyon1245
- tuyu1244
- subi1246
- apah1238
iso_639_3 unknown
no profiler for kind=unknown
rows19,401
null0 (0.0%)
name text
100.0% of rows are unique strings
71.7% rows are a single word
rows19,401
null0 (0.0%)
unique19,401
len_min1
len_max58
len_mean9.211
len_median7.000
len_p9520.000
word_mean1.369
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size17,861
readability_flesch_mean60.531
emoji_rate0.000
url_rate0.000
one_word_rate0.717
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- Abaza
- Sandiwar
- Texmelucan Zapotec
- Standard Braj of Mathura
- Kuikúro-Kalapálo
- Yaitepec Chatino
- Kyon
- Tuyuca
- Subiya
- Apahapsili
family_name unknown
no profiler for kind=unknown
rows19,401
null0 (0.0%)
family_glottocode unknown
no profiler for kind=unknown
rows19,401
null0 (0.0%)
macroarea categorical
rows19,401
null839 (4.3%)
unique6
top_valueAfrica
top_rate0.321
cardinality6
entropy2.176
entropy_ratio0.842
Top values (rank 1–20)
- Africa — 5,955
- Eurasia — 5,028
- Papunesia — 4,847
- South America — 1,095
- North America — 1,035
- Australia — 602
latitude numeric
59.1% null
rows19,401
null11,472 (59.1%)
unique7,786
min-55.275
max73.135
mean8.164
median6.292
std18.955
q1-5.139
q319.273
iqr24.412
skew0.543
kurtosis0.305
n_outliers135
outlier_rate0.017
zero_rate0.000
longitude numeric
59.1% null
rows19,401
null11,472 (59.1%)
unique7,745
min-178.785
max179.306
mean51.217
median47.565
std81.149
q17.180
q3124.144
iqr116.964
skew-0.481
kurtosis-0.776
n_outliers13
outlier_rate1.64e-03
zero_rate0.000
status categorical
top value is 100.0% of rows
rows19,401
null0 (0.0%)
unique1
top_valueliving
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Top values (rank 1–20)
- living — 19,401
speakers_count unknown
no profiler for kind=unknown
rows19,401
null0 (0.0%)
phoneme_count numeric
88.8% null
skew=+2.32
rows19,401
null17,228 (88.8%)
unique100
min11.000
max231.000
mean38.197
median34.000
std17.780
q126.000
q346.000
iqr20.000
skew2.325
kurtosis11.537
n_outliers79
outlier_rate0.036
zero_rate0.000