saturn

/home/coolhand/servers/diachronica/etymology_atlas/parquet/languages.parquet 19,401 rows sample n=19,401 seed 42 2026-05-01T17:52:19+00:00

Overview

Source/home/coolhand/servers/diachronica/etymology_atlas/parquet/languages.parquet
Total rows19,401
Profiled sample19,401
Columns11
Generated2026-05-01T17:52:19+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Errors during insight pass (12)
  • dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtoTKFaRpXbwfdgyLm'}
  • column:glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtoxaemshRZWUqz6EJ'}
  • column:iso_639_3:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtpZJ8XpNgBKzybnYc'}
  • column:name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtq9mHyESTKBMmX7U6'}
  • column:family_name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtqmif11KcLA8FXbLW'}
  • column:family_glottocode:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtrJU7fLd2qGf4P7q9'}
  • column:macroarea:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtrnzBAtepcta1FDuV'}
  • column:latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtsHkVwW62oFyp6UhP'}
  • column:longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtsnGf91pNn6V3ck4R'}
  • column:status:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGttCpQFWVMHaVz3aiY'}
  • column:speakers_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGttfba2CoEiUDHYwMA'}
  • column:phoneme_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacGtu88fW2B5BFhexWUg'}

Numeric correlation

glottocode text

100.0% of rows are unique strings 100.0% rows are a single word 95th-percentile length under 20 chars
rows19,401
null0 (0.0%)
unique19,401
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size19,401
readability_flesch_mean93.302
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. abaz1241
  2. sand1275
  3. texm1235
  4. stan1298
  5. kuik1246
  6. yait1239
  7. kyon1245
  8. tuyu1244
  9. subi1246
  10. apah1238

iso_639_3 unknown

no profiler for kind=unknown
rows19,401
null0 (0.0%)

name text

100.0% of rows are unique strings 71.7% rows are a single word
rows19,401
null0 (0.0%)
unique19,401
len_min1
len_max58
len_mean9.211
len_median7.000
len_p9520.000
word_mean1.369
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size17,861
readability_flesch_mean60.531
emoji_rate0.000
url_rate0.000
one_word_rate0.717
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. Abaza
  2. Sandiwar
  3. Texmelucan Zapotec
  4. Standard Braj of Mathura
  5. Kuikúro-Kalapálo
  6. Yaitepec Chatino
  7. Kyon
  8. Tuyuca
  9. Subiya
  10. Apahapsili

family_name unknown

no profiler for kind=unknown
rows19,401
null0 (0.0%)

family_glottocode unknown

no profiler for kind=unknown
rows19,401
null0 (0.0%)

macroarea categorical

rows19,401
null839 (4.3%)
unique6
top_valueAfrica
top_rate0.321
cardinality6
entropy2.176
entropy_ratio0.842
Top values (rank 1–20)
  1. Africa — 5,955
  2. Eurasia — 5,028
  3. Papunesia — 4,847
  4. South America — 1,095
  5. North America — 1,035
  6. Australia — 602

latitude numeric

59.1% null
rows19,401
null11,472 (59.1%)
unique7,786
min-55.275
max73.135
mean8.164
median6.292
std18.955
q1-5.139
q319.273
iqr24.412
skew0.543
kurtosis0.305
n_outliers135
outlier_rate0.017
zero_rate0.000

longitude numeric

59.1% null
rows19,401
null11,472 (59.1%)
unique7,745
min-178.785
max179.306
mean51.217
median47.565
std81.149
q17.180
q3124.144
iqr116.964
skew-0.481
kurtosis-0.776
n_outliers13
outlier_rate1.64e-03
zero_rate0.000

status categorical

top value is 100.0% of rows
rows19,401
null0 (0.0%)
unique1
top_valueliving
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Top values (rank 1–20)
  1. living — 19,401

speakers_count unknown

no profiler for kind=unknown
rows19,401
null0 (0.0%)

phoneme_count numeric

88.8% null skew=+2.32
rows19,401
null17,228 (88.8%)
unique100
min11.000
max231.000
mean38.197
median34.000
std17.780
q126.000
q346.000
iqr20.000
skew2.325
kurtosis11.537
n_outliers79
outlier_rate0.036
zero_rate0.000