saturn

/home/coolhand/datasets/language-data/glottolog_languoid.csv 23,740 rows sample n=23,740 seed 42 2026-05-01T18:07:06+00:00

Overview

Source/home/coolhand/datasets/language-data/glottolog_languoid.csv
Total rows23,740
Profiled sample23,740
Columns16
Generated2026-05-01T18:07:06+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Errors during insight pass (17)
  • dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2CEb6ZJ2sk7tx6Dcn'}
  • column:id:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Ci7PdZwS9oV1mXWn'}
  • column:family_id:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2DHM2cT79nySvQwrC'}
  • column:parent_id:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2DobicsQ982YjRzd7'}
  • column:name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Fv6VBxYmrFA4mLch'}
  • column:bookkeeping:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2GcX5NoWqwEe3z1bD'}
  • column:level:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2H72peMwdrKNCpEDx'}
  • column:status:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Hbo3cNwTAkJeyyNb'}
  • column:latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2J4L79qQByuTLXG7N'}
  • column:longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2JZM9nui2yhiWHJog'}
  • column:iso639P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2K6avna54nJQgJP7X'}
  • column:description:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Kf5Jyk7eLDb8Wzbz'}
  • column:markup_description:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2LDZP7NeLGCBhfc6i'}
  • column:child_family_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2LsFSgo9mMEQbp8Cy'}
  • column:child_language_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2MmKgiE3Ynhhwpx9z'}
  • column:child_dialect_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2NK4gHxBiVnLywHNQ'}
  • column:country_ids:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2NvmjyEJg54po8hSb'}

Numeric correlation

id text

100.0% of rows are unique strings 100.0% rows are a single word 95th-percentile length under 20 chars
rows23,740
null0 (0.0%)
unique23,740
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size20,000
readability_flesch_mean86.111
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. abbe1238
  2. sanm1298
  3. thur1255
  4. suar1238
  5. kukn1238
  6. yagu1244
  7. labu1252
  8. uist1237
  9. suku1258
  10. arak1251

family_id categorical

rows23,740
null429 (1.8%)
unique287
top_valueatla1278
top_rate0.200
cardinality287
entropy4.886
entropy_ratio0.598
Top values (rank 1–20)
  1. atla1278 — 4,663
  2. aust1307 — 3,850
  3. indo1319 — 2,201
  4. sino1245 — 1,666
  5. afro1255 — 1,259
  6. nucl1709 — 762
  7. pama1250 — 598
  8. aust1305 — 503
  9. book1242 — 399
  10. otom1299 — 338
  11. mand1469 — 303
  12. sign1238 — 259
  13. drav1251 — 255
  14. cent2225 — 251
  15. turk1311 — 229
  16. taik1256 — 223
  17. nilo1247 — 201
  18. ural1272 — 185
  19. japo1237 — 179
  20. tupi1275 — 157

parent_id text

100.0% rows are a single word 95th-percentile length under 20 chars 68.5% duplicate strings
rows23,740
null429 (1.8%)
unique7,338
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates15,973
duplicate_rate0.685
vocab_size7,189
readability_flesch_mean91.187
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. abee1242
  2. cent2144
  3. yuag1237
  4. mnon1258
  5. pama1253
  6. raic1241
  7. kenh1234
  8. uygh1240
  9. taih1244
  10. book1242

name text

100.0% of rows are unique strings 69.5% rows are a single word
rows23,740
null0 (0.0%)
unique23,740
len_min1
len_max58
len_mean9.950
len_median8.000
len_p9522.000
word_mean1.398
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size17,915
readability_flesch_mean42.625
emoji_rate0.000
url_rate0.000
one_word_rate0.695
allcaps_rate1.68e-04
boilerplate_rate0.000
Sample values (first 10)
  1. Abbey-Ve
  2. San Martín Itunyoso Triqui
  3. Thuri
  4. Asabano
  5. Kukna
  6. Yagua
  7. Labuan
  8. Uis Tasae
  9. Sukurase
  10. Araki (Iran)

bookkeeping categorical

top value is 98.3% of rows
rows23,740
null0 (0.0%)
unique2
top_valueFalse
top_rate0.983
cardinality2
entropy0.123
entropy_ratio0.123
Top values (rank 1–20)
  1. False — 23,341
  2. True — 399

level categorical

rows23,740
null0 (0.0%)
unique3
top_valuedialect
top_rate0.460
cardinality3
entropy1.494
entropy_ratio0.943
Top values (rank 1–20)
  1. dialect — 10,920
  2. language — 8,481
  3. family — 4,339

status categorical

rows23,740
null0 (0.0%)
unique6
top_valuesafe
top_rate0.799
cardinality6
entropy1.150
entropy_ratio0.445
Top values (rank 1–20)
  1. safe — 18,965
  2. definitely endangered — 1,814
  3. vulnerable — 1,194
  4. extinct — 889
  5. critically endangered — 465
  6. severely endangered — 413

latitude numeric

66.5% null
rows23,740
null15,797 (66.5%)
unique7,798
min-55.275
max73.135
mean8.170
median6.306
std18.962
q1-5.137
q319.336
iqr24.472
skew0.540
kurtosis0.301
n_outliers129
outlier_rate0.016
zero_rate0.000

longitude numeric

66.5% null
rows23,740
null15,797 (66.5%)
unique7,757
min-178.785
max179.306
mean51.270
median47.724
std81.138
q17.235
q3124.122
iqr116.887
skew-0.483
kurtosis-0.774
n_outliers13
outlier_rate1.64e-03
zero_rate0.000

iso639P3code text

100.0% of rows are unique strings 100.0% rows are a single word 66.4% null 95th-percentile length under 20 chars
rows23,740
null15,772 (66.4%)
unique7,968
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size7,968
readability_flesch_mean119.528
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
  1. aau
  2. mat
  3. twx
  4. sui
  5. ktr
  6. ygw
  7. key
  8. tui
  9. ssk
  10. agg

description unknown

no profiler for kind=unknown
rows23,740
null0 (0.0%)

markup_description unknown

no profiler for kind=unknown
rows23,740
null0 (0.0%)

child_family_count numeric

skew=+44.40 9.2% rows beyond 1.5 IQR
rows23,740
null0 (0.0%)
unique88
min0.000
max859.000
mean0.879
median0.000
std13.204
q10.000
q30.000
iqr0.000
skew44.398
kurtosis2,353
n_outliers2,179
outlier_rate0.092
zero_rate0.908

child_language_count numeric

skew=+41.86 18.3% rows beyond 1.5 IQR
rows23,740
null0 (0.0%)
unique126
min0.000
max1,435
mean1.996
median0.000
std23.408
q10.000
q30.000
iqr0.000
skew41.859
kurtosis2,115
n_outliers4,339
outlier_rate0.183
zero_rate0.817

child_dialect_count numeric

skew=+42.22 18.0% rows beyond 1.5 IQR
rows23,740
null0 (0.0%)
unique164
min0.000
max2,369
mean3.389
median0.000
std36.799
q10.000
q31.000
iqr1.000
skew42.219
kurtosis2,159
n_outliers4,272
outlier_rate0.180
zero_rate0.744

country_ids categorical

64.2% null
rows23,740
null15,250 (64.2%)
unique680
top_valuePG
top_rate0.103
cardinality680
entropy6.493
entropy_ratio0.690
Top values (rank 1–20)
  1. PG — 874
  2. ID — 695
  3. NG — 480
  4. AU — 432
  5. IN — 356
  6. MX — 297
  7. CN — 271
  8. BR — 263
  9. US — 247
  10. CM — 196
  11. PH — 177
  12. CD — 156
  13. VU — 118
  14. SD — 99
  15. PE — 97
  16. TZ — 93
  17. MY — 90
  18. TD — 88
  19. RU — 83
  20. CO — 82