saturn
/home/coolhand/datasets/language-data/glottolog_languoid.csv 23,740 rows sample n=23,740 seed 42 2026-05-01T18:07:06+00:00
Overview
| Source | /home/coolhand/datasets/language-data/glottolog_languoid.csv |
| Total rows | 23,740 |
| Profiled sample | 23,740 |
| Columns | 16 |
| Generated | 2026-05-01T18:07:06+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
Errors during insight pass (17)
dataset:__global__:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2CEb6ZJ2sk7tx6Dcn'}column:id:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Ci7PdZwS9oV1mXWn'}column:family_id:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2DHM2cT79nySvQwrC'}column:parent_id:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2DobicsQ982YjRzd7'}column:name:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Fv6VBxYmrFA4mLch'}column:bookkeeping:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2GcX5NoWqwEe3z1bD'}column:level:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2H72peMwdrKNCpEDx'}column:status:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Hbo3cNwTAkJeyyNb'}column:latitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2J4L79qQByuTLXG7N'}column:longitude:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2JZM9nui2yhiWHJog'}column:iso639P3code:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2K6avna54nJQgJP7X'}column:description:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2Kf5Jyk7eLDb8Wzbz'}column:markup_description:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2LDZP7NeLGCBhfc6i'}column:child_family_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2LsFSgo9mMEQbp8Cy'}column:child_language_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2MmKgiE3Ynhhwpx9z'}column:child_dialect_count:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2NK4gHxBiVnLywHNQ'}column:country_ids:anthropic:claude-opus-4-7: BadRequestError — Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}, 'request_id': 'req_011CacJ2NvmjyEJg54po8hSb'}
Numeric correlation
id text
100.0% of rows are unique strings
100.0% rows are a single word
95th-percentile length under 20 chars
rows23,740
null0 (0.0%)
unique23,740
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size20,000
readability_flesch_mean86.111
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- abbe1238
- sanm1298
- thur1255
- suar1238
- kukn1238
- yagu1244
- labu1252
- uist1237
- suku1258
- arak1251
family_id categorical
rows23,740
null429 (1.8%)
unique287
top_valueatla1278
top_rate0.200
cardinality287
entropy4.886
entropy_ratio0.598
Top values (rank 1–20)
- atla1278 — 4,663
- aust1307 — 3,850
- indo1319 — 2,201
- sino1245 — 1,666
- afro1255 — 1,259
- nucl1709 — 762
- pama1250 — 598
- aust1305 — 503
- book1242 — 399
- otom1299 — 338
- mand1469 — 303
- sign1238 — 259
- drav1251 — 255
- cent2225 — 251
- turk1311 — 229
- taik1256 — 223
- nilo1247 — 201
- ural1272 — 185
- japo1237 — 179
- tupi1275 — 157
parent_id text
100.0% rows are a single word
95th-percentile length under 20 chars
68.5% duplicate strings
rows23,740
null429 (1.8%)
unique7,338
len_min8
len_max8
len_mean8.000
len_median8.000
len_p958.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates15,973
duplicate_rate0.685
vocab_size7,189
readability_flesch_mean91.187
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- abee1242
- cent2144
- yuag1237
- mnon1258
- pama1253
- raic1241
- kenh1234
- uygh1240
- taih1244
- book1242
name text
100.0% of rows are unique strings
69.5% rows are a single word
rows23,740
null0 (0.0%)
unique23,740
len_min1
len_max58
len_mean9.950
len_median8.000
len_p9522.000
word_mean1.398
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size17,915
readability_flesch_mean42.625
emoji_rate0.000
url_rate0.000
one_word_rate0.695
allcaps_rate1.68e-04
boilerplate_rate0.000
Sample values (first 10)
- Abbey-Ve
- San Martín Itunyoso Triqui
- Thuri
- Asabano
- Kukna
- Yagua
- Labuan
- Uis Tasae
- Sukurase
- Araki (Iran)
bookkeeping categorical
top value is 98.3% of rows
rows23,740
null0 (0.0%)
unique2
top_valueFalse
top_rate0.983
cardinality2
entropy0.123
entropy_ratio0.123
Top values (rank 1–20)
- False — 23,341
- True — 399
level categorical
rows23,740
null0 (0.0%)
unique3
top_valuedialect
top_rate0.460
cardinality3
entropy1.494
entropy_ratio0.943
Top values (rank 1–20)
- dialect — 10,920
- language — 8,481
- family — 4,339
status categorical
rows23,740
null0 (0.0%)
unique6
top_valuesafe
top_rate0.799
cardinality6
entropy1.150
entropy_ratio0.445
Top values (rank 1–20)
- safe — 18,965
- definitely endangered — 1,814
- vulnerable — 1,194
- extinct — 889
- critically endangered — 465
- severely endangered — 413
latitude numeric
66.5% null
rows23,740
null15,797 (66.5%)
unique7,798
min-55.275
max73.135
mean8.170
median6.306
std18.962
q1-5.137
q319.336
iqr24.472
skew0.540
kurtosis0.301
n_outliers129
outlier_rate0.016
zero_rate0.000
longitude numeric
66.5% null
rows23,740
null15,797 (66.5%)
unique7,757
min-178.785
max179.306
mean51.270
median47.724
std81.138
q17.235
q3124.122
iqr116.887
skew-0.483
kurtosis-0.774
n_outliers13
outlier_rate1.64e-03
zero_rate0.000
iso639P3code text
100.0% of rows are unique strings
100.0% rows are a single word
66.4% null
95th-percentile length under 20 chars
rows23,740
null15,772 (66.4%)
unique7,968
len_min3
len_max3
len_mean3.000
len_median3.000
len_p953.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates0
duplicate_rate0.000
vocab_size7,968
readability_flesch_mean119.528
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Sample values (first 10)
- aau
- mat
- twx
- sui
- ktr
- ygw
- key
- tui
- ssk
- agg
description unknown
no profiler for kind=unknown
rows23,740
null0 (0.0%)
markup_description unknown
no profiler for kind=unknown
rows23,740
null0 (0.0%)
child_family_count numeric
skew=+44.40
9.2% rows beyond 1.5 IQR
rows23,740
null0 (0.0%)
unique88
min0.000
max859.000
mean0.879
median0.000
std13.204
q10.000
q30.000
iqr0.000
skew44.398
kurtosis2,353
n_outliers2,179
outlier_rate0.092
zero_rate0.908
child_language_count numeric
skew=+41.86
18.3% rows beyond 1.5 IQR
rows23,740
null0 (0.0%)
unique126
min0.000
max1,435
mean1.996
median0.000
std23.408
q10.000
q30.000
iqr0.000
skew41.859
kurtosis2,115
n_outliers4,339
outlier_rate0.183
zero_rate0.817
child_dialect_count numeric
skew=+42.22
18.0% rows beyond 1.5 IQR
rows23,740
null0 (0.0%)
unique164
min0.000
max2,369
mean3.389
median0.000
std36.799
q10.000
q31.000
iqr1.000
skew42.219
kurtosis2,159
n_outliers4,272
outlier_rate0.180
zero_rate0.744
country_ids categorical
64.2% null
rows23,740
null15,250 (64.2%)
unique680
top_valuePG
top_rate0.103
cardinality680
entropy6.493
entropy_ratio0.690
Top values (rank 1–20)
- PG — 874
- ID — 695
- NG — 480
- AU — 432
- IN — 356
- MX — 297
- CN — 271
- BR — 263
- US — 247
- CM — 196
- PH — 177
- CD — 156
- VU — 118
- SD — 99
- PE — 97
- TZ — 93
- MY — 90
- TD — 88
- RU — 83
- CO — 82