saturn·

data trove iso 639 3 language codes

saturn notebook · generated 2026-06-22 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/linguistic/language-families/iso-639-3-aliases.json

Saturn profiled 1 rows across 2 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/linguistic/language-families/iso-639-3-aliases.json",
    "--findings", "data-trove-iso-639-3-language-codes.json",
    "--llm", "anthropic:default",
])

Summary confidence: low

This dataset is a single-record JSON file related to ISO 639-3 language aliases, likely containing structured linguistic metadata about language family classifications. With only 1 row and 2 columns — both flagged as 'unknown' kind and skipped during profiling — there is essentially no statistical signal available for analysis. The file likely contains deeply nested or complex JSON structures (aliases and metadata) that a standard profiler cannot flatten automatically. The most valuable next step is to manually inspect the raw JSON structure to understand nesting depth and extract usable fields before any meaningful analysis can proceed.

citing: row_count · column_count · columns[0].alerts · columns[1].alerts · columns[0].null_rate · columns[1].null_rate

Out[4]:

saturn.schema() · 2 columns

column kind n null% unique alerts
metadata unknown 1 0.0% skipped
aliases unknown 1 0.0% skipped
Fig 1.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
metadataunknown0.0%
aliasesunknown0.0%

metadata unknown metadata

This column contains only a single row and was skipped by the profiler, yielding no distributional statistics. With n=1 and no unique-count or type information available, nothing meaningful can be inferred about its content or role beyond its name ('metadata'), which suggests ancillary descriptive data. No surprises can be flagged due to the absence of evidence.

Treatment: Inspect raw value manually before deciding whether to parse, drop, or expand into structured fields.

anthropic:default · confidence low
Out[7]:

saturn.columns["metadata"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

aliases unknown other

The column 'aliases' contains only 1 row and was marked 'skipped' by the profiler, yielding no distributional statistics. With n=1 and no uniqueness or type information available, nothing meaningful can be inferred about its content or role beyond the column name suggesting it may store alternative names or identifiers. The near-total absence of profiling data is itself the primary signal here.

Treatment: Manually inspect raw values; re-profile with a larger sample before assigning a role or applying any transformation.

anthropic:default · confidence low
Out[9]:

saturn.columns["aliases"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

How to cite

click to copy

BibTeX
@misc{saturn-data-trove-iso-639-3-language-codes-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: data trove iso 639 3 language codes},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/data-trove-iso-639-3-language-codes}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:default},
}
APA
Steuber, L. (2026). Saturn reading: data trove iso 639 3 language codes. Source: /home/coolhand/html/datavis/data_trove/data/linguistic/language-families/iso-639-3-aliases.json. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:default). Retrieved from https://dr.eamer.dev/saturn/view/data-trove-iso-639-3-language-codes