saturn·

data trove wikipedia pageviews

saturn notebook · generated 2026-06-21 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/attention/wikipedia_pageviews.json

Saturn profiled 1 rows across 2 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/attention/wikipedia_pageviews.json",
    "--findings", "data-trove-wikipedia-pageviews.json",
    "--llm", "anthropic:default",
])

Summary confidence: low

This dataset is a single-row JSON file containing Wikipedia pageview data, with two columns — 'countries' and 'metadata' — both of which were skipped during profiling and returned no usable statistics. With only 1 row and no resolved data types or value distributions, there is virtually nothing to analyze at this stage. The file likely contains nested or complex JSON structures that require unpacking before any meaningful analysis can begin. The immediate priority should be to inspect the raw file contents and flatten or parse the nested fields.

citing: row_count · column_count · columns[0].alerts · columns[1].alerts · columns[0].null_rate · columns[1].null_rate

Out[4]:

saturn.schema() · 2 columns

column kind n null% unique alerts
metadata unknown 1 0.0% skipped
countries unknown 1 0.0% skipped
Fig 1.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
metadataunknown0.0%
countriesunknown0.0%

metadata unknown metadata

This column contains a single row and was skipped by the profiler, yielding no distributional statistics. With n=1 and no unique-count or type information available, no meaningful characterisation of its content or role is possible beyond its name ('metadata'), which suggests ancillary descriptive data. The complete absence of profiling output is itself a flag—likely caused by an unsupported or complex type (e.g., JSON blob, struct, or binary).

Treatment: Inspect raw value manually to determine type; parse or flatten structured content before any downstream use.

anthropic:default · confidence low
Out[7]:

saturn.columns["metadata"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

countries unknown other

This column contains only a single row with no additional statistics computed, and was marked 'skipped' during profiling — likely because the dataset itself has n=1. No distributional signals are available. The column name suggests it holds country values, but nothing can be inferred about content, cardinality, or format from the evidence.

Treatment: Inspect raw value manually; re-profile on a larger dataset before assigning a role or treatment.

anthropic:default · confidence low
Out[9]:

saturn.columns["countries"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

How to cite

click to copy

BibTeX
@misc{saturn-data-trove-wikipedia-pageviews-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: data trove wikipedia pageviews},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/data-trove-wikipedia-pageviews}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:default},
}
APA
Steuber, L. (2026). Saturn reading: data trove wikipedia pageviews. Source: /home/coolhand/html/datavis/data_trove/data/attention/wikipedia_pageviews.json. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:default). Retrieved from https://dr.eamer.dev/saturn/view/data-trove-wikipedia-pageviews