saturn

/home/coolhand/datasets/us-attention-data/wikipedia_event_articles.json 10 rows sample n=10 seed 42 2026-05-01T17:17:54+00:00

Overview

Source/home/coolhand/datasets/us-attention-data/wikipedia_event_articles.json
Total rows10
Profiled sample10
Columns5
Generated2026-05-01T17:17:54+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This is a small dataset of 10 Wikipedia articles tracking US public attention, with view metrics (peak_views, avg_daily_views, total_views) plus an article name and a timeline field. The view metrics are heavily right-skewed — peak_views has a skew of 2.61 and a max of 739,258 against a median of just 22,111, suggesting one or two articles dominate attention. Each numeric column flags one outlier (10% outlier rate), so it's worth identifying which article is pulling the distribution. The article column has 10 unique values for 10 rows, so it functions as an identifier rather than a category to aggregate on.

article high anthropic:claude-opus-4-7

This column holds Wikipedia-style article titles (e.g., Donald_Trump, COVID-19_pandemic, Taylor_Swift) using underscore-separated naming. Every one of the 10 rows is unique (n_unique=10, entropy_ratio=1.0), so it functions as a row identifier rather than a categorical feature. The mix spans people, countries, and topics, with no nulls and no repeated value (top_rate=0.1).

avg_daily_views medium anthropic:claude-opus-4-7

Numeric column capturing average daily views per item, with all 10 rows unique and no nulls or zeros. The distribution is right-skewed (skew 1.57) with a mean of 20484 sitting well above the median of 13139, and a max of 66878 flagged as the lone outlier (10% outlier rate) versus a min of 2199. Standard deviation (19006) nearly matches the mean, signalling high dispersion in a tiny sample.

peak_views high anthropic:claude-opus-4-7

This appears to be a peak view-count metric per item, with all 10 rows unique and no nulls. The distribution is heavily right-skewed (skew 2.61, kurtosis 4.93): the median is 22111 while the mean is 101490.2 and the max reaches 739258, roughly 12x the Q3 of 58074.25. One outlier (10% of rows) is dragging the standard deviation (225418) far above the IQR (43543.25).

total_views medium anthropic:claude-opus-4-7

Numeric view-count metric across just 10 rows, all distinct and non-null. Distribution is right-skewed (skew 1.57) with values ranging from 200,122 to 6,085,895 against a median of 1,195,666.5, and one row (10%) flagged as an outlier pulling the mean up to 1,864,031.7. With n=10 the shape estimates are fragile.

timeline low anthropic:claude-opus-4-7

This column, named 'timeline', was skipped by the profiler and has no computed statistics beyond a row count of 10 and a null rate of 0.0. Its kind is reported as 'unknown' and the unique count is missing, so nothing can be said about cardinality, types, or value distribution. The 'skipped' alert is the only substantive signal present.

Numeric correlation

article categorical

10 singleton categories
rows10
null0 (0.0%)
unique10
top_valueDonald_Trump
top_rate0.100
cardinality10
entropy3.322
entropy_ratio1.000
Top values (rank 1–20)
  1. Donald_Trump — 1
  2. Joe_Biden — 1
  3. Climate_change — 1
  4. COVID-19_pandemic — 1
  5. Artificial_intelligence — 1
  6. Russia — 1
  7. Israel — 1
  8. Taylor_Swift — 1
  9. Elon_Musk — 1
  10. United_States — 1

avg_daily_views numeric

10.0% rows beyond 1.5 IQR
rows10
null0 (0.0%)
unique10
min2,199
max66,878
mean20,484
median13,139
std19,006
q110,870
q323,820
iqr12,950
skew1.573
kurtosis1.608
n_outliers1
outlier_rate0.100
zero_rate0.000

peak_views numeric

skew=+2.61 10.0% rows beyond 1.5 IQR
rows10
null0 (0.0%)
unique10
min3,613
max739,258
mean101,490
median22,111
std225,418
q114,531
q358,074
iqr43,543
skew2.609
kurtosis4.928
n_outliers1
outlier_rate0.100
zero_rate0.000

total_views numeric

10.0% rows beyond 1.5 IQR
rows10
null0 (0.0%)
unique10
min200,122
max6,085,895
mean1,864,032
median1,195,666
std1,729,565
q1989,119
q32,167,589
iqr1,178,470
skew1.573
kurtosis1.608
n_outliers1
outlier_rate0.100
zero_rate0.000

timeline unknown

no profiler for kind=unknown
rows10
null0 (0.0%)