saturn

/home/coolhand/html/datavis/data_trove/data/cultural/olympics/olympic_medals_data.json 1,433 rows sample n=1,433 seed 42 2026-05-01T17:22:25+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/cultural/olympics/olympic_medals_data.json
Total rows1,433
Profiled sample1,433
Columns8
Generated2026-05-01T17:22:25+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset contains 1,433 rows of Olympic medal counts by country and year, spanning 1896 to 2024 across 165 countries. Medal columns (gold, silver, bronze, total) are heavily right-skewed with high kurtosis and many outliers — a small number of dominant nations pull the means well above the medians (e.g. total has a median of 5 but a max of 234). Zero-rates are notable too: 33.9% of rows have zero gold medals and 25.3% zero silver, reflecting how often countries leave a Games empty-handed in a category. Country participation is fairly even at the top, with France and Great Britain tied as most-frequent entries (30 appearances each). Start by examining the shape of `total` and `gold` distributions and the `year` coverage to understand era effects.

year high anthropic:claude-opus-4-7

Four-digit calendar years spanning 1896 to 2024 with 30 distinct values across 1,433 rows and no nulls. The distribution is left-skewed (skew -0.76) toward recent decades, with a median of 1992 and IQR from 1960 to 2008, suggesting coverage is sparser in the early 20th century. No outliers were flagged.

country high anthropic:claude-opus-4-7

Three-letter country codes (e.g., FRA, GBR, USA, DEN, SUI) covering 159 distinct nations across 1433 rows with no nulls. The distribution is remarkably flat — the top value FRA accounts for only 2.1% of rows and entropy ratio is 0.92, so no country dominates. Top counts cluster tightly between 28 and 30, suggesting a near-uniform sampling design rather than organic population weights.

country_name high anthropic:claude-opus-4-7

Categorical country labels with 165 distinct values across 1433 rows and no nulls. Distribution is remarkably flat — the top value 'France' covers only 2.09% of rows, and the top ten countries each appear 28–30 times, giving an entropy ratio of 0.91 (near-uniform). This looks like a panel where each country contributes a similar number of observations rather than a skewed real-world sample.

gold high anthropic:claude-opus-4-7

Numeric count-style feature 'gold' ranging from 0 to 83 with median 1 and mean 4.06, so most rows sit near zero (zero_rate 0.339) while a long tail pulls the average up. Distribution is severely right-skewed (skew 4.26, kurtosis 23.14) with 134 outliers (9.35% of rows) above the q3 of 4. Only 52 unique values across 1433 rows suggests a discrete tally rather than a continuous measurement.

silver high anthropic:claude-opus-4-7

A non-negative integer-like count of silver medals or items, with 45 distinct values ranging 0 to 79 and a median of 2. The distribution is heavily right-skewed (skew 4.03, kurtosis 23.2) with 25.3% zeros and 9.8% flagged as outliers, so a small set of large counts dominates the mean (4.04) versus the median.

bronze high anthropic:claude-opus-4-7

This is a count of bronze medals (or similar bronze-tier tally) per record, with 1433 rows, 44 distinct integer values from 0 to 78, and no nulls. The distribution is heavily right-skewed (skew 3.37, kurtosis 16.94): the median is 2 and Q3 is 5, yet the max reaches 78, producing 150 outliers (10.5%). Roughly 19.8% of rows are zero, so a sizeable share of entities have never won bronze.

total high anthropic:claude-opus-4-7

This appears to be a count-style numeric feature (total), heavily right-skewed: the median is 5 while the mean is 12.5 and the max reaches 234. Skew of 3.92 and kurtosis of 20.8 confirm a long tail, with 151 values (10.5%) flagged as outliers. No nulls or zeros, and only 97 unique values across 1,433 rows, suggesting a discrete count with a small repeating vocabulary.

rank_total high anthropic:claude-opus-4-7

Integer-valued ranking field spanning 1 to 93 with 93 unique values across 1433 rows, suggesting a complete rank table repeated many times (e.g., per period or per group). Distribution is right-skewed (skew 0.74) with median 26 below mean 31.06, so lower ranks dominate while a tail extends toward 93. No nulls, no zeros, and no outliers flagged given the bounded range.

Numeric correlation

year numeric

rows1,433
null0 (0.0%)
unique30
min1,896
max2,024
mean1,982
median1,992
std33.948
q11,960
q32,008
iqr48.000
skew-0.757
kurtosis-0.408
n_outliers0
outlier_rate0.000
zero_rate0.000

country categorical

rows1,433
null0 (0.0%)
unique159
top_valueFRA
top_rate0.021
cardinality159
entropy6.695
entropy_ratio0.916
Top values (rank 1–20)
  1. FRA — 30
  2. GBR — 30
  3. USA — 29
  4. DEN — 29
  5. SUI — 29
  6. HUN — 28
  7. AUS — 28
  8. BEL — 28
  9. ITA — 28
  10. SWE — 28
  11. AUT — 27
  12. NED — 27
  13. CAN — 27
  14. NOR — 26
  15. FIN — 26
  16. JPN — 23
  17. NZL — 23
  18. POL — 23
  19. MEX — 22
  20. GRE — 21

country_name categorical

rows1,433
null0 (0.0%)
unique165
top_valueFrance
top_rate0.021
cardinality165
entropy6.715
entropy_ratio0.912
Top values (rank 1–20)
  1. France — 30
  2. Great Britain — 30
  3. United States — 29
  4. Denmark — 29
  5. Switzerland — 29
  6. Hungary — 28
  7. Australia — 28
  8. Belgium — 28
  9. Italy — 28
  10. Sweden — 28
  11. Austria — 27
  12. Netherlands — 27
  13. Canada — 27
  14. Norway — 26
  15. Finland — 26
  16. Japan — 23
  17. New Zealand — 23
  18. Poland — 23
  19. Mexico — 22
  20. Greece — 21

gold numeric

skew=+4.26 9.4% rows beyond 1.5 IQR
rows1,433
null0 (0.0%)
unique52
min0.000
max83.000
mean4.059
median1.000
std8.419
q10.000
q34.000
iqr4.000
skew4.259
kurtosis23.139
n_outliers134
outlier_rate0.094
zero_rate0.339

silver numeric

skew=+4.03 9.8% rows beyond 1.5 IQR
rows1,433
null0 (0.0%)
unique45
min0.000
max79.000
mean4.038
median2.000
std7.121
q10.000
q34.000
iqr4.000
skew4.026
kurtosis23.209
n_outliers140
outlier_rate0.098
zero_rate0.253

bronze numeric

skew=+3.37 10.5% rows beyond 1.5 IQR
rows1,433
null0 (0.0%)
unique44
min0.000
max78.000
mean4.398
median2.000
std6.853
q11.000
q35.000
iqr4.000
skew3.370
kurtosis16.944
n_outliers150
outlier_rate0.105
zero_rate0.198

total numeric

skew=+3.92 10.5% rows beyond 1.5 IQR
rows1,433
null0 (0.0%)
unique97
min1.000
max234.000
mean12.495
median5.000
std21.660
q12.000
q313.000
iqr11.000
skew3.922
kurtosis20.799
n_outliers151
outlier_rate0.105
zero_rate0.000

rank_total numeric

rows1,433
null0 (0.0%)
unique93
min1.000
max93.000
mean31.055
median26.000
std22.704
q113.000
q345.000
iqr32.000
skew0.739
kurtosis-0.368
n_outliers0
outlier_rate0.000
zero_rate0.000