saturn

/home/coolhand/html/datavis/data_trove/data/quirky/wine_by_country.json 62 rows sample n=62 seed 42 2026-06-21T23:45:55+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/quirky/wine_by_country.json
Total rows62
Profiled sample62
Columns2
Generated2026-06-21T23:45:55+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
namecategorical0.0%
countnumeric0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset medium anthropic:default

This dataset lists wine production (or a related wine metric) aggregated by country, covering 62 countries each with an associated count. The count distribution is extremely skewed: the median is just 2, yet the mean is nearly 19 and the maximum reaches 476, with 10 flagged outliers — suggesting a small handful of countries dominate the wine landscape entirely. France tops the list and is worth examining alongside the other high-count outliers to understand which countries drive the bulk of the totals.

count high anthropic:default

This column appears to be a frequency or occurrence count, likely representing how many times an event or item appears in some grouping. The distribution is extremely right-skewed (skew=6.26, kurtosis=41.80): the median is just 2.0 while the mean is 18.94 and the maximum reaches 476.0, indicating a handful of dominant entries vastly outnumber the rest. With only 21 unique values across 62 rows and 10 outliers (16.1% of records), a small number of high-count observations are pulling the distribution heavily. The IQR of 7.25 (Q1=1.0, Q3=8.25) confirms that 75% of values sit at or below 8, making the max of 476 a stark anomaly.

name high anthropic:default

This column contains country names, with 62 unique values across 62 rows and a null rate of 0.0 — every row holds a distinct country. The entropy ratio of ~1.0 and top_rate of 0.016 confirm perfect cardinality: each country appears exactly once, making this effectively a unique identifier rather than a grouping variable. The 'long_tail' alert is technically triggered but is a non-issue here since all values are equally frequent.

name categorical

62 singleton categories
rows62
null0 (0.0%)
unique62
top_valueFrance
top_rate0.016
cardinality62
entropy5.954
entropy_ratio1.000
Show data table
Top values for name (20 unique shown, of 62 total).
valuecountshare
France11.6%
United Kingdom11.6%
Germany11.6%
Spain11.6%
Belgium11.6%
Italy11.6%
United States11.6%
Switzerland11.6%
Australia11.6%
Bolivia11.6%
Austria11.6%
Croatia11.6%
Canada11.6%
Portugal11.6%
Poland11.6%
Netherlands11.6%
Ireland11.6%
Romania11.6%
Argentina11.6%
Unknown11.6%
Top values (rank 1–20)
  1. France — 1
  2. United Kingdom — 1
  3. Germany — 1
  4. Spain — 1
  5. Belgium — 1
  6. Italy — 1
  7. United States — 1
  8. Switzerland — 1
  9. Australia — 1
  10. Bolivia — 1
  11. Austria — 1
  12. Croatia — 1
  13. Canada — 1
  14. Portugal — 1
  15. Poland — 1
  16. Netherlands — 1
  17. Ireland — 1
  18. Romania — 1
  19. Argentina — 1
  20. Unknown — 1

count numeric

skew=+6.26 16.1% rows beyond 1.5 IQR
rows62
null0 (0.0%)
unique21
min1.000
max476.000
mean18.935
median2.000
std63.532
q11.000
q38.250
iqr7.250
skew6.255
kurtosis41.802
n_outliers10
outlier_rate0.161
zero_rate0.000
Show data table
Histogram bins for count (median: 2.0).
bincount
1 – 68.8659
68.86 – 136.72
136.7 – 204.60
204.6 – 272.40
272.4 – 340.30
340.3 – 408.10
408.1 – 4761