saturn

/home/coolhand/html/datavis/data_trove/data/quirky/wine_by_country.json 62 rows sample n=62 seed 42 2026-06-21T23:45:55+00:00

Overview

Source	/home/coolhand/html/datavis/data_trove/data/quirky/wine_by_country.json
Total rows	62
Profiled sample	62
Columns	2
Generated	2026-06-21T23:45:55+00:00

Show data table

Per-column null rate across the corpus.
column	kind	null %
name	categorical	0.0%
count	numeric	0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset medium anthropic:default

This dataset lists wine production (or a related wine metric) aggregated by country, covering 62 countries each with an associated count. The count distribution is extremely skewed: the median is just 2, yet the mean is nearly 19 and the maximum reaches 476, with 10 flagged outliers — suggesting a small handful of countries dominate the wine landscape entirely. France tops the list and is worth examining alongside the other high-count outliers to understand which countries drive the bulk of the totals.

count high anthropic:default

This column appears to be a frequency or occurrence count, likely representing how many times an event or item appears in some grouping. The distribution is extremely right-skewed (skew=6.26, kurtosis=41.80): the median is just 2.0 while the mean is 18.94 and the maximum reaches 476.0, indicating a handful of dominant entries vastly outnumber the rest. With only 21 unique values across 62 rows and 10 outliers (16.1% of records), a small number of high-count observations are pulling the distribution heavily. The IQR of 7.25 (Q1=1.0, Q3=8.25) confirms that 75% of values sit at or below 8, making the max of 476 a stark anomaly.

name high anthropic:default

This column contains country names, with 62 unique values across 62 rows and a null rate of 0.0 — every row holds a distinct country. The entropy ratio of ~1.0 and top_rate of 0.016 confirm perfect cardinality: each country appears exactly once, making this effectively a unique identifier rather than a grouping variable. The 'long_tail' alert is technically triggered but is a non-issue here since all values are equally frequent.

name categorical

62 singleton categories

rows62

null0 (0.0%)

unique62

top_valueFrance

top_rate0.016

cardinality62

entropy5.954

entropy_ratio1.000

Show data table

Top values for name (20 unique shown, of 62 total).
value	count	share
France	1	1.6%
United Kingdom	1	1.6%
Germany	1	1.6%
Spain	1	1.6%
Belgium	1	1.6%
Italy	1	1.6%
United States	1	1.6%
Switzerland	1	1.6%
Australia	1	1.6%
Bolivia	1	1.6%
Austria	1	1.6%
Croatia	1	1.6%
Canada	1	1.6%
Portugal	1	1.6%
Poland	1	1.6%
Netherlands	1	1.6%
Ireland	1	1.6%
Romania	1	1.6%
Argentina	1	1.6%
Unknown	1	1.6%

Top values (rank 1–20)

France — 1
United Kingdom — 1
Germany — 1
Spain — 1
Belgium — 1
Italy — 1
United States — 1
Switzerland — 1
Australia — 1
Bolivia — 1
Austria — 1
Croatia — 1
Canada — 1
Portugal — 1
Poland — 1
Netherlands — 1
Ireland — 1
Romania — 1
Argentina — 1
Unknown — 1

count numeric

skew=+6.26 16.1% rows beyond 1.5 IQR

rows62

null0 (0.0%)

unique21

min1.000

max476.000

mean18.935

median2.000

std63.532

q11.000

q38.250

iqr7.250

skew6.255

kurtosis41.802

n_outliers10

outlier_rate0.161

zero_rate0.000

Show data table

Histogram bins for count (median: 2.0).
bin	count
1 – 68.86	59
68.86 – 136.7	2
136.7 – 204.6	0
204.6 – 272.4	0
272.4 – 340.3	0
340.3 – 408.1	0
408.1 – 476	1