saturn

/home/coolhand/html/datavis/data_trove/data/quirky/peppers.json 175 rows sample n=175 seed 42 2026-05-01T16:51:40+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/quirky/peppers.json
Total rows175
Profiled sample175
Columns11
Generated2026-05-01T16:51:40+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset catalogs 175 pepper varieties with 11 fields covering name, origin, flavor, heat category, biological type, intended use, and Scoville heat measurements (min, median, max, plus a jalapeño-relative score). The Scoville and jalRP numeric columns are extremely right-skewed (skew ~9-10, kurtosis >100) with max scoville_max reaching 16,000,000 versus a median of just 30,000 — a handful of super-hot peppers dominate the tail and 24% of rows flag as outliers. On the categorical side, 'Medium' heat accounts for 40% of peppers and 'Culinary' use covers 80%, while origin leans heavily toward the United States (26%) and Mexico (15%). Worth a closer look first: the Scoville distribution (consider a log scale) and the type column, which has casing inconsistencies ('annuum' vs 'Annuum', 'chinense' vs 'Chinense') that should be cleaned before any grouping.

name high anthropic:claude-opus-4-7

The `name` column holds 175 unique strings across 175 rows (cardinality 175, entropy_ratio ~1.0), making it a perfect per-row identifier. Sample values like "Bell Pepper", "Gypsy Pepper", and "Peperone di Senise" suggest this is a catalog of pepper varieties rather than a categorical feature. With every value occurring exactly once (top_rate 0.0057), there is no useful frequency signal to model on.

heat high anthropic:claude-opus-4-7

This is a categorical heat/spice level rating with 5 ordinal tiers and no nulls across 175 rows. Medium dominates at 40% (70 rows), followed by Mild (45); the upper tiers Hot (17) and Extra Hot (13) are the rarest, while Super Hot (30) is oddly more common than Hot, breaking the expected monotonic decline up the heat scale.

scoville_min high anthropic:claude-opus-4-7

Numeric heat ratings (Scoville minimum) for 175 entries spanning 0 to 15,000,000 with a median of 15,000 — classic chili pepper data. Distribution is brutally right-skewed (skew 10.31, kurtosis 120.13) with mean 289,208 dwarfing the median, and 29 outliers (16.6% rate) plus 9.7% zeros. The std of 1,218,458 against an IQR of just 74,000 confirms a long, heavy tail.

scoville_max high anthropic:claude-opus-4-7

Maximum Scoville heat ratings for 175 peppers, ranging from 0 to 16,000,000 with a median of 30,000 but a mean of 384,835. Distribution is extremely right-skewed (skew 9.45, kurtosis 106) with 24.6% of values flagged as outliers and 5.7% zeros. The IQR (2,750-100,000) is dwarfed by the max, consistent with a few extreme superhot varieties dominating the tail.

scoville_median high anthropic:claude-opus-4-7

Numeric column capturing the median Scoville heat rating across 175 entries with no nulls and 80 unique values. The distribution is extremely right-skewed (skew 9.79, kurtosis 111.5): the median is 22,500 while the mean is 339,805 and the max reaches 15,500,000, with 41 outliers (23.4%) and 5.7% zeros. The IQR (2,000 to 90,000) is tiny relative to the std of 1,278,965, confirming a heavy upper tail.

jalRP high anthropic:claude-opus-4-7

Numeric feature 'jalRP' is extremely right-skewed: the median is 4.29 with Q3 at 17.14, yet the max reaches 2952.38 and the mean (64.72) sits far above the median. Skew of 9.79 and kurtosis of 111.48 confirm a heavy tail, and 23.4% of values flag as outliers with 5.7% exact zeros. Only 81 unique values across 175 rows suggests repeated discrete magnitudes rather than a smooth continuum.

type high anthropic:claude-opus-4-7

This column records the Capsicum species (type), dominated by 'annuum' at 59.4% of 175 rows with 'chinense' second at 46. Watch out for case-inconsistent duplicates ('Annuum' 4, 'Chinense' 2 alongside their lowercase forms) and a literal 'N/A' string that isn't being counted as null (null_rate 0.0).

origin high anthropic:claude-opus-4-7

This is a categorical origin/country field with 34 distinct values across 175 rows and no nulls. Distribution is moderately concentrated: United States leads at 26.3% (46 rows), followed by Mexico (26) and South America (11), with entropy ratio 0.78 indicating fairly broad spread across the long tail. Notable quirks include a mix of country-level (United States, Italy, India) and region-level (South America, Caribbean) labels, plus 7 explicit 'Unknown' entries.

use high anthropic:claude-opus-4-7

This is a low-cardinality categorical describing the use of an item, with 4 distinct values across 175 rows and no nulls. The distribution is heavily skewed: 'Culinary' accounts for 80.6% (141 rows), 'Ornamental' for 31, plus 2 rows with a combined 'Culinary, Ornamental' label and 1 empty string that should be treated as missing. Entropy ratio of 0.40 confirms the imbalance.

flavor high anthropic:claude-opus-4-7

This is a categorical flavor descriptor field, with values that look like comma-separated tag combinations (e.g. 'Sweet, Fruity, Earthy, Smoky') rather than single labels. Cardinality is high — 73 unique values across only 175 rows — and entropy_ratio of 0.845 confirms a long tail; the top value 'Sweet' covers just 14.3% of rows. The compound labels suggest the underlying data is multi-label flavor notes that have been collapsed into one string.

url high anthropic:claude-opus-4-7

This is a URL column serving as a per-row identifier, with all 175 values unique and zero nulls. Every entry is a pepperscale.com pepper page (e.g., bell-pepper, gypsy-pepper, habanada-pepper), so the column is effectively a primary key for pepper varieties. Entropy ratio of ~1.0 confirms no repetition.

Numeric correlation

name categorical

175 singleton categories
rows175
null0 (0.0%)
unique175
top_valueBell Pepper
top_rate5.71e-03
cardinality175
entropy7.451
entropy_ratio1.000
Top values (rank 1–20)
  1. Bell Pepper — 1
  2. Gypsy Pepper — 1
  3. Purple Beauty Pepper — 1
  4. Melrose Pepper — 1
  5. Carmen Pepper — 1
  6. California Wonder Pepper — 1
  7. Peperone di Senise — 1
  8. Fushimi Pepper — 1
  9. Elephant Ears Pepper — 1
  10. Habanada Pepper — 1
  11. Tangerine Dream Pepper — 1
  12. Chilly Chili — 1
  13. Shishito Pepper — 1
  14. Trinidad Perfume — 1
  15. Banana Pepper — 1
  16. Pepperoncini — 1
  17. Pimento Pepper — 1
  18. Jimmy Nardello Pepper — 1
  19. Mariachi Pepper — 1
  20. Santa Fe Grande Pepper — 1

heat categorical

rows175
null0 (0.0%)
unique5
top_valueMedium
top_rate0.400
cardinality5
entropy2.074
entropy_ratio0.893
Top values (rank 1–20)
  1. Medium — 70
  2. Mild — 45
  3. Super Hot — 30
  4. Hot — 17
  5. Extra Hot — 13

scoville_min numeric

skew=+10.31 16.6% rows beyond 1.5 IQR
rows175
null0 (0.0%)
unique44
min0.000
max15,000,000
mean289,209
median15,000
std1,218,458
q11,000
q375,000
iqr74,000
skew10.313
kurtosis120.132
n_outliers29
outlier_rate0.166
zero_rate0.097

scoville_max numeric

skew=+9.45 24.6% rows beyond 1.5 IQR
rows175
null0 (0.0%)
unique59
min0.000
max16,000,000
mean384,835
median30,000
std1,333,100
q12,750
q3100,000
iqr97,250
skew9.450
kurtosis106.108
n_outliers43
outlier_rate0.246
zero_rate0.057

scoville_median numeric

skew=+9.79 23.4% rows beyond 1.5 IQR
rows175
null0 (0.0%)
unique80
min0.000
max15,500,000
mean339,805
median22,500
std1,278,966
q12,000
q390,000
iqr88,000
skew9.794
kurtosis111.468
n_outliers41
outlier_rate0.234
zero_rate0.057

jalRP numeric

skew=+9.79 23.4% rows beyond 1.5 IQR
rows175
null0 (0.0%)
unique81
min0.000
max2,952
mean64.721
median4.290
std243.607
q10.380
q317.140
iqr16.760
skew9.795
kurtosis111.478
n_outliers41
outlier_rate0.234
zero_rate0.057

type categorical

rows175
null0 (0.0%)
unique8
top_valueannuum
top_rate0.594
cardinality8
entropy1.657
entropy_ratio0.552
Top values (rank 1–20)
  1. annuum — 104
  2. chinense — 46
  3. baccatum — 12
  4. Annuum — 4
  5. frutescens — 4
  6. pubescens — 2
  7. Chinense — 2
  8. N/A — 1

origin categorical

rows175
null0 (0.0%)
unique34
top_valueUnited States
top_rate0.263
cardinality34
entropy3.980
entropy_ratio0.782
Top values (rank 1–20)
  1. United States — 46
  2. Mexico — 26
  3. South America — 11
  4. Peru — 11
  5. Italy — 8
  6. Unknown — 7
  7. United Kingdom — 7
  8. Trinidad — 7
  9. Caribbean — 6
  10. India — 6
  11. Brazil — 5
  12. Spain — 4
  13. Hungary — 4
  14. Japan — 3
  15. Africa — 3
  16. China — 2
  17. Thailand — 2
  18. Balkan Peninsula — 1
  19. France — 1
  20. Chile — 1

use categorical

rows175
null0 (0.0%)
unique4
top_valueCulinary
top_rate0.806
cardinality4
entropy0.810
entropy_ratio0.405
Top values (rank 1–20)
  1. Culinary — 141
  2. Ornamental — 31
  3. Culinary, Ornamental — 2
  4. — 1

flavor categorical

49 singleton categories
rows175
null0 (0.0%)
unique73
top_valueSweet
top_rate0.143
cardinality73
entropy5.232
entropy_ratio0.845
Top values (rank 1–20)
  1. Sweet — 25
  2. Sweet, Fruity — 21
  3. Neutral — 19
  4. Fruity, Sweet — 6
  5. Bright, Sweet — 4
  6. Sweet, Tangy — 4
  7. Sweet, Fruity, Smoky — 4
  8. Sweet, Fruity, Citrusy — 4
  9. Sweet, Fruity, Earthy, Smoky — 4
  10. Sweet, Fruity, Floral — 3
  11. Sweet, Fruity, Citrusy, Floral — 3
  12. Sweet, Fruity, Earthy — 3
  13. Sweet, Tropical — 3
  14. Bright, Grassy — 3
  15. Sweet, Floral — 2
  16. Sweet, Smoky — 2
  17. Earthy — 2
  18. Smoky, Sweet, Earthy — 2
  19. Smoky, Earthy — 2
  20. Sweet, Citrusy — 2

url categorical

175 singleton categories
rows175
null0 (0.0%)
unique175
top_valuehttps://www.pepperscale.com/bell-pepper/
top_rate5.71e-03
cardinality175
entropy7.451
entropy_ratio1.000
Top values (rank 1–20)
  1. https://www.pepperscale.com/bell-pepper/ — 1
  2. https://www.pepperscale.com/gypsy-pepper/ — 1
  3. https://www.pepperscale.com/purple-beauty-pepper/ — 1
  4. https://www.pepperscale.com/melrose-pepper/ — 1
  5. https://www.pepperscale.com/carmen-pepper/ — 1
  6. https://www.pepperscale.com/california-wonder-pepper/ — 1
  7. https://www.pepperscale.com/peperone-di-senise/ — 1
  8. https://www.pepperscale.com/fushimi-pepper/ — 1
  9. https://www.pepperscale.com/elephant-ears-pepper/ — 1
  10. https://www.pepperscale.com/habanada-pepper/ — 1
  11. https://www.pepperscale.com/tangerine-dream-pepper/ — 1
  12. https://www.pepperscale.com/chilly-chili/ — 1
  13. https://www.pepperscale.com/shishito-pepper/ — 1
  14. https://www.pepperscale.com/trinidad-perfume/ — 1
  15. https://www.pepperscale.com/banana-pepper/ — 1
  16. https://www.pepperscale.com/pepperoncini/ — 1
  17. https://www.pepperscale.com/pimento-pepper/ — 1
  18. https://pepperscale.com/jimmy-nardello-pepper/ — 1
  19. https://www.pepperscale.com/mariachi-pepper/ — 1
  20. https://www.pepperscale.com/santa-fe-grande-pepper/ — 1