This dataset catalogs 175 pepper varieties with 11 fields covering name, origin, flavor, heat category, biological type, intended use, and Scoville heat measurements (min, median, max, plus a jalapeño-relative score). The Scoville and jalRP numeric columns are extremely right-skewed (skew ~9-10, kurtosis >100) with max scoville_max reaching 16,000,000 versus a median of just 30,000 — a handful of super-hot peppers dominate the tail and 24% of rows flag as outliers. On the categorical side, 'Medium' heat accounts for 40% of peppers and 'Culinary' use covers 80%, while origin leans heavily toward the United States (26%) and Mexico (15%). Worth a closer look first: the Scoville distribution (consider a log scale) and the type column, which has casing inconsistencies ('annuum' vs 'Annuum', 'chinense' vs 'Chinense') that should be cleaned before any grouping.
saturn
/home/coolhand/html/datavis/data_trove/data/quirky/peppers.json 175 rows sample n=175 seed 42 2026-05-01T16:51:40+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/data/quirky/peppers.json |
| Total rows | 175 |
| Profiled sample | 175 |
| Columns | 11 |
| Generated | 2026-05-01T16:51:40+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
The `name` column holds 175 unique strings across 175 rows (cardinality 175, entropy_ratio ~1.0), making it a perfect per-row identifier. Sample values like "Bell Pepper", "Gypsy Pepper", and "Peperone di Senise" suggest this is a catalog of pepper varieties rather than a categorical feature. With every value occurring exactly once (top_rate 0.0057), there is no useful frequency signal to model on.
This is a categorical heat/spice level rating with 5 ordinal tiers and no nulls across 175 rows. Medium dominates at 40% (70 rows), followed by Mild (45); the upper tiers Hot (17) and Extra Hot (13) are the rarest, while Super Hot (30) is oddly more common than Hot, breaking the expected monotonic decline up the heat scale.
Numeric heat ratings (Scoville minimum) for 175 entries spanning 0 to 15,000,000 with a median of 15,000 — classic chili pepper data. Distribution is brutally right-skewed (skew 10.31, kurtosis 120.13) with mean 289,208 dwarfing the median, and 29 outliers (16.6% rate) plus 9.7% zeros. The std of 1,218,458 against an IQR of just 74,000 confirms a long, heavy tail.
Maximum Scoville heat ratings for 175 peppers, ranging from 0 to 16,000,000 with a median of 30,000 but a mean of 384,835. Distribution is extremely right-skewed (skew 9.45, kurtosis 106) with 24.6% of values flagged as outliers and 5.7% zeros. The IQR (2,750-100,000) is dwarfed by the max, consistent with a few extreme superhot varieties dominating the tail.
Numeric column capturing the median Scoville heat rating across 175 entries with no nulls and 80 unique values. The distribution is extremely right-skewed (skew 9.79, kurtosis 111.5): the median is 22,500 while the mean is 339,805 and the max reaches 15,500,000, with 41 outliers (23.4%) and 5.7% zeros. The IQR (2,000 to 90,000) is tiny relative to the std of 1,278,965, confirming a heavy upper tail.
Numeric feature 'jalRP' is extremely right-skewed: the median is 4.29 with Q3 at 17.14, yet the max reaches 2952.38 and the mean (64.72) sits far above the median. Skew of 9.79 and kurtosis of 111.48 confirm a heavy tail, and 23.4% of values flag as outliers with 5.7% exact zeros. Only 81 unique values across 175 rows suggests repeated discrete magnitudes rather than a smooth continuum.
This column records the Capsicum species (type), dominated by 'annuum' at 59.4% of 175 rows with 'chinense' second at 46. Watch out for case-inconsistent duplicates ('Annuum' 4, 'Chinense' 2 alongside their lowercase forms) and a literal 'N/A' string that isn't being counted as null (null_rate 0.0).
This is a categorical origin/country field with 34 distinct values across 175 rows and no nulls. Distribution is moderately concentrated: United States leads at 26.3% (46 rows), followed by Mexico (26) and South America (11), with entropy ratio 0.78 indicating fairly broad spread across the long tail. Notable quirks include a mix of country-level (United States, Italy, India) and region-level (South America, Caribbean) labels, plus 7 explicit 'Unknown' entries.
This is a low-cardinality categorical describing the use of an item, with 4 distinct values across 175 rows and no nulls. The distribution is heavily skewed: 'Culinary' accounts for 80.6% (141 rows), 'Ornamental' for 31, plus 2 rows with a combined 'Culinary, Ornamental' label and 1 empty string that should be treated as missing. Entropy ratio of 0.40 confirms the imbalance.
This is a categorical flavor descriptor field, with values that look like comma-separated tag combinations (e.g. 'Sweet, Fruity, Earthy, Smoky') rather than single labels. Cardinality is high — 73 unique values across only 175 rows — and entropy_ratio of 0.845 confirms a long tail; the top value 'Sweet' covers just 14.3% of rows. The compound labels suggest the underlying data is multi-label flavor notes that have been collapsed into one string.
This is a URL column serving as a per-row identifier, with all 175 values unique and zero nulls. Every entry is a pepperscale.com pepper page (e.g., bell-pepper, gypsy-pepper, habanada-pepper), so the column is effectively a primary key for pepper varieties. Entropy ratio of ~1.0 confirms no repetition.
Numeric correlation
name categorical
Top values (rank 1–20)
- Bell Pepper — 1
- Gypsy Pepper — 1
- Purple Beauty Pepper — 1
- Melrose Pepper — 1
- Carmen Pepper — 1
- California Wonder Pepper — 1
- Peperone di Senise — 1
- Fushimi Pepper — 1
- Elephant Ears Pepper — 1
- Habanada Pepper — 1
- Tangerine Dream Pepper — 1
- Chilly Chili — 1
- Shishito Pepper — 1
- Trinidad Perfume — 1
- Banana Pepper — 1
- Pepperoncini — 1
- Pimento Pepper — 1
- Jimmy Nardello Pepper — 1
- Mariachi Pepper — 1
- Santa Fe Grande Pepper — 1
heat categorical
Top values (rank 1–20)
- Medium — 70
- Mild — 45
- Super Hot — 30
- Hot — 17
- Extra Hot — 13
scoville_min numeric
scoville_max numeric
scoville_median numeric
jalRP numeric
type categorical
Top values (rank 1–20)
- annuum — 104
- chinense — 46
- baccatum — 12
- Annuum — 4
- frutescens — 4
- pubescens — 2
- Chinense — 2
- N/A — 1
origin categorical
Top values (rank 1–20)
- United States — 46
- Mexico — 26
- South America — 11
- Peru — 11
- Italy — 8
- Unknown — 7
- United Kingdom — 7
- Trinidad — 7
- Caribbean — 6
- India — 6
- Brazil — 5
- Spain — 4
- Hungary — 4
- Japan — 3
- Africa — 3
- China — 2
- Thailand — 2
- Balkan Peninsula — 1
- France — 1
- Chile — 1
use categorical
Top values (rank 1–20)
- Culinary — 141
- Ornamental — 31
- Culinary, Ornamental — 2
- — 1
flavor categorical
Top values (rank 1–20)
- Sweet — 25
- Sweet, Fruity — 21
- Neutral — 19
- Fruity, Sweet — 6
- Bright, Sweet — 4
- Sweet, Tangy — 4
- Sweet, Fruity, Smoky — 4
- Sweet, Fruity, Citrusy — 4
- Sweet, Fruity, Earthy, Smoky — 4
- Sweet, Fruity, Floral — 3
- Sweet, Fruity, Citrusy, Floral — 3
- Sweet, Fruity, Earthy — 3
- Sweet, Tropical — 3
- Bright, Grassy — 3
- Sweet, Floral — 2
- Sweet, Smoky — 2
- Earthy — 2
- Smoky, Sweet, Earthy — 2
- Smoky, Earthy — 2
- Sweet, Citrusy — 2
url categorical
Top values (rank 1–20)
- https://www.pepperscale.com/bell-pepper/ — 1
- https://www.pepperscale.com/gypsy-pepper/ — 1
- https://www.pepperscale.com/purple-beauty-pepper/ — 1
- https://www.pepperscale.com/melrose-pepper/ — 1
- https://www.pepperscale.com/carmen-pepper/ — 1
- https://www.pepperscale.com/california-wonder-pepper/ — 1
- https://www.pepperscale.com/peperone-di-senise/ — 1
- https://www.pepperscale.com/fushimi-pepper/ — 1
- https://www.pepperscale.com/elephant-ears-pepper/ — 1
- https://www.pepperscale.com/habanada-pepper/ — 1
- https://www.pepperscale.com/tangerine-dream-pepper/ — 1
- https://www.pepperscale.com/chilly-chili/ — 1
- https://www.pepperscale.com/shishito-pepper/ — 1
- https://www.pepperscale.com/trinidad-perfume/ — 1
- https://www.pepperscale.com/banana-pepper/ — 1
- https://www.pepperscale.com/pepperoncini/ — 1
- https://www.pepperscale.com/pimento-pepper/ — 1
- https://pepperscale.com/jimmy-nardello-pepper/ — 1
- https://www.pepperscale.com/mariachi-pepper/ — 1
- https://www.pepperscale.com/santa-fe-grande-pepper/ — 1