saturn

/home/coolhand/html/datavis/data_trove/data/quirky/hot_sauces.json 258 rows sample n=258 seed 42 2026-05-01T17:04:18+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/quirky/hot_sauces.json
Total rows258
Profiled sample258
Columns9
Generated2026-05-01T17:04:18+00:00

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.

Dataset high anthropic:claude-opus-4-7

This dataset catalogs 258 hot sauce products sourced entirely from OpenFoodFacts, with 9 categorical columns covering brand, category, country, ingredients, labels, name, and URL. Brands are highly fragmented across 158 unique values, with Tabasco (12) and McIlhenny Company, Tabasco (11) leading but no dominant player — and 37 records have a blank brand worth investigating. Geographically, the United States (54) and France (28) account for the largest shares of the 123 country values, though inconsistent encoding (e.g., 'en:us' vs 'United States') suggests a data-cleaning task. The labels column is sparse: 145 of 258 rows are blank, so dietary tags like 'No gluten' or 'Non GMO project' apply to only a small minority. Note that source and type are constant (OpenFoodFacts / hot_sauce_product) and carry no analytical signal.

name high anthropic:claude-opus-4-7

This is a product name field for hot sauces, with 221 unique values across 258 rows and near-maximal entropy ratio of 0.984. The top value 'Carolina Reaper Hot Sauce' only covers 2.3% of rows, and casing/spelling variants ('Carolina Reaper Hot Sauce' vs 'Carolina reaper hot sauce', 'Sriracha Hot Chilli Sauce' vs 'Sriracha Hot Chili Sauce') plus a French entry and 3 empty strings indicate inconsistent normalization despite a 0.0 null rate.

brand high anthropic:claude-opus-4-7

Categorical brand label for what appears to be a hot sauce catalogue, with 158 distinct brands across 258 rows and very high entropy ratio (0.894) indicating a long tail. The most common value is the empty string at 37 occurrences (14.3% top rate), meaning missing-as-blank dominates over real brands like Tabasco (12) and McIlhenny Company, Tabasco (11). Note also that 'Tabasco' and 'McIlhenny Company, Tabasco' likely refer to the same maker but appear as separate categories, suggesting inconsistent normalisation.

countries high anthropic:claude-opus-4-7

This is a country-of-origin or sale label for 258 records, with 123 distinct values and no nulls. The encoding is inconsistent: plain names ('United States', 54) coexist with Open Food Facts-style tag prefixes ('en:us', 10; 'en:United States', 4) and multi-country strings ('United States, World'), so the same country appears under several spellings. High entropy ratio (0.82) and a long tail confirm the values are fragmented well beyond the 20.9% top rate.

categories high anthropic:claude-opus-4-7

Comma-delimited product category tags, dominated by condiment/sauce/hot-sauce hierarchies. Cardinality is high (106 unique across 258 rows, entropy ratio 0.82) and the most common value is the empty string at 13.6% (35 rows), indicating missing labels encoded as blanks rather than nulls. Near-duplicate variants differ only by spacing, casing, or 'en:' prefixes (e.g., 'Condiments,Sauces' vs 'Condiments, Sauces, Groceries'), so raw cardinality overstates the true taxonomy.

ingredients high anthropic:claude-opus-4-7

Free-text ingredient lists for what appears to be hot-sauce or chili products, with 207 distinct strings across 258 rows and entropy ratio 0.90 indicating near-unique values. The dominant 'value' is an empty string at 49 rows (19% top_rate), so roughly a fifth of records have no ingredients recorded. The remaining entries mix multiple languages (English, French, Norwegian, German) and formatting conventions, so direct categorical use is not viable.

labels high anthropic:claude-opus-4-7

Free-form product label tags (dietary, certification, packaging) with 77 distinct values across 258 rows. Over half the rows (56.2%) carry an empty string rather than a true null, so null_rate=0 is misleading. Values mix languages (English 'No gluten' vs French 'Sans gluten') and formats (raw text vs Open Food Facts taxonomy codes like 'en:vegan'), and many cells concatenate multiple labels with commas.

url high anthropic:claude-opus-4-7

This column holds Open Food Facts product URLs, one per row, with the trailing path segment being the product barcode. Every one of the 258 values is unique (entropy_ratio 1.0, top_rate 0.0039), so it functions as a row identifier rather than a feature.

source high anthropic:claude-opus-4-7

This column records the data provenance, with every one of the 258 rows tagged 'OpenFoodFacts'. Cardinality is 1 and entropy is 0, so it carries no information for modelling and simply documents that the entire slice came from a single source.

type high anthropic:claude-opus-4-7

This column is a constant categorical tag identifying every row as 'hot_sauce_product', appearing in all 258 records with no nulls. Cardinality is 1 and entropy is 0, so it carries no discriminative information. It likely served as a type marker from an ingestion pipeline rather than a usable feature.

name categorical

199 singleton categories
rows258
null0 (0.0%)
unique221
top_valueCarolina Reaper Hot Sauce
top_rate0.023
cardinality221
entropy7.666
entropy_ratio0.984
Top values (rank 1–20)
  1. Carolina Reaper Hot Sauce — 6
  2. Tabasco — 5
  3. Sriracha Hot Chilli Sauce — 3
  4. Sriracha Hot Chili Sauce — 3
  5. Sauce de piment sriracha — 3
  6. — 3
  7. Ghost pepper hot sauce — 3
  8. Carolina Reaper Sauce — 3
  9. Carolina reaper hot sauce — 3
  10. Carolina Reaper — 3
  11. Salsa Picante — 2
  12. Sriracha Sauce — 2
  13. Sriracha — 2
  14. Sauce sriracha — 2
  15. Sauce de Piment Sriracha — 2
  16. Tabasco Green Pepper Sauce — 2
  17. Tabasco® brand pepper sauce — 2
  18. Habanero Hot Sauce — 2
  19. Hot Sauce Chile Habanero — 2
  20. Ghost Pepper — 2

brand categorical

132 singleton categories
rows258
null0 (0.0%)
unique158
top_value
top_rate0.143
cardinality158
entropy6.530
entropy_ratio0.894
Top values (rank 1–20)
  1. — 37
  2. Tabasco — 12
  3. McIlhenny Company, Tabasco — 11
  4. Flying Goose Brand — 6
  5. Melinda's — 5
  6. Lola's Fine Hot Sauce — 5
  7. Cholula — 4
  8. Encona — 4
  9. El Yucateco — 4
  10. Mrs. Renfro's — 4
  11. Huy Fong Foods, Inc. — 3
  12. Sauce Shop — 3
  13. Go-Tan — 2
  14. Vitasia — 2
  15. Valentina — 2
  16. Heinz — 2
  17. sauce shop — 2
  18. CHOLULA — 2
  19. TABASCO — 2
  20. Serpis — 2

countries categorical

99 singleton categories
rows258
null0 (0.0%)
unique123
top_valueUnited States
top_rate0.209
cardinality123
entropy5.676
entropy_ratio0.818
Top values (rank 1–20)
  1. United States — 54
  2. France — 28
  3. en:us — 10
  4. en:gb — 8
  5. en:fr — 8
  6. en:france — 4
  7. en:germany — 4
  8. United States, World — 4
  9. en:United States — 4
  10. United Kingdom — 3
  11. en:United Kingdom — 3
  12. France, United States — 3
  13. en:Canada — 3
  14. World — 3
  15. France, en:morocco — 2
  16. en:ma — 2
  17. France,Royaume-Uni — 2
  18. en:Germany — 2
  19. Belgique,France — 2
  20. Canada — 2

categories categorical

85 singleton categories
rows258
null0 (0.0%)
unique106
top_value
top_rate0.136
cardinality106
entropy5.506
entropy_ratio0.818
Top values (rank 1–20)
  1. — 35
  2. Condiments, Sauces, Hot sauces, Groceries — 32
  3. Condiments, Sauces, Groceries — 23
  4. Condiments, Sauces, Dips, Groceries — 13
  5. Condiments, Sauces, Sauces chili, en:groceries — 9
  6. Condiments,Sauces — 8
  7. Condiments, Sauces, Hot sauces — 7
  8. Hot sauces — 5
  9. Condiments,Sauces,Hot sauces — 5
  10. Condiments,Sauces,Hot sauces,Groceries — 5
  11. Condiments, Sauces, en:hot-sauces — 4
  12. Condiments,Sauces,Sauces chili — 4
  13. Sauces chili — 3
  14. Condiments, Sauces, Sauces chili, Sauces sriracha, en:groceries — 3
  15. Condiments, Sauces, Barbecue sauces, Groceries — 3
  16. Condiments, Sauces — 3
  17. undefined — 3
  18. Condimentos,Salsas,Salsas de chiles,en:groceries — 2
  19. en:hot-sauces — 2
  20. Condiments, Sauces, Hot sauces, Sriracha sauces — 2

ingredients categorical

203 singleton categories
rows258
null0 (0.0%)
unique207
top_value
top_rate0.190
cardinality207
entropy6.922
entropy_ratio0.900
Top values (rank 1–20)
  1. — 49
  2. Distilled Vinegar, Red Pepper (19%), Salt. — 2
  3. Vinaigre d'alcool, piment rouge (19%), sel. — 2
  4. Distilled vinegar, red pepper, salt. — 2
  5. Rød chillipepper 54%, sukker, hvitløk, salt, vann, syre (eddiksyre, sitronsyre), smaksforsterker (mononatriumglutamat), konserveringsmiddel (natriumbenzoat). — 1
  6. Wasser, 30% Zucker, 8% Chilischoten*, Paprika, modifizierte Stärke, Speisesalz Säuerungsmittel: Essigsäure; Knoblauch, Zwiebeln, Verdickungsmittel: Xanthan; Konservierungsstoff: Kaliumsorbat. — 1
  7. soybean oil [45%], chilli [25%], onion [15%], fermented soybeans [soybeans, water], flavour enhancer [e621], salt, sugar, sichuan pepper powder, — 1
  8. Water, Chili Pepper, Vinegar, Salt, Spice, Sodium Benzoate (Preservative). — 1
  9. Fermented Red Cayenne Peppers (35%), Spirit Vinegar, Water, Salt, Garlic Powder. — 1
  10. Eau, piments (5%), sel, acidifiant (acide acétique), stabilisant (gomme xanthane), farine de riz, épices, vinaigre de cidre, arômes naturels. — 1
  11. Vineger, Louisiana type Red Chili Pepper, Salt, Thickener(Xanthan Gum), Green Pepper Natural Identical Flavor, Natural Color(E120), Antioxidant (Ascorbic Acid). May Contain Celery. — 1
  12. chilli 61%, sugar, water, salt, garlic, flavour enhancer: monosodium glutamate, stabiliser: xanthan gum, acidity regulator: acetic acid, citric acid, preservative: potassium sorbate — 1
  13. pickled red chilli 64% [chili, salt, acidity regulator (acetic acid)), sugar, water, garlic, salt, thickener (modified starch, xanthan gum), acidity regulator (acetic acid, citric acid), flavour enhancer (yeast extract), preservative (potassium sorbate), colour (paprika oleoresin). — 1
  14. WATER, DRIED CHILI PEPPERS (5.0%) (ARBOL & PIQUIN), SALT, VINEGAR BLEND (SPIRIT VINEGAR CIDER VINEGAR), SPICES, STABILISER (XANTHAN GUM) — 1
  15. Red hot pepper (87%), Garlic, Coriander, Salt, Caraway, Acidifying : E330, — 1
  16. 45% raapzaadolie, water, 20% sriracha saus (rode pepers, suiker, knoflook, zout, water, voedingszuur (azijnzuur, citroenzuur),smaakversterker (mononatriumglutamaat), conserveermiddel (natriumbenzoaat)), suiker, azijn, mosterd (water, azijn, MOSTERDZAAD,suiker, zout), zout, gemodificeerde zetmelen, voedingszuur (melkzuur), HEEL EIPOEDER, conserveermiddelen (kaliumsorbaat,natriumbenzoaat), verdikkingsmiddel (xanthaangom), antioxidant (calcium-dinatrium-EDTA). — 1
  17. Chilis, Zucker, Knoblauch, Salz, Essigsäure E260, Konservierungsmittel Kaliumsorbat E202, Konservierungsmittel Natriumbisulfit E222, Xanthan E415. — 1
  18. water, 32% piri-piri pepper, salt, acidity regulators: acetic acid, lactic acid, citric acid, wine vinegar (contains sulphites), spices, thickener: xanthan gum, paprika extract, preservatives: sodium benzoate, potassium sorbate, — 1
  19. Chili (83.23%), Sugar, Salt, Garlic (3.60%), Acetic Acid, Potassium Sorbate and Sodium Bisulfite as preservatives, Xanthan Gum. CONTAINS SULPHITE (SODIUM BISULFITE) INGR — 1
  20. Chili 70%, Zucker, Wasser, Salz, Sauerungsmittel: Essigsäure, Citronensäure; Verdickungsmittel: Xanthan; Geschmacksverstärker Mononatriumglutamat; Konservierungsstoff Kaliumsorbat — 1

labels categorical

62 singleton categories
rows258
null0 (0.0%)
unique77
top_value
top_rate0.562
cardinality77
entropy3.557
entropy_ratio0.568
Top values (rank 1–20)
  1. — 145
  2. No gluten — 9
  3. No GMOs, Non GMO project — 9
  4. Sans gluten — 5
  5. Halal — 4
  6. en:vegan — 4
  7. No GMOs, Non GMO project, en:no-gluten — 3
  8. Point Vert — 3
  9. Vegetarian, Vegan, Green Dot — 2
  10. Triman — 2
  11. No gluten, en:vegan — 2
  12. Punto Verde — 2
  13. Sans OGM,en:Non GMO project — 2
  14. en:halal — 2
  15. en:no-gluten — 2
  16. Sin gluten,Punto Verde — 1
  17. Vegetarian, Vegan, European Vegetarian Union, European Vegetarian Union Vegan, Nutriscore, Rainforest Alliance, en:green-dot — 1
  18. Vegetarian — 1
  19. Thai quality label, Halal, Natural colorings, Thailand Diversity & Refinement, The Central Islamic Committee of Thailand — 1
  20. No gluten, No added MSG — 1

url categorical

258 singleton categories
rows258
null0 (0.0%)
unique258
top_valuehttps://world.openfoodfacts.org/product/8710605030051
top_rate3.88e-03
cardinality258
entropy8.011
entropy_ratio1.000
Top values (rank 1–20)
  1. https://world.openfoodfacts.org/product/8710605030051 — 1
  2. https://world.openfoodfacts.org/product/20170196 — 1
  3. https://world.openfoodfacts.org/product/6921804700269 — 1
  4. https://world.openfoodfacts.org/product/0097339000054 — 1
  5. https://world.openfoodfacts.org/product/0041500888125 — 1
  6. https://world.openfoodfacts.org/product/3166296552214 — 1
  7. https://world.openfoodfacts.org/product/6221033171107 — 1
  8. https://world.openfoodfacts.org/product/8853662056029 — 1
  9. https://world.openfoodfacts.org/product/5020580016999 — 1
  10. https://world.openfoodfacts.org/product/0049733000215 — 1
  11. https://world.openfoodfacts.org/product/6194049100044 — 1
  12. https://world.openfoodfacts.org/product/8710605030044 — 1
  13. https://world.openfoodfacts.org/product/0024463061095 — 1
  14. https://world.openfoodfacts.org/product/20026752 — 1
  15. https://world.openfoodfacts.org/product/0024463061163 — 1
  16. https://world.openfoodfacts.org/product/8853662056067 — 1
  17. https://world.openfoodfacts.org/product/0702382999100 — 1
  18. https://world.openfoodfacts.org/product/9556041131063 — 1
  19. https://world.openfoodfacts.org/product/0016229912437 — 1
  20. https://world.openfoodfacts.org/product/0633148100624 — 1

source categorical

top value is 100.0% of rows
rows258
null0 (0.0%)
unique1
top_valueOpenFoodFacts
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Top values (rank 1–20)
  1. OpenFoodFacts — 258

type categorical

top value is 100.0% of rows
rows258
null0 (0.0%)
unique1
top_valuehot_sauce_product
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Top values (rank 1–20)
  1. hot_sauce_product — 258