{"columns":[{"alerts":[{"code":"long_tail","level":"info","message":"7 singleton categories"}],"column":"name","extras":{"singletons":7,"top_values":[["Whole Insects",1],["Flour/Powder",1],["Protein Products",1],["Snack Bars",1],["Confectionery",1],["Whole Snacks",1],["Crackers",1]]},"kind":"categorical","n":7,"n_null":0,"n_unique":7,"null_rate":0.0,"stats":{"cardinality":7,"entropy":2.807354922057604,"entropy_ratio":1.0,"top_rate":0.14285714285714285,"top_value":"Whole Insects"}},{"alerts":[{"code":"outliers","level":"warn","message":"14.3% rows beyond 1.5 IQR"}],"column":"count","extras":{"histogram":{"counts":[6,0,0,0,1],"edges":[1.0,12.2,23.4,34.599999999999994,45.8,57.0]},"sample":[57.0,9.0,3.0,3.0,1.0,1.0,1.0]},"kind":"numeric","n":7,"n_null":0,"n_unique":4,"null_rate":0.0,"stats":{"iqr":5.0,"kurtosis":1.9873414014577229,"max":57.0,"mean":10.714285714285714,"median":3.0,"min":1.0,"n_outliers":1,"outlier_rate":0.14285714285714285,"q1":1.0,"q3":6.0,"skew":1.9649129223845474,"std":20.60513112240319,"zero_rate":0.0}}],"insights":{"errors":[],"insights":[{"confidence":"medium","critiques":[],"evidence_keys":["mean","median","max","std","n_outliers","outlier_rate","skew","top_value","n_unique"],"featured_charts":[{"caption":"Look for the single extreme outlier at 57 that dwarfs all other form-type counts.","column":"count","kind":"histogram"},{"caption":"Compare counts across all 7 product forms to see which category dominates the edible-insect market.","column":"name","kind":"bar"},{"caption":"The donut shows how lopsided the share of one form type is relative to the rest combined.","column":"name","kind":"donut"}],"model":"anthropic:default","narrative":"This tiny dataset categorises insect-based food products into 7 form types and records how many products fall into each category. With only 7 rows, the big story is the extreme skew in counts: most form types have just 1\u20136 products, but one outlier category reaches 57, pulling the mean (10.7) far above the median (3.0). That dominant category is worth identifying immediately, as it likely represents the most commercially developed segment of the edible-insect market. The high standard deviation (20.6) confirms the distribution is anything but uniform.","scope":"dataset","target":"__global__"},{"confidence":"medium","critiques":[],"evidence_keys":["n","n_unique","stats.max","stats.median","stats.mean","stats.skew","stats.std","stats.n_outliers","stats.outlier_rate","alerts"],"model":"anthropic:default","narrative":"This column appears to be a frequency or occurrence count, likely recording how many times something was observed across 7 rows with only 4 distinct values. The distribution is severely right-skewed (skew = 1.96) with one outlier: the max of 57.0 sits far above the median of 3.0 and mean of 10.71, pulling the standard deviation to 20.61 \u2014 an unusually high spread for such a small dataset. With only 7 rows total, this column provides very limited statistical signal.","role":"feature","scope":"column","target":"count","treatment":"Investigate the outlier row (value 57.0) for data quality issues; if retained, log-transform before modelling to reduce skew."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","entropy_ratio","top_rate","top_values","alerts"],"model":"anthropic:default","narrative":"This column contains product category names for what appears to be an insect-based food product taxonomy, with 7 distinct categories such as 'Whole Insects', 'Flour/Powder', 'Snack Bars', and 'Crackers'. Every category appears exactly once (top_rate = 0.143, equal to 1/7), yielding a perfectly uniform distribution and maximum entropy ratio of 1.0. The 'long_tail' alert is a statistical artefact of this uniformity rather than a genuine skew signal. With only 7 rows and 7 unique values, this is likely a small reference/lookup table rather than a transactional dataset.","role":"label","scope":"column","target":"name","treatment":"Use as a categorical label or join key against a larger fact table; no encoding needed at this cardinality."}],"providers":["anthropic:default"],"total_usage":{"completion_tokens":855,"prompt_tokens":2027,"total_tokens":2882}},"language_counts":{},"meta":{"generated_at":"2026-06-21T23:46:15+00:00","mode":"full","row_count":7,"sampled_rows":7,"seed":42,"source":"/home/coolhand/html/datavis/data_trove/data/quirky/insects_by_form.json"},"notes":[],"saturn_version":"0.2.0","schema":{"count":"numeric","name":"categorical"}}
