saturn

/home/coolhand/html/datavis/data_trove/data/quirky/ufo_by_state.json 58 rows sample n=58 seed 42 2026-06-22T00:49:50+00:00

Overview

Source	/home/coolhand/html/datavis/data_trove/data/quirky/ufo_by_state.json
Total rows	58
Profiled sample	58
Columns	2
Generated	2026-06-22T00:49:50+00:00

Show data table

Per-column null rate across the corpus.
column	kind	null %
state	categorical	0.0%
count	numeric	0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This dataset contains UFO sighting counts aggregated by U.S. state, covering all 58 rows with no missing values. The count distribution is heavily right-skewed (skew ~2.93) with high kurtosis and 4 outlier states that far exceed the norm — the max of 16,197 sightings dwarfs the median of 1,510, suggesting a handful of states dominate UFO reports. The state column has one entry per state, so the interesting story is entirely in how unevenly sightings are distributed across states. Look closely at the top states to see which ones are driving the bulk of reported sightings.

count medium anthropic:default

This column appears to be an event or item count, likely representing frequency or volume of some activity across 58 records. The distribution is severely right-skewed (skew = 2.93, kurtosis = 11.75) with a min of 1 and a max of 16,197 against a median of only 1,510.5, indicating a handful of dominant observations pulling the mean (2,274.7) well above the median. Four outliers (≈6.9% of rows) are driving the extreme tail, and the standard deviation (2,642.8) exceeds the mean, confirming high dispersion.

state high anthropic:default

This column contains US state abbreviations, with exactly 58 unique values across 58 rows — meaning every row has a distinct state code and the dataset contains one record per state (plus potentially DC and a territory or two beyond the standard 50). Entropy ratio of 1.0 and a top_rate of 0.0172 (1/58) confirm perfectly uniform distribution with zero repetition, making this effectively a lookup key rather than a grouping variable. The long_tail alert is technically correct but misleading — there is no tail, just perfect cardinality.

state categorical

58 singleton categories

rows58

null0 (0.0%)

unique58

top_valueCA

top_rate0.017

cardinality58

entropy5.858

entropy_ratio1.000

Show data table

Top values for state (20 unique shown, of 58 total).
value	count	share
CA	1	1.7%
FL	1	1.7%
WA	1	1.7%
TX	1	1.7%
NY	1	1.7%
PA	1	1.7%
AZ	1	1.7%
OH	1	1.7%
IL	1	1.7%
NC	1	1.7%
MI	1	1.7%
OR	1	1.7%
CO	1	1.7%
NJ	1	1.7%
MO	1	1.7%
GA	1	1.7%
IN	1	1.7%
MA	1	1.7%
VA	1	1.7%
WI	1	1.7%

Top values (rank 1–20)

CA — 1
FL — 1
WA — 1
TX — 1
NY — 1
PA — 1
AZ — 1
OH — 1
IL — 1
NC — 1
MI — 1
OR — 1
CO — 1
NJ — 1
MO — 1
GA — 1
IN — 1
MA — 1
VA — 1
WI — 1

count numeric

skew=+2.93 6.9% rows beyond 1.5 IQR

rows58

null0 (0.0%)

unique55

min1.000

max16,197

mean2,275

median1,510

std2,643

q1648.750

q32,789

iqr2,140

skew2.925

kurtosis11.754

n_outliers4

outlier_rate0.069

zero_rate0.000

Show data table

Histogram bins for count (median: 1510.5).
bin	count
1 – 2315	38
2315 – 4628	13
4628 – 6942	4
6942 – 9256	2
9256 – 1.157e+04	0
1.157e+04 – 1.388e+04	0
1.388e+04 – 1.62e+04	1