This dataset aggregates UFO sightings by shape, with 28 rows and 5 columns covering shape categories, sighting counts, average durations, and nested sightings/yearly trend data. The numeric fields are highly skewed: avgDuration ranges from 30 to 37,800 with a mean of about 3,749 and skew near 3.95, while count ranges from 1 to 12,877 with a median of just 993.5. Both fields flag outliers worth inspecting — likely a few dominant shape categories pulling the distribution. The shape column has 28 unique values (one row per shape), so it functions as an identifier rather than a grouping variable. Start by looking at which shapes drive the count and duration extremes.
saturn
/home/coolhand/html/datavis/data_trove/data/quirky/ufo_shapes_aggregated.json 28 rows sample n=28 seed 42 2026-05-01T17:08:23+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/data/quirky/ufo_shapes_aggregated.json |
| Total rows | 28 |
| Profiled sample | 28 |
| Columns | 5 |
| Generated | 2026-05-01T17:08:23+00:00 |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:claude-opus-4-7.
This column enumerates UFO shape descriptors (light, triangle, circle, fireball, sphere, disk, oval, formation, etc.). Every one of the 28 rows holds a distinct value, giving cardinality 28 and entropy_ratio 1.0 — the column behaves as a unique key rather than a categorical feature. The presence of bucket terms like 'unknown' and 'other' alongside specific shapes suggests this is a reference/lookup list of shape categories, not observations.
Numeric tally column with 28 rows, 24 unique values, no nulls or zeros, ranging from 1 to 12877 with a median of 993.5 and mean of 2163.93. The distribution is heavily right-skewed (skew 2.06, kurtosis 4.84) with one outlier flagged at the high end and an IQR of 3786 against a std of 2876.24.
The column 'sightings' was skipped by the profiler, so its kind is unknown and no descriptive statistics were computed. The only confirmed facts are 28 rows and a null rate of 0.0; cardinality and value distribution are unavailable.
The column 'yearlyTrend' was skipped by the profiler, so its kind is unknown and no statistics were computed beyond a row count of 28 and a null rate of 0.0. With no uniqueness, type, or value signals available, its content and role cannot be inferred from this evidence.
Likely a per-group average duration metric (probably seconds) summarised across 28 unique entities with no nulls. The distribution is heavily right-skewed (skew 3.95, kurtosis 15.42) with a median of 1906.65 but a max of 37800 — roughly 20x the median — and 2 outliers (7.14%) pulling the mean up to 3748.62. Standard deviation (7305.74) exceeds the mean, confirming a long tail.
Numeric correlation
shape categorical
Top values (rank 1–20)
- light — 1
- triangle — 1
- circle — 1
- fireball — 1
- unknown — 1
- other — 1
- sphere — 1
- disk — 1
- oval — 1
- formation — 1
- cigar — 1
- changing — 1
- flash — 1
- rectangle — 1
- cylinder — 1
- diamond — 1
- chevron — 1
- teardrop — 1
- egg — 1
- cone — 1