saturn·

quirky ufo shapes aggregated

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/quirky/ufo_shapes_aggregated.json

Saturn profiled 28 rows across 5 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/quirky/ufo_shapes_aggregated.json",
    "--findings", "quirky-ufo_shapes_aggregated.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset aggregates UFO sightings by shape, with 28 rows and 5 columns covering shape categories, sighting counts, average durations, and nested sightings/yearly trend data. The numeric fields are highly skewed: avgDuration ranges from 30 to 37,800 with a mean of about 3,749 and skew near 3.95, while count ranges from 1 to 12,877 with a median of just 993.5. Both fields flag outliers worth inspecting — likely a few dominant shape categories pulling the distribution. The shape column has 28 unique values (one row per shape), so it functions as an identifier rather than a grouping variable. Start by looking at which shapes drive the count and duration extremes.

citing: row_count · column_count · columns[avgDuration].stats · columns[count].stats · columns[shape].stats · columns[shape].top_values

Out[4]:

saturn.schema() · 5 columns

column kind n null% unique alerts
shape categorical 28 0.0% 28 long_tail
count numeric 28 0.0% 24 high_skew
sightings unknown 28 0.0% skipped
yearlyTrend unknown 28 0.0% skipped
avgDuration numeric 28 0.0% 28 high_skew outliers
Fig 1.
count · Check the long right tail — most shapes have low counts but a few exceed several thousand sightings.
Show data table
Histogram bins for count (median: 993.5).
bincount
1 – 257619
2576 – 51516
5151 – 77272
7727 – 1.03e+040
1.03e+04 – 1.288e+041
Fig 2.
avgDuration · Look for the extreme outliers stretching up to 37,800 against a median near 1,907.
Show data table
Histogram bins for avgDuration (median: 1906.6487044449127).
bincount
30 – 758426
7584 – 1.514e+040
1.514e+04 – 2.269e+041
2.269e+04 – 3.025e+040
3.025e+04 – 3.78e+041
Fig 3.
shape · Each shape appears once, so use this as a lookup of the 28 categories rather than a frequency view.
Show data table
Top values for shape (20 unique shown, of 28 total).
valuecountshare
light13.6%
triangle13.6%
circle13.6%
fireball13.6%
unknown13.6%
other13.6%
sphere13.6%
disk13.6%
oval13.6%
formation13.6%
cigar13.6%
changing13.6%
flash13.6%
rectangle13.6%
cylinder13.6%
diamond13.6%
chevron13.6%
teardrop13.6%
egg13.6%
cone13.6%
Fig 4.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
shapecategorical0.0%
countnumeric0.0%
sightingsunknown0.0%
yearlyTrendunknown0.0%
avgDurationnumeric0.0%
Fig 5.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 2 numeric columns (values clipped to 2 decimals).
countavgDuration
count+1.00+0.18
avgDuration+0.18+1.00

shape categorical identifier

This column enumerates UFO shape descriptors (light, triangle, circle, fireball, sphere, disk, oval, formation, etc.). Every one of the 28 rows holds a distinct value, giving cardinality 28 and entropy_ratio 1.0 — the column behaves as a unique key rather than a categorical feature. The presence of bucket terms like 'unknown' and 'other' alongside specific shapes suggests this is a reference/lookup list of shape categories, not observations.

Treatment: Treat as a lookup dimension; left-join on this key rather than using as a model feature.

anthropic:claude-opus-4-7 · confidence high
Out[11]:

saturn.columns["shape"].stats

statvalue
n28
nulls0 (0.0%)
unique28
top_value light
top_rate 0.03571
cardinality 28
entropy 4.807
entropy_ratio 1
alert: long_tail28 singleton categories
Fig 6.
Top values for shape.
Show data table
Top values for shape (20 unique shown, of 28 total).
valuecountshare
light13.6%
triangle13.6%
circle13.6%
fireball13.6%
unknown13.6%
other13.6%
sphere13.6%
disk13.6%
oval13.6%
formation13.6%
cigar13.6%
changing13.6%
flash13.6%
rectangle13.6%
cylinder13.6%
diamond13.6%
chevron13.6%
teardrop13.6%
egg13.6%
cone13.6%

count numeric feature

Numeric tally column with 28 rows, 24 unique values, no nulls or zeros, ranging from 1 to 12877 with a median of 993.5 and mean of 2163.93. The distribution is heavily right-skewed (skew 2.06, kurtosis 4.84) with one outlier flagged at the high end and an IQR of 3786 against a std of 2876.24.

Treatment: log-transform before modelling to tame the right skew.

anthropic:claude-opus-4-7 · confidence high
Out[14]:

saturn.columns["count"].stats

statvalue
n28
nulls0 (0.0%)
unique24
min 1
max 12,877
mean 2164
median 993.5
std 2876
q1 134.2
q3 3920
iqr 3,786
skew 2.06
kurtosis 4.845
n_outliers 1
outlier_rate 0.03571
zero_rate 0
alert: high_skewskew=+2.06
Fig 7.
Distribution of count. Vertical dash marks the median.
Show data table
Histogram bins for count (median: 993.5).
bincount
1 – 257619
2576 – 51516
5151 – 77272
7727 – 1.03e+040
1.03e+04 – 1.288e+041

sightings unknown other

The column 'sightings' was skipped by the profiler, so its kind is unknown and no descriptive statistics were computed. The only confirmed facts are 28 rows and a null rate of 0.0; cardinality and value distribution are unavailable.

Treatment: Re-profile or inspect manually to determine type before any downstream use.

anthropic:claude-opus-4-7 · confidence low
Out[17]:

saturn.columns["sightings"].stats

statvalue
n28
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

yearlyTrend unknown other

The column 'yearlyTrend' was skipped by the profiler, so its kind is unknown and no statistics were computed beyond a row count of 28 and a null rate of 0.0. With no uniqueness, type, or value signals available, its content and role cannot be inferred from this evidence.

Treatment: Re-profile this column with parsing enabled before deciding on any downstream handling.

anthropic:claude-opus-4-7 · confidence low
Out[19]:

saturn.columns["yearlyTrend"].stats

statvalue
n28
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

avgDuration numeric feature

Likely a per-group average duration metric (probably seconds) summarised across 28 unique entities with no nulls. The distribution is heavily right-skewed (skew 3.95, kurtosis 15.42) with a median of 1906.65 but a max of 37800 — roughly 20x the median — and 2 outliers (7.14%) pulling the mean up to 3748.62. Standard deviation (7305.74) exceeds the mean, confirming a long tail.

Treatment: Log-transform before modelling to tame the right tail and outliers.

anthropic:claude-opus-4-7 · confidence high
Out[21]:

saturn.columns["avgDuration"].stats

statvalue
n28
nulls0 (0.0%)
unique28
min 30
max 37,800
mean 3749
median 1907
std 7306
q1 926.6
q3 3130
iqr 2203
skew 3.948
kurtosis 15.42
n_outliers 2
outlier_rate 0.07143
zero_rate 0
alert: high_skewskew=+3.95
alert: outliers7.1% rows beyond 1.5 IQR
Fig 8.
Distribution of avgDuration. Vertical dash marks the median.
Show data table
Histogram bins for avgDuration (median: 1906.6487044449127).
bincount
30 – 758426
7584 – 1.514e+040
1.514e+04 – 2.269e+041
2.269e+04 – 3.025e+040
3.025e+04 – 3.78e+041

How to cite

click to copy

BibTeX
@misc{saturn-quirky-ufo-shapes-aggregated-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: quirky ufo shapes aggregated},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/quirky-ufo_shapes_aggregated}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: quirky ufo shapes aggregated. Source: /home/coolhand/html/datavis/data_trove/data/quirky/ufo_shapes_aggregated.json. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/quirky-ufo_shapes_aggregated