saturn·

quirky aurora

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/quirky/aurora.json

Saturn profiled 300 rows across 8 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/quirky/aurora.json",
    "--findings", "quirky-aurora.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset captures 300 minute-by-minute aurora and solar wind observations starting 2026-01-20, with 8 columns covering geomagnetic indices (kp_index, estimated_kp, intensity), solar wind conditions (speed, density), and a categorical activity label. The activity field is heavily skewed toward 'Moderate Storm' (172 of 300, ~57%), with only 8 'Quiet' readings — worth a closer look since this dominates the storyline. The kp_index and intensity columns are left-skewed and pile up at their max values (median equals max), with ~15% flagged as low-side outliers, suggesting the sample is a sustained storm period rather than a balanced range. Solar wind speed is also unusually elevated (min 881, max 1051 km/s), reinforcing that this is a storm-window snapshot rather than typical conditions.

citing: activity.top_values · activity.top_rate · kp_index.stats · estimated_kp.stats · intensity.stats · solar_wind_speed.stats · solar_wind_density.stats · row_count

Fig 1.
activity · Shows how 'Moderate Storm' dominates the period, with 'Quiet' barely represented.
Show data table
Top values for activity (5 unique shown, of 5 total).
valuecountshare
Moderate Storm17257.3%
Minor Storm6521.7%
Unsettled3812.7%
Active175.7%
Quiet82.7%
Fig 2.
kp_index · Reveals the strong left skew with most values pinned at kp=6 and a tail of low-end outliers.
Show data table
Histogram bins for kp_index (median: 6.0).
bincount
0 – 0.35294
0.3529 – 0.70590
0.7059 – 1.0594
1.059 – 1.4120
1.412 – 1.7650
1.765 – 2.11816
2.118 – 2.4710
2.471 – 2.8240
2.824 – 3.17622
3.176 – 3.5290
3.529 – 3.8820
3.882 – 4.23517
4.235 – 4.5880
4.588 – 4.9410
4.941 – 5.29465
5.294 – 5.6470
5.647 – 6172
Fig 3.
intensity · Mirrors kp_index — clustered near 0.667 with ~15% outliers on the low side.
Show data table
Histogram bins for intensity (median: 0.667).
bincount
0 – 0.039244
0.03924 – 0.078470
0.07847 – 0.11774
0.1177 – 0.15690
0.1569 – 0.19620
0.1962 – 0.235416
0.2354 – 0.27460
0.2746 – 0.31390
0.3139 – 0.353122
0.3531 – 0.39240
0.3924 – 0.43160
0.4316 – 0.470817
0.4708 – 0.51010
0.5101 – 0.54930
0.5493 – 0.588565
0.5885 – 0.62780
0.6278 – 0.667172
Fig 4.
solar_wind_speed · Confirms elevated wind speeds (881–1051 km/s) consistent with an active storm window.
Show data table
Histogram bins for solar_wind_speed (median: 940.15).
bincount
881.6 – 891.66
891.6 – 901.624
901.6 – 911.568
911.5 – 921.522
921.5 – 931.517
931.5 – 941.514
941.5 – 951.510
951.5 – 961.519
961.5 – 971.414
971.4 – 981.49
981.4 – 991.46
991.4 – 10017
1001 – 101111
1011 – 102114
1021 – 103118
1031 – 104131
1041 – 105110
Fig 5.
solar_wind_density · The most well-behaved numeric variable — roughly symmetric around 3.1 with few outliers.
Show data table
Histogram bins for solar_wind_density (median: 3.1399999999999997).
bincount
0.23 – 0.70596
0.7059 – 1.18215
1.182 – 1.65828
1.658 – 2.13429
2.134 – 2.60937
2.609 – 3.08533
3.085 – 3.56134
3.561 – 4.03738
4.037 – 4.51334
4.513 – 4.98931
4.989 – 5.4656
5.465 – 5.9414
5.941 – 6.4161
6.416 – 6.8922
6.892 – 7.3681
7.368 – 7.8440
7.844 – 8.321
Fig 6.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
time_tagcategorical0.0%
kp_indexnumeric0.0%
estimated_kpnumeric0.0%
activitycategorical0.0%
intensitynumeric0.0%
solar_wind_timecategorical0.0%
solar_wind_speednumeric0.0%
solar_wind_densitynumeric0.0%
Fig 7.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 5 numeric columns (values clipped to 2 decimals).
kp_indexestimated_kpintensitysolar_wind_speedsolar_wind_density
kp_index+1.00+0.98+1.00+0.60+0.44
estimated_kp+0.98+1.00+0.98+0.65+0.50
intensity+1.00+0.98+1.00+0.60+0.44
solar_wind_speed+0.60+0.65+0.60+1.00+0.53
solar_wind_density+0.44+0.50+0.44+0.53+1.00

time_tag categorical timestamp

This is an ISO-8601 timestamp column at minute resolution, with all 300 values unique (entropy_ratio 1.0) and zero nulls. The visible top values form a contiguous one-minute sequence starting 2026-01-20T00:06:00, consistent with a regular time index rather than event timestamps. Cardinality equals row count, so it acts as a row key over time.

Treatment: Parse to datetime and use as the time index; do not one-hot encode.

anthropic:claude-opus-4-7 · confidence high
Out[13]:

saturn.columns["time_tag"].stats

statvalue
n300
nulls0 (0.0%)
unique300
top_value 2026-01-20T00:06:00
top_rate 0.003333
cardinality 300
entropy 8.229
entropy_ratio 1
alert: long_tail300 singleton categories
Fig 8.
Top values for time_tag.
Show data table
Top values for time_tag (20 unique shown, of 300 total).
valuecountshare
2026-01-20T00:06:0010.3%
2026-01-20T00:07:0010.3%
2026-01-20T00:08:0010.3%
2026-01-20T00:09:0010.3%
2026-01-20T00:10:0010.3%
2026-01-20T00:11:0010.3%
2026-01-20T00:12:0010.3%
2026-01-20T00:13:0010.3%
2026-01-20T00:14:0010.3%
2026-01-20T00:15:0010.3%
2026-01-20T00:16:0010.3%
2026-01-20T00:17:0010.3%
2026-01-20T00:18:0010.3%
2026-01-20T00:19:0010.3%
2026-01-20T00:20:0010.3%
2026-01-20T00:21:0010.3%
2026-01-20T00:22:0010.3%
2026-01-20T00:23:0010.3%
2026-01-20T00:24:0010.3%
2026-01-20T00:25:0010.3%

kp_index numeric feature

Numeric column on a small 0-6 integer scale with only 7 distinct values, consistent with a Kp geomagnetic index reading. The distribution is heavily concentrated at the high end (median and Q3 both 6.0, mean 5.09) and strongly left-skewed (skew -1.68), with 46 low-side outliers flagged (15.3%). No nulls, and zeros are rare (1.3%).

Treatment: Treat as ordinal categorical (7 levels) rather than continuous; outliers are real low-Kp readings, do not clip.

anthropic:claude-opus-4-7 · confidence high
Out[16]:

saturn.columns["kp_index"].stats

statvalue
n300
nulls0 (0.0%)
unique7
min 0
max 6
mean 5.09
median 6
std 1.396
q1 5
q3 6
iqr 1
skew -1.683
kurtosis 2.149
n_outliers 46
outlier_rate 0.1533
zero_rate 0.01333
alert: outliers15.3% rows beyond 1.5 IQR
Fig 9.
Distribution of kp_index. Vertical dash marks the median.
Show data table
Histogram bins for kp_index (median: 6.0).
bincount
0 – 0.35294
0.3529 – 0.70590
0.7059 – 1.0594
1.059 – 1.4120
1.412 – 1.7650
1.765 – 2.11816
2.118 – 2.4710
2.471 – 2.8240
2.824 – 3.17622
3.176 – 3.5290
3.529 – 3.8820
3.882 – 4.23517
4.235 – 4.5880
4.588 – 4.9410
4.941 – 5.29465
5.294 – 5.6470
5.647 – 6172

estimated_kp numeric feature

This looks like an estimated Kp geomagnetic index value, bounded between 0.0 and 6.33 across 300 rows with only 18 unique values, consistent with the discrete third-step Kp scale. The distribution is heavily left-skewed (skew -1.30) with the median pinned at the maximum 6.33 and Q3 also at 6.33, meaning at least half the rows sit at the ceiling. 18 low-side outliers (6% rate) and a near-zero zero_rate (0.67%) confirm a long thin tail toward quiet conditions.

Treatment: Treat as ordinal/discrete and consider binning or a rank transform before modelling given the ceiling-heavy left skew.

anthropic:claude-opus-4-7 · confidence high
Out[19]:

saturn.columns["estimated_kp"].stats

statvalue
n300
nulls0 (0.0%)
unique18
min 0
max 6.33
mean 5.183
median 6.33
std 1.538
q1 4.67
q3 6.33
iqr 1.66
skew -1.295
kurtosis 0.9485
n_outliers 18
outlier_rate 0.06
zero_rate 0.006667
alert: outliers6.0% rows beyond 1.5 IQR
Fig 10.
Distribution of estimated_kp. Vertical dash marks the median.
Show data table
Histogram bins for estimated_kp (median: 6.33).
bincount
0 – 0.37244
0.3724 – 0.74472
0.7447 – 1.1172
1.117 – 1.4890
1.489 – 1.8626
1.862 – 2.2344
2.234 – 2.6066
2.606 – 2.9796
2.979 – 3.35116
3.351 – 3.72410
3.724 – 4.0964
4.096 – 4.4683
4.468 – 4.84163
4.841 – 5.2132
5.213 – 5.5850
5.585 – 5.9584
5.958 – 6.33168

activity categorical label

Categorical descriptor of geomagnetic or space-weather activity level, with 5 ordered classes from 'Quiet' to 'Moderate Storm'. The distribution is heavily skewed toward storm conditions: 'Moderate Storm' alone accounts for 57.3% (172/300) of rows, while 'Quiet' appears just 8 times, suggesting the dataset is filtered to disturbed periods rather than representing typical activity. Entropy ratio of 0.73 confirms moderate concentration on the top class.

Treatment: Treat as ordinal target; consider class-weighting or stratified sampling given the imbalance toward 'Moderate Storm'.

anthropic:claude-opus-4-7 · confidence high
Out[22]:

saturn.columns["activity"].stats

statvalue
n300
nulls0 (0.0%)
unique5
top_value Moderate Storm
top_rate 0.5733
cardinality 5
entropy 1.69
entropy_ratio 0.7278
Fig 11.
Top values for activity.
Show data table
Top values for activity (5 unique shown, of 5 total).
valuecountshare
Moderate Storm17257.3%
Minor Storm6521.7%
Unsettled3812.7%
Active175.7%
Quiet82.7%

intensity numeric feature

Numeric 'intensity' column bounded between 0.0 and 0.667 with only 7 unique values across 300 rows, suggesting a discretised or quantised measurement rather than a continuous reading. Distribution is heavily left-skewed (skew -1.68) with median equal to the max (0.667), and 46 rows (15.3%) flagged as outliers on the low end. The handful of zeros (1.3%) and the ceiling at 0.667 hint at a capped or normalised score.

Treatment: Treat as ordinal/categorical given only 7 unique values, or bin explicitly before modelling.

anthropic:claude-opus-4-7 · confidence high
Out[25]:

saturn.columns["intensity"].stats

statvalue
n300
nulls0 (0.0%)
unique7
min 0
max 0.667
mean 0.5658
median 0.667
std 0.1553
q1 0.556
q3 0.667
iqr 0.111
skew -1.682
kurtosis 2.139
n_outliers 46
outlier_rate 0.1533
zero_rate 0.01333
alert: outliers15.3% rows beyond 1.5 IQR
Fig 12.
Distribution of intensity. Vertical dash marks the median.
Show data table
Histogram bins for intensity (median: 0.667).
bincount
0 – 0.039244
0.03924 – 0.078470
0.07847 – 0.11774
0.1177 – 0.15690
0.1569 – 0.19620
0.1962 – 0.235416
0.2354 – 0.27460
0.2746 – 0.31390
0.3139 – 0.353122
0.3531 – 0.39240
0.3924 – 0.43160
0.4316 – 0.470817
0.4708 – 0.51010
0.5101 – 0.54930
0.5493 – 0.588565
0.5885 – 0.62780
0.6278 – 0.667172

solar_wind_time categorical timestamp

This column is a minute-resolution timestamp of solar wind observations, all falling on 2026-01-20 with values like '2026-01-20 03:58:00.000'. With 299 unique values across 300 rows and entropy ratio 0.9997, it is effectively a per-row time index; only '2026-01-20 03:58:00.000' repeats (twice). No nulls, but the long_tail alert reflects this near-unique structure rather than meaningful categories.

Treatment: Parse to datetime and use as the time axis; do not treat as a categorical feature.

anthropic:claude-opus-4-7 · confidence high
Out[28]:

saturn.columns["solar_wind_time"].stats

statvalue
n300
nulls0 (0.0%)
unique299
top_value 2026-01-20 03:58:00.000
top_rate 0.006667
cardinality 299
entropy 8.222
entropy_ratio 0.9998
alert: long_tail298 singleton categories
Fig 13.
Top values for solar_wind_time.
Show data table
Top values for solar_wind_time (20 unique shown, of 299 total).
valuecountshare
2026-01-20 03:58:00.00020.7%
2026-01-20 00:06:00.00010.3%
2026-01-20 00:07:00.00010.3%
2026-01-20 00:08:00.00010.3%
2026-01-20 00:09:00.00010.3%
2026-01-20 00:10:00.00010.3%
2026-01-20 00:11:00.00010.3%
2026-01-20 00:12:00.00010.3%
2026-01-20 00:13:00.00010.3%
2026-01-20 00:14:00.00010.3%
2026-01-20 00:15:00.00010.3%
2026-01-20 00:16:00.00010.3%
2026-01-20 00:17:00.00010.3%
2026-01-20 00:18:00.00010.3%
2026-01-20 00:19:00.00010.3%
2026-01-20 00:20:00.00010.3%
2026-01-20 00:21:00.00010.3%
2026-01-20 00:22:00.00010.3%
2026-01-20 00:23:00.00010.3%
2026-01-20 00:24:00.00010.3%

solar_wind_speed numeric feature

Numeric measurements of solar wind speed across 300 records, all populated and tightly clustered between 881.6 and 1051.3 with a mean of 955.79 and median of 940.15. The distribution is mildly right-skewed (0.45) and platykurtic (kurtosis -1.32), suggesting a flatter-than-normal spread with no outliers flagged. With 266 unique values out of 300, the column behaves as a continuous physical feature rather than a categorical one.

Treatment: Use as-is for modelling; optional standardization given the narrow range and mild skew.

anthropic:claude-opus-4-7 · confidence high
Out[31]:

saturn.columns["solar_wind_speed"].stats

statvalue
n300
nulls0 (0.0%)
unique266
min 881.6
max 1051
mean 955.8
median 940.1
std 51.98
q1 908.5
q3 1008
iqr 99.8
skew 0.4506
kurtosis -1.323
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 14.
Distribution of solar_wind_speed. Vertical dash marks the median.
Show data table
Histogram bins for solar_wind_speed (median: 940.15).
bincount
881.6 – 891.66
891.6 – 901.624
901.6 – 911.568
911.5 – 921.522
921.5 – 931.517
931.5 – 941.514
941.5 – 951.510
951.5 – 961.519
961.5 – 971.414
971.4 – 981.49
981.4 – 991.46
991.4 – 10017
1001 – 101111
1011 – 102114
1021 – 103118
1031 – 104131
1041 – 105110

solar_wind_density numeric feature

This is a numeric feature capturing solar wind density, fully populated across 300 rows with 215 distinct values. The distribution is fairly symmetric (skew 0.26, kurtosis 0.12) with mean 3.10 and median 3.14, ranging from 0.23 to 8.32 and an IQR of 2.02. Only 2 outliers (0.67%) appear, so the column looks clean and well-behaved.

Treatment: Use as-is in modelling; standard scaling is sufficient given the near-symmetric distribution.

anthropic:claude-opus-4-7 · confidence high
Out[34]:

saturn.columns["solar_wind_density"].stats

statvalue
n300
nulls0 (0.0%)
unique215
min 0.23
max 8.32
mean 3.103
median 3.14
std 1.344
q1 2.09
q3 4.11
iqr 2.02
skew 0.2555
kurtosis 0.1168
n_outliers 2
outlier_rate 0.006667
zero_rate 0
Fig 15.
Distribution of solar_wind_density. Vertical dash marks the median.
Show data table
Histogram bins for solar_wind_density (median: 3.1399999999999997).
bincount
0.23 – 0.70596
0.7059 – 1.18215
1.182 – 1.65828
1.658 – 2.13429
2.134 – 2.60937
2.609 – 3.08533
3.085 – 3.56134
3.561 – 4.03738
4.037 – 4.51334
4.513 – 4.98931
4.989 – 5.4656
5.465 – 5.9414
5.941 – 6.4161
6.416 – 6.8922
6.892 – 7.3681
7.368 – 7.8440
7.844 – 8.321

How to cite

click to copy

BibTeX
@misc{saturn-quirky-aurora-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: quirky aurora},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/quirky-aurora}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: quirky aurora. Source: /home/coolhand/html/datavis/data_trove/data/quirky/aurora.json. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/quirky-aurora