data trove noaa ovation aurora forecast
Reading
This dataset captures 71 five-minute snapshots of auroral activity on January 20, 2026, with each row recording a timestamp, activity classification, intensity, and power readings for the northern and southern hemispheres. The most striking feature is that 'Storm' conditions dominate 55% of the observations, with 'Active' and 'Quiet' states making up the remainder — suggesting this day saw sustained geomagnetic disturbance. Both north and south power readings show wide, roughly uniform distributions (IQR ~110 GW) with medians well below their means, hinting that storm periods drive the upper range of power values. Intensity is similarly spread across most of its 0–1 range with near-zero skew, making the relationship between activity class and intensity worth exploring closely.
citing: row_count · column_count · activity.top_values · activity.top_rate · north_power.median · north_power.mean · north_power.iqr · south_power.median · south_power.mean · south_power.iqr · intensity.min · intensity.max · intensity.skew
Charts the summary said to look at first
Show data table
| value | count | share |
|---|---|---|
| Storm | 39 | 54.9% |
| Active | 20 | 28.2% |
| Quiet | 12 | 16.9% |
Show data table
| bin | count |
|---|---|
| 0.11 – 0.2213 | 20 |
| 0.2213 – 0.3325 | 6 |
| 0.3325 – 0.4437 | 4 |
| 0.4437 – 0.555 | 4 |
| 0.555 – 0.6663 | 4 |
| 0.6663 – 0.7775 | 1 |
| 0.7775 – 0.8888 | 4 |
| 0.8888 – 1 | 28 |
Show data table
| bin | count |
|---|---|
| 11 – 33.12 | 26 |
| 33.12 – 55.25 | 8 |
| 55.25 – 77.38 | 5 |
| 77.38 – 99.5 | 6 |
| 99.5 – 121.6 | 5 |
| 121.6 – 143.8 | 6 |
| 143.8 – 165.9 | 8 |
| 165.9 – 188 | 7 |
Show data table
| bin | count |
|---|---|
| 11 – 33.38 | 28 |
| 33.38 – 55.75 | 9 |
| 55.75 – 78.12 | 3 |
| 78.12 – 100.5 | 6 |
| 100.5 – 122.9 | 5 |
| 122.9 – 145.2 | 6 |
| 145.2 – 167.6 | 7 |
| 167.6 – 190 | 7 |
Show data table
| value | count | share |
|---|---|---|
| Storm | 39 | 54.9% |
| Active | 20 | 28.2% |
| Quiet | 12 | 16.9% |
Schema
5 columns| Alerts | ||||
|---|---|---|---|---|
| time_tag | categorical | 0.0% | 71 |
long_tail
|
| north_power | numeric | 0.0% | 60 |
|
| south_power | numeric | 0.0% | 61 |
|
| activity | categorical | 0.0% | 3 |
|
| intensity | numeric | 0.0% | 35 |
|
time_tag
categorical timestamp long_tailThis column contains datetime strings representing regular 5-minute interval timestamps on 2026-01-20, making it a time-series index. All 71 values are unique (cardinality 71, null_rate 0.0, top_rate 0.014) and entropy_ratio is effectively 1.0, confirming every row maps to a distinct timestamp. The 'long_tail' alert is misleading here — the distribution is perfectly uniform (each value appears exactly once), not skewed. This column should be parsed as a proper datetime and used as a time index rather than a categorical feature. Treatment: Parse to datetime and set as time index; do not encode as categorical.
- n
- 71
- nulls
- 0 (0.0%)
- unique
- 71
- top_value
- 2026-01-20 00:00:00
- top_rate
- 0.01408
- cardinality
- 71
- entropy
- 6.15
- entropy_ratio
- 1
north_power
numeric featureThis column appears to measure a directional power reading (northward component) for 71 observations, likely a physical or sensor-derived quantity given its name and continuous numeric nature. The distribution is notably platykurtic (kurtosis -1.28), meaning values are spread very flatly across the range of 11–188 with no heavy tails and no outliers detected. The IQR of 110.5 is nearly as large as the full range, and the median (60.0) sits well below the mean (77.27), suggesting a modest right skew (0.45) driven by a cluster of higher values. With 60 unique values across 71 rows and zero nulls or zeros, the data is dense and well-populated but has some repeated measurements worth investigating. Treatment: Use as-is or apply mild log-transform to reduce right skew before regression or distance-based modelling.
- n
- 71
- nulls
- 0 (0.0%)
- unique
- 60
- min
- 11
- max
- 188
- mean
- 77.27
- median
- 60
- std
- 58.74
- q1
- 21
- q3
- 131.5
- iqr
- 110.5
- skew
- 0.4523
- kurtosis
- -1.28
- n_outliers
- 0
- outlier_rate
- 0
- zero_rate
- 0
south_power
numeric featureThis column likely represents a power measurement (e.g., watts, kilowatts, or a similar energy metric) associated with a southern-facing sensor, panel, or zone. With a range of 11–190 and a mean of 75.77, the distribution is notably flat: an IQR of 113.0 spanning nearly the full range, combined with a platykurtic kurtosis of -1.29, indicates values are spread broadly and uniformly rather than clustering around a central tendency. The median (51.0) sits well below the mean (75.77), confirming modest right skew (0.49), but no outliers are flagged, suggesting this spread is genuine variability rather than contamination. 61 unique values across 71 rows means some repeated readings exist, which may warrant inspection for duplicate observations. Treatment: Check for duplicate rows given 61 unique values in 71 records; apply mild log-transform or scaling before regression given right skew and wide spread.
- n
- 71
- nulls
- 0 (0.0%)
- unique
- 61
- min
- 11
- max
- 190
- mean
- 75.77
- median
- 51
- std
- 60.32
- q1
- 19
- q3
- 132
- iqr
- 113
- skew
- 0.4943
- kurtosis
- -1.29
- n_outliers
- 0
- outlier_rate
- 0
- zero_rate
- 0
activity
categorical labelThis column represents a categorical activity-level classification, likely describing geophysical or meteorological states, with exactly three levels: 'Storm', 'Active', and 'Quiet'. The dominant class is 'Storm' at 54.9% (39 of 71 rows), which is mildly surprising given that 'Storm' might intuitively be expected as a rarer, extreme condition. The distribution is moderately imbalanced — 'Quiet' accounts for only 12 observations — which may affect model performance on minority classes. Entropy ratio of 0.898 indicates the distribution is reasonably spread but not uniform. Treatment: Ordinal-encode (Quiet < Active < Storm) or one-hot encode; monitor class imbalance if used as a target.
- n
- 71
- nulls
- 0 (0.0%)
- unique
- 3
- top_value
- Storm
- top_rate
- 0.5493
- cardinality
- 3
- entropy
- 1.423
- entropy_ratio
- 0.8979
intensity
numeric featureThis column represents a normalized intensity measure, bounded between 0.11 and 1.0, consistent with a scaled or clipped continuous score. The distribution is notably platykurtic (kurtosis -1.73), indicating a very flat, spread-out distribution rather than a bell curve — values are nearly uniformly scattered across the range. Despite 71 rows, there are only 35 unique values, suggesting the data originates from a discrete or quantized source (e.g., rounded measurements or a fixed rating scale). Skew is negligible (-0.07), and the IQR of 0.79 spans almost the entire range, confirming broad dispersion with no outliers. Treatment: Use as-is or apply quantile binning given the near-uniform, flat distribution; no log-transform needed given symmetry.
- n
- 71
- nulls
- 0 (0.0%)
- unique
- 35
- min
- 0.11
- max
- 1
- mean
- 0.6024
- median
- 0.6
- std
- 0.3647
- q1
- 0.21
- q3
- 1
- iqr
- 0.79
- skew
- -0.07261
- kurtosis
- -1.731
- n_outliers
- 0
- outlier_rate
- 0
- zero_rate
- 0