saturn·

lightning monthly heatmap

source /home/coolhand/html/datavis/data_trove/data/natural_hazards/lightning/monthly_heatmap.json 59,070 rows 4 columns profiled 2026-05-01 raw JSON static .html .ipynb Report Notebook

Reading

dataset summary · high confidence anthropic:claude-opus-4-7

This dataset contains 59,070 rows of monthly lightning strike observations, each tagged with latitude, longitude, month, and strike count. Geographically the points sit between roughly 25.35°N–35.46°N and -96.74°W–-79.03°W, suggesting coverage of the southeastern United States. The strikes column is highly skewed (skew ≈ 2.02, max 531 vs. median 34) with a long right tail worth investigating to identify hotspots. Latitude also flags an outlier cluster (about 9.5% of rows), so it is worth checking whether those represent edge regions or data quality issues. Month is evenly bounded 1–12 with a mean near 6.9, hinting at a mild summer concentration in the records.

citing: row_count · columns.lat.stats · columns.lon.stats · columns.strikes.stats · columns.month.stats · columns.lat.alerts · columns.strikes.alerts

Schema

4 columns
Per-column summary. Click column name to jump to its detail.
Alerts
lat numeric 0.0% 868
outliers
lon numeric 0.0% 1,231
month numeric 0.0% 12
strikes numeric 0.0% 381
high_skew

lat

numeric feature outliers
This is a latitude coordinate column, with all 59070 rows populated across 868 unique values bounded between 25.35 and 35.46 — a span consistent with a regional rather than global footprint. The distribution is right-skewed (skew 1.15) with median 28.84 just below mean 29.23, and roughly 9.5% of points (5626) flag as outliers, suggesting a secondary cluster of locations sitting well north of the main mass around Q1–Q3 (27.84–29.94). Treatment: Pair with longitude for geospatial joins or binning rather than treating as a standalone scalar feature. high · anthropic:claude-opus-4-7
n
59,070
nulls
0 (0.0%)
unique
868
min
25.35
max
35.46
mean
29.23
median
28.84
std
1.899
q1
27.84
q3
29.94
iqr
2.1
skew
1.147
kurtosis
0.7033
n_outliers
5,626
outlier_rate
0.09524
zero_rate
0

lon

numeric feature
Longitude coordinates, all negative and tightly bounded between -96.74 and -79.03, placing every record in the eastern half of North America. The distribution is left-skewed (skew -0.85) with a median of -82.06, suggesting a concentration toward the eastern edge of that range and a longer tail westward. With only 1,231 unique values across 59,070 rows, coordinates are heavily repeated, hinting at a fixed set of locations rather than continuous GPS readings. Treatment: Pair with lat as a geospatial feature; consider binning or joining to a place identifier given the low cardinality. high · anthropic:claude-opus-4-7
n
59,070
nulls
0 (0.0%)
unique
1,231
min
-96.74
max
-79.03
mean
-85.27
median
-82.06
std
5.252
q1
-89.94
q3
-81.22
iqr
8.72
skew
-0.8513
kurtosis
-0.8532
n_outliers
0
outlier_rate
0
zero_rate
0

month

numeric feature
This is a calendar month encoded as an integer 1–12, with all 12 values present across 59,070 rows and no nulls. The distribution leans slightly toward the second half of the year (mean 6.90, median 7) but is essentially flat (kurtosis -0.21, skew -0.15) with no outliers. Treatment: Treat as a cyclical categorical (e.g., sine/cosine encode) rather than a raw numeric. high · anthropic:claude-opus-4-7
n
59,070
nulls
0 (0.0%)
unique
12
min
1
max
12
mean
6.899
median
7
std
2.335
q1
5
q3
8
iqr
3
skew
-0.155
kurtosis
-0.2056
n_outliers
0
outlier_rate
0
zero_rate
0

strikes

numeric feature high_skew
A non-negative integer count called 'strikes' with 381 distinct values across 59,070 rows and no nulls or zeros, ranging from 1 to 531 with a median of 34 and mean of 49.4. The distribution is heavily right-skewed (skew 2.02, kurtosis 6.09) and the standard deviation (49.8) roughly equals the mean, with 2,917 outliers (4.9%) extending the upper tail well past the Q3 of 68. Treatment: Apply a log or sqrt transform before modelling to tame the right skew. high · anthropic:claude-opus-4-7
n
59,070
nulls
0 (0.0%)
unique
381
min
1
max
531
mean
49.43
median
34
std
49.83
q1
14
q3
68
iqr
54
skew
2.023
kurtosis
6.094
n_outliers
2,917
outlier_rate
0.04938
zero_rate
0