saturn

/home/coolhand/html/datavis/data_trove/data/wild/weather/noaa_significant_storms.json 14,770 rows sample n=14,770 seed 42 2026-06-21T23:29:41+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/wild/weather/noaa_significant_storms.json
Total rows14,770
Profiled sample14,770
Columns14
Generated2026-06-21T23:29:41+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
latitudenumeric0.0%
longitudenumeric0.0%
nametext0.0%
descriptiontext0.0%
categorycategorical0.0%
datetext0.0%
countrycategorical0.0%
event_typecategorical0.0%
statecategorical0.0%
magnitudecategorical51.8%
injuriescategorical0.0%
fatalitiescategorical0.0%
damage_propertytext0.0%
sourcecategorical0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This dataset contains 14,770 records of significant US storms sourced from the NOAA Storm Events Database, covering events across all 50+ states with dates, locations, event types, casualties, and property damage estimates. The most striking pattern is the dominance of tornadoes (6,334 events, 43% of all records), far outnumbering the next categories of Flash Flood and Thunderstorm Wind. Two dates worth flagging immediately are 1974-04-03 (126 events, the Super Outbreak) and 2011-04-27 (105 events, the 2011 Super Outbreak), suggesting this dataset captures landmark multi-tornado outbreaks disproportionately. Property damage skews heavily toward million-dollar figures, with '2.5M' being the single most common damage value (2,278 occurrences), hinting at possible rounding or a threshold-based inclusion criterion. Texas leads all states with 1,450 events, nearly double the next state (Missouri at 648), reflecting both its geographic size and exposure to severe weather corridors.

damage_property high anthropic:default

This column represents property damage amounts stored as formatted currency strings (e.g., '2.5M', '250K', '0.00K'), typical of NOAA storm event or similar disaster/insurance datasets. With only 1,014 unique values across 14,770 rows, the duplicate rate is extremely high at 93.1%, reflecting heavy rounding/bucketing of damage estimates rather than precise measurements. All values are single tokens (one_word_rate: 1.0) and 87.2% are uppercase, consistent with a coded categorical-style encoding of numeric magnitudes. There are 368 empty strings (null_rate reported as 0.0 but n_empty=368), which should be treated as missing values.

date high anthropic:default

This column contains ISO-8601 date strings (YYYY-MM-DD format), stored as text rather than a native date type — all 14,770 values are exactly 10 characters with zero nulls. The duplicate rate of 65.75% (9,712 duplicates across only 5,058 unique dates) is notable and suggests this is a grouping/event date used as a foreign-key-style attribute rather than a unique record timestamp. The top date, 1974-04-03, appears 126 times, and several 2011 dates cluster heavily, which may reflect significant event concentrations worth investigating.

description high anthropic:default

This column contains structured event descriptions summarising disaster or incident outcomes — specifically property damage amounts, injury counts, fatalities, and seismic magnitudes (e.g., 'Magnitude 0; $2.5M property damage'). The duplicate rate is strikingly high at 60.76%, with 8,974 duplicates across 14,770 rows and only 5,796 unique values, indicating these are templated strings generated from a small set of outcome combinations rather than free-form text. The Flesch readability mean of 29.86 reflects the dense, numeric, shorthand nature of the content. A small multilingual signal exists (10 Norwegian, 5 French, 1 Japanese entries) which may indicate data sourced from multiple regional systems and warrants review.

name high anthropic:default

This column contains structured event description labels of the form '[Weather Event Type] in [STATE, COUNTY]', effectively serving as a composite label combining event type and geographic location. The duplicate rate is strikingly high at 54.9%, with 8,110 duplicates across 14,770 rows and only 6,660 unique values, indicating that the same event type/location combinations recur frequently — consistent with repeated weather incidents in the same areas. The multilingual alert is almost certainly a false positive from language detection mis-classifying US place names and weather terminology as non-English; dominant language is English (4,796 of sampled values) and top values are entirely English-structured strings. Vocabulary size of 1,980 across ~14k rows and a mean of ~4.6 words per entry confirm the formulaic, low-variety nature of the text.

category high anthropic:default

This column is a dataset category tag, holding a single constant value 'significant_us_storms' across all 14,770 rows with no nulls. It carries zero information entropy (entropy = 0.0) and a top_rate of 1.0, meaning it is entirely invariant. This is a metadata label describing the dataset itself, not a feature with predictive or analytical value.

country high anthropic:default

This column represents the country of origin or scope for all records in the dataset, and every single one of the 14,770 rows contains the value 'USA' — making it a zero-entropy constant. The column carries no discriminative information whatsoever and will contribute nothing to any model or analysis. Its uniformity may also indicate the dataset is intentionally scoped to a single country, which is worth confirming before joining with broader datasets.

source high anthropic:default

This column identifies the data source, and every single one of the 14,770 rows carries the identical value 'NOAA Storm Events Database' — cardinality of 1 with top_rate of 1.0 and entropy of 0.0. It is a constant metadata field, almost certainly a provenance tag added during ingestion. It carries zero predictive or analytical signal.

magnitude medium anthropic:default

This column appears to represent a magnitude measure (likely seismic, stellar, or similar scientific scale) stored as a categorical type despite containing numeric-looking values spanning a wide range (e.g., 1.75, 2.75, 50.00, 70.00). Two surprises stand out: first, 51.78% of rows are null, triggering an alert; second, the dominant value '0' accounts for 54.24% of non-null rows (3,863 of ~7,124 non-null records), suggesting zero may encode 'none', 'unknown', or a sentinel rather than a true zero magnitude. The presence of both small decimal values (1.75, 2.00, 2.50) and large round integers (50.00, 61.00, 65.00, 70.00) hints at a possible mixed-scale or mixed-source column.

latitude high anthropic:default

This column contains geographic latitude values, spanning from -14.3236 to 70.1269 degrees, consistent with worldwide location data. The distribution is tightly clustered between Q1=33.63 and Q3=41.13 (IQR ~7.5), suggesting the bulk of records concentrate around mid-latitude Northern Hemisphere locations (roughly US/Europe range), with the mean (37.28) and median (37.12) nearly identical indicating only mild skew (-0.18). The leptokurtic shape (kurtosis 3.34) and 159 outliers (~1.1%) reflect a small tail of equatorial or high-latitude records that an analyst should verify are not geocoding errors.

longitude high anthropic:default

This column represents geographic longitude, with values spanning from -170.7316 to 171.4689 degrees. The bulk of observations cluster around the Americas (mean -90.94, IQR roughly -96.4 to -84.23, consistent with the central/eastern US or Caribbean), but the extreme kurtosis of 55.6 and 623 outliers (4.2%) indicate a heavy-tailed distribution with a notable minority of records far outside this core region — including values near +171, suggesting Pacific or Asian locations. The positive skew (1.29) and tight IQR relative to the full range confirm most records concentrate in a narrow band while a long right tail pulls toward positive (eastern hemisphere) longitudes.

event_type high anthropic:default

This column contains categorical labels for natural weather/disaster event types across 14,770 records, with 17 distinct categories and no nulls. The dominant class is 'Tornado' at 42.9% (6,334 occurrences), creating notable class imbalance — the top 5 categories ('Tornado', 'Flash Flood', 'Thunderstorm Wind', 'Flood', 'Hail') account for the vast majority of records, while tail categories like 'Marine Thunderstorm Wind' (25) and 'Debris Flow' (43) are sparsely represented. The entropy ratio of 0.572 confirms moderate but uneven spread across classes, which will challenge classifiers without resampling or class-weight adjustment.

fatalities high anthropic:default

This column records fatality counts per incident, stored as strings but representing non-negative integers ranging from 0 to at least 10 across 49 distinct values. The dominant value is '0' at 69.1% of rows (10,209 of 14,770), indicating most incidents involve no fatalities. The distribution is heavily right-skewed, with counts dropping sharply: 1 fatality appears 3,208 times, 2 appears 649 times, and values thin out rapidly beyond that — yet 49 unique values suggests some high-count outliers exist beyond the top 10 shown. Low entropy (1.42, ratio 0.25) confirms the extreme concentration on zero.

injuries high anthropic:default

This column represents a count of injuries per record, stored as a categorical type despite being fundamentally numeric. The dominant value is '0' appearing in 68.1% of rows (10,064 of 14,770), indicating most records involve no injuries. With 178 unique values and top counts following a steep drop-off consistent with a zero-inflated count distribution, the categorical encoding is likely a data-type artifact — the values are clearly ordinal integers and should be treated as numeric.

state high anthropic:default

This column represents the US state associated with each record, stored as full uppercase state names. With 65 unique values against the expected 50 US states, there are likely extra entries such as territories (e.g., Puerto Rico, Guam), non-standard labels, or minor data quality issues worth auditing. Texas dominates at 9.8% of records (1,450), and the top-10 states are heavily weighted toward the South and Midwest. The high entropy ratio of 0.86 indicates a relatively even spread across categories, though Texas is a clear outlier compared to the rest.

Numeric correlation

Show data table
Pearson correlation across 2 numeric columns (values clipped to 2 decimals).
latitudelongitude
latitude+1.00-0.31
longitude-0.31+1.00

Languages detected

Per-string language detection across text columns (sampled).

Show data table
Per-language counts (total 10,000 detected strings).
langcountshare
en978097.8%
es1341.3%
de250.2%
ja230.2%
no100.1%
id60.1%
fr60.1%
it50.1%
pt40.0%
sr20.0%
ru20.0%
eu20.0%
zh10.0%

latitude numeric

rows14,770
null0 (0.0%)
unique7,810
min-14.324
max70.127
mean37.278
median37.120
std5.247
q133.630
q341.129
iqr7.499
skew-0.179
kurtosis3.341
n_outliers159
outlier_rate0.011
zero_rate0.000
Show data table
Histogram bins for latitude (median: 37.12).
bincount
-14.32 – -12.213
-12.21 – -10.10
-10.1 – -7.990
-7.99 – -5.8790
-5.879 – -3.7670
-3.767 – -1.6560
-1.656 – 0.45520
0.4552 – 2.5660
2.566 – 4.6780
4.678 – 6.7890
6.789 – 8.92
8.9 – 11.010
11.01 – 13.120
13.12 – 15.232
15.23 – 17.350
17.35 – 19.4675
19.46 – 21.5719
21.57 – 23.6810
23.68 – 25.7922
25.79 – 27.9270
27.9 – 30.01522
30.01 – 32.121240
32.12 – 34.242165
34.24 – 36.352333
36.35 – 38.461803
38.46 – 40.571901
40.57 – 42.682226
42.68 – 44.791382
44.79 – 46.9515
46.9 – 49.01232
49.01 – 51.130
51.13 – 53.240
53.24 – 55.350
55.35 – 57.465
57.46 – 59.576
59.57 – 61.6815
61.68 – 63.7911
63.79 – 65.98
65.9 – 68.022
68.02 – 70.131

longitude numeric

rows14,770
null0 (0.0%)
unique8,828
min-170.732
max171.469
mean-90.940
median-90.220
std11.696
q1-96.400
q3-84.230
iqr12.170
skew1.286
kurtosis55.608
n_outliers623
outlier_rate0.042
zero_rate0.000
Show data table
Histogram bins for longitude (median: -90.22).
bincount
-170.7 – -162.24
-162.2 – -153.633
-153.6 – -145.128
-145.1 – -136.54
-136.5 – -12811
-128 – -119.4305
-119.4 – -110.8466
-110.8 – -102.3544
-102.3 – -93.743693
-93.74 – -85.185377
-85.18 – -76.633281
-76.63 – -68.07941
-68.07 – -59.5279
-59.52 – -50.960
-50.96 – -42.410
-42.41 – -33.850
-33.85 – -25.30
-25.3 – -16.740
-16.74 – -8.1860
-8.186 – 0.36870
0.3687 – 8.9240
8.924 – 17.480
17.48 – 26.030
26.03 – 34.590
34.59 – 43.140
43.14 – 51.70
51.7 – 60.250
60.25 – 68.810
68.81 – 77.360
77.36 – 85.920
85.92 – 94.470
94.47 – 1030
103 – 111.60
111.6 – 120.10
120.1 – 128.70
128.7 – 137.20
137.2 – 145.82
145.8 – 154.41
154.4 – 162.90
162.9 – 171.51

name text

13 languages detected in sample 54.9% duplicate strings
rows14,770
null0 (0.0%)
unique6,660
len_min17
len_max134
len_mean30.219
len_median29.000
len_p9541.000
word_mean4.588
word_median4.000
n_empty0
n_duplicates8,110
duplicate_rate0.549
vocab_size1,980
readability_flesch_mean31.163
emoji_rate0.000
url_rate0.000
one_word_rate0.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for name (mean: 30.219160460392686).
charscount
17 – 2050
20 – 23793
23 – 262165
26 – 293842
29 – 322969
32 – 351813
35 – 371442
37 – 40915
40 – 43493
43 – 46153
46 – 4945
49 – 524
52 – 553
55 – 584
58 – 616
61 – 649
64 – 675
67 – 707
70 – 733
73 – 7610
76 – 787
78 – 815
81 – 846
84 – 873
87 – 903
90 – 933
93 – 961
96 – 995
99 – 1023
102 – 1050
105 – 1080
108 – 1110
111 – 1140
114 – 1160
116 – 1191
119 – 1220
122 – 1250
125 – 1280
128 – 1311
131 – 1341
Sample values (first 10)
  1. Tornado in LOUISIANA, CADDO
  2. Flood in LOUISIANA, BEAUREGARD
  3. Tornado in NORTH CAROLINA, ANSON
  4. Thunderstorm Wind in GEORGIA, NEWTON
  5. Hail in WISCONSIN, DODGE
  6. Flash Flood in MISSOURI, DENT
  7. Tornado in PENNSYLVANIA, LUZERNE
  8. Flash Flood in KENTUCKY, FRANKLIN
  9. Lightning in FLORIDA, PALM BEACH
  10. Tornado in SOUTH CAROLINA, DARLINGTON

description text

5 languages detected in sample 60.8% duplicate strings
rows14,770
null0 (0.0%)
unique5,796
len_min3
len_max259
len_mean50.085
len_median36.000
len_p95166.000
word_mean7.393
word_median5.000
n_empty0
n_duplicates8,974
duplicate_rate0.608
vocab_size4,289
readability_flesch_mean29.858
emoji_rate0.000
url_rate0.000
one_word_rate2.71e-04
allcaps_rate2.71e-04
boilerplate_rate0.000
Show data table
Character-length distribution for description (mean: 50.085240352065).
charscount
3 – 94
9 – 1613
16 – 223101
22 – 291689
29 – 352230
35 – 411186
41 – 48572
48 – 541589
54 – 612168
61 – 67902
67 – 737
73 – 806
80 – 8614
86 – 9310
93 – 9915
99 – 10525
105 – 11230
112 – 11830
118 – 12534
125 – 13147
131 – 13771
137 – 14464
144 – 15052
150 – 15777
157 – 16360
163 – 16974
169 – 17666
176 – 18265
182 – 18954
189 – 19569
195 – 20182
201 – 20870
208 – 21472
214 – 22178
221 – 22764
227 – 23329
233 – 24020
240 – 24611
246 – 25312
253 – 2598
Sample values (first 10)
  1. Magnitude 0; 40 injuries, 9 fatalities; $250K property damage
  2. $40.00M property damage
  3. $3.00M property damage
  4. EG55.00; 1 injuries, 1 fatalities; $20.00K property damage; A weak upper-level wave moving across the state resulted in widespread strong to severe thunderstorms across north and central Georgia.
  5. Magnitude 1.75; $3.17M property damage
  6. $15.00M property damage
  7. 5 injuries, 0 fatalities; $1.00M property damage
  8. 0 injuries, 1 fatalities
  9. 1 injuries, 1 fatalities
  10. Magnitude 0; 0 injuries, 1 fatalities; $25K property damage

category categorical

top value is 100.0% of rows
rows14,770
null0 (0.0%)
unique1
top_valuesignificant_us_storms
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for category (1 unique shown, of 1 total).
valuecountshare
significant_us_storms14770100.0%
Top values (rank 1–20)
  1. significant_us_storms — 14,770

date text

100.0% rows are a single word 100.0% rows are all-caps 95th-percentile length under 20 chars 65.8% duplicate strings
rows14,770
null0 (0.0%)
unique5,058
len_min10
len_max10
len_mean10.000
len_median10.000
len_p9510.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates9,712
duplicate_rate0.658
vocab_size5,058
readability_flesch_mean121.220
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate1.000
boilerplate_rate0.000
Show data table
Character-length distribution for date (mean: 10.0).
charscount
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 1014770
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
10 – 100
Sample values (first 10)
  1. 1950-02-12
  2. 2016-03-10
  3. 2020-04-13
  4. 2018-06-25
  5. 2006-04-13
  6. 2024-11-05
  7. 2006-12-01
  8. 2021-12-11
  9. 2018-07-10
  10. 1961-04-12

country categorical

top value is 100.0% of rows
rows14,770
null0 (0.0%)
unique1
top_valueUSA
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for country (1 unique shown, of 1 total).
valuecountshare
USA14770100.0%
Top values (rank 1–20)
  1. USA — 14,770

event_type categorical

rows14,770
null0 (0.0%)
unique17
top_valueTornado
top_rate0.429
cardinality17
entropy2.336
entropy_ratio0.572
Show data table
Top values for event_type (17 unique shown, of 17 total).
valuecountshare
Tornado633442.9%
Flash Flood235816.0%
Thunderstorm Wind225715.3%
Flood177712.0%
Hail12468.4%
Lightning5743.9%
Heavy Rain990.7%
Marine Strong Wind430.3%
Debris Flow430.3%
Marine Thunderstorm Wind250.2%
Marine High Wind50.0%
Dust Devil30.0%
Waterspout20.0%
Tropical Storm10.0%
High Wind10.0%
Heat10.0%
Marine Lightning10.0%
Top values (rank 1–20)
  1. Tornado — 6,334
  2. Flash Flood — 2,358
  3. Thunderstorm Wind — 2,257
  4. Flood — 1,777
  5. Hail — 1,246
  6. Lightning — 574
  7. Heavy Rain — 99
  8. Marine Strong Wind — 43
  9. Debris Flow — 43
  10. Marine Thunderstorm Wind — 25
  11. Marine High Wind — 5
  12. Dust Devil — 3
  13. Waterspout — 2
  14. Tropical Storm — 1
  15. High Wind — 1
  16. Heat — 1
  17. Marine Lightning — 1

state categorical

rows14,770
null0 (0.0%)
unique65
top_valueTEXAS
top_rate0.098
cardinality65
entropy5.182
entropy_ratio0.861
Show data table
Top values for state (20 unique shown, of 65 total).
valuecountshare
TEXAS14509.8%
MISSOURI6484.4%
ARKANSAS6024.1%
MISSISSIPPI5703.9%
GEORGIA5623.8%
ILLINOIS5603.8%
IOWA5273.6%
LOUISIANA5073.4%
TENNESSEE4993.4%
FLORIDA4983.4%
OKLAHOMA4903.3%
NEBRASKA4863.3%
ALABAMA4693.2%
WISCONSIN4633.1%
OHIO4413.0%
MICHIGAN4262.9%
NORTH CAROLINA4222.9%
KANSAS4182.8%
INDIANA4082.8%
KENTUCKY3832.6%
Top values (rank 1–20)
  1. TEXAS — 1,450
  2. MISSOURI — 648
  3. ARKANSAS — 602
  4. MISSISSIPPI — 570
  5. GEORGIA — 562
  6. ILLINOIS — 560
  7. IOWA — 527
  8. LOUISIANA — 507
  9. TENNESSEE — 499
  10. FLORIDA — 498
  11. OKLAHOMA — 490
  12. NEBRASKA — 486
  13. ALABAMA — 469
  14. WISCONSIN — 463
  15. OHIO — 441
  16. MICHIGAN — 426
  17. NORTH CAROLINA — 422
  18. KANSAS — 418
  19. INDIANA — 408
  20. KENTUCKY — 383

magnitude categorical

51.8% null
rows14,770
null7,648 (51.8%)
unique170
top_value0
top_rate0.542
cardinality170
entropy3.586
entropy_ratio0.484
Show data table
Top values for magnitude (20 unique shown, of 170 total).
valuecountshare
0386326.2%
1.753832.6%
2.752201.5%
70.001621.1%
50.001511.0%
2.001501.0%
2.501230.8%
61.001220.8%
65.001040.7%
52.00950.6%
78.00800.5%
70790.5%
3.00770.5%
56.00760.5%
87.00650.4%
60.00630.4%
50590.4%
60540.4%
1.50500.3%
61470.3%
Top values (rank 1–20)
  1. 0 — 3,863
  2. 1.75 — 383
  3. 2.75 — 220
  4. 70.00 — 162
  5. 50.00 — 151
  6. 2.00 — 150
  7. 2.50 — 123
  8. 61.00 — 122
  9. 65.00 — 104
  10. 52.00 — 95
  11. 78.00 — 80
  12. 70 — 79
  13. 3.00 — 77
  14. 56.00 — 76
  15. 87.00 — 65
  16. 60.00 — 63
  17. 50 — 59
  18. 60 — 54
  19. 1.50 — 50
  20. 61 — 47

injuries categorical

rows14,770
null0 (0.0%)
unique178
top_value0
top_rate0.681
cardinality178
entropy2.468
entropy_ratio0.330
Show data table
Top values for injuries (20 unique shown, of 178 total).
valuecountshare
01006468.1%
18936.0%
25523.7%
33432.3%
42361.6%
52341.6%
102191.5%
61961.3%
121581.1%
71340.9%
81210.8%
201140.8%
151110.8%
11900.6%
9850.6%
13700.5%
14690.5%
30680.5%
25560.4%
16480.3%
Top values (rank 1–20)
  1. 0 — 10,064
  2. 1 — 893
  3. 2 — 552
  4. 3 — 343
  5. 4 — 236
  6. 5 — 234
  7. 10 — 219
  8. 6 — 196
  9. 12 — 158
  10. 7 — 134
  11. 8 — 121
  12. 20 — 114
  13. 15 — 111
  14. 11 — 90
  15. 9 — 85
  16. 13 — 70
  17. 14 — 69
  18. 30 — 68
  19. 25 — 56
  20. 16 — 48

fatalities categorical

rows14,770
null0 (0.0%)
unique49
top_value0
top_rate0.691
cardinality49
entropy1.423
entropy_ratio0.254
Show data table
Top values for fatalities (20 unique shown, of 49 total).
valuecountshare
01020969.1%
1320821.7%
26494.4%
32221.5%
41120.8%
5740.5%
6660.4%
7380.3%
9250.2%
10240.2%
8210.1%
11200.1%
13110.1%
16100.1%
1290.1%
1480.1%
1760.0%
2060.0%
2540.0%
2330.0%
Top values (rank 1–20)
  1. 0 — 10,209
  2. 1 — 3,208
  3. 2 — 649
  4. 3 — 222
  5. 4 — 112
  6. 5 — 74
  7. 6 — 66
  8. 7 — 38
  9. 9 — 25
  10. 10 — 24
  11. 8 — 21
  12. 11 — 20
  13. 13 — 11
  14. 16 — 10
  15. 12 — 9
  16. 14 — 8
  17. 17 — 6
  18. 20 — 6
  19. 25 — 4
  20. 23 — 3

damage_property text

100.0% rows are a single word 87.2% rows are all-caps 95th-percentile length under 20 chars 93.1% duplicate strings
rows14,770
null0 (0.0%)
unique1,014
len_min0
len_max8
len_mean4.381
len_median5.000
len_p957.000
word_mean1.000
word_median1.000
n_empty368
n_duplicates13,756
duplicate_rate0.931
vocab_size1,013
readability_flesch_mean116.977
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.872
boilerplate_rate0.000
Show data table
Character-length distribution for damage_property (mean: 4.380568720379147).
charscount
0 – 0368
0 – 00
0 – 10
1 – 10
1 – 10
1 – 1264
1 – 10
1 – 20
2 – 20
2 – 20
2 – 21252
2 – 20
2 – 30
3 – 30
3 – 30
3 – 31172
3 – 30
3 – 40
4 – 40
4 – 40
4 – 43414
4 – 40
4 – 50
5 – 50
5 – 50
5 – 56075
5 – 50
5 – 60
6 – 60
6 – 60
6 – 61450
6 – 60
6 – 70
7 – 70
7 – 70
7 – 7514
7 – 70
7 – 80
8 – 80
8 – 8261
Sample values (first 10)
  1. 250K
  2. 40.00M
  3. 3.00M
  4. 20.00K
  5. 3.17M
  6. 15.00M
  7. 1.00M
  8. 0.00K
  9. 0.00K
  10. 25K

source categorical

top value is 100.0% of rows
rows14,770
null0 (0.0%)
unique1
top_valueNOAA Storm Events Database
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for source (1 unique shown, of 1 total).
valuecountshare
NOAA Storm Events Database14770100.0%
Top values (rank 1–20)
  1. NOAA Storm Events Database — 14,770