This dataset contains 5,411 Bigfoot sighting reports sourced from the Bigfoot Field Researchers Organization (BFRO), covering sightings across 53 U.S. states and territories with attributes including location, date, classification, and a short description. Washington state dominates with 631 reports (about 12% of all sightings), followed by California and Ohio, suggesting strong geographic clustering worth examining. The temporal distribution is skewed toward more recent decades — median year is 2001 with records stretching back to 1870 — raising questions about whether sightings are truly increasing or simply better reported. Sighting classifications split almost evenly between Class A (direct sightings, 2,655) and Class B (indirect evidence, 2,722), with Class C being rare at just 34 reports.
saturn
/home/coolhand/html/datavis/data_trove/data/wild/bigfoot_sightings.json 5,411 rows sample n=5,411 seed 42 2026-06-22T00:50:12+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/data/wild/bigfoot_sightings.json |
| Total rows | 5,411 |
| Profiled sample | 5,411 |
| Columns | 9 |
| Generated | 2026-06-22T00:50:12+00:00 |
Show data table
| column | kind | null % |
|---|---|---|
| id | numeric | 0.0% |
| state | categorical | 0.0% |
| state_code | categorical | 0.0% |
| county | text | 0.0% |
| url | text | 0.0% |
| month | categorical | 3.0% |
| year | numeric | 1.1% |
| classification | categorical | 0.0% |
| description | text | 0.0% |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.
This column contains US county names, functioning as a categorical geographic label with 1,022 unique values across 5,411 rows. The duplicate rate is high at 81.1%, which is expected for a county field where many records share the same geography. Notably, 338 rows (6.2%) have empty strings rather than nulls, masking true missingness since null_rate reports 0.0. The top counties — Pierce, Jefferson, Lewis, Washington, Snohomish — suggest a Pacific Northwest-heavy dataset, but the presence of Humboldt and Polk hints at multi-state coverage.
This column contains unique URLs pointing to individual report pages on bfro.net (the Bigfoot Field Researchers Organization database), all following the pattern `https://www.bfro.net/gdb/show_report.asp?id=
This column contains short free-text descriptions of reported sightings — most likely UFO or wildlife sighting reports, inferred from the high-frequency terms 'sighting', 'near', and 'possible'. With 5,407 unique values out of 5,411 rows and zero nulls, entries are nearly all distinct; the 4 duplicates (duplicate_rate 0.00074) are negligible. Mean length of ~67 characters and ~10 words per entry suggests structured-but-natural one-line summaries rather than long narratives. Flesch readability of 55.7 indicates plain, accessible prose with a vocabulary of 7,169 unique tokens across the corpus.
This column is a numeric row identifier: all 5,411 values are unique, there are no nulls, and the zero rate is 0.0, consistent with a primary key or surrogate ID. The IDs are not sequential (range 60–79,711 with a mean of ~23,288 and median of ~16,598), suggesting they originate from a larger parent table or were assigned non-contiguously. Mild positive skew (0.91) indicates more records cluster in lower ID ranges, but the near-zero kurtosis (−0.15) and absence of outliers confirm a broadly spread, roughly uniform-ish distribution rather than a tightly clustered one.
This column represents calendar years for records in the dataset, spanning 1870 to 2025 with 99 distinct values. The distribution is left-skewed (skew = -0.974) with a mean of ~1998 and IQR of 22 years (1987–2009), meaning the bulk of records cluster in the late 20th to early 21st century while a thin tail extends back to 1870. The 49 outliers (0.9%) likely correspond to those historically distant records, and analysts should verify whether pre-20th-century entries are genuine or data quality issues.
This column is a three-level ordinal or nominal classification label applied to all 5,411 rows with no nulls. The distribution is nearly balanced between 'Class B' (2,722; 50.3%) and 'Class A' (2,655; 49.1%), but 'Class C' is severely underrepresented with only 34 instances (~0.6%), which would surprise any analyst expecting a balanced multi-class target and will require oversampling or class-weight adjustments if used as a target variable.
This column contains US state names, with all 50 states likely represented plus Washington D.C. and possibly territories (53 unique values, 0 nulls across 5,411 rows). Washington dominates at 11.7% (631 records), roughly 1.5× California's share (431), suggesting a dataset with geographic bias toward the Pacific Northwest. Entropy ratio of 0.877 indicates reasonably broad distribution across states, though concentration in a handful of large/coastal states is apparent.
This column contains US state abbreviations (plus possibly DC and territories, explaining the 53 distinct values vs. 50 states). Washington ('wa') is notably over-represented at 11.7% of 5,411 rows — roughly 1.5× California ('ca') and nearly 2× Ohio ('oh') — suggesting a geographic skew toward the Pacific Northwest rather than a nationally representative sample. Entropy ratio of 0.877 indicates reasonably broad distribution across states, but the top-heavy concentration in 'wa' is worth flagging.
This column represents calendar month names, but with a cardinality of 32 instead of the expected 12, there are clearly duplicate or variant entries beyond the standard month labels — likely encoding errors, alternate spellings, or appended year/year-month combinations. The distribution is notably skewed toward summer and early-autumn months (August 634, October 632, July 618), with winter months dramatically underrepresented (December 233, January 228), suggesting seasonal bias in data collection. The entropy_ratio of 0.761 across 32 unique values rather than 12 is a strong flag that this field is dirty and needs normalisation before use.
Numeric correlation
Show data table
| id | year | |
|---|---|---|
| id | +1.00 | +0.12 |
| year | +0.12 | +1.00 |
id numeric
Show data table
| bin | count |
|---|---|
| 60 – 2051 | 743 |
| 2051 – 4043 | 469 |
| 4043 – 6034 | 305 |
| 6034 – 8025 | 306 |
| 8025 – 1.002e+04 | 268 |
| 1.002e+04 – 1.201e+04 | 202 |
| 1.201e+04 – 1.4e+04 | 198 |
| 1.4e+04 – 1.599e+04 | 176 |
| 1.599e+04 – 1.798e+04 | 119 |
| 1.798e+04 – 1.997e+04 | 81 |
| 1.997e+04 – 2.196e+04 | 89 |
| 2.196e+04 – 2.396e+04 | 146 |
| 2.396e+04 – 2.595e+04 | 254 |
| 2.595e+04 – 2.794e+04 | 215 |
| 2.794e+04 – 2.993e+04 | 191 |
| 2.993e+04 – 3.192e+04 | 105 |
| 3.192e+04 – 3.391e+04 | 77 |
| 3.391e+04 – 3.59e+04 | 85 |
| 3.59e+04 – 3.789e+04 | 98 |
| 3.789e+04 – 3.989e+04 | 91 |
| 3.989e+04 – 4.188e+04 | 113 |
| 4.188e+04 – 4.387e+04 | 90 |
| 4.387e+04 – 4.586e+04 | 90 |
| 4.586e+04 – 4.785e+04 | 84 |
| 4.785e+04 – 4.984e+04 | 71 |
| 4.984e+04 – 5.183e+04 | 80 |
| 5.183e+04 – 5.382e+04 | 10 |
| 5.382e+04 – 5.582e+04 | 33 |
| 5.582e+04 – 5.781e+04 | 70 |
| 5.781e+04 – 5.98e+04 | 92 |
| 5.98e+04 – 6.179e+04 | 18 |
| 6.179e+04 – 6.378e+04 | 78 |
| 6.378e+04 – 6.577e+04 | 47 |
| 6.577e+04 – 6.776e+04 | 65 |
| 6.776e+04 – 6.975e+04 | 42 |
| 6.975e+04 – 7.175e+04 | 8 |
| 7.175e+04 – 7.374e+04 | 33 |
| 7.374e+04 – 7.573e+04 | 45 |
| 7.573e+04 – 7.772e+04 | 50 |
| 7.772e+04 – 7.971e+04 | 74 |
state categorical
Show data table
| value | count | share |
|---|---|---|
| Washington | 631 | 11.7% |
| California | 431 | 8.0% |
| Ohio | 317 | 5.9% |
| Florida | 314 | 5.8% |
| Oregon | 253 | 4.7% |
| Illinois | 239 | 4.4% |
| Texas | 238 | 4.4% |
| Michigan | 217 | 4.0% |
| Missouri | 161 | 3.0% |
| Georgia | 135 | 2.5% |
| Colorado | 128 | 2.4% |
| Pennsylvania | 125 | 2.3% |
| British Columbia | 122 | 2.3% |
| New York | 116 | 2.1% |
| Kentucky | 115 | 2.1% |
| Arkansas | 104 | 1.9% |
| Tennessee | 104 | 1.9% |
| West Virginia | 104 | 1.9% |
| Oklahoma | 101 | 1.9% |
| Idaho | 99 | 1.8% |
Top values (rank 1–20)
- Washington — 631
- California — 431
- Ohio — 317
- Florida — 314
- Oregon — 253
- Illinois — 239
- Texas — 238
- Michigan — 217
- Missouri — 161
- Georgia — 135
- Colorado — 128
- Pennsylvania — 125
- British Columbia — 122
- New York — 116
- Kentucky — 115
- Arkansas — 104
- Tennessee — 104
- West Virginia — 104
- Oklahoma — 101
- Idaho — 99
state_code categorical
Show data table
| value | count | share |
|---|---|---|
| wa | 631 | 11.7% |
| ca | 431 | 8.0% |
| oh | 317 | 5.9% |
| fl | 314 | 5.8% |
| or | 253 | 4.7% |
| il | 239 | 4.4% |
| tx | 238 | 4.4% |
| mi | 217 | 4.0% |
| mo | 161 | 3.0% |
| ga | 135 | 2.5% |
| co | 128 | 2.4% |
| pa | 125 | 2.3% |
| ca-bc | 122 | 2.3% |
| ny | 116 | 2.1% |
| ky | 115 | 2.1% |
| ar | 104 | 1.9% |
| tn | 104 | 1.9% |
| wv | 104 | 1.9% |
| ok | 101 | 1.9% |
| id | 99 | 1.8% |
Top values (rank 1–20)
- wa — 631
- ca — 431
- oh — 317
- fl — 314
- or — 253
- il — 239
- tx — 238
- mi — 217
- mo — 161
- ga — 135
- co — 128
- pa — 125
- ca-bc — 122
- ny — 116
- ky — 115
- ar — 104
- tn — 104
- wv — 104
- ok — 101
- id — 99
county text
Show data table
| chars | count |
|---|---|
| 0 – 1 | 338 |
| 1 – 1 | 0 |
| 1 – 2 | 0 |
| 2 – 2 | 0 |
| 2 – 3 | 0 |
| 3 – 3 | 28 |
| 3 – 4 | 457 |
| 4 – 5 | 0 |
| 5 – 5 | 640 |
| 5 – 6 | 0 |
| 6 – 6 | 1110 |
| 6 – 7 | 0 |
| 7 – 7 | 802 |
| 7 – 8 | 916 |
| 8 – 9 | 0 |
| 9 – 9 | 608 |
| 9 – 10 | 0 |
| 10 – 10 | 301 |
| 10 – 11 | 0 |
| 11 – 12 | 62 |
| 12 – 12 | 94 |
| 12 – 13 | 0 |
| 13 – 13 | 5 |
| 13 – 14 | 0 |
| 14 – 14 | 24 |
| 14 – 15 | 0 |
| 15 – 16 | 16 |
| 16 – 16 | 3 |
| 16 – 17 | 0 |
| 17 – 17 | 3 |
| 17 – 18 | 0 |
| 18 – 18 | 0 |
| 18 – 19 | 0 |
| 19 – 20 | 3 |
| 20 – 20 | 0 |
| 20 – 21 | 0 |
| 21 – 21 | 0 |
| 21 – 22 | 0 |
| 22 – 22 | 0 |
| 22 – 23 | 1 |
Sample values (first 10)
- Bibb
- Houston
- Lewis
- Cowlitz
- Montmorency
- Cass
- Skamania
- Ferry
- Navajo
url text
Show data table
| chars | count |
|---|---|
| 46 – 46 | 11 |
| 46 – 46 | 0 |
| 46 – 46 | 0 |
| 46 – 46 | 0 |
| 46 – 46 | 0 |
| 46 – 46 | 0 |
| 46 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 288 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 47 | 0 |
| 47 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 1789 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 48 | 0 |
| 48 – 49 | 0 |
| 49 – 49 | 0 |
| 49 – 49 | 0 |
| 49 – 49 | 0 |
| 49 – 49 | 0 |
| 49 – 49 | 0 |
| 49 – 49 | 3323 |
Sample values (first 10)
- https://www.bfro.net/GDB/show_report.asp?id=21714
- https://www.bfro.net/GDB/show_report.asp?id=7604
- https://www.bfro.net/GDB/show_report.asp?id=12803
- https://www.bfro.net/GDB/show_report.asp?id=55658
- https://www.bfro.net/GDB/show_report.asp?id=27009
- https://www.bfro.net/GDB/show_report.asp?id=43994
- https://www.bfro.net/GDB/show_report.asp?id=14942
- https://www.bfro.net/GDB/show_report.asp?id=21473
- https://www.bfro.net/GDB/show_report.asp?id=25722
- https://www.bfro.net/GDB/show_report.asp?id=29358
month categorical
Show data table
| value | count | share |
|---|---|---|
| August | 634 | 11.7% |
| October | 632 | 11.7% |
| July | 618 | 11.4% |
| September | 515 | 9.5% |
| June | 468 | 8.6% |
| November | 458 | 8.5% |
| May | 303 | 5.6% |
| April | 259 | 4.8% |
| December | 233 | 4.3% |
| January | 228 | 4.2% |
| Summer | 217 | 4.0% |
| March | 201 | 3.7% |
| February | 163 | 3.0% |
| Fall | 129 | 2.4% |
| Spring | 96 | 1.8% |
| Winter | 57 | 1.1% |
| Late | 6 | 0.1% |
| about | 6 | 0.1% |
| mid | 5 | 0.1% |
| or | 5 | 0.1% |
Top values (rank 1–20)
- August — 634
- October — 632
- July — 618
- September — 515
- June — 468
- November — 458
- May — 303
- April — 259
- December — 233
- January — 228
- Summer — 217
- March — 201
- February — 163
- Fall — 129
- Spring — 96
- Winter — 57
- Late — 6
- about — 6
- mid — 5
- or — 5
year numeric
Show data table
| bin | count |
|---|---|
| 1870 – 1874 | 1 |
| 1874 – 1878 | 0 |
| 1878 – 1882 | 0 |
| 1882 – 1886 | 0 |
| 1886 – 1889 | 0 |
| 1889 – 1893 | 1 |
| 1893 – 1897 | 0 |
| 1897 – 1901 | 0 |
| 1901 – 1905 | 0 |
| 1905 – 1909 | 1 |
| 1909 – 1913 | 1 |
| 1913 – 1916 | 0 |
| 1916 – 1920 | 2 |
| 1920 – 1924 | 2 |
| 1924 – 1928 | 2 |
| 1928 – 1932 | 2 |
| 1932 – 1936 | 4 |
| 1936 – 1940 | 2 |
| 1940 – 1944 | 5 |
| 1944 – 1948 | 4 |
| 1948 – 1951 | 15 |
| 1951 – 1955 | 13 |
| 1955 – 1959 | 18 |
| 1959 – 1963 | 24 |
| 1963 – 1967 | 53 |
| 1967 – 1971 | 120 |
| 1971 – 1975 | 158 |
| 1975 – 1978 | 331 |
| 1978 – 1982 | 307 |
| 1982 – 1986 | 257 |
| 1986 – 1990 | 224 |
| 1990 – 1994 | 195 |
| 1994 – 1998 | 380 |
| 1998 – 2002 | 610 |
| 2002 – 2006 | 679 |
| 2006 – 2010 | 622 |
| 2010 – 2013 | 616 |
| 2013 – 2017 | 355 |
| 2017 – 2021 | 220 |
| 2021 – 2025 | 130 |
classification categorical
Show data table
| value | count | share |
|---|---|---|
| Class B | 2722 | 50.3% |
| Class A | 2655 | 49.1% |
| Class C | 34 | 0.6% |
Top values (rank 1–20)
- Class B — 2,722
- Class A — 2,655
- Class C — 34
description text
Show data table
| chars | count |
|---|---|
| 10 – 15 | 2 |
| 15 – 21 | 4 |
| 21 – 26 | 21 |
| 26 – 31 | 56 |
| 31 – 36 | 108 |
| 36 – 42 | 185 |
| 42 – 47 | 376 |
| 47 – 52 | 551 |
| 52 – 57 | 525 |
| 57 – 63 | 568 |
| 63 – 68 | 692 |
| 68 – 73 | 495 |
| 73 – 79 | 486 |
| 79 – 84 | 369 |
| 84 – 89 | 330 |
| 89 – 94 | 196 |
| 94 – 100 | 135 |
| 100 – 105 | 99 |
| 105 – 110 | 74 |
| 110 – 116 | 42 |
| 116 – 121 | 26 |
| 121 – 126 | 23 |
| 126 – 131 | 10 |
| 131 – 137 | 9 |
| 137 – 142 | 6 |
| 142 – 147 | 4 |
| 147 – 152 | 6 |
| 152 – 158 | 2 |
| 158 – 163 | 1 |
| 163 – 168 | 2 |
| 168 – 174 | 3 |
| 174 – 179 | 0 |
| 179 – 184 | 0 |
| 184 – 189 | 2 |
| 189 – 195 | 1 |
| 195 – 200 | 0 |
| 200 – 205 | 0 |
| 205 – 210 | 0 |
| 210 – 216 | 0 |
| 216 – 221 | 2 |
Sample values (first 10)
- Rescue workers describes possible stalking on the Cahaba River outside Montevallo
- Possible bigfoot activity near Walker County line.
- Hikers off Lewis River Trail find large footprint east of Cougar
- Hunters on bikes have close encounter with a sasquatch near Randle
- Woman recalls daylight sighting while driving and a possible incident at a home east of Gaylord
- Possible sighting near Bowser on Vancouver Island
- County workers find possible footprints near Bena
- Man and girlfriend, camping, hear loud footsteps and tree knocking near Cispus, Washington
- Possible reoccurring activity at a hunting spot near Republic
- Brief daylight sighting within the city limits of Show Low