Luke's Data Hoard

Centralized index of datasets spanning paranormal phenomena, natural disasters, linguistics, economics, culture, health, geography, and accessibility research

0 Datasets
12 Categories
0 Public
0 Local

Paranormal & Anomalous

6 datasets

Ghost sightings, UFO reports, Bigfoot encounters, cryptids, crop circles, and roller coaster data.

Strange Places v5.2 Public
šŸ“ 354,770 records šŸ—‚ļø 14 categories šŸŒ Global coverage

354,770 georeferenced mysterious phenomena from NASA, NOAA, USGS, BFRO, NUFORC, OpenStreetMap, the Megalithic Portal, and the Shadowlands Haunted Places Index. Tornadoes, caves, UFOs, megaliths, meteorites, ghost towns, storms, haunted places, thermal springs, bigfoot sightings, earthquakes, shipwrecks, fireballs, and volcanoes.

Paranormal Earth Science UFO Geospatial
Ghost Sightings Database Locally Cached
šŸ‘» 9,717 reports šŸ“ US locations šŸ—ŗļø Geolocated

Ghost sighting reports from The Shadowlands archive with location data, descriptions, and regional analysis.

Ghosts Haunted Regional
UFO Sightings Analysis Locally Cached
šŸ›ø 60k+ sightings šŸ“Š By state, shape, year ā±ļø Time series

Detailed breakdowns of UFO sightings from NUFORC data with aggregations by state, shape classification, region, year, and cross-tabulations.

UFO Time Series Geographic
BFRO Bigfoot Sightings (Full Scrape) Locally Cached
šŸ‘£ 5,400+ reports šŸ“ US + Canada šŸ“… 1950s–2025

Every report from the Bigfoot Field Researchers Organization, scraped county by county across all 50 US states and 7 Canadian provinces. Includes Class A/B/C classification, date, county, and description for each sighting. Phase 2 enrichment adds GPS coordinates, season, and nearest town from individual report pages.

Cryptids Bigfoot BFRO Georeferenced
Roller Coaster Database Locally Cached
šŸŽ¢ 5,000+ coasters šŸ“Š Height, speed, length šŸŒ Worldwide

Roller coaster specifications from RCDB including height, speed, length, inversions, manufacturer, and operating status.

Roller Coasters Theme Parks Engineering
Witch Trials Locally Cached
šŸ”„ 10,940 records

European witch trial records with year, city, country, number tried and killed

witch-trials history

Nature & Earth Science

22 datasets

Meteorites, earthquakes, exoplanets, solar flares, bird migration, coral reefs, waterfalls, volcanoes, planets, moons, and aurora forecasts.

Large Meteorites (10kg+) Public
ā˜„ļø 1,000+ falls āš–ļø 10kg+ minimum šŸ“ Geolocated

NASA's catalog of significant meteorite falls and finds over 10kg with coordinates, mass, classification, and discovery year.

Meteorites NASA Space
USGS Significant Earthquakes Public
šŸŒ 3,742 events šŸ“ˆ Magnitude 4.5+ šŸ“… 1900-present

Significant earthquake events from USGS with magnitude, depth, location, and impact data.

Earthquakes USGS Seismic
NASA Exoplanet Archive Locally Cached
🪐 5,000+ planets šŸ”­ Confirmed šŸ“Š Orbital data

Confirmed exoplanets from NASA with orbital parameters, mass, radius, discovery method, and host star properties.

Exoplanets NASA Astronomy
Solar Flares & Space Weather Locally Cached
ā˜€ļø Major events šŸ“ˆ X-class flares šŸ“… Recent years

NOAA space weather data including solar flares, coronal mass ejections, and geomagnetic storm forecasts.

Solar Flares NOAA Space Weather
Bird Migration Patterns Locally Cached
🐦 1M+ observations šŸ“ Geolocated šŸ—“ļø Seasonal

eBird observation data showing bird species migration patterns, stopover locations, and seasonal timing across North America.

Birds Migration eBird
Coral Reef Monitoring Locally Cached
🪸 500+ sites šŸŒ”ļø Temperature šŸ“‰ Bleaching data

Global coral reef health monitoring data including bleaching events, temperature anomalies, and biodiversity assessments.

Coral Reefs Ocean Climate
Waterfalls Worldwide Public
šŸ’§ 80,678 waterfalls šŸ“ Height data šŸŒ Global

Notable waterfalls worldwide with height, flow rate, location, and accessibility information.

Waterfalls Nature Tourism
Global Volcanism Program Locally Cached
šŸŒ‹ 1,500+ volcanoes šŸ’„ Eruption history šŸ“ Geolocated

Smithsonian Institution's database of active and potentially active volcanoes with eruption histories and hazard assessments.

Volcanoes Smithsonian Geology
Solar System Planets Locally Cached
🪐 8 planets šŸ“Š Physical properties 🌌 Orbital data

Solar system planet data: mass, diameter, orbital period, distance from sun, and atmospheric composition.

Planets Solar System Astronomy
Natural Satellites (Moons) Locally Cached
šŸŒ• 200+ moons 🪐 Planetary systems šŸ“ Size & orbit data

Natural satellites in our solar system with parent planet, orbital characteristics, physical properties, and discovery information.

Moons Satellites Astronomy
NOAA Ovation Aurora Forecast Locally Cached
🌌 Aurora predictions 🧲 Geomagnetic data šŸ“Š NOAA SWPC

NOAA Space Weather Prediction Center's Ovation aurora forecast model with predicted viewing probabilities by location and time.

Aurora Space Weather NOAA
Lighthouses Worldwide Locally Cached
šŸ—¼ 14,585 records

Lighthouse locations from OpenStreetMap with height, year built, operator

lighthouses navigation osm
Caves Worldwide Locally Cached
šŸ•³ļø 69,716 records

Cave locations from OpenStreetMap with access info, tourism status, descriptions

caves geology osm
Shipwrecks Locally Cached
āš“ 5,569 records

Shipwreck locations from OpenStreetMap with year sunk, type, Wikipedia links

shipwrecks maritime osm
Megaliths Locally Cached
šŸ—æ 15,464 records

Megalithic site locations from OpenStreetMap including dolmens, menhirs, stone circles

megaliths archaeology osm
Ancient Ruins Locally Cached
šŸ›ļø 96,876 records

Archaeological and historic ruin locations from OpenStreetMap with site type, period, Wikipedia links

ruins archaeology osm
Fossils (PBDB) Locally Cached
🦓 22,043 records

Fossil occurrence records from the Paleobiology Database with taxonomic classification, age, and coordinates

fossils paleontology pbdb
Deep Sea Specimens Locally Cached
šŸ™ 200,000 records

Deep sea organism occurrences (depth >1000m) from OBIS with full taxonomic hierarchy

deep-sea marine obis
Bioluminescence Locally Cached
šŸ’” 43,060 records

Bioluminescent organism occurrences from OBIS with taxonomic classification and coordinates

bioluminescence marine obis
Tornadoes (NOAA SPC) Locally Cached
šŸŒŖļø 70,022 records

US tornado records from NOAA Storm Prediction Center with magnitude, injuries, fatalities, path data

tornadoes weather noaa

Disasters & Extremes

10 datasets

Airplane crashes, shark attacks, lightning strikes, storms, temperature anomalies, wildfires, and disaster mashups.

Airplane Crashes & Fatalities (1908-2009) Local Data
āœˆļø 5,268 incidents šŸ“… 1908-2009 šŸ“Š CSV 1.5 MB

Historical record of airplane crashes and fatalities worldwide from the first recorded airplane fatality (1908) through 2009. Includes operator, aircraft type, route, and casualty details.

Aviation Crashes Fatalities
Global Shark Attack File (GSAF) Local Data
🦈 6,463 incidents šŸ“… 1845-2020 šŸ“Š CSV 3.5 MB

Shark attack records from the Shark Research Institute covering provoked, unprovoked, and boat attacks with species, activity, and outcome details.

Sharks Ocean Animal Attacks
NOAA Lightning Strikes (2018) Local Data
⚔ 3.4M strike records šŸ“… 2018 šŸ—ŗļø US continental

Daily lightning strike counts with geographic center points from NOAA's National Lightning Detection Network. Preprocessed data available for heatmap, monthly, and state-level analysis.

Lightning NOAA Weather
US Disasters Mashup Public
šŸŒ€ 54,575 events šŸ“Š NOAA + USGS + NTSB + EPA šŸ—ŗļø Geolocated

Combined disaster dataset merging NOAA storms, USGS earthquakes, NTSB aviation accidents, and EPA Superfund sites across the US.

Disasters NOAA USGS Aviation Superfund
NOAA Significant Storms Public
šŸŒŖļø 14,770 events šŸ“ˆ Tornadoes, hurricanes, floods šŸ“… Historical

Curated significant US storm events from NOAA with geographic coordinates, event types, and descriptions.

Storms NOAA Weather
Temperature Anomalies (NASA GISS) Public
šŸŒ”ļø 1880-2015 šŸ“ˆ Global anomalies šŸ—ŗļø Gridded data

NASA GISS Surface Temperature Analysis showing global temperature anomalies relative to 1951-1980 baseline with monthly and annual data.

Climate NASA Temperature
Wildfire Incidents Locally Cached
šŸ”„ 20+ years šŸ“ Acreage burned šŸ“ Perimeters

National Interagency Fire Center data on wildfire incidents including location, size, containment status, and fire perimeters.

Wildfires NIFC Fire
FEMA Flood Insurance Claims Locally Cached
🌊 5M+ claims šŸ’° Payout amounts šŸ“… 1970-present

National Flood Insurance Program claims data showing flood event frequency, severity, and financial impacts by county.

Floods FEMA Insurance
US Drought Monitor Locally Cached
šŸœļø Weekly updates šŸ“Š Severity levels šŸ—ŗļø County-level

US Drought Monitor weekly drought severity classifications showing spatial extent and intensity of drought conditions.

Drought NOAA Climate
Extreme Heat Events Locally Cached
šŸŒ”ļø Heat index 105°F+ šŸ’€ Mortality data šŸ™ļø Urban focus

Extreme heat event tracking with heat index calculations, duration, affected populations, and heat-related mortality statistics.

Heat Waves Public Health Urban

Civic & Urban

7 datasets

NYC 311, GDELT events, parking tickets, restaurant inspections, bridges, Wikipedia activity, Internet Archive, and Bluesky social.

NYC 311 Service Requests Locally Cached
šŸ“ž 30M+ requests šŸ—“ļø 2010-present šŸ—ŗļø Geolocated

Complete NYC 311 service request data including noise complaints, street conditions, housing issues, and response times by neighborhood.

NYC 311 Urban
US Attention Data (2020-2025) Public
šŸ“Š 3 sources šŸ“… Weekly cadence šŸŒ Global attention

Cross-platform attention metrics tracking global focus on the United States. Combines Wikipedia pageviews, GDELT global event mentions, and Google Trends search interest on a weekly cadence from 2020-2025.

Attention Wikipedia GDELT Google Trends
GDELT Global Events Locally Cached
šŸŒ 500M+ events šŸ“° News-derived ā±ļø Real-time

GDELT Project's global event database tracking political, social, and conflict events worldwide from news media coverage.

GDELT News Events
Wikipedia Pageviews Locally Cached
šŸ“Š Top articles šŸ“… Daily/monthly šŸŒ By language

Wikipedia pageview statistics showing trending articles, search patterns, and attention flows across topics and languages.

Wikipedia Pageviews Attention
Bluesky Social Posts Public
šŸ’¬ 101K posts 😊 Sentiment analyzed šŸ”’ Anonymized

Anonymized Bluesky posts with VADER sentiment analysis. SHA-256 hashed DIDs/URIs, 44K unique authors. Dec 2-25, 2025. CC-BY-4.0 license.

Social Media Bluesky NLP
Parking Tickets (NYC) Locally Cached
šŸš— 10M+ tickets šŸ’° Fine amounts šŸ“ Locations

NYC parking violation data with ticket types, fine amounts, vehicle types, and issuance locations for spatial and temporal analysis.

Parking NYC Fines
Internet Archive Lending Locally Cached
šŸ“š Lending data šŸ“Š Circulation stats šŸ•°ļø Historical

Internet Archive's digital lending library statistics showing book circulation patterns, popular titles, and access trends.

Books Internet Archive Lending
National Bridge Inventory Locally Cached
šŸŒ‰ 600K+ bridges šŸ”§ Condition ratings šŸ“Š Annual inspections

Federal Highway Administration bridge inventory with structural condition ratings, age, traffic volume, and deficiency status.

Bridges Infrastructure NBI

Language & Linguistics

7 datasets

WALS, PHOIBLE, Glottolog, ISO 639-3, World Languages, CLMET, Historical Lemma, Language Families, Brown Corpus, COCA API, Historical Corpora, Joshua Project languages, and etymology atlas.

World Languages Integrated Public
šŸŒ 7,900+ languages šŸ“Š ISO 639-3 šŸ‘„ Speaker counts

Language database integrating ISO 639-3, Glottolog, native names, speaker populations, endangerment status, and geographic distribution.

Languages ISO 639-3 Linguistics
WALS (World Atlas of Language Structures) Locally Cached
šŸ“š 192 features šŸŒ 2,679 languages šŸ“Š 76K+ datapoints

Typological feature database covering phonology, morphology, syntax, and lexicon across languages worldwide.

WALS Typology Features
PHOIBLE (Phonetics Database) Locally Cached
šŸ”Š 105K+ phonemes šŸ“ IPA notation šŸ—£ļø 3,000+ languages

Phoneme inventory database with IPA symbols, features (consonant/vowel/tone), and cross-linguistic phonological patterns.

Phonology PHOIBLE IPA
Glottolog Languoids Locally Cached
🌳 23,700+ entries šŸ“ Coordinates šŸ”— Family trees

Language classification with family hierarchies, geographic coordinates, and historical relationships.

Glottolog Families Classification
ISO 639-3 Language Codes Locally Cached
šŸ·ļø 7,900+ codes šŸ“‹ 3-letter standard šŸ”„ ISO 639-1/2 mappings

Official ISO 639-3 three-letter language codes with mappings to ISO 639-1 (2-letter) and ISO 639-2 (bibliographic) codes.

ISO 639-3 Standards Codes
Etymology Atlas Public
šŸ“– 4.2M relationships 🌳 Cognate sets šŸ”— Word forms

Etymological database from Lexibank CLDF data with cognate relationships, word forms, and language evolution patterns.

Etymology Historical Cognates
Joshua Project Global Peoples Public
šŸ—£ļø 7,134 languages šŸ‘„ 16,382 people groups šŸŒ 238 countries

Demographic, linguistic, and religious data for 16,382 people groups worldwide from the Joshua Project API. Includes Bible translation status, evangelical percentages, and enriched Parquet files for fast analysis.

Languages Joshua Project People Groups

Economics & Commerce

6 datasets

Billionaires, market caps, World Bank GDP, patents, retail locations, and NYC housing.

Forbes Billionaires Locally Cached
šŸ’° 2,600+ billionaires šŸ“Š Net worth šŸŒ Global

Forbes billionaire rankings with net worth, source of wealth, age, country, and industry classification.

Billionaires Forbes Wealth
World Bank GDP Data Locally Cached
šŸŒ 200+ countries šŸ“ˆ Time series šŸ’µ USD values

Historical GDP data from World Bank with total GDP, per capita GDP, and growth rates for countries worldwide.

GDP World Bank Economics
Corporate Market Capitalizations Locally Cached
šŸ“Š Major corporations šŸ’¹ Market cap šŸ¢ Dow Jones

Market capitalization data for major corporations including Dow Jones Industrial Average components and Fortune 500 companies.

Market Cap Corporations Dow Jones
USPTO Patent Data Locally Cached
šŸ“œ Millions of patents šŸ”¬ Classifications šŸ“… Historical

US Patent and Trademark Office data with patent classifications, filing dates, inventors, and citation networks.

Patents USPTO Innovation
Retail Store Locations Locally Cached
šŸŖ 100K+ locations šŸ“ Geolocated šŸ¢ Major chains

Retail store location data for major chains including Walmart, Target, Kroger, CVS, Walgreens, and regional chains.

Retail Stores Locations
NYC Housing Analysis Local Data
šŸ™ļø 2,168 census tracts šŸ“Š Rent, income, tenure šŸ—ŗļø 5 boroughs

Census tract-level housing data for all five NYC boroughs: median rent, household income, rent burden, and tenure (owner vs renter). Includes borough and neighborhood GeoJSON boundaries with data joined.

NYC Housing Rent Burden Census

Policy & Governance

5 datasets

Executive orders, tariff policies, federal workforce changes, market sector performance, and geopolitical events.

Executive Orders Database Stub
šŸ“œ 225 executive orders šŸ›ļø Federal Register šŸ“Š Status tracking

Executive orders issued by U.S. presidents with metadata including issue date, Federal Register number, title, status, and policy areas.

Executive Orders Policy Federal Register
Tariff Timeline Stub
šŸ“Š 13.5% avg rate šŸ’µ Tax Foundation/Treasury šŸŒ Trade policy

Historical and current tariff rates by product category, country, and time period. Includes weighted average rates and trade impact analysis.

Tariffs Trade Policy
DOGE Workforce Cuts Stub
šŸ‘„ 317K personnel cuts šŸ›ļø OPM/AllSides šŸ“Š By agency

Department of Government Efficiency proposed workforce reductions by federal agency, with personnel counts and policy rationale.

Federal Workforce DOGE Personnel
Market Sector Performance Stub
šŸ“ˆ 12 sectors šŸ’¹ S&P/Goldman Sachs šŸ“Š Performance tracking

Economic sector performance indices tracking technology, finance, healthcare, energy, and other major market sectors over time.

Markets Sectors Economics
Geopolitical Actions Timeline Stub
šŸŒ Military/diplomatic events šŸ” Crisis Group sources šŸ“… Timeline data

Timeline of significant geopolitical events including military actions, diplomatic incidents, and international policy changes.

Geopolitics Diplomacy International

Food & Culture

12 datasets

Cheese, wine, edible insects, OpenFoodFacts, baby names, emoji, street art, social norms, food access, SNAP, and grocery density.

World of Cheese Locally Cached
šŸ§€ 1,800+ varieties šŸŒ 70+ countries šŸ“Š Milk types

Cheese database with varieties, countries of origin, milk types, aging processes, and flavor profiles.

Cheese Food Taxonomy
Wine Varieties & Regions Locally Cached
šŸ· 2,000+ varieties šŸŒ Regions šŸ‡ Grape types

Wine varieties with regional classifications, grape composition, flavor profiles, and production statistics.

Wine Viticulture Food
Edible Insects Database Locally Cached
šŸ¦— 2,100+ species šŸ“Š Nutrition data šŸŒ Regional use

Edible insect species with nutritional profiles, regional consumption patterns, and sustainability metrics.

Insects Entomophagy Nutrition
USDA Food Access Atlas Locally Cached
šŸ—ŗļø Census tracts šŸŖ Supermarket access šŸ“Š Low-income data

USDA atlas measuring food access by census tract with low-income/low-access designations and SNAP usage statistics.

Food Access USDA Food Deserts
Food Desert States Summary Locally Cached
šŸ—ŗļø State-level data šŸ“Š Aggregated metrics šŸŖ Food access

State-level aggregation of USDA Food Access Atlas data showing percentage of population in food deserts and low-access areas.

Food Deserts States USDA
SNAP Participation & Benefits Locally Cached
šŸ“Š County-level šŸ‘„ Participation rates šŸ’µ Benefit amounts

Supplemental Nutrition Assistance Program data showing participation rates, benefit amounts, and coverage gaps by county.

SNAP FNS Food Security
Grocery Store Density Locally Cached
šŸŖ Store counts šŸ“ Per capita šŸ—ŗļø County-level

Grocery store density measurements by county showing stores per capita and accessibility metrics for fresh food.

Grocery Density Food Access
Emoji Data Locally Cached
šŸ˜€ 3,600+ emoji šŸ“‚ Categories šŸ“… Timeline

Complete Unicode emoji dataset with categories, release dates, names, and historical evolution of emoji standards.

Emoji Unicode Culture
Baby Names (SSA) Locally Cached
šŸ‘¶ 1880-2023 šŸ“Š Popularity trends ⚧ By gender

Social Security Administration baby name data showing popularity trends, regional variations, and cultural naming patterns.

Names SSA Demographics
OpenFoodFacts Database Locally Cached
🄫 2M+ products šŸ“Š Nutrition facts šŸŒ Global

Open database of food products worldwide with nutrition facts, ingredients, allergens, labels, and Nutri-Score ratings.

Food Nutrition OpenFoodFacts
Chocolate Origins Locally Cached
šŸ« 2,530 records

Chocolate bar ratings with company, bean origin, cocoa percentage, and tasting notes

chocolate food ratings
Hot Sauces & Peppers Locally Cached
šŸŒ¶ļø 101 records

Hot sauces and peppers with Scoville ratings, origins, and species

hot-sauce peppers scoville
Social Norms Graph Locally Cached
šŸ•øļø Network graph

Network graph of social actions and moral judgments from the Social Chemistry 101 dataset

social-norms morality network

Health, Housing & Veterans

13 datasets

Housing affordability, HUD PIT counts, eviction, CMS hospitals, Medicaid, hospital closures, food access, and veterans data (5 military datasets).

US Inequality Atlas Public
šŸ—ŗļø 3,200+ counties šŸ„ 5,421 hospitals šŸ“Š FIPS-keyed

County-level inequality data for all ~3,200 US counties, keyed on 5-digit FIPS codes. Covers food deserts, healthcare access, housing affordability, hospital infrastructure, and veteran demographics. All from Census ACS, CMS, USDA, and HRSA.

Inequality Food Deserts Healthcare Housing
US Housing Affordability Crisis Public
šŸ  3,200+ counties šŸ’° Rent burden šŸ“Š Income data

County-level housing affordability analysis with median rents, household incomes, cost burden percentages, and homelessness data.

Housing Affordability Rent Burden
HUD Point-in-Time Counts Locally Cached
šŸšļø Homelessness counts šŸ“Š Sheltered/unsheltered šŸ“… Annual surveys

HUD Point-in-Time homeless population counts by CoC (Continuum of Care) with sheltered/unsheltered breakdowns and demographics.

Homelessness HUD PIT Counts
Eviction Data Locally Cached
šŸ  County-level āš–ļø Filing rates šŸ“‰ Trends

Eviction Lab data showing eviction filing rates, judgment rates, and temporal trends across US counties.

Eviction Housing Displacement
US Military Veteran Analysis Public
šŸŽ–ļø County-level šŸ‘„ Age, gender šŸ“Š 5 datasets merged

Veteran demographics merged from 5 sources: population, employment, homelessness, suicide rates, and firearm ownership correlations.

Veterans Military Demographics
Veteran Population by County Locally Cached
šŸ—ŗļø All US counties šŸ“Š Age cohorts ⚧ By gender

VA and Census veteran population estimates by county with age, gender, and period-of-service breakdowns.

Veterans Population Census
Veteran Employment Statistics Locally Cached
šŸ’¼ Labor force data šŸ“‰ Unemployment rates šŸ“Š By state

Bureau of Labor Statistics veteran employment data with labor force participation, unemployment, and industry breakdowns.

Veterans Employment BLS
Veteran Homelessness Locally Cached
šŸšļø PIT counts šŸŽ–ļø Veteran-specific šŸ“Š Trends

HUD Point-in-Time counts for veteran homelessness with sheltered/unsheltered status and CoC-level data.

Veterans Homelessness HUD
Veteran Suicide Rates Locally Cached
šŸ“ˆ Annual rates šŸ“Š By state āš ļø Mental health

VA data on veteran suicide rates by state with comparisons to civilian populations and mental health service utilization.

Veterans Suicide Mental Health
CMS Hospital Database Locally Cached
šŸ„ 6,000+ hospitals ⭐ Quality ratings šŸ“ Locations

Centers for Medicare & Medicaid Services hospital data with quality ratings, ownership, emergency services, and patient experience scores.

Hospitals CMS Healthcare
Medicaid Enrollment Locally Cached
šŸ“Š By state šŸ‘„ Enrollment šŸ“… Monthly

State-level Medicaid enrollment data with monthly trends, eligibility categories, and managed care penetration.

Medicaid Healthcare Enrollment
Healthcare Deserts Locally Cached
šŸ—ŗļø County-level šŸ„ Provider ratios šŸ“ Distance metrics

Healthcare desert identification using provider-to-population ratios, travel distances, and HRSA shortage area designations.

Healthcare Access Deserts
HRSA Shortage Areas Locally Cached
šŸ„ HPSA designations šŸ“Š Shortage scores šŸ—ŗļø Geographic/population

Health Resources and Services Administration shortage area designations (HPSA) for primary care, dental, and mental health.

HRSA Shortage Areas HPSA

Geography & Demographics

12 datasets

County analysis, SCARS, country centroids, election results, geology, county boundaries, air quality, ocean plastic, light pollution, infrastructure (submarine cables, satellites, internet topology), and endangered languages.

US County Reference (FIPS) Locally Cached
šŸ—ŗļø 3,200+ counties šŸ·ļø 5-digit FIPS šŸ“ Centroids

Standardized FIPS county codes with names, state associations, land area, population, and geographic centroids for spatial joins.

FIPS Counties Census
SCARS (Standardized County Analysis & Research System) Locally Cached
šŸ“Š Multi-indicator šŸ—ŗļø All US counties šŸ”— Merge-ready

Unified county-level dataset integrating economic, demographic, health, housing, and infrastructure indicators for comparative analysis.

SCARS Counties Multi-indicator
Country Centroids Locally Cached
šŸŒ 200+ countries šŸ“ Lat/lon šŸ·ļø ISO codes

Geographic centroids for all countries with ISO 3166-1 alpha-2/alpha-3 codes, names, and population-weighted centers.

Countries Centroids ISO 3166
US Presidential Election Results by County Local Data
šŸ—³ļø ~3,100 counties per election šŸ“… 2016 + 2020 šŸ“Š Vote counts + margins

County-level presidential election results with total votes, party breakdown, and vote margins. Mergeable with other FIPS-keyed datasets.

Elections Voting FIPS
US Geological Features Local Data
🪨 4 GeoJSON layers šŸ—ŗļø Continental US šŸ“ USGS sourced

Geological regions, mineral deposits, Cretaceous-era arc boundaries, and Deep South historical timeline features. Part of the SCARS project mapping geology against socioeconomic data.

Geology Minerals GeoJSON
US County & State Boundaries (GeoJSON) Local Data
šŸ—ŗļø 3,200+ counties šŸ“ Simplified + full šŸ“Š GeoJSON + CSV

County and state boundary polygons in GeoJSON format, including simplified versions for web rendering and full-resolution versions with data attributes joined.

Boundaries GeoJSON Counties
EPA Air Quality Index Locally Cached
šŸŒ«ļø Daily AQI šŸ“ Monitoring stations šŸ­ Pollutant levels

EPA Air Quality Index data with daily measurements for PM2.5, PM10, ozone, and other criteria pollutants by monitoring station.

Air Quality EPA AQI
Ocean Plastic Pollution Locally Cached
🌊 Global coverage šŸ“Š Density estimates šŸ—ŗļø Gyres

Ocean plastic pollution density estimates with gyres mapping, microplastic concentrations, and source attribution.

Ocean Plastic Pollution
Light Pollution Atlas Locally Cached
🌃 Night sky brightness šŸ—ŗļø Global coverage ⭐ Bortle scale

Light pollution measurements showing night sky brightness levels, Bortle scale classifications, and dark sky preserve locations.

Light Pollution Astronomy Night Sky
Submarine Cable Map Locally Cached
🌊 500+ cables šŸ“ Routes ⚔ Capacity

Global submarine telecommunications cable routes with landing points, capacity, owners, and ready-for-service dates.

Submarine Cables Infrastructure Internet
Satellite TLE Data Locally Cached
šŸ›°ļø Active satellites šŸ“” TLE format šŸŒ Orbital elements

Two-Line Element (TLE) data for active satellites from CelesTrak with orbital parameters and classification.

Satellites TLE CelesTrak
Endangered Languages Locally Cached
šŸ—£ļø 3,000+ languages āš ļø Threat levels šŸ“ Locations

UNESCO Atlas of Endangered Languages with vitality assessments, speaker populations, and documentation status.

Languages Endangered UNESCO

Accessibility & Assistive Tech

5 datasets

SAP (Augmentative Communication), WLASL (Sign Language), ACS disability data, WebAIM web compliance, WHO global health, and accessibility audit tools.

Accessibility Atlas Public
šŸ“Š 13,000+ records across 10 datasets šŸŒ Census + WHO + WebAIM šŸ“… 2021-2025

Disability demographics (3,222 US counties, 185 WHO countries), web accessibility compliance (WebAIM Million 2025, screen reader survey), ADA lawsuits, Section 508 federal compliance, VizWiz blind user VQA, and WLASL sign language index.

Disability WCAG Screen Readers WHO Census
Census Communication Disability Data Locally Cached
šŸ’¬ 50 states + DC šŸ“Š ACS 5-year šŸ—ŗļø State-level

Census ACS data on communication disabilities by state, covering hearing, vision, cognitive, and self-care difficulty rates.

Disability Communication Census
WLASL (Word-Level American Sign Language) Locally Cached
🤟 2,000 signs šŸ“¹ Video glosses šŸ·ļø Annotations

Word-level American Sign Language video dataset with annotations, glosses, and metadata for sign language recognition research.

ASL Sign Language WLASL
ACS Disability Statistics Locally Cached
šŸ“Š County-level šŸ‘„ 6 disability types šŸ“… Annual surveys

American Community Survey disability data with prevalence rates for hearing, vision, cognitive, ambulatory, self-care, and independent living disabilities.

Disability ACS Census
Accessibility Audit Tools Locally Cached
āœ… WCAG 2.1 AA šŸ” Automated checks šŸ“‹ Reports

Accessibility audit results and tools for WCAG 2.1 AA compliance checking including axe-core and Pa11y scan outputs.

Accessibility WCAG Audit

Entertainment & Pop Culture

6 datasets

Gaming, film, music, and satire datasets

Steam Games Catalog Locally Cached
šŸŽ® 50,872 games

Steam game catalog with titles, release dates, pricing, and platform availability

gaming steam catalog
Steam User Reviews Locally Cached
šŸ’¬ 41.1M reviews šŸ“¦ 1.9 GB (gitignored)

41 million Steam user game recommendations. File is 1.9 GB and gitignored.

gaming steam reviews
Steam Users Locally Cached
šŸ‘„ 14.3M users šŸ“¦ 185 MB (gitignored)

14 million Steam user profiles. File is 185 MB and gitignored.

gaming steam users
Bond Girls Locally Cached
šŸŽ¬ 71 records

Bond franchise companion characters with actress/Bond actor ages, film financials in actual and inflation-adjusted USD

film bond pop-culture
Boy Bands Locally Cached
šŸŽ¤ 15 bands, 55 members

Boy band discography with members, active years, and chart performance

music boy-bands pop-culture
Onion Headlines Locally Cached
šŸ“° 2,104 headlines

The Onion headline index with article IDs and image URLs. No header row.

satire headlines text