Summary confidence: high
This dataset contains 200,000 deep-sea biodiversity occurrence records spanning taxonomic classification, geographic coordinates, ocean depth, and collection year. The most striking feature is the dominance of blank values across taxonomy columns — 55% of genus, 40% of family, and 73% of species entries are empty strings, suggesting many records are identified only at higher taxonomic levels. Proteobacteria, Cnidaria, and Chordata are the best-represented phyla, while Australia accounts for the vast majority of records with a named country (~79k of ~96k non-blank entries). Depth ranges from 1,000 to 11,000 metres with a mean around 2,400 m, and the year column is heavily left-skewed with over 12,000 outlier records dating back as far as 1875, versus a median of 2016.
citing: phylum.top_values · species.stats.top_rate · genus.stats.top_rate · family.stats.top_rate · country.top_values · depth.stats.mean · depth.stats.max · depth.stats.min · year.stats.median · year.stats.min · year.stats.n_outliers · year.alerts