Summary confidence: high
This dataset is a global catalogue of 80,678 waterfalls sourced entirely from OpenStreetMap, covering geographic coordinates and basic descriptive attributes. The most striking finding is how sparse the data quality is: 89.9% of records carry only the generic description 'Waterfall' with no height recorded, and 59.7% of entries are named 'Unnamed Waterfall', suggesting the dataset is geographically broad but informationally thin. Height data is worth a closer look — where it does exist, values cluster at small measurements (2–10 metres), hinting at a possible recording bias toward easily measured falls. The geographic spread is genuinely global (latitude ranges from -77.7 to 78.7), but the country field is nearly empty for 99.97% of records, so spatial analysis should rely on the raw coordinates rather than the country column.
citing: row_count · description.top_rate · description.top_value · height.top_rate · height.top_values · name.top_values · name.n_duplicates · name.duplicate_rate · latitude.min · latitude.max · country.top_rate · source.top_value