geographic country centroids
Reading
This dataset contains 7,124 records of country centroid points sourced from Natural Earth 1:10m Admin 0 Label Points, with 10 columns covering geographic identifiers and coordinates. In practice, only the latitude and longitude columns carry usable signal — all eight categorical fields (continent, iso_a2, iso_a3, name, name_long, region_un, source, subregion) are effectively empty or constant, with a single value covering 100% of rows. Start by examining the spatial distribution: longitude spans the full globe (-179.97 to 179.99) while latitude is skewed toward the northern hemisphere (mean 22.9, median 25.2, skew -0.60). The 35 latitude outliers (~0.5%) likely correspond to extreme polar points worth a quick sanity check.
citing: row_count · column_count · columns.continent.stats.top_rate · columns.source.stats.top_value · columns.latitude.stats · columns.longitude.stats
Charts the summary said to look at first
Show data table
| bin | count |
|---|---|
| -83.05 – -78.9 | 19 |
| -78.9 – -74.74 | 26 |
| -74.74 – -70.58 | 38 |
| -70.58 – -66.43 | 42 |
| -66.43 – -62.27 | 51 |
| -62.27 – -58.11 | 10 |
| -58.11 – -53.95 | 45 |
| -53.95 – -49.8 | 77 |
| -49.8 – -45.64 | 72 |
| -45.64 – -41.48 | 62 |
| -41.48 – -37.32 | 30 |
| -37.32 – -33.17 | 19 |
| -33.17 – -29.01 | 10 |
| -29.01 – -24.85 | 23 |
| -24.85 – -20.7 | 85 |
| -20.7 – -16.54 | 140 |
| -16.54 – -12.38 | 169 |
| -12.38 – -8.224 | 238 |
| -8.224 – -4.067 | 246 |
| -4.067 – 0.09051 | 250 |
| 0.09051 – 4.248 | 293 |
| 4.248 – 8.405 | 321 |
| 8.405 – 12.56 | 381 |
| 12.56 – 16.72 | 302 |
| 16.72 – 20.88 | 179 |
| 20.88 – 25.03 | 400 |
| 25.03 – 29.19 | 489 |
| 29.19 – 33.35 | 214 |
| 33.35 – 37.51 | 388 |
| 37.51 – 41.66 | 263 |
| 41.66 – 45.82 | 163 |
| 45.82 – 49.98 | 142 |
| 49.98 – 54.13 | 225 |
| 54.13 – 58.29 | 275 |
| 58.29 – 62.45 | 502 |
| 62.45 – 66.61 | 441 |
| 66.61 – 70.76 | 222 |
| 70.76 – 74.92 | 88 |
| 74.92 – 79.08 | 105 |
| 79.08 – 83.24 | 79 |
Show data table
| bin | count |
|---|---|
| -180 – -171 | 112 |
| -171 – -162 | 105 |
| -162 – -153 | 87 |
| -153 – -144 | 109 |
| -144 – -135 | 55 |
| -135 – -126 | 113 |
| -126 – -117 | 95 |
| -117 – -108 | 91 |
| -108 – -98.98 | 36 |
| -98.98 – -89.98 | 82 |
| -89.98 – -80.98 | 350 |
| -80.98 – -71.98 | 554 |
| -71.98 – -62.98 | 253 |
| -62.98 – -53.98 | 202 |
| -53.98 – -44.99 | 83 |
| -44.99 – -35.99 | 49 |
| -35.99 – -26.99 | 30 |
| -26.99 – -17.99 | 66 |
| -17.99 – -8.989 | 75 |
| -8.989 – 0.01045 | 126 |
| 0.01045 – 9.01 | 130 |
| 9.01 – 18.01 | 299 |
| 18.01 – 27.01 | 632 |
| 27.01 – 36.01 | 131 |
| 36.01 – 45.01 | 88 |
| 45.01 – 54.01 | 90 |
| 54.01 – 63 | 306 |
| 63 – 72 | 48 |
| 72 – 81 | 226 |
| 81 – 90 | 28 |
| 90 – 99 | 168 |
| 99 – 108 | 256 |
| 108 – 117 | 190 |
| 117 – 126 | 604 |
| 126 – 135 | 535 |
| 135 – 144 | 170 |
| 144 – 153 | 185 |
| 153 – 162 | 115 |
| 162 – 171 | 149 |
| 171 – 180 | 101 |
Show data table
| value | count | share |
|---|---|---|
| Natural Earth 1:10m Admin 0 Label Points | 7124 | 100.0% |
Show data table
| value | count | share |
|---|---|---|
| 7124 | 100.0% |
Schema
10 columns| Alerts | ||||
|---|---|---|---|---|
| iso_a2 | categorical | 0.0% | 1 |
imbalance
|
| iso_a3 | categorical | 0.0% | 1 |
imbalance
|
| name | categorical | 0.0% | 1 |
imbalance
|
| name_long | categorical | 0.0% | 1 |
imbalance
|
| continent | categorical | 0.0% | 1 |
imbalance
|
| region_un | categorical | 0.0% | 1 |
imbalance
|
| subregion | categorical | 0.0% | 1 |
imbalance
|
| longitude | numeric | 0.0% | 7,124 |
|
| latitude | numeric | 0.0% | 7,124 |
|
| source | categorical | 0.0% | 1 |
imbalance
|
iso_a2
categorical metadata imbalanceThis appears to be an ISO 3166-1 alpha-2 country code field, but every one of the 7124 rows holds the empty string, giving a cardinality of 1 and entropy of 0. The column carries no information whatsoever despite having a 0.0 null_rate (blanks are not counted as nulls here). Treatment: Drop; the column is a constant empty string with no signal.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
iso_a3
categorical metadata imbalanceThis appears to be an ISO 3166-1 alpha-3 country code field, but every one of the 7124 rows holds the empty string. Cardinality is 1, entropy is 0, and there are no nulls — the column is present but entirely uninformative. Treatment: Drop; the column is a constant empty string with zero variance.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
name
categorical metadata imbalanceThis 'name' column is constant: every one of the 7124 rows holds the empty string, giving cardinality 1 and entropy 0. There is no signal here — likely a field that was scraped or exported but never populated. Treatment: Drop; the column is a single constant value with zero information.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
name_long
categorical other imbalanceThis column is a constant — every one of the 7124 rows holds the empty string, giving a cardinality of 1 and entropy of 0. Although technically non-null, it carries no information and the top_rate of 1.0 confirms there is nothing to distinguish records here. Treatment: Drop; the column is constant and contributes no signal.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
continent
categorical metadata imbalanceThis column appears intended to record continent metadata, but every one of the 7124 rows holds an empty string. With cardinality of 1, entropy of 0, and a top_rate of 1.0 on the blank value, it carries no information whatsoever. Treatment: Drop the column; it is constant-empty and contributes nothing.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
region_un
categorical metadata imbalanceThis column appears to be a UN region metadata field, but every one of the 7124 rows holds an empty string, giving a cardinality of 1 and entropy of 0. There is no signal here whatsoever — the field carries no information despite a 0.0 null_rate (blanks are not counted as nulls). Treatment: Drop; the column is constant (empty string) across all rows.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
subregion
categorical metadata imbalanceThis column appears to be a geographic subregion field, but it carries zero information: all 7,124 rows hold the same empty-string value, giving cardinality 1 and entropy 0. It is effectively a blank placeholder rather than a populated attribute. Treatment: Drop; the column is constant and contributes no signal.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0
longitude
numeric featureThis column appears to be a geographic longitude in decimal degrees, with values spanning the full -179.97 to 179.99 range. All 7124 entries are unique with no nulls or zeros, and the distribution is broadly flat (kurtosis -1.13) with only mild negative skew (-0.27), suggesting fairly even global coverage rather than clustering on a single region. The IQR of ~191.8 degrees confirms points are scattered across both hemispheres. Treatment: Pair with latitude for geospatial features; consider cyclic encoding given the -180/180 wrap.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 7,124
- min
- -180
- max
- 180
- mean
- 21.9
- median
- 23.48
- std
- 97.72
- q1
- -72.33
- q3
- 119.5
- iqr
- 191.8
- skew
- -0.267
- kurtosis
- -1.131
- n_outliers
- 0
- outlier_rate
- 0
- zero_rate
- 0
latitude
numeric featureThis column is a geographic latitude in decimal degrees, with all 7124 values unique and no nulls. The range spans -83.05 to 83.24, covering nearly the full globe, with a median of 25.20 skewed slightly toward the northern hemisphere (skew -0.60). Only 35 values (0.49%) flag as outliers and the wide IQR of 51.96 reflects genuine geographic spread rather than data error. Treatment: Pair with longitude for spatial features; consider binning or projecting rather than using raw degrees in linear models.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 7,124
- min
- -83.05
- max
- 83.24
- mean
- 22.92
- median
- 25.2
- std
- 34.23
- q1
- 1.149
- q3
- 53.11
- iqr
- 51.96
- skew
- -0.6007
- kurtosis
- 0.1113
- n_outliers
- 35
- outlier_rate
- 0.004913
- zero_rate
- 0
source
categorical metadata imbalanceThis column records the data provenance, with every one of the 7124 rows tagged as 'Natural Earth 1:10m Admin 0 Label Points'. Cardinality is 1 and entropy is 0, so it carries no information for any model or grouping. It's useful only as a dataset-level annotation. Treatment: Drop from modelling; retain in dataset documentation as the source label.
- n
- 7,124
- nulls
- 0 (0.0%)
- unique
- 1
- top_value
- Natural Earth 1:10m Admin 0 Label Points
- top_rate
- 1
- cardinality
- 1
- entropy
- 0
- entropy_ratio
- 0