saturn·

geographic country centroids

source /home/coolhand/html/datavis/data_trove/data/geographic/country_centroids.json 7,124 rows 10 columns profiled 2026-05-01 raw JSON static .html .ipynb Report Notebook

Reading

dataset summary · high confidence anthropic:claude-opus-4-7

This dataset contains 7,124 records of country centroid points sourced from Natural Earth 1:10m Admin 0 Label Points, with 10 columns covering geographic identifiers and coordinates. In practice, only the latitude and longitude columns carry usable signal — all eight categorical fields (continent, iso_a2, iso_a3, name, name_long, region_un, source, subregion) are effectively empty or constant, with a single value covering 100% of rows. Start by examining the spatial distribution: longitude spans the full globe (-179.97 to 179.99) while latitude is skewed toward the northern hemisphere (mean 22.9, median 25.2, skew -0.60). The 35 latitude outliers (~0.5%) likely correspond to extreme polar points worth a quick sanity check.

citing: row_count · column_count · columns.continent.stats.top_rate · columns.source.stats.top_value · columns.latitude.stats · columns.longitude.stats

Schema

10 columns
Per-column summary. Click column name to jump to its detail.
Alerts
iso_a2 categorical 0.0% 1
imbalance
iso_a3 categorical 0.0% 1
imbalance
name categorical 0.0% 1
imbalance
name_long categorical 0.0% 1
imbalance
continent categorical 0.0% 1
imbalance
region_un categorical 0.0% 1
imbalance
subregion categorical 0.0% 1
imbalance
longitude numeric 0.0% 7,124
latitude numeric 0.0% 7,124
source categorical 0.0% 1
imbalance

iso_a2

categorical metadata imbalance
This appears to be an ISO 3166-1 alpha-2 country code field, but every one of the 7124 rows holds the empty string, giving a cardinality of 1 and entropy of 0. The column carries no information whatsoever despite having a 0.0 null_rate (blanks are not counted as nulls here). Treatment: Drop; the column is a constant empty string with no signal. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

iso_a3

categorical metadata imbalance
This appears to be an ISO 3166-1 alpha-3 country code field, but every one of the 7124 rows holds the empty string. Cardinality is 1, entropy is 0, and there are no nulls — the column is present but entirely uninformative. Treatment: Drop; the column is a constant empty string with zero variance. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

name

categorical metadata imbalance
This 'name' column is constant: every one of the 7124 rows holds the empty string, giving cardinality 1 and entropy 0. There is no signal here — likely a field that was scraped or exported but never populated. Treatment: Drop; the column is a single constant value with zero information. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

name_long

categorical other imbalance
This column is a constant — every one of the 7124 rows holds the empty string, giving a cardinality of 1 and entropy of 0. Although technically non-null, it carries no information and the top_rate of 1.0 confirms there is nothing to distinguish records here. Treatment: Drop; the column is constant and contributes no signal. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

continent

categorical metadata imbalance
This column appears intended to record continent metadata, but every one of the 7124 rows holds an empty string. With cardinality of 1, entropy of 0, and a top_rate of 1.0 on the blank value, it carries no information whatsoever. Treatment: Drop the column; it is constant-empty and contributes nothing. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

region_un

categorical metadata imbalance
This column appears to be a UN region metadata field, but every one of the 7124 rows holds an empty string, giving a cardinality of 1 and entropy of 0. There is no signal here whatsoever — the field carries no information despite a 0.0 null_rate (blanks are not counted as nulls). Treatment: Drop; the column is constant (empty string) across all rows. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

subregion

categorical metadata imbalance
This column appears to be a geographic subregion field, but it carries zero information: all 7,124 rows hold the same empty-string value, giving cardinality 1 and entropy 0. It is effectively a blank placeholder rather than a populated attribute. Treatment: Drop; the column is constant and contributes no signal. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0

longitude

numeric feature
This column appears to be a geographic longitude in decimal degrees, with values spanning the full -179.97 to 179.99 range. All 7124 entries are unique with no nulls or zeros, and the distribution is broadly flat (kurtosis -1.13) with only mild negative skew (-0.27), suggesting fairly even global coverage rather than clustering on a single region. The IQR of ~191.8 degrees confirms points are scattered across both hemispheres. Treatment: Pair with latitude for geospatial features; consider cyclic encoding given the -180/180 wrap. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
7,124
min
-180
max
180
mean
21.9
median
23.48
std
97.72
q1
-72.33
q3
119.5
iqr
191.8
skew
-0.267
kurtosis
-1.131
n_outliers
0
outlier_rate
0
zero_rate
0

latitude

numeric feature
This column is a geographic latitude in decimal degrees, with all 7124 values unique and no nulls. The range spans -83.05 to 83.24, covering nearly the full globe, with a median of 25.20 skewed slightly toward the northern hemisphere (skew -0.60). Only 35 values (0.49%) flag as outliers and the wide IQR of 51.96 reflects genuine geographic spread rather than data error. Treatment: Pair with longitude for spatial features; consider binning or projecting rather than using raw degrees in linear models. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
7,124
min
-83.05
max
83.24
mean
22.92
median
25.2
std
34.23
q1
1.149
q3
53.11
iqr
51.96
skew
-0.6007
kurtosis
0.1113
n_outliers
35
outlier_rate
0.004913
zero_rate
0

source

categorical metadata imbalance
This column records the data provenance, with every one of the 7124 rows tagged as 'Natural Earth 1:10m Admin 0 Label Points'. Cardinality is 1 and entropy is 0, so it carries no information for any model or grouping. It's useful only as a dataset-level annotation. Treatment: Drop from modelling; retain in dataset documentation as the source label. high · anthropic:claude-opus-4-7
n
7,124
nulls
0 (0.0%)
unique
1
top_value
Natural Earth 1:10m Admin 0 Label Points
top_rate
1
cardinality
1
entropy
0
entropy_ratio
0