saturn·

geographic country centroids

saturn notebook · generated 2026-05-01 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/geographic/country_centroids.json

Saturn profiled 7,124 rows across 10 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/geographic/country_centroids.json",
    "--findings", "geographic-country_centroids.json",
    "--llm", "anthropic:claude-opus-4-7",
])

Summary confidence: high

This dataset contains 7,124 records of country centroid points sourced from Natural Earth 1:10m Admin 0 Label Points, with 10 columns covering geographic identifiers and coordinates. In practice, only the latitude and longitude columns carry usable signal — all eight categorical fields (continent, iso_a2, iso_a3, name, name_long, region_un, source, subregion) are effectively empty or constant, with a single value covering 100% of rows. Start by examining the spatial distribution: longitude spans the full globe (-179.97 to 179.99) while latitude is skewed toward the northern hemisphere (mean 22.9, median 25.2, skew -0.60). The 35 latitude outliers (~0.5%) likely correspond to extreme polar points worth a quick sanity check.

citing: row_count · column_count · columns.continent.stats.top_rate · columns.source.stats.top_value · columns.latitude.stats · columns.longitude.stats

Out[4]:

saturn.schema() · 10 columns

column kind n null% unique alerts
iso_a2 categorical 7,124 0.0% 1 imbalance
iso_a3 categorical 7,124 0.0% 1 imbalance
name categorical 7,124 0.0% 1 imbalance
name_long categorical 7,124 0.0% 1 imbalance
continent categorical 7,124 0.0% 1 imbalance
region_un categorical 7,124 0.0% 1 imbalance
subregion categorical 7,124 0.0% 1 imbalance
longitude numeric 7,124 0.0% 7,124
latitude numeric 7,124 0.0% 7,124
source categorical 7,124 0.0% 1 imbalance
Fig 1.
latitude · Check the northern-hemisphere bias (median 25.2, skew -0.60) and the 35 flagged outliers near the poles.
Show data table
Histogram bins for latitude (median: 25.195900167159152).
bincount
-83.05 – -78.919
-78.9 – -74.7426
-74.74 – -70.5838
-70.58 – -66.4342
-66.43 – -62.2751
-62.27 – -58.1110
-58.11 – -53.9545
-53.95 – -49.877
-49.8 – -45.6472
-45.64 – -41.4862
-41.48 – -37.3230
-37.32 – -33.1719
-33.17 – -29.0110
-29.01 – -24.8523
-24.85 – -20.785
-20.7 – -16.54140
-16.54 – -12.38169
-12.38 – -8.224238
-8.224 – -4.067246
-4.067 – 0.09051250
0.09051 – 4.248293
4.248 – 8.405321
8.405 – 12.56381
12.56 – 16.72302
16.72 – 20.88179
20.88 – 25.03400
25.03 – 29.19489
29.19 – 33.35214
33.35 – 37.51388
37.51 – 41.66263
41.66 – 45.82163
45.82 – 49.98142
49.98 – 54.13225
54.13 – 58.29275
58.29 – 62.45502
62.45 – 66.61441
66.61 – 70.76222
70.76 – 74.9288
74.92 – 79.08105
79.08 – 83.2479
Fig 2.
longitude · Confirm global coverage spanning roughly -180 to 180 with no outliers and a fairly flat distribution.
Show data table
Histogram bins for longitude (median: 23.477727818184434).
bincount
-180 – -171112
-171 – -162105
-162 – -15387
-153 – -144109
-144 – -13555
-135 – -126113
-126 – -11795
-117 – -10891
-108 – -98.9836
-98.98 – -89.9882
-89.98 – -80.98350
-80.98 – -71.98554
-71.98 – -62.98253
-62.98 – -53.98202
-53.98 – -44.9983
-44.99 – -35.9949
-35.99 – -26.9930
-26.99 – -17.9966
-17.99 – -8.98975
-8.989 – 0.01045126
0.01045 – 9.01130
9.01 – 18.01299
18.01 – 27.01632
27.01 – 36.01131
36.01 – 45.0188
45.01 – 54.0190
54.01 – 63306
63 – 7248
72 – 81226
81 – 9028
90 – 99168
99 – 108256
108 – 117190
117 – 126604
126 – 135535
135 – 144170
144 – 153185
153 – 162115
162 – 171149
171 – 180101
Fig 3.
source · Single-value field confirming every row originates from Natural Earth 1:10m Admin 0 Label Points.
Show data table
Top values for source (1 unique shown, of 1 total).
valuecountshare
Natural Earth 1:10m Admin 0 Label Points7124100.0%
Fig 4.
name · Illustrates that country name is empty for all 7,124 rows — a likely data-loading issue to investigate.
Show data table
Top values for name (1 unique shown, of 1 total).
valuecountshare
7124100.0%
Fig 5.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
iso_a2categorical0.0%
iso_a3categorical0.0%
namecategorical0.0%
name_longcategorical0.0%
continentcategorical0.0%
region_uncategorical0.0%
subregioncategorical0.0%
longitudenumeric0.0%
latitudenumeric0.0%
sourcecategorical0.0%
Fig 6.
Pearson correlation across numeric columns (sampled, bounded).
Show data table
Pearson correlation across 2 numeric columns (values clipped to 2 decimals).
longitudelatitude
longitude+1.00-0.10
latitude-0.10+1.00

iso_a2 categorical metadata

This appears to be an ISO 3166-1 alpha-2 country code field, but every one of the 7124 rows holds the empty string, giving a cardinality of 1 and entropy of 0. The column carries no information whatsoever despite having a 0.0 null_rate (blanks are not counted as nulls here).

Treatment: Drop; the column is a constant empty string with no signal.

anthropic:claude-opus-4-7 · confidence high
Out[12]:

saturn.columns["iso_a2"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 7.
Top values for iso_a2.
Show data table
Top values for iso_a2 (1 unique shown, of 1 total).
valuecountshare
7124100.0%

iso_a3 categorical metadata

This appears to be an ISO 3166-1 alpha-3 country code field, but every one of the 7124 rows holds the empty string. Cardinality is 1, entropy is 0, and there are no nulls — the column is present but entirely uninformative.

Treatment: Drop; the column is a constant empty string with zero variance.

anthropic:claude-opus-4-7 · confidence high
Out[15]:

saturn.columns["iso_a3"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 8.
Top values for iso_a3.
Show data table
Top values for iso_a3 (1 unique shown, of 1 total).
valuecountshare
7124100.0%

name categorical metadata

This 'name' column is constant: every one of the 7124 rows holds the empty string, giving cardinality 1 and entropy 0. There is no signal here — likely a field that was scraped or exported but never populated.

Treatment: Drop; the column is a single constant value with zero information.

anthropic:claude-opus-4-7 · confidence high
Out[18]:

saturn.columns["name"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 9.
Top values for name.
Show data table
Top values for name (1 unique shown, of 1 total).
valuecountshare
7124100.0%

name_long categorical other

This column is a constant — every one of the 7124 rows holds the empty string, giving a cardinality of 1 and entropy of 0. Although technically non-null, it carries no information and the top_rate of 1.0 confirms there is nothing to distinguish records here.

Treatment: Drop; the column is constant and contributes no signal.

anthropic:claude-opus-4-7 · confidence high
Out[21]:

saturn.columns["name_long"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 10.
Top values for name_long.
Show data table
Top values for name_long (1 unique shown, of 1 total).
valuecountshare
7124100.0%

continent categorical metadata

This column appears intended to record continent metadata, but every one of the 7124 rows holds an empty string. With cardinality of 1, entropy of 0, and a top_rate of 1.0 on the blank value, it carries no information whatsoever.

Treatment: Drop the column; it is constant-empty and contributes nothing.

anthropic:claude-opus-4-7 · confidence high
Out[24]:

saturn.columns["continent"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 11.
Top values for continent.
Show data table
Top values for continent (1 unique shown, of 1 total).
valuecountshare
7124100.0%

region_un categorical metadata

This column appears to be a UN region metadata field, but every one of the 7124 rows holds an empty string, giving a cardinality of 1 and entropy of 0. There is no signal here whatsoever — the field carries no information despite a 0.0 null_rate (blanks are not counted as nulls).

Treatment: Drop; the column is constant (empty string) across all rows.

anthropic:claude-opus-4-7 · confidence high
Out[27]:

saturn.columns["region_un"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 12.
Top values for region_un.
Show data table
Top values for region_un (1 unique shown, of 1 total).
valuecountshare
7124100.0%

subregion categorical metadata

This column appears to be a geographic subregion field, but it carries zero information: all 7,124 rows hold the same empty-string value, giving cardinality 1 and entropy 0. It is effectively a blank placeholder rather than a populated attribute.

Treatment: Drop; the column is constant and contributes no signal.

anthropic:claude-opus-4-7 · confidence high
Out[30]:

saturn.columns["subregion"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 13.
Top values for subregion.
Show data table
Top values for subregion (1 unique shown, of 1 total).
valuecountshare
7124100.0%

longitude numeric feature

This column appears to be a geographic longitude in decimal degrees, with values spanning the full -179.97 to 179.99 range. All 7124 entries are unique with no nulls or zeros, and the distribution is broadly flat (kurtosis -1.13) with only mild negative skew (-0.27), suggesting fairly even global coverage rather than clustering on a single region. The IQR of ~191.8 degrees confirms points are scattered across both hemispheres.

Treatment: Pair with latitude for geospatial features; consider cyclic encoding given the -180/180 wrap.

anthropic:claude-opus-4-7 · confidence high
Out[33]:

saturn.columns["longitude"].stats

statvalue
n7,124
nulls0 (0.0%)
unique7,124
min -180
max 180
mean 21.9
median 23.48
std 97.72
q1 -72.33
q3 119.5
iqr 191.8
skew -0.267
kurtosis -1.131
n_outliers 0
outlier_rate 0
zero_rate 0
Fig 14.
Distribution of longitude. Vertical dash marks the median.
Show data table
Histogram bins for longitude (median: 23.477727818184434).
bincount
-180 – -171112
-171 – -162105
-162 – -15387
-153 – -144109
-144 – -13555
-135 – -126113
-126 – -11795
-117 – -10891
-108 – -98.9836
-98.98 – -89.9882
-89.98 – -80.98350
-80.98 – -71.98554
-71.98 – -62.98253
-62.98 – -53.98202
-53.98 – -44.9983
-44.99 – -35.9949
-35.99 – -26.9930
-26.99 – -17.9966
-17.99 – -8.98975
-8.989 – 0.01045126
0.01045 – 9.01130
9.01 – 18.01299
18.01 – 27.01632
27.01 – 36.01131
36.01 – 45.0188
45.01 – 54.0190
54.01 – 63306
63 – 7248
72 – 81226
81 – 9028
90 – 99168
99 – 108256
108 – 117190
117 – 126604
126 – 135535
135 – 144170
144 – 153185
153 – 162115
162 – 171149
171 – 180101

latitude numeric feature

This column is a geographic latitude in decimal degrees, with all 7124 values unique and no nulls. The range spans -83.05 to 83.24, covering nearly the full globe, with a median of 25.20 skewed slightly toward the northern hemisphere (skew -0.60). Only 35 values (0.49%) flag as outliers and the wide IQR of 51.96 reflects genuine geographic spread rather than data error.

Treatment: Pair with longitude for spatial features; consider binning or projecting rather than using raw degrees in linear models.

anthropic:claude-opus-4-7 · confidence high
Out[36]:

saturn.columns["latitude"].stats

statvalue
n7,124
nulls0 (0.0%)
unique7,124
min -83.05
max 83.24
mean 22.92
median 25.2
std 34.23
q1 1.149
q3 53.11
iqr 51.96
skew -0.6007
kurtosis 0.1113
n_outliers 35
outlier_rate 0.004913
zero_rate 0
Fig 15.
Distribution of latitude. Vertical dash marks the median.
Show data table
Histogram bins for latitude (median: 25.195900167159152).
bincount
-83.05 – -78.919
-78.9 – -74.7426
-74.74 – -70.5838
-70.58 – -66.4342
-66.43 – -62.2751
-62.27 – -58.1110
-58.11 – -53.9545
-53.95 – -49.877
-49.8 – -45.6472
-45.64 – -41.4862
-41.48 – -37.3230
-37.32 – -33.1719
-33.17 – -29.0110
-29.01 – -24.8523
-24.85 – -20.785
-20.7 – -16.54140
-16.54 – -12.38169
-12.38 – -8.224238
-8.224 – -4.067246
-4.067 – 0.09051250
0.09051 – 4.248293
4.248 – 8.405321
8.405 – 12.56381
12.56 – 16.72302
16.72 – 20.88179
20.88 – 25.03400
25.03 – 29.19489
29.19 – 33.35214
33.35 – 37.51388
37.51 – 41.66263
41.66 – 45.82163
45.82 – 49.98142
49.98 – 54.13225
54.13 – 58.29275
58.29 – 62.45502
62.45 – 66.61441
66.61 – 70.76222
70.76 – 74.9288
74.92 – 79.08105
79.08 – 83.2479

source categorical metadata

This column records the data provenance, with every one of the 7124 rows tagged as 'Natural Earth 1:10m Admin 0 Label Points'. Cardinality is 1 and entropy is 0, so it carries no information for any model or grouping. It's useful only as a dataset-level annotation.

Treatment: Drop from modelling; retain in dataset documentation as the source label.

anthropic:claude-opus-4-7 · confidence high
Out[39]:

saturn.columns["source"].stats

statvalue
n7,124
nulls0 (0.0%)
unique1
top_value Natural Earth 1:10m Admin 0 Label Points
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: imbalancetop value is 100.0% of rows
Fig 16.
Top values for source.
Show data table
Top values for source (1 unique shown, of 1 total).
valuecountshare
Natural Earth 1:10m Admin 0 Label Points7124100.0%

How to cite

click to copy

BibTeX
@misc{saturn-geographic-country-centroids-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: geographic country centroids},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/geographic-country_centroids}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:claude-opus-4-7},
}
APA
Steuber, L. (2026). Saturn reading: geographic country centroids. Source: /home/coolhand/html/datavis/data_trove/data/geographic/country_centroids.json. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:claude-opus-4-7). Retrieved from https://dr.eamer.dev/saturn/view/geographic-country_centroids