saturn

/home/coolhand/html/datavis/data_trove/data/quirky/megaliths.json 15,464 rows sample n=15,464 seed 42 2026-06-22T00:36:07+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/quirky/megaliths.json
Total rows15,464
Profiled sample15,464
Columns14
Generated2026-06-22T00:36:07+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
idnumeric0.0%
osm_typecategorical0.0%
nametext0.0%
latnumeric0.0%
lonnumeric0.0%
typecategorical0.0%
megalith_typecategorical0.0%
descriptioncategorical0.0%
wikipediatext0.0%
wikidatatext0.0%
heritagecategorical0.0%
heritage_operatorcategorical0.0%
start_datecategorical0.0%
materialcategorical0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This dataset catalogues 15,464 megalithic structures (dolmens, menhirs, stone circles, nuraghes, and more) drawn from OpenStreetMap, with geographic coordinates, heritage classification, and typology fields. The most striking pattern is extreme sparsity in descriptive metadata: over 95% of records have no description, 98.5% have no material recorded, and roughly 70% lack a Wikidata link, suggesting the dataset is geographically rich but editorially thin. The megalith_type column is the most informative categorical field, splitting meaningfully across menhirs (5,231), dolmens (4,501), nuraghes (1,080), and stone circles (1,011). Geographically, the bulk of sites cluster in Western Europe (median latitude ~47.6°N, median longitude ~-1.6°), but high skew and outliers in both lat and lon indicate a long tail of sites in places like Sardinia, Iberia, Ireland, and beyond — worth mapping.

wikidata high anthropic:default

This column stores Wikidata entity identifiers (Q-codes) linking dataset rows to Wikidata knowledge-base entries. Two signals demand immediate attention: 10,819 of 15,464 rows (70%) are empty strings rather than true nulls, and the duplicate_rate is 0.723, meaning many rows share the same Q-code — the top value 'Q106546933' appears 17 times, suggesting a many-to-one entity mapping. The allcaps_rate of 0.300 reflects the uppercase 'Q' prefix on valid codes.

name high anthropic:default

This column contains the local or common name of prehistoric megalithic monuments (dolmens, menhirs, stone circles, nuraghes, etc.), drawing from a multilingual dataset spanning at least English, French, Russian (Cyrillic), and German. Two signals stand out: 30.5% of rows (4,720 of 15,464) are empty strings rather than true nulls, and the duplicate rate is 36.2% (5,595 duplicates), largely driven by generic type-names like 'Долмен' (191), 'Dolmen' (51), 'Menhir' (50), and 'Standing Stone' (48) being reused across many distinct monuments. The one-word rate of 37.8% and word mean of ~2.5 are consistent with short monument names, but the 4,720 empty strings should be treated as missing values.

wikipedia high anthropic:default

This column stores Wikipedia article references in a 'language-code:article-title' format (e.g., 'de:Großsteingräber im Haldensleber Forst'), linking dataset records to corresponding Wikipedia pages across multiple languages including German, French, Catalan, Portuguese, and English. The dominant surprise is that 13,060 of 15,464 rows (84.5%) are empty strings, meaning most records have no Wikipedia link at all. Among populated values, 13,406 duplicates exist because the same Wikipedia article is referenced by multiple records — consistent with grouped/list articles covering many individual megalithic sites. The multi-language mix (de, fr, pt, ca, en prefixes visible) is expected for a multilingual cultural-heritage dataset.

description high anthropic:default

This column is a text description field for archaeological or heritage site records, containing short labels or names of megalithic structures (e.g., 'Jættestue', 'Großsteingrab', 'Dolmen', 'Stone circle') in multiple languages including Danish, German, Portuguese, and English. The most striking signal is that 95.8% of the 15,464 rows (14,814) carry an empty string, effectively making the field near-empty at scale. The remaining 586 distinct non-empty values are heavily long-tailed, with the most frequent non-empty value ('Jættestue') appearing only 11 times. The entropy ratio of 0.069 confirms extreme imbalance driven by the dominant empty-string value.

material high anthropic:default

This column captures the construction or surface material of a physical feature (likely a wall, path, or structure in a geospatial dataset), with 13 distinct values across 15,464 rows. The dominant 'value' is an empty string, accounting for 98.44% of all records — meaning the field is effectively unpopulated for the vast majority of entries, despite a null_rate of 0.0. The remaining 241 non-empty records span stone-type materials (stone, granite, sandstone, limestone, etc.), with a minor language inconsistency ('Quarzit' appearing in German). Entropy is extremely low (0.133) confirming near-total dominance of the blank value.

start_date high anthropic:default

This column is intended to capture a start date for records, but it is overwhelmingly empty: 15,430 of 15,464 rows (99.78%) contain a blank string, making it nearly useless as a feature. The 34 non-empty values are highly heterogeneous — mixing ISO dates ('2004-07-01'), calendar years ('1999'), approximate historical dates ('~2000 BC'), ranges ('between 3500 and 2800 BCE'), negative year offsets ('-3000 BC'), and even a code-like value ('C-30') — indicating no enforced format or schema. The extreme imbalance (top_rate 0.998) and near-zero entropy (0.032) confirm the column carries almost no information signal.

type high anthropic:default

This column classifies archaeological monument types, with 19 distinct categories across 15,464 records and no nulls. It is severely imbalanced: 'megalith' dominates at 97.73% of all records (15,113), leaving the remaining 18 types — menhir, dolmen, standing_stone, stone_circle, nuraghe, etc. — sharing just 351 records. The entropy ratio of 0.049 confirms near-total concentration in one class, which will severely impair any multi-class model trained on this label.

lat high anthropic:default

This column contains geographic latitude values, ranging from -51.81° to 65.17°, almost certainly representing the latitude coordinate of geolocated records. The vast majority of values cluster in the 43°–51° band (IQR of ~7.6°), suggesting heavy concentration in mid-latitude Europe or North America. The negative skew of -3.09 and extreme kurtosis of 26.33 indicate a sharp central peak with a long left tail — a surprising number of records pull toward lower or even southern-hemisphere latitudes, captured in 134 flagged outliers (~0.87%). Near-uniqueness (15,320 unique out of 15,464 rows) is expected for precise coordinate data.

lon high anthropic:default

This column represents geographic longitude values, with readings spanning from -151.36 to 144.74 degrees — a plausible global range. What is surprising is the severe positive skew (3.65) and extreme kurtosis (34.34), indicating the distribution is heavily concentrated in a narrow band (IQR of only 11.53, centred around Western Europe/Africa longitudes near 0°) with 676 outliers (4.37%) pulled far to the east and west. The mean (2.62) and median (-1.62) diverge noticeably, confirming the asymmetric clustering, likely reflecting a dataset dominated by European locations with a long tail of global outliers.

id high anthropic:default

This column is a numeric row identifier — every one of the 15,464 rows carries a distinct value with zero nulls, confirming it functions as a unique primary key. The values are large integers spanning roughly 24 million to 13.5 billion, which is consistent with a distributed-system or database auto-increment ID rather than a sequential integer index. Mild positive skew (0.89) and a wide IQR (~4.5 billion) suggest IDs were assigned non-uniformly over time or across sources, but no outliers are flagged.

heritage medium anthropic:default

This column represents a heritage classification or designation status for records, likely a regulatory or cultural heritage grading field. The dominant 'value' is an empty string, which accounts for 87.96% of all 15,464 rows, indicating that most records carry no heritage designation. The remaining values are a heterogeneous mix of numeric grades (1–4, 7), boolean-style strings ('yes', 'no'), a Portuguese classification phrase ('Em Vias de Classificação'), and a single 'Scheduled Monument' entry — suggesting the column was populated from multiple source systems or locales with no enforced vocabulary.

heritage_operator high anthropic:default

This column identifies the heritage operator or authority responsible for a record, with 31 distinct coded values across 15,464 rows. The dominant 'value' is an empty string, accounting for 89.5% of all rows (13,848), meaning the vast majority of records have no operator assigned — this blank dominance severely suppresses the entropy ratio to 0.14. Among the 30 non-empty values, 'mhs' (960), 'IE:smr' (229), and 'dgpc' (185) are the most common, suggesting a mix of abbreviated authority codes and occasional full names (e.g., 'Historic Environment Scotland'), indicating inconsistent formatting across sources.

megalith_type high anthropic:default

This column classifies prehistoric stone monuments into structural types, with 73 distinct categories across 15,464 records and no nulls. The dominant class is 'menhir' (5,231 records, ~33.8%), followed closely by 'dolmen' (4,501), meaning these two types together account for over 60% of all rows — a moderate concentration reflected in an entropy ratio of 0.44. Notably, the third most frequent value is an empty string ('') with 1,714 occurrences (~11.1%), which masquerades as a non-null entry and represents a meaningful data quality issue that null_rate alone does not capture.

osm_type high anthropic:default

This column encodes the OpenStreetMap geometry type, distinguishing between point features ('node') and linear/polygon features ('way'). With only 2 distinct values across 15,464 rows and zero nulls, it is a clean binary flag. The distribution is heavily skewed: 'node' dominates at 86.1% (13,311 records) versus 'way' at just 13.9% (2,153 records). The low entropy of 0.582 confirms the imbalance, which may matter if 'way' features behave differently in downstream models.

Numeric correlation

Show data table
Pearson correlation across 3 numeric columns (values clipped to 2 decimals).
idlatlon
id+1.00+0.00-0.05
lat+0.00+1.00-0.11
lon-0.05-0.11+1.00

id numeric

rows15,464
null0 (0.0%)
unique15,464
min24,151,805
max13,537,320,281
mean4,503,184,710
median3,411,205,876
std3,470,459,882
q12,374,723,034
q36,845,360,977
iqr4,470,637,943
skew0.891
kurtosis-0.201
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for id (median: 3411205875.5).
bincount
2.415e+07 – 3.62e+08682
3.62e+08 – 6.998e+08889
6.998e+08 – 1.038e+09623
1.038e+09 – 1.375e+09736
1.375e+09 – 1.713e+09297
1.713e+09 – 2.051e+09352
2.051e+09 – 2.389e+09305
2.389e+09 – 2.727e+092679
2.727e+09 – 3.065e+09300
3.065e+09 – 3.402e+09756
3.402e+09 – 3.74e+09951
3.74e+09 – 4.078e+09705
4.078e+09 – 4.416e+09532
4.416e+09 – 4.754e+09318
4.754e+09 – 5.092e+09386
5.092e+09 – 5.429e+09200
5.429e+09 – 5.767e+09207
5.767e+09 – 6.105e+09261
6.105e+09 – 6.443e+09161
6.443e+09 – 6.781e+09175
6.781e+09 – 7.119e+09250
7.119e+09 – 7.456e+09270
7.456e+09 – 7.794e+0989
7.794e+09 – 8.132e+09115
8.132e+09 – 8.47e+09734
8.47e+09 – 8.808e+09391
8.808e+09 – 9.146e+09155
9.146e+09 – 9.483e+09137
9.483e+09 – 9.821e+09204
9.821e+09 – 1.016e+10137
1.016e+10 – 1.05e+1094
1.05e+10 – 1.083e+10180
1.083e+10 – 1.117e+10175
1.117e+10 – 1.151e+10157
1.151e+10 – 1.185e+10133
1.185e+10 – 1.219e+10114
1.219e+10 – 1.252e+10132
1.252e+10 – 1.286e+1097
1.286e+10 – 1.32e+10255
1.32e+10 – 1.354e+10130

osm_type categorical

rows15,464
null0 (0.0%)
unique2
top_valuenode
top_rate0.861
cardinality2
entropy0.582
entropy_ratio0.582
Show data table
Top values for osm_type (2 unique shown, of 2 total).
valuecountshare
node1331186.1%
way215313.9%
Top values (rank 1–20)
  1. node — 13,311
  2. way — 2,153

name text

37.8% rows are a single word 36.2% duplicate strings
rows15,464
null0 (0.0%)
unique9,869
len_min0
len_max84
len_mean13.647
len_median15.000
len_p9530.000
word_mean2.495
word_median2.000
n_empty4,720
n_duplicates5,595
duplicate_rate0.362
vocab_size9,447
readability_flesch_mean46.901
emoji_rate0.000
url_rate0.000
one_word_rate0.378
allcaps_rate3.17e-03
boilerplate_rate0.000
Show data table
Character-length distribution for name (mean: 13.64685721676151).
charscount
0 – 24725
2 – 427
4 – 6252
6 – 8552
8 – 10420
10 – 13493
13 – 15810
15 – 171037
17 – 191247
19 – 211295
21 – 231726
23 – 25831
25 – 27680
27 – 29447
29 – 32283
32 – 34185
34 – 36134
36 – 3886
38 – 4083
40 – 4237
42 – 4443
44 – 4615
46 – 4813
48 – 5013
50 – 524
52 – 559
55 – 574
57 – 591
59 – 615
61 – 632
63 – 652
65 – 671
67 – 690
69 – 711
71 – 740
74 – 760
76 – 780
78 – 800
80 – 820
82 – 841
Sample values (first 10)
  1. Crow Hill Cairn Circle
  2. Arkaun
  3. Su Nuraxi
  4. Sornaichean Coir' Fhinn
  5. Shaori fortress-1
  6. Anta do Monte da Caeira 2
  7. Tomba dei Giganti
  8. Menhir von Benzingerode

lat numeric

skew=-3.09
rows15,464
null0 (0.0%)
unique15,320
min-51.809
max65.169
mean46.415
median47.592
std6.810
q142.949
q350.518
iqr7.569
skew-3.087
kurtosis26.331
n_outliers134
outlier_rate8.67e-03
zero_rate0.000
Show data table
Histogram bins for lat (median: 47.59247835).
bincount
-51.81 – -48.881
-48.88 – -45.960
-45.96 – -43.040
-43.04 – -40.110
-40.11 – -37.191
-37.19 – -34.260
-34.26 – -31.343
-31.34 – -28.410
-28.41 – -25.492
-25.49 – -22.560
-22.56 – -19.641
-19.64 – -16.721
-16.72 – -13.794
-13.79 – -10.874
-10.87 – -7.9428
-7.942 – -5.01821
-5.018 – -2.0933
-2.093 – 0.831313
0.8313 – 3.75626
3.756 – 6.681
6.68 – 9.6057
9.605 – 12.535
12.53 – 15.458
15.45 – 18.382
18.38 – 21.35
21.3 – 24.232
24.23 – 27.158
27.15 – 30.083
30.08 – 339
33 – 35.9238
35.92 – 38.85523
38.85 – 41.772211
41.77 – 44.73646
44.7 – 47.623269
47.62 – 50.551808
50.55 – 53.471660
53.47 – 56.41627
56.4 – 59.32506
59.32 – 62.2433
62.24 – 65.175

lon numeric

skew=+3.65
rows15,464
null0 (0.0%)
unique15,407
min-151.358
max144.738
mean2.618
median-1.620
std14.644
q1-3.083
q38.447
iqr11.531
skew3.654
kurtosis34.341
n_outliers676
outlier_rate0.044
zero_rate0.000
Show data table
Histogram bins for lon (median: -1.6201083).
bincount
-151.4 – -1441
-144 – -136.60
-136.6 – -129.20
-129.2 – -121.71
-121.7 – -114.30
-114.3 – -106.91
-106.9 – -99.541
-99.54 – -92.142
-92.14 – -84.742
-84.74 – -77.336
-77.33 – -69.9334
-69.93 – -62.532
-62.53 – -55.131
-55.13 – -47.725
-47.72 – -40.321
-40.32 – -32.920
-32.92 – -25.520
-25.52 – -18.120
-18.12 – -10.714
-10.71 – -3.313136
-3.31 – 4.0927654
4.092 – 11.493031
11.49 – 18.9921
18.9 – 26.358
26.3 – 33.715
33.7 – 41.1441
41.1 – 48.5121
48.51 – 55.913
55.91 – 63.310
63.31 – 70.711
70.71 – 78.127
78.12 – 85.527
85.52 – 92.927
92.92 – 100.31
100.3 – 107.719
107.7 – 115.17
115.1 – 122.523
122.5 – 129.930
129.9 – 137.315
137.3 – 144.76

type categorical

top value is 97.7% of rows
rows15,464
null0 (0.0%)
unique19
top_valuemegalith
top_rate0.977
cardinality19
entropy0.210
entropy_ratio0.049
Show data table
Top values for type (19 unique shown, of 19 total).
valuecountshare
megalith1511397.7%
menhir1561.0%
dolmen830.5%
standing_stone590.4%
stone_circle160.1%
nuraghe80.1%
gallery_grave60.0%
passage_grave50.0%
lech40.0%
stone_ship30.0%
tholos20.0%
chamber20.0%
village10.0%
plaque10.0%
cist10.0%
long_barrow10.0%
chambered_cairn10.0%
grave_field10.0%
stone10.0%
Top values (rank 1–20)
  1. megalith — 15,113
  2. menhir — 156
  3. dolmen — 83
  4. standing_stone — 59
  5. stone_circle — 16
  6. nuraghe — 8
  7. gallery_grave — 6
  8. passage_grave — 5
  9. lech — 4
  10. stone_ship — 3
  11. tholos — 2
  12. chamber — 2
  13. village — 1
  14. plaque — 1
  15. cist — 1
  16. long_barrow — 1
  17. chambered_cairn — 1
  18. grave_field — 1
  19. stone — 1

megalith_type categorical

rows15,464
null0 (0.0%)
unique73
top_valuemenhir
top_rate0.338
cardinality73
entropy2.749
entropy_ratio0.444
Show data table
Top values for megalith_type (20 unique shown, of 73 total).
valuecountshare
menhir523133.8%
dolmen450129.1%
171411.1%
nuraghe10807.0%
stone_circle10116.5%
passage_grave5373.5%
chamber4372.8%
long_barrow1841.2%
alignment1160.8%
cist1070.7%
gallery_grave850.5%
standing_stone680.4%
stone_ship470.3%
tholos320.2%
court_tomb320.2%
round_barrow250.2%
well230.1%
wedge_tomb230.1%
cairn200.1%
stone200.1%
Top values (rank 1–20)
  1. menhir — 5,231
  2. dolmen — 4,501
  3. — 1,714
  4. nuraghe — 1,080
  5. stone_circle — 1,011
  6. passage_grave — 537
  7. chamber — 437
  8. long_barrow — 184
  9. alignment — 116
  10. cist — 107
  11. gallery_grave — 85
  12. standing_stone — 68
  13. stone_ship — 47
  14. tholos — 32
  15. court_tomb — 32
  16. round_barrow — 25
  17. well — 23
  18. wedge_tomb — 23
  19. cairn — 20
  20. stone — 20

description categorical

558 singleton categories top value is 95.8% of rows
rows15,464
null0 (0.0%)
unique587
top_value
top_rate0.958
cardinality587
entropy0.633
entropy_ratio0.069
Show data table
Top values for description (20 unique shown, of 587 total).
valuecountshare
1481495.8%
Jættestue110.1%
Anta da Herdade da Ordem80.1%
Stone circle50.0%
Großsteingrab50.0%
Rest eines Großsteingrabes50.0%
Long Barrow40.0%
Dolmen40.0%
Langdysse40.0%
Four standing and one recumbent standing stone.40.0%
pair of two standing stones40.0%
Hünengrab30.0%
Henge / Círculo lítico30.0%
Menhir20.0%
Гармония20.0%
Runddysse20.0%
Allée couverte20.0%
Stendysse20.0%
Tumulus, dalle de couverture20.0%
Table, chevet, orthostates droit et gauche20.0%
Top values (rank 1–20)
  1. — 14,814
  2. Jættestue — 11
  3. Anta da Herdade da Ordem — 8
  4. Stone circle — 5
  5. Großsteingrab — 5
  6. Rest eines Großsteingrabes — 5
  7. Long Barrow — 4
  8. Dolmen — 4
  9. Langdysse — 4
  10. Four standing and one recumbent standing stone. — 4
  11. pair of two standing stones — 4
  12. Hünengrab — 3
  13. Henge / Círculo lítico — 3
  14. Menhir — 2
  15. Гармония — 2
  16. Runddysse — 2
  17. Allée couverte — 2
  18. Stendysse — 2
  19. Tumulus, dalle de couverture — 2
  20. Table, chevet, orthostates droit et gauche — 2

wikipedia text

85.2% rows are a single word 86.7% duplicate strings
rows15,464
null0 (0.0%)
unique2,058
len_min0
len_max75
len_mean4.100
len_median0.000
len_p9529.000
word_mean1.351
word_median1.000
n_empty13,060
n_duplicates13,406
duplicate_rate0.867
vocab_size2,769
readability_flesch_mean5.480
emoji_rate0.000
url_rate0.000
one_word_rate0.852
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for wikipedia (mean: 4.100038799793068).
charscount
0 – 213060
2 – 40
4 – 60
6 – 81
8 – 93
9 – 1121
11 – 1347
13 – 1528
15 – 17159
17 – 19115
19 – 21191
21 – 22236
22 – 24271
24 – 26254
26 – 28203
28 – 30130
30 – 32198
32 – 34133
34 – 36110
36 – 3875
38 – 3949
39 – 4197
41 – 4330
43 – 456
45 – 4712
47 – 497
49 – 514
51 – 522
52 – 548
54 – 563
56 – 583
58 – 600
60 – 620
62 – 643
64 – 660
66 – 681
68 – 691
69 – 710
71 – 731
73 – 752
Sample values (first 10)
  1. it:Su Nuraxi
  2. ka:შაორის ციხე
  3. de:Menhir von Benzingerode

wikidata text

100.0% rows are a single word 30.0% rows are all-caps 95th-percentile length under 20 chars 72.3% duplicate strings
rows15,464
null0 (0.0%)
unique4,289
len_min0
len_max10
len_mean2.667
len_median0.000
len_p9510.000
word_mean1.000
word_median1.000
n_empty10,819
n_duplicates11,175
duplicate_rate0.723
vocab_size4,288
readability_flesch_mean38.790
emoji_rate0.000
url_rate0.000
one_word_rate1.000
allcaps_rate0.300
boilerplate_rate0.000
Show data table
Character-length distribution for wikidata (mean: 2.6668391101914124).
charscount
0 – 010819
0 – 00
0 – 10
1 – 10
1 – 10
1 – 20
2 – 20
2 – 20
2 – 20
2 – 20
2 – 30
3 – 30
3 – 30
3 – 40
4 – 40
4 – 40
4 – 40
4 – 40
4 – 50
5 – 50
5 – 50
5 – 60
6 – 60
6 – 60
6 – 65
6 – 60
6 – 70
7 – 70
7 – 794
7 – 80
8 – 80
8 – 80
8 – 81167
8 – 80
8 – 90
9 – 90
9 – 92574
9 – 100
10 – 100
10 – 10805
Sample values (first 10)
  1. Q253241
  2. Q125484975
  3. Q16377805
  4. Q19834685
  5. Q17325138

heritage categorical

rows15,464
null0 (0.0%)
unique12
top_value
top_rate0.880
cardinality12
entropy0.734
entropy_ratio0.205
Show data table
Top values for heritage (12 unique shown, of 12 total).
valuecountshare
1360288.0%
212648.2%
32051.3%
11200.8%
yes1090.7%
no690.4%
Em Vias de Classificação600.4%
4240.2%
780.1%
Scheduled Monument10.0%
610.0%
M002110.0%
Top values (rank 1–20)
  1. — 13,602
  2. 2 — 1,264
  3. 3 — 205
  4. 1 — 120
  5. yes — 109
  6. no — 69
  7. Em Vias de Classificação — 60
  8. 4 — 24
  9. 7 — 8
  10. Scheduled Monument — 1
  11. 6 — 1
  12. M0021 — 1

heritage_operator categorical

rows15,464
null0 (0.0%)
unique31
top_value
top_rate0.895
cardinality31
entropy0.703
entropy_ratio0.142
Show data table
Top values for heritage_operator (20 unique shown, of 31 total).
valuecountshare
1384889.5%
mhs9606.2%
IE:smr2291.5%
dgpc1851.2%
pc1030.7%
rce230.1%
Historic Environment Scotland180.1%
cadw140.1%
whc140.1%
lda120.1%
nld90.1%
IE:smr;IE:nm80.1%
he60.0%
Cadw50.0%
mecd40.0%
DGPC30.0%
IE:smr:IE:nm30.0%
alsh20.0%
hs20.0%
raa20.0%
Top values (rank 1–20)
  1. — 13,848
  2. mhs — 960
  3. IE:smr — 229
  4. dgpc — 185
  5. pc — 103
  6. rce — 23
  7. Historic Environment Scotland — 18
  8. cadw — 14
  9. whc — 14
  10. lda — 12
  11. nld — 9
  12. IE:smr;IE:nm — 8
  13. he — 6
  14. Cadw — 5
  15. mecd — 4
  16. DGPC — 3
  17. IE:smr:IE:nm — 3
  18. alsh — 2
  19. hs — 2
  20. raa — 2

start_date categorical

21 singleton categories top value is 99.8% of rows
rows15,464
null0 (0.0%)
unique26
top_value
top_rate0.998
cardinality26
entropy0.032
entropy_ratio6.86e-03
Show data table
Top values for start_date (20 unique shown, of 26 total).
valuecountshare
1543099.8%
199950.0%
C-3040.0%
~2000 BC20.0%
between 3500 and 2800 BCE20.0%
2900 BC..2600 BC10.0%
-3000 BC10.0%
-200010.0%
2004-07-0110.0%
before -325010.0%
3720 BC10.0%
2800-2200 BC10.0%
~5000 BCE10.0%
~C30 BC10.0%
2000 BC10.0%
Mittelneolithikum (2350 - 2150 v. u. Z.)10.0%
1500 BC10.0%
2800 BC..2200 BC10.0%
2012-04-3010.0%
3100 BC10.0%
Top values (rank 1–20)
  1. — 15,430
  2. 1999 — 5
  3. C-30 — 4
  4. ~2000 BC — 2
  5. between 3500 and 2800 BCE — 2
  6. 2900 BC..2600 BC — 1
  7. -3000 BC — 1
  8. -2000 — 1
  9. 2004-07-01 — 1
  10. before -3250 — 1
  11. 3720 BC — 1
  12. 2800-2200 BC — 1
  13. ~5000 BCE — 1
  14. ~C30 BC — 1
  15. 2000 BC — 1
  16. Mittelneolithikum (2350 - 2150 v. u. Z.) — 1
  17. 1500 BC — 1
  18. 2800 BC..2200 BC — 1
  19. 2012-04-30 — 1
  20. 3100 BC — 1

material categorical

7 singleton categories top value is 98.4% of rows
rows15,464
null0 (0.0%)
unique13
top_value
top_rate0.984
cardinality13
entropy0.133
entropy_ratio0.036
Show data table
Top values for material (13 unique shown, of 13 total).
valuecountshare
1522398.4%
stone1961.3%
granite290.2%
sandstone50.0%
limestone20.0%
dry_stone20.0%
Quarzit10.0%
reinforced_concrete10.0%
stone;concrete10.0%
basalt10.0%
quartz_blanc10.0%
granit10.0%
andesite10.0%
Top values (rank 1–20)
  1. — 15,223
  2. stone — 196
  3. granite — 29
  4. sandstone — 5
  5. limestone — 2
  6. dry_stone — 2
  7. Quarzit — 1
  8. reinforced_concrete — 1
  9. stone;concrete — 1
  10. basalt — 1
  11. quartz_blanc — 1
  12. granit — 1
  13. andesite — 1