saturn

/home/coolhand/html/datavis/data_trove/data/quirky/carnivorous_plants_real.json 610 rows sample n=610 seed 42 2026-06-22T00:47:38+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/data/quirky/carnivorous_plants_real.json
Total rows610
Profiled sample610
Columns14
Generated2026-06-22T00:47:38+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
scientificNamecategorical0.0%
speciescategorical0.0%
genuscategorical0.0%
familycategorical0.0%
latitudenumeric0.0%
longitudenumeric0.0%
countrycategorical0.0%
stateProvincecategorical0.0%
localitycategorical0.0%
basisOfRecordcategorical0.0%
yearnumeric0.0%
monthnumeric0.2%
coordinateUncertaintynumeric22.6%
gbifIDcategorical0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This is a GBIF biodiversity occurrence dataset with 610 records spanning 14 columns, covering observations and preserved specimens of organisms across 35 countries, primarily recorded between 2021 and 2026. Despite the filename suggesting carnivorous plants, the dataset actually mixes three distinct taxonomic families — Hesperiidae (skippers/butterflies), Canellaceae (spice plants), and Araceae — each contributing roughly 300, 300, and 10 records respectively, which is a notable data-quality curiosity worth investigating. The dominant species is Canella winterana with 174 records (28.5%), and the US, Mexico, Brazil, and Guadeloupe together account for nearly half of all country-level records. Coordinate uncertainty is severely skewed and problematic: the median is just 35 metres but the max reaches 766,917 metres, with 91 outliers and a 23% null rate, meaning spatial analyses should treat location precision with caution.

coordinateUncertainty high anthropic:default

This column represents geographic coordinate uncertainty (likely in meters), a standard biodiversity/occurrence record field indicating the spatial precision of a location fix. Two signals are striking: the distribution is extremely right-skewed (skew=17.3, kurtosis=335.7) with a median of just 35m but a max of 766,917m (~767km), and 91 outliers (19.3% of rows) drive a mean of 6,463m versus a median of 35m — suggesting a mix of GPS-precise records and very coarse or placeholder uncertainty values. Additionally, 22.6% of values are null, meaning nearly a quarter of records carry no uncertainty estimate at all.

gbifID high anthropic:default

This column is the GBIF (Global Biodiversity Information Facility) occurrence identifier, a globally unique numeric key assigned to biodiversity occurrence records. With 610 rows, 610 unique values, a null rate of 0.0, and an entropy ratio of essentially 1.0, every record has a distinct ID — perfect identifier behaviour. The top frequency of 0.001639 (1 occurrence each) confirms zero duplication. Values appear to be sequential large integers in the 5937748xxx range, consistent with GBIF's ID scheme.

locality high anthropic:default

This column records detailed collection locality descriptions for what appears to be a biological specimen or herbarium dataset, likely from Madagascar (French-language administrative hierarchy: Région, District, Commune, Fokontany) with at least one Brazilian entry ('Serra da Farinha-seca'). The dominant signal is that 92.3% of the 610 rows (563 records) have an empty string, rendering the column nearly uninformative for most records. Among the 29 distinct non-empty values, each of the populated entries appears only 2–3 times, confirming the long-tail alert and suggesting these are free-text verbatim field notes rather than a controlled vocabulary.

scientificName high anthropic:default

This column contains Linnaean binomial scientific names for biological taxa, including both plant species (e.g., 'Canella winterana (L.) Gaertn.') and insect species (e.g., 'Hylephila phyleus (Drury, 1773)'), suggesting a mixed-taxon dataset. The dominant value 'Canella winterana (L.) Gaertn.' accounts for 28.5% of all 610 rows, which is strikingly disproportionate given 157 unique names, and the long-tail alert confirms the remaining mass is spread thinly across many species. Entropy ratio of 0.756 indicates moderate but uneven spread—far from uniform distribution.

latitude high anthropic:default

This column contains geographic latitude values, ranging from -43.25° (southern hemisphere, around southern South America or South Africa) to 46.70° (northern hemisphere, around central Europe or northern US). The distribution is notably platykurtic (kurtosis -1.28) with a wide IQR of 47.75 degrees, indicating near-uniform spread across a broad swath of the globe rather than clustering around any particular region. The mean (5.20°) and median (17.01°) diverge by ~12 degrees, and the slight negative skew suggests a modest tail toward southern latitudes. With 466 unique values out of 610 rows and zero nulls, this is a real-valued geographic coordinate with moderate but not near-unique cardinality.

longitude high anthropic:default

This column contains geographic longitude coordinates, with values spanning from -115.04 to 153.39 degrees — a range that covers locations across the Americas, Europe/Africa, and into the Pacific/Oceania region. The mean (-32.94) and median (-63.06) diverge substantially, and the IQR of 120.21 degrees is very wide, indicating the dataset captures globally dispersed locations rather than a single region. A skew of 1.18 suggests a tail toward positive (eastern) longitudes, meaning most records cluster in western longitudes but some pull toward Oceania (e.g., the max of 153.39 is consistent with eastern Australia). With 467 unique values out of 610 rows and zero nulls, coordinates appear mostly distinct but not fully unique, suggesting some location reuse.

year high anthropic:default

This column represents a calendar year dimension spanning 2021–2026, with only 6 distinct values across 610 rows — confirming it is a low-cardinality temporal grouping field rather than a continuous numeric. The distribution is left-skewed (skew = –0.80) with both median and max at 2026, meaning the majority of records are concentrated in the most recent year. The presence of 2026 data (a future or current year at time of writing) alongside 2021 suggests multi-year longitudinal coverage, with recent years — especially 2026 — heavily over-represented.

month high anthropic:default

This column represents calendar month, encoded as an integer from 1 to 12 with exactly 12 unique values and near-zero nulls (0.16%). The distribution is notably right-skewed (skew ≈ 1.0) with a median of 1.0 and Q1 also at 1.0, meaning at least 25% of rows are coded as January — suggesting strong concentration in early months rather than a uniform seasonal spread. The mean of 3.75 versus median of 1.0 confirms this heavy front-loading, which would surprise an analyst expecting roughly uniform monthly representation.

basisOfRecord high anthropic:default

This column is a Darwin Core 'basisOfRecord' field classifying how each biodiversity occurrence was recorded, with only two valid categories present. The distribution is heavily skewed: HUMAN_OBSERVATION dominates at 90.2% (550 of 610 records), while PRESERVED_SPECIMEN accounts for just 9.8% (60 records). The extreme imbalance between field observations and museum/herbarium specimens may affect any model trained on this feature, as PRESERVED_SPECIMEN is a near-minority class. No nulls are present, and the low entropy (0.464) confirms the near-uniform dominance of one category.

country high anthropic:default

This column captures country of origin or location for 610 records, spanning 35 distinct countries with no nulls. The United States dominates at 21.3% (130 records), followed by Mexico (73) and Brazil (51), but the presence of Guadeloupe (48) and Madagascar (40) as top-tier entries is surprising for a dataset this size, suggesting a specific thematic or biological focus (e.g., species occurrences, field surveys) rather than a general population sample. Entropy ratio of 0.77 indicates reasonably broad distribution across countries, though the top-heavy US share introduces mild imbalance.

family high anthropic:default

This column contains biological family-level taxonomic names, with exactly 3 distinct values across 610 non-null rows. Notably, two families — Hesperiidae (butterflies) and Canellaceae (flowering plants) — each appear exactly 300 times, suggesting a deliberately balanced dataset pairing two taxonomic groups, while Araceae (another plant family) appears only 10 times, likely as a minor addition or control group. The near-perfect split between an animal family and a plant family in equal counts is unusual and warrants investigation into dataset construction intent.

genus high anthropic:default

This column contains biological genus names, functioning as a taxonomic label for 610 specimens spanning 94 distinct genera. The dominant genus 'Canella' accounts for 28.5% of all rows (174 occurrences), creating notable imbalance — the top four genera alone (Canella, Warburgia, Cinnamosma, Cinnamodendron) appear to be plant genera (order Canellales), while lower-ranked entries like Urbanus, Hylephila, Burnsius, and Pyrgus are butterfly genera (Hesperiidae), suggesting the dataset may mix taxonomic kingdoms or represent a cross-taxon study. Entropy ratio of 0.74 indicates moderate concentration despite 94 unique values.

species high anthropic:default

This column records biological taxonomic names, appearing to be species-level identifiers (binomial nomenclature) though it also contains family-level names such as 'Droseraceae' and 'Sarraceniaceae', indicating inconsistent taxonomic rank across rows. The dominant value 'Canella winterana' accounts for 28.5% of all 610 rows (174 occurrences), which is a striking concentration given 123 unique values and an entropy ratio of 0.74 suggesting moderate-to-high diversity otherwise. The mix of species binomials (plants and insects, e.g. butterflies like 'Hylephila phyleus') alongside family names is a data quality concern that should be flagged before modelling.

stateProvince high anthropic:default

This column captures state or province-level geographic designations, with 108 distinct values across 610 rows and no nulls. The top value is 'Texas' (13.3% of rows), but the mix of US states, Mexican states (Nayarit, Sinaloa), Australian states (Queensland), South African provinces (KwaZulu-Natal), Brazilian states (Santa Catarina), and a French Caribbean municipality (Pointe-à-Pitre) confirms this is an international dataset — not US-only. Notably, 30 rows (≈4.9%) contain an empty string rather than a true null, and 11 rows are coded as the catch-all 'Other', both of which will need handling. The high entropy ratio (0.82) reflects genuine geographic spread rather than a dominated distribution.

Numeric correlation

Show data table
Pearson correlation across 5 numeric columns (values clipped to 2 decimals).
latitudelongitudeyearmonthcoordinateUncertainty
latitude+1.00-0.74+0.14-0.18-0.10
longitude-0.74+1.00+0.00+0.06+0.01
year+0.14+0.00+1.00-0.65-0.17
month-0.18+0.06-0.65+1.00+0.16
coordinateUncertainty-0.10+0.01-0.17+0.16+1.00

scientificName categorical

81 singleton categories
rows610
null0 (0.0%)
unique157
top_valueCanella winterana (L.) Gaertn.
top_rate0.285
cardinality157
entropy5.517
entropy_ratio0.756
Show data table
Top values for scientificName (20 unique shown, of 157 total).
valuecountshare
Canella winterana (L.) Gaertn.17428.5%
Warburgia salutaris (Bertol.fil.) Chiov.355.7%
Cinnamodendron dinisii Schwacke203.3%
Hylephila phyleus (Drury, 1773)183.0%
Cinnamosma Baill.172.8%
Cinnamosma fragrans Baill.142.3%
Ocybadistes walkeri Heron, 1894111.8%
Cinnamodendron occhionianum F.Barros & J.Salazar101.6%
Pinellia fujianensis H.Li & G.H.Zhu101.6%
Urbanus proteus (Linnaeus, 1758)81.3%
Cinnamosma madagascariensis Danguy81.3%
Warburgia ugandensis Sprague81.3%
Lerodea eufala (Edwards, 1869)71.1%
Burnsius albezens Grishin, 202271.1%
Lerema Scudder, 187271.1%
Quasimellana eulogius (Plötz, 1882)61.0%
Spicauda procne (Plötz, 1880)61.0%
Burnsius oileus (Linnaeus, 1767)50.8%
Burnsius orcynoides50.8%
Cephrenes augiades (Felder, 1860)50.8%
Top values (rank 1–20)
  1. Canella winterana (L.) Gaertn. — 174
  2. Warburgia salutaris (Bertol.fil.) Chiov. — 35
  3. Cinnamodendron dinisii Schwacke — 20
  4. Hylephila phyleus (Drury, 1773) — 18
  5. Cinnamosma Baill. — 17
  6. Cinnamosma fragrans Baill. — 14
  7. Ocybadistes walkeri Heron, 1894 — 11
  8. Cinnamodendron occhionianum F.Barros & J.Salazar — 10
  9. Pinellia fujianensis H.Li & G.H.Zhu — 10
  10. Urbanus proteus (Linnaeus, 1758) — 8
  11. Cinnamosma madagascariensis Danguy — 8
  12. Warburgia ugandensis Sprague — 8
  13. Lerodea eufala (Edwards, 1869) — 7
  14. Burnsius albezens Grishin, 2022 — 7
  15. Lerema Scudder, 1872 — 7
  16. Quasimellana eulogius (Plötz, 1882) — 6
  17. Spicauda procne (Plötz, 1880) — 6
  18. Burnsius oileus (Linnaeus, 1767) — 5
  19. Burnsius orcynoides — 5
  20. Cephrenes augiades (Felder, 1860) — 5

species categorical

rows610
null0 (0.0%)
unique123
top_valueCanella winterana
top_rate0.285
cardinality123
entropy5.144
entropy_ratio0.741
Show data table
Top values for species (20 unique shown, of 123 total).
valuecountshare
Canella winterana17428.5%
Droseraceae386.2%
Warburgia salutaris355.7%
Cinnamodendron dinisii203.3%
Hylephila phyleus193.1%
Sarraceniaceae193.1%
Cinnamosma fragrans142.3%
Ocybadistes walkeri111.8%
Urbanus dorantes101.6%
Cinnamodendron occhionianum101.6%
Pinellia fujianensis101.6%
Pyrgus oileus91.5%
Cinnamosma madagascariensis91.5%
Mellana eulogius81.3%
Urbanus proteus81.3%
Warburgia ugandensis81.3%
Urbanus procne71.1%
Lerodea eufala71.1%
Burnsius albezens71.1%
Gorgythion begga61.0%
Top values (rank 1–20)
  1. Canella winterana — 174
  2. Droseraceae — 38
  3. Warburgia salutaris — 35
  4. Cinnamodendron dinisii — 20
  5. Hylephila phyleus — 19
  6. Sarraceniaceae — 19
  7. Cinnamosma fragrans — 14
  8. Ocybadistes walkeri — 11
  9. Urbanus dorantes — 10
  10. Cinnamodendron occhionianum — 10
  11. Pinellia fujianensis — 10
  12. Pyrgus oileus — 9
  13. Cinnamosma madagascariensis — 9
  14. Mellana eulogius — 8
  15. Urbanus proteus — 8
  16. Warburgia ugandensis — 8
  17. Urbanus procne — 7
  18. Lerodea eufala — 7
  19. Burnsius albezens — 7
  20. Gorgythion begga — 6

genus categorical

rows610
null0 (0.0%)
unique94
top_valueCanella
top_rate0.285
cardinality94
entropy4.840
entropy_ratio0.738
Show data table
Top values for genus (20 unique shown, of 94 total).
valuecountshare
Canella17428.5%
Warburgia437.0%
Cinnamosma406.6%
Cinnamodendron345.6%
Urbanus254.1%
Hylephila193.1%
Burnsius162.6%
Pyrgus111.8%
Lerema111.8%
Ocybadistes111.8%
Pinellia101.6%
Mellana81.3%
Trapezites81.3%
Heliopetes71.1%
Lerodea71.1%
Toxidia71.1%
Pleodendron71.1%
Staphylus61.0%
Gorgythion61.0%
Polites61.0%
Top values (rank 1–20)
  1. Canella — 174
  2. Warburgia — 43
  3. Cinnamosma — 40
  4. Cinnamodendron — 34
  5. Urbanus — 25
  6. Hylephila — 19
  7. Burnsius — 16
  8. Pyrgus — 11
  9. Lerema — 11
  10. Ocybadistes — 11
  11. Pinellia — 10
  12. Mellana — 8
  13. Trapezites — 8
  14. Heliopetes — 7
  15. Lerodea — 7
  16. Toxidia — 7
  17. Pleodendron — 7
  18. Staphylus — 6
  19. Gorgythion — 6
  20. Polites — 6

family categorical

rows610
null0 (0.0%)
unique3
top_valueHesperiidae
top_rate0.492
cardinality3
entropy1.104
entropy_ratio0.697
Show data table
Top values for family (3 unique shown, of 3 total).
valuecountshare
Hesperiidae30049.2%
Canellaceae30049.2%
Araceae101.6%
Top values (rank 1–20)
  1. Hesperiidae — 300
  2. Canellaceae — 300
  3. Araceae — 10

latitude numeric

rows610
null0 (0.0%)
unique466
min-43.246
max46.705
mean5.200
median17.008
std22.746
q1-22.922
q324.826
iqr47.748
skew-0.652
kurtosis-1.283
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for latitude (median: 17.008014000000003).
bincount
-43.25 – -39.51
-39.5 – -35.757
-35.75 – -3222
-32 – -28.2538
-28.25 – -24.5165
-24.51 – -20.7632
-20.76 – -17.0112
-17.01 – -13.2612
-13.26 – -9.5146
-9.514 – -5.7662
-5.766 – -2.0191
-2.019 – 1.72918
1.729 – 5.4773
5.477 – 9.22517
9.225 – 12.979
12.97 – 16.7250
16.72 – 20.4798
20.47 – 24.2249
24.22 – 27.97145
27.97 – 31.7120
31.71 – 35.462
35.46 – 39.210
39.21 – 42.960
42.96 – 46.71

longitude numeric

rows610
null0 (0.0%)
unique467
min-115.044
max153.391
mean-32.935
median-63.056
std78.931
q1-89.369
q330.839
iqr120.208
skew1.184
kurtosis0.084
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for longitude (median: -63.056028).
bincount
-115 – -103.961
-103.9 – -92.6791
-92.67 – -81.4916
-81.49 – -70.3184
-70.31 – -59.12128
-59.12 – -47.9444
-47.94 – -36.7512
-36.75 – -25.570
-25.57 – -14.380
-14.38 – -3.1960
-3.196 – 7.9890
7.989 – 19.173
19.17 – 30.3612
30.36 – 41.5438
41.54 – 52.7340
52.73 – 63.910
63.91 – 75.10
75.1 – 86.282
86.28 – 97.470
97.47 – 108.714
108.7 – 119.817
119.8 – 1311
131 – 142.26
142.2 – 153.441

country categorical

rows610
null0 (0.0%)
unique35
top_valueUnited States of America
top_rate0.213
cardinality35
entropy3.961
entropy_ratio0.772
Show data table
Top values for country (20 unique shown, of 35 total).
valuecountshare
United States of America13021.3%
Mexico7312.0%
Brazil518.4%
Guadeloupe487.9%
Australia477.7%
South Africa416.7%
Madagascar406.6%
Puerto Rico376.1%
Dominican Republic162.6%
Panama152.5%
Argentina142.3%
Singapore101.6%
Cayman Islands101.6%
Antigua and Barbuda101.6%
China101.6%
Virgin Islands (U.S.)81.3%
Kenya81.3%
Hong Kong61.0%
Costa Rica50.8%
Sint Maarten (Dutch part)40.7%
Top values (rank 1–20)
  1. United States of America — 130
  2. Mexico — 73
  3. Brazil — 51
  4. Guadeloupe — 48
  5. Australia — 47
  6. South Africa — 41
  7. Madagascar — 40
  8. Puerto Rico — 37
  9. Dominican Republic — 16
  10. Panama — 15
  11. Argentina — 14
  12. Singapore — 10
  13. Cayman Islands — 10
  14. Antigua and Barbuda — 10
  15. China — 10
  16. Virgin Islands (U.S.) — 8
  17. Kenya — 8
  18. Hong Kong — 6
  19. Costa Rica — 5
  20. Sint Maarten (Dutch part) — 4

stateProvince categorical

rows610
null0 (0.0%)
unique108
top_valueTexas
top_rate0.133
cardinality108
entropy5.530
entropy_ratio0.819
Show data table
Top values for stateProvince (20 unique shown, of 108 total).
valuecountshare
Texas8113.3%
Florida467.5%
Pointe-à-Pitre355.7%
Nayarit335.4%
304.9%
Sinaloa264.3%
Queensland243.9%
KwaZulu-Natal172.8%
Santa Catarina162.6%
Other111.8%
Rio Grande do Sul111.8%
Mpumalanga101.6%
Mahajanga101.6%
Fujian101.6%
Cabo Rojo91.5%
Limpopo91.5%
Toliara91.5%
New South Wales81.3%
Fajardo81.3%
Paraná81.3%
Top values (rank 1–20)
  1. Texas — 81
  2. Florida — 46
  3. Pointe-à-Pitre — 35
  4. Nayarit — 33
  5. — 30
  6. Sinaloa — 26
  7. Queensland — 24
  8. KwaZulu-Natal — 17
  9. Santa Catarina — 16
  10. Other — 11
  11. Rio Grande do Sul — 11
  12. Mpumalanga — 10
  13. Mahajanga — 10
  14. Fujian — 10
  15. Cabo Rojo — 9
  16. Limpopo — 9
  17. Toliara — 9
  18. New South Wales — 8
  19. Fajardo — 8
  20. Paraná — 8

locality categorical

17 singleton categories
rows610
null0 (0.0%)
unique29
top_value
top_rate0.923
cardinality29
entropy0.747
entropy_ratio0.154
Show data table
Top values for locality (20 unique shown, of 29 total).
valuecountshare
56392.3%
District de Soanierana Ivongo, Commune de Manompana, Fokontany de Vohijiny, Village d'Ambohitsara. Forêt littorale de Sahavalanina, au Sud-Est d'Ambohitsara.30.5%
District Mahabo, Commune Analamisandy,Fokontany Soazato, Forêt d'Azohy. Collectés avec: Ando, Tefy, Cécile, Jean Michel. Échantillon préservé en l'alcool.30.5%
Région Vatovavy, Kianjavato, Ambodifandramanana, Ankarabo, vestige de forêt au sud du Mt Vatovavy. Echantillons préservés en alcool, récoltés avec équipe polisinala (Auguste, Jean Frédéric).30.5%
Antsiranana, SAVA, District de Vohémar, Commune rurale d'Antsirabe-nord, Fokontany d'Andravinambo, foret d'Antsolatra Marojala Sokitra. Plantes préservées en alcool, récoltées avec Bezanaka Jean Honoré.30.5%
Région Sofia, District de Mandritsara, commune rurale Marotandrano, fokontany Antsiatsiaka. Foret de Bezavona à 2 km à l'Est du village d'Antsiatsiaka, foret humide sempervirente de moyenne altitude sur latérite. Avec Raharimanana Théo, Ranaivoson Ernest, Marojery Réné chef FKT, Traravola, Rabemalaza Justin, Risy guides locaux.30.5%
Distrit Sakaraha Commune Rurale Amboronabo Fokontany Mitia village Belambo Collecté avec Mamomjy, Tariha, Rehary30.5%
District Sakaraha, Commune rurale Amboronabo, Fokotany Mitia-Est. Forêt de Herea, au Nord d'Analavelona, sur sable. Hameau le plus proche Belambo.30.5%
District Vaingaindrano, Commnune Tsianofana, Fokontany Abaronga, localité Andasibe . Forêt humide de la nouvelle aire protégée d' Agnakatrika. Collecté avec Iakily Armand.30.5%
Serra da Farinha-seca, encosta do Morro Sete.20.3%
Serra da Graciosa. Encosta próxima ao Recanto Bela Vista.20.3%
Estância do Meio.20.3%
UTM25_32T_0600_515010.2%
Parque Estadual da Serra da Baitaca, proximidades da Cachoeira do Samambaia.10.2%
Parque Estadual da Serra da Baitaca,10.2%
Ca. 700 m al sur de San Francisco de San Isidro, costado sur (del parqueo sur) de la escuela Golden Valley. Remanentes de bosque muy húmedo, en cafetales, casas, potreros y finca de Hammel y Pérez por el Río Tures.10.2%
Estação de Tratamento de Água Piraí (ETA Piraí)10.2%
Alto Benedito.10.2%
Comfloresta.10.2%
Sítio Barcelos. Área de PRAD. Propriedade de Vilmar de Lima Barcelos.10.2%
Top values (rank 1–20)
  1. — 563
  2. District de Soanierana Ivongo, Commune de Manompana, Fokontany de Vohijiny, Village d'Ambohitsara. Forêt littorale de Sahavalanina, au Sud-Est d'Ambohitsara. — 3
  3. District Mahabo, Commune Analamisandy,Fokontany Soazato, Forêt d'Azohy. Collectés avec: Ando, Tefy, Cécile, Jean Michel. Échantillon préservé en l'alcool. — 3
  4. Région Vatovavy, Kianjavato, Ambodifandramanana, Ankarabo, vestige de forêt au sud du Mt Vatovavy. Echantillons préservés en alcool, récoltés avec équipe polisinala (Auguste, Jean Frédéric). — 3
  5. Antsiranana, SAVA, District de Vohémar, Commune rurale d'Antsirabe-nord, Fokontany d'Andravinambo, foret d'Antsolatra Marojala Sokitra. Plantes préservées en alcool, récoltées avec Bezanaka Jean Honoré. — 3
  6. Région Sofia, District de Mandritsara, commune rurale Marotandrano, fokontany Antsiatsiaka. Foret de Bezavona à 2 km à l'Est du village d'Antsiatsiaka, foret humide sempervirente de moyenne altitude sur latérite. Avec Raharimanana Théo, Ranaivoson Ernest, Marojery Réné chef FKT, Traravola, Rabemalaza Justin, Risy guides locaux. — 3
  7. Distrit Sakaraha Commune Rurale Amboronabo Fokontany Mitia village Belambo Collecté avec Mamomjy, Tariha, Rehary — 3
  8. District Sakaraha, Commune rurale Amboronabo, Fokotany Mitia-Est. Forêt de Herea, au Nord d'Analavelona, sur sable. Hameau le plus proche Belambo. — 3
  9. District Vaingaindrano, Commnune Tsianofana, Fokontany Abaronga, localité Andasibe . Forêt humide de la nouvelle aire protégée d' Agnakatrika. Collecté avec Iakily Armand. — 3
  10. Serra da Farinha-seca, encosta do Morro Sete. — 2
  11. Serra da Graciosa. Encosta próxima ao Recanto Bela Vista. — 2
  12. Estância do Meio. — 2
  13. UTM25_32T_0600_5150 — 1
  14. Parque Estadual da Serra da Baitaca, proximidades da Cachoeira do Samambaia. — 1
  15. Parque Estadual da Serra da Baitaca, — 1
  16. Ca. 700 m al sur de San Francisco de San Isidro, costado sur (del parqueo sur) de la escuela Golden Valley. Remanentes de bosque muy húmedo, en cafetales, casas, potreros y finca de Hammel y Pérez por el Río Tures. — 1
  17. Estação de Tratamento de Água Piraí (ETA Piraí) — 1
  18. Alto Benedito. — 1
  19. Comfloresta. — 1
  20. Sítio Barcelos. Área de PRAD. Propriedade de Vilmar de Lima Barcelos. — 1

basisOfRecord categorical

rows610
null0 (0.0%)
unique2
top_valueHUMAN_OBSERVATION
top_rate0.902
cardinality2
entropy0.464
entropy_ratio0.464
Show data table
Top values for basisOfRecord (2 unique shown, of 2 total).
valuecountshare
HUMAN_OBSERVATION55090.2%
PRESERVED_SPECIMEN609.8%
Top values (rank 1–20)
  1. HUMAN_OBSERVATION — 550
  2. PRESERVED_SPECIMEN — 60

year numeric

rows610
null0 (0.0%)
unique6
min2,021
max2,026
mean2,025
median2,026
std1.503
q12,024
q32,026
iqr2.000
skew-0.797
kurtosis-0.793
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for year (median: 2026.0).
bincount
2021 – 20217
2021 – 20210
2021 – 20220
2022 – 20220
2022 – 202270
2022 – 20220
2022 – 20220
2022 – 20230
2023 – 20230
2023 – 202371
2023 – 20230
2023 – 20240
2024 – 20240
2024 – 20240
2024 – 202479
2024 – 20240
2024 – 20250
2025 – 20250
2025 – 20250
2025 – 202575
2025 – 20250
2025 – 20260
2026 – 20260
2026 – 2026308

month numeric

rows610
null1 (0.2%)
unique12
min1.000
max12.000
mean3.752
median1.000
std3.750
q11.000
q37.000
iqr6.000
skew1.002
kurtosis-0.508
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for month (median: 1.0).
bincount
1 – 1.458342
1.458 – 1.9170
1.917 – 2.37518
2.375 – 2.8330
2.833 – 3.29224
3.292 – 3.750
3.75 – 4.20824
4.208 – 4.6670
4.667 – 5.12521
5.125 – 5.5830
5.583 – 6.04217
6.042 – 6.50
6.5 – 6.9580
6.958 – 7.41742
7.417 – 7.8750
7.875 – 8.33323
8.333 – 8.7920
8.792 – 9.2515
9.25 – 9.7080
9.708 – 10.1724
10.17 – 10.620
10.62 – 11.0829
11.08 – 11.540
11.54 – 1230

coordinateUncertainty numeric

22.6% null skew=+17.30 19.3% rows beyond 1.5 IQR
rows610
null138 (22.6%)
unique151
min1.000
max766,917
mean6,463
median35.000
std38,136
q15.000
q3466.750
iqr461.750
skew17.305
kurtosis335.690
n_outliers91
outlier_rate0.193
zero_rate0.000
Show data table
Histogram bins for coordinateUncertainty (median: 35.0).
bincount
1 – 3.652e+04467
3.652e+04 – 7.304e+042
7.304e+04 – 1.096e+051
1.096e+05 – 1.461e+050
1.461e+05 – 1.826e+050
1.826e+05 – 2.191e+050
2.191e+05 – 2.556e+051
2.556e+05 – 2.922e+050
2.922e+05 – 3.287e+050
3.287e+05 – 3.652e+050
3.652e+05 – 4.017e+050
4.017e+05 – 4.382e+050
4.382e+05 – 4.748e+050
4.748e+05 – 5.113e+050
5.113e+05 – 5.478e+050
5.478e+05 – 5.843e+050
5.843e+05 – 6.208e+050
6.208e+05 – 6.574e+050
6.574e+05 – 6.939e+050
6.939e+05 – 7.304e+050
7.304e+05 – 7.669e+051

gbifID categorical

610 singleton categories
rows610
null0 (0.0%)
unique610
top_value5937748304
top_rate1.64e-03
cardinality610
entropy9.253
entropy_ratio1.000
Show data table
Top values for gbifID (20 unique shown, of 610 total).
valuecountshare
593774830410.2%
593774830810.2%
593774830910.2%
593774831210.2%
593774831610.2%
593774832210.2%
593774832510.2%
593774832710.2%
593774832910.2%
593774833310.2%
593774833510.2%
593774833610.2%
593774833810.2%
593774834210.2%
593774834410.2%
593774835010.2%
593774835210.2%
593774835310.2%
593774836310.2%
593774836910.2%
Top values (rank 1–20)
  1. 5937748304 — 1
  2. 5937748308 — 1
  3. 5937748309 — 1
  4. 5937748312 — 1
  5. 5937748316 — 1
  6. 5937748322 — 1
  7. 5937748325 — 1
  8. 5937748327 — 1
  9. 5937748329 — 1
  10. 5937748333 — 1
  11. 5937748335 — 1
  12. 5937748336 — 1
  13. 5937748338 — 1
  14. 5937748342 — 1
  15. 5937748344 — 1
  16. 5937748350 — 1
  17. 5937748352 — 1
  18. 5937748353 — 1
  19. 5937748363 — 1
  20. 5937748369 — 1