saturn

/home/coolhand/html/datavis/data_trove/entertainment/film/bond_girls.csv 71 rows sample n=71 seed 42 2026-06-21T23:42:45+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/entertainment/film/bond_girls.csv
Total rows71
Profiled sample71
Columns11
Generated2026-06-21T23:42:45+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
bond_girl_namecategorical0.0%
actress_agenumeric0.0%
film_titlecategorical0.0%
film_release_yearnumeric0.0%
bond_actorcategorical0.0%
bond_actor_agenumeric0.0%
director_namecategorical0.0%
box_office_actual_$numeric0.0%
box_office_adjusted_2005_$numeric0.0%
budget_actual_$categorical0.0%
budget_adjusted_2005_$categorical0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This dataset covers 71 records of Bond Girls across 25 James Bond films, tracking each actress alongside financial performance, ages, directors, and Bond actors. The most striking signal is the age disparity: Bond girls average 28.9 years old while Bond actors average 43.1 years — a gap of roughly 14 years that is remarkably consistent across the franchise. Box office figures show strong right skew with outliers (actual revenue ranges from $59.5M to $1,108.6M, with a mean well above the median), suggesting a handful of films massively outperformed the rest. Sean Connery dominates the Bond actor column with 23 appearances, and Guy Hamilton is the most prolific director with 14 entries, both worth examining against box office performance.

box_office_actual_$ high anthropic:default

This column records actual box office revenue (likely in millions of dollars) for 71 films, with only 25 unique values suggesting heavy rounding or binning of financial figures. The distribution is strongly right-skewed (skew = 1.92, kurtosis = 3.55): the median is $156.2M while the mean is pulled to $265.4M, and the max reaches $1,108.6M — nearly 5× the mean — with 4 flagged outliers (5.6% of rows) driving this spread. The IQR of $231.55M against a std of $235.3M confirms that a small number of blockbusters are distorting the central tendency.

box_office_adjusted_2005_$ medium anthropic:default

This column represents inflation-adjusted box office revenue (in millions of 2005 USD) for a small set of films or releases (n=71). Striking is the very low cardinality — only 24 unique values across 71 rows — suggesting heavy value repetition or bucketing, which is unusual for a continuous financial metric. The distribution is moderately right-skewed (skew=0.81) with a high outlier rate of 26.8% (19 of 71 observations), and values range from $250.9M to $943.5M, indicating this dataset likely covers only commercially successful titles.

bond_girl_name high anthropic:default

This column contains the names of Bond girls (female characters/actresses from the James Bond film franchise), serving as a near-unique label per row across 71 records. Cardinality is 70 out of 71 rows, with Léa Seydoux being the only duplicate (appearing twice, likely reflecting her appearances in both Spectre and No Time to Die). Entropy ratio of 0.999 confirms the distribution is almost perfectly uniform — every other name appears exactly once, triggering the long-tail alert despite the dataset being small.

actress_age high anthropic:default

This column records the age of an actress at the time of some event (likely a film role or award), spanning 21 to 51 years across 71 records. With only 20 unique values despite 71 rows, ages are heavily repeated, suggesting a small cast or repeated appearances by the same actresses. The distribution is moderately right-skewed (skew 0.97) with a single outlier at 51, while the bulk of values cluster between 24 and 33 (IQR = 9.0) around a mean of ~28.9 — consistent with known industry bias toward younger actresses.

bond_actor_age high anthropic:default

This column records the age of the actor playing James Bond at the time of each film, spanning 71 observations (likely one per film or scene) across only 21 unique values — consistent with a small roster of actors each appearing in multiple entries. The distribution is remarkably symmetric and platykurtic (kurtosis −1.02), ranging from 30 to 58 with a mean and median both near 43, suggesting Bond actors tend to be cast and retained through their late-30s to late-40s with no strong skew toward younger or older talent.

film_release_year high anthropic:default

This column records the theatrical release year of films, spanning 1962 to 2021 across 71 rows. With only 25 unique values across 71 records, many films share a release year, suggesting clustering around certain production eras. The distribution is mildly right-skewed (skew 0.60) with a median of 1979 and mean of 1983, indicating the dataset leans toward older films, though at least one entry extends to 2021. The IQR of 30 years (1967–1997) confirms the bulk of titles fall in a mid-20th-to-late-20th-century range.

bond_actor high anthropic:default

This column identifies which actor portrayed James Bond in each record, covering all 6 canonical Bond actors across 71 rows. Sean Connery leads with 23 appearances (32.4% of records), followed by Roger Moore (19) and Pierce Brosnan (12); George Lazenby and Timothy Dalton are notably underrepresented with only 3 records each, reflecting their shorter tenures. The entropy ratio of 0.88 indicates a moderately uneven but not extreme distribution. No nulls are present and cardinality is perfectly clean.

budget_actual_$ medium anthropic:default

This column represents actual budget figures in dollars (likely millions), stored as strings rather than a numeric type — a likely ingestion or schema issue. With 23 unique values across 71 rows and an entropy ratio of 0.971, the distribution is nearly uniform, meaning no single budget tier dominates heavily (top value '7.0' appears only 8 times, ~11.3% of rows). Values range from small figures like '2.0' up to '34.0', suggesting a wide spread of project or department budgets. The categorical typing of a clearly numeric column is the primary surprise and will break any quantitative analysis.

budget_adjusted_2005_$ high anthropic:default

This column represents inflation-adjusted budget figures (in millions of 2005 USD), but it has been stored as a categorical type rather than numeric — a likely ingestion or typing error. With 71 rows and 25 unique values, cardinality is moderate, but an entropy ratio of 0.984 indicates near-uniform distribution with no dominant value (top value '41.9' appears only 5 times, a 7% top_rate). The decimal-formatted string values (e.g., '41.9', '91.5') confirm these are numeric quantities masquerading as categories.

director_name high anthropic:default

This column contains the names of film directors, almost certainly for the James Bond franchise — the 12 unique directors across 71 records are an exact match for the known Bond film series. Guy Hamilton leads with 14 films, followed by John Glen (11), Terence Young (10), and Lewis Gilbert (10), accounting for the bulk of the series. The entropy ratio of 0.917 is surprisingly high for only 12 categories, indicating the distribution is relatively flat rather than dominated by one director. No nulls are present.

film_title high anthropic:default

This column contains film titles from the James Bond franchise, covering at least 25 distinct films across 71 rows — indicating each film appears multiple times (averaging ~2.8 rows per title). The distribution is remarkably flat: the top value 'Thunderball' appears only 5 times (7.04% top_rate) and entropy_ratio is 0.984, meaning titles are spread almost uniformly across the dataset, suggesting rows likely represent individual scenes, characters, songs, or some other per-film sub-records rather than one row per film.

Numeric correlation

Show data table
Pearson correlation across 5 numeric columns (values clipped to 2 decimals).
actress_agefilm_release_yearbond_actor_agebox_office_actual_$box_office_adjusted_2005_$
actress_age+1.00+0.31+0.29+0.26-0.08
film_release_year+0.31+1.00+0.50+0.85-0.12
bond_actor_age+0.29+0.50+1.00+0.25-0.39
box_office_actual_$+0.26+0.85+0.25+1.00+0.31
box_office_adjusted_2005_$-0.08-0.12-0.39+0.31+1.00

bond_girl_name categorical

69 singleton categories
rows71
null0 (0.0%)
unique70
top_valueLéa Seydoux
top_rate0.028
cardinality70
entropy6.122
entropy_ratio0.999
Show data table
Top values for bond_girl_name (20 unique shown, of 70 total).
valuecountshare
Léa Seydoux22.8%
Ursula Andress11.4%
Zena Marshall11.4%
Eunice Gayson11.4%
Daniela Bianchi11.4%
Martine Beswick11.4%
Aliza Gur11.4%
Honor Blackman11.4%
Shirley Eaton11.4%
Tania Mallet11.4%
Nadja Regin11.4%
Margaret Nolan11.4%
Claudine Auger11.4%
Luciana Paluzzi11.4%
Molly Peters11.4%
Maryse Guy Mitsouko11.4%
Mie Hama11.4%
Akiko Wakabayashi11.4%
Tsai Chin11.4%
Karin Dor11.4%
Top values (rank 1–20)
  1. Léa Seydoux — 2
  2. Ursula Andress — 1
  3. Zena Marshall — 1
  4. Eunice Gayson — 1
  5. Daniela Bianchi — 1
  6. Martine Beswick — 1
  7. Aliza Gur — 1
  8. Honor Blackman — 1
  9. Shirley Eaton — 1
  10. Tania Mallet — 1
  11. Nadja Regin — 1
  12. Margaret Nolan — 1
  13. Claudine Auger — 1
  14. Luciana Paluzzi — 1
  15. Molly Peters — 1
  16. Maryse Guy Mitsouko — 1
  17. Mie Hama — 1
  18. Akiko Wakabayashi — 1
  19. Tsai Chin — 1
  20. Karin Dor — 1

actress_age numeric

rows71
null0 (0.0%)
unique20
min21.000
max51.000
mean28.915
median28.000
std5.547
q124.000
q333.000
iqr9.000
skew0.970
kurtosis1.794
n_outliers1
outlier_rate0.014
zero_rate0.000
Show data table
Histogram bins for actress_age (median: 28.0).
bincount
21 – 24.7521
24.75 – 28.515
28.5 – 32.2515
32.25 – 3611
36 – 39.758
39.75 – 43.50
43.5 – 47.250
47.25 – 511

film_title categorical

rows71
null0 (0.0%)
unique25
top_valueThunderball
top_rate0.070
cardinality25
entropy4.568
entropy_ratio0.984
Show data table
Top values for film_title (20 unique shown, of 25 total).
valuecountshare
Thunderball57.0%
Goldfinger57.0%
You Only Live Twice45.6%
Diamonds Are Forever45.6%
From Russia with Love34.2%
On Her Majesty's Secret Service34.2%
Live and Let Die34.2%
The Spy Who Loved Me34.2%
Moonraker34.2%
For Your Eyes Only34.2%
A View to a Kill34.2%
GoldenEye34.2%
Tomorrow Never Dies34.2%
The World Is Not Enough34.2%
Die Another Day34.2%
No Time to Die34.2%
Dr. No22.8%
The Man with the Golden Gun22.8%
Octopussy22.8%
Licence to Kill22.8%
Top values (rank 1–20)
  1. Thunderball — 5
  2. Goldfinger — 5
  3. You Only Live Twice — 4
  4. Diamonds Are Forever — 4
  5. From Russia with Love — 3
  6. On Her Majesty's Secret Service — 3
  7. Live and Let Die — 3
  8. The Spy Who Loved Me — 3
  9. Moonraker — 3
  10. For Your Eyes Only — 3
  11. A View to a Kill — 3
  12. GoldenEye — 3
  13. Tomorrow Never Dies — 3
  14. The World Is Not Enough — 3
  15. Die Another Day — 3
  16. No Time to Die — 3
  17. Dr. No — 2
  18. The Man with the Golden Gun — 2
  19. Octopussy — 2
  20. Licence to Kill — 2

film_release_year numeric

rows71
null0 (0.0%)
unique25
min1,962
max2,021
mean1,983
median1,979
std17.592
q11,967
q31,997
iqr30.000
skew0.601
kurtosis-0.841
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for film_release_year (median: 1979.0).
bincount
1962 – 196922
1969 – 19779
1977 – 198411
1984 – 19926
1992 – 19996
1999 – 20068
2006 – 20144
2014 – 20215

bond_actor categorical

rows71
null0 (0.0%)
unique6
top_valueSean Connery
top_rate0.324
cardinality6
entropy2.272
entropy_ratio0.879
Show data table
Top values for bond_actor (6 unique shown, of 6 total).
valuecountshare
Sean Connery2332.4%
Roger Moore1926.8%
Pierce Brosnan1216.9%
Daniel Craig1115.5%
George Lazenby34.2%
Timothy Dalton34.2%
Top values (rank 1–20)
  1. Sean Connery — 23
  2. Roger Moore — 19
  3. Pierce Brosnan — 12
  4. Daniel Craig — 11
  5. George Lazenby — 3
  6. Timothy Dalton — 3

bond_actor_age numeric

rows71
null0 (0.0%)
unique21
min30.000
max58.000
mean43.141
median43.000
std7.842
q136.000
q349.000
iqr13.000
skew0.129
kurtosis-1.018
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for bond_actor_age (median: 43.0).
bincount
30 – 33.58
33.5 – 3710
37 – 40.58
40.5 – 4410
44 – 47.515
47.5 – 516
51 – 54.59
54.5 – 585

director_name categorical

rows71
null0 (0.0%)
unique12
top_valueGuy Hamilton
top_rate0.197
cardinality12
entropy3.288
entropy_ratio0.917
Show data table
Top values for director_name (12 unique shown, of 12 total).
valuecountshare
Guy Hamilton1419.7%
John Glen1115.5%
Terence Young1014.1%
Lewis Gilbert1014.1%
Martin Campbell57.0%
Sam Mendes45.6%
Peter R. Hunt34.2%
Roger Spottiswoode34.2%
Michael Apted34.2%
Lee Tamahori34.2%
Cary Joji Fukunaga34.2%
Marc Forster22.8%
Top values (rank 1–20)
  1. Guy Hamilton — 14
  2. John Glen — 11
  3. Terence Young — 10
  4. Lewis Gilbert — 10
  5. Martin Campbell — 5
  6. Sam Mendes — 4
  7. Peter R. Hunt — 3
  8. Roger Spottiswoode — 3
  9. Michael Apted — 3
  10. Lee Tamahori — 3
  11. Cary Joji Fukunaga — 3
  12. Marc Forster — 2

box_office_actual_$ numeric

5.6% rows beyond 1.5 IQR
rows71
null0 (0.0%)
unique25
min59.500
max1,109
mean265.368
median156.200
std235.306
q1120.450
q3352.000
iqr231.550
skew1.923
kurtosis3.550
n_outliers4
outlier_rate0.056
zero_rate0.000
Show data table
Histogram bins for box_office_actual_$ (median: 156.2).
bincount
59.5 – 190.641
190.6 – 321.87
321.8 – 452.912
452.9 – 5843
584 – 715.24
715.2 – 846.30
846.3 – 977.52
977.5 – 11092

box_office_adjusted_2005_$ numeric

26.8% rows beyond 1.5 IQR
rows71
null0 (0.0%)
unique24
min250.900
max943.500
mean520.532
median465.400
std178.268
q1439.500
q3543.800
iqr104.300
skew0.811
kurtosis-0.107
n_outliers19
outlier_rate0.268
zero_rate0.000
Show data table
Histogram bins for box_office_adjusted_2005_$ (median: 465.4).
bincount
250.9 – 337.511
337.5 – 424.15
424.1 – 510.621
510.6 – 597.220
597.2 – 683.80
683.8 – 770.42
770.4 – 856.910
856.9 – 943.52

budget_actual_$ categorical

rows71
null0 (0.0%)
unique23
top_value7.0
top_rate0.113
cardinality23
entropy4.392
entropy_ratio0.971
Show data table
Top values for budget_actual_$ (20 unique shown, of 23 total).
valuecountshare
7.0811.3%
6.857.0%
3.057.0%
10.345.6%
7.245.6%
2.034.2%
14.034.2%
34.034.2%
28.034.2%
30.034.2%
60.034.2%
110.034.2%
135.034.2%
142.034.2%
250.0–301.034.2%
1.122.8%
27.522.8%
36.022.8%
150.022.8%
200.022.8%
Top values (rank 1–20)
  1. 7.0 — 8
  2. 6.8 — 5
  3. 3.0 — 5
  4. 10.3 — 4
  5. 7.2 — 4
  6. 2.0 — 3
  7. 14.0 — 3
  8. 34.0 — 3
  9. 28.0 — 3
  10. 30.0 — 3
  11. 60.0 — 3
  12. 110.0 — 3
  13. 135.0 — 3
  14. 142.0 — 3
  15. 250.0–301.0 — 3
  16. 1.1 — 2
  17. 27.5 — 2
  18. 36.0 — 2
  19. 150.0 — 2
  20. 200.0 — 2

budget_adjusted_2005_$ categorical

rows71
null0 (0.0%)
unique25
top_value41.9
top_rate0.070
cardinality25
entropy4.568
entropy_ratio0.984
Show data table
Top values for budget_adjusted_2005_$ (20 unique shown, of 25 total).
valuecountshare
41.957.0%
18.657.0%
59.945.6%
34.745.6%
12.634.2%
37.334.2%
30.834.2%
45.134.2%
91.534.2%
60.234.2%
54.534.2%
76.934.2%
133.934.2%
158.334.2%
154.234.2%
188.7–226.434.2%
7.022.8%
27.722.8%
53.922.8%
56.722.8%
Top values (rank 1–20)
  1. 41.9 — 5
  2. 18.6 — 5
  3. 59.9 — 4
  4. 34.7 — 4
  5. 12.6 — 3
  6. 37.3 — 3
  7. 30.8 — 3
  8. 45.1 — 3
  9. 91.5 — 3
  10. 60.2 — 3
  11. 54.5 — 3
  12. 76.9 — 3
  13. 133.9 — 3
  14. 158.3 — 3
  15. 154.2 — 3
  16. 188.7–226.4 — 3
  17. 7.0 — 2
  18. 27.7 — 2
  19. 53.9 — 2
  20. 56.7 — 2