saturn

/home/coolhand/html/datavis/data_trove/entertainment/gaming/recommendations_metadata.json 1 rows sample n=1 seed 42 2026-06-21T23:41:17+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/entertainment/gaming/recommendations_metadata.json
Total rows1
Profiled sample1
Columns6
Generated2026-06-21T23:41:17+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
dataset_namecategorical0.0%
last_updatedcategorical0.0%
sourcecategorical0.0%
record_countnumeric0.0%
fieldsunknown0.0%
notescategorical0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset low anthropic:default

This is a single-row metadata descriptor for the 'Steam Game Recommendations' dataset, last updated 2025-01-20 — it is a catalog entry rather than the underlying data itself. The key takeaway is the scale of what it describes: 41.1 million Steam user reviews stored in a 1.9 GB file, sourced likely via Kaggle or SteamSpy. The metadata notes that the full dataset links to companion files (games.csv and users.csv) via app_id and user_id, and includes playtime and helpfulness metrics — making those join keys the most important fields to validate before any analysis. Analysts should treat this file as a data dictionary and move quickly to the referenced source files for substantive exploration.

dataset_name high anthropic:default

This column is a dataset-level metadata tag identifying the source dataset, with every single row (n=1) carrying the value 'Steam Game Recommendations'. Cardinality is 1 and entropy is 0.0, meaning the column is entirely constant and carries zero information. The 'long_tail' and 'imbalance' alerts are triggered mechanically by the extreme top_rate of 1.0, but are not meaningful here — this is simply a constant label.

last_updated high anthropic:default

This column is a metadata timestamp recording when each record was last updated. With only 1 row in the dataset and a single value of '2025-01-20' holding a top_rate of 1.0, the column is entirely constant — there is zero variance. The alerts for long_tail and imbalance are technically correct but vacuous given the single-row dataset; no meaningful distribution analysis is possible.

notes high anthropic:default

This column is a dataset-level metadata note, not a real data column — its single value is a documentation string describing the broader dataset (41.1 million Steam reviews, file size, join keys, and available metrics). With n=1, cardinality=1, and top_rate=1.0, it carries zero analytical signal and is purely an artifact of how the dataset profile was constructed. The entropy of 0.0 confirms there is no variation whatsoever.

source high anthropic:default

This column records the data provenance/source attribution for the dataset, and contains exactly one unique value across all rows: 'Steam Store user reviews (likely via Kaggle or SteamSpy)'. With cardinality of 1, entropy of 0.0, and a top_rate of 1.0, it is a constant column carrying zero discriminative information. The alerts for long_tail and imbalance are technically triggered but are trivially explained by the single-value nature of the column.

record_count high anthropic:default

This column appears to be a summary or metadata field recording total row count for a dataset or batch, with a single observed value of 41,154,794. The dataset profile contains only 1 row (n=1), meaning this column is a scalar summary rather than a per-record attribute. It is flagged as 'constant' with zero variance, zero IQR, and min/max/mean/median all equal to 41,154,794.0. There is no analytical signal here — it carries no discriminative power and exists purely as a metadata annotation.

fields low anthropic:default

This column ('fields') contains only a single row and was skipped by the profiler, yielding no distributional statistics. With n=1 and no type inference completed, essentially nothing can be determined about its content or role. The absence of nulls is the only positive signal available.

dataset_name categorical

1 singleton categories top value is 100.0% of rows
rows1
null0 (0.0%)
unique1
top_valueSteam Game Recommendations
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for dataset_name (1 unique shown, of 1 total).
valuecountshare
Steam Game Recommendations1100.0%
Top values (rank 1–20)
  1. Steam Game Recommendations — 1

last_updated categorical

1 singleton categories top value is 100.0% of rows
rows1
null0 (0.0%)
unique1
top_value2025-01-20
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for last_updated (1 unique shown, of 1 total).
valuecountshare
2025-01-201100.0%
Top values (rank 1–20)
  1. 2025-01-20 — 1

source categorical

1 singleton categories top value is 100.0% of rows
rows1
null0 (0.0%)
unique1
top_valueSteam Store user reviews (likely via Kaggle or SteamSpy)
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for source (1 unique shown, of 1 total).
valuecountshare
Steam Store user reviews (likely via Kaggle or SteamSpy)1100.0%
Top values (rank 1–20)
  1. Steam Store user reviews (likely via Kaggle or SteamSpy) — 1

record_count numeric

only one distinct value
rows1
null0 (0.0%)
unique1
min41,154,794
max41,154,794
mean41,154,794
median41,154,794
std0.000
q141,154,794
q341,154,794
iqr0.000
skew0.000
kurtosis0.000
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for record_count (median: 41154794.0).
bincount
4.115e+07 – 4.115e+070
4.115e+07 – 4.115e+070
4.115e+07 – 4.115e+071
4.115e+07 – 4.115e+070
4.115e+07 – 4.115e+070

fields unknown

no profiler for kind=unknown
rows1
null0 (0.0%)

notes categorical

1 singleton categories top value is 100.0% of rows
rows1
null0 (0.0%)
unique1
top_value41.1 million Steam user reviews/recommendations. File is 1.9 GB. Links to games.csv via app_id and to users.csv via user_id. Includes playtime and helpfulness metrics.
top_rate1.000
cardinality1
entropy-0.000
entropy_ratio0.000
Show data table
Top values for notes (1 unique shown, of 1 total).
valuecountshare
41.1 million Steam user reviews/recommendations. File is 1.9 GB. Links to games.csv via app_id and to users.csv via user_id. Includes playtime and helpfulness metrics.1100.0%
Top values (rank 1–20)
  1. 41.1 million Steam user reviews/recommendations. File is 1.9 GB. Links to games.csv via app_id and to users.csv via user_id. Includes playtime and helpfulness metrics. — 1