This is a single-row metadata descriptor for the 'Steam Game Recommendations' dataset, last updated 2025-01-20 — it is a catalog entry rather than the underlying data itself. The key takeaway is the scale of what it describes: 41.1 million Steam user reviews stored in a 1.9 GB file, sourced likely via Kaggle or SteamSpy. The metadata notes that the full dataset links to companion files (games.csv and users.csv) via app_id and user_id, and includes playtime and helpfulness metrics — making those join keys the most important fields to validate before any analysis. Analysts should treat this file as a data dictionary and move quickly to the referenced source files for substantive exploration.
saturn
/home/coolhand/html/datavis/data_trove/entertainment/gaming/recommendations_metadata.json 1 rows sample n=1 seed 42 2026-06-21T23:41:17+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/entertainment/gaming/recommendations_metadata.json |
| Total rows | 1 |
| Profiled sample | 1 |
| Columns | 6 |
| Generated | 2026-06-21T23:41:17+00:00 |
Show data table
| column | kind | null % |
|---|---|---|
| dataset_name | categorical | 0.0% |
| last_updated | categorical | 0.0% |
| source | categorical | 0.0% |
| record_count | numeric | 0.0% |
| fields | unknown | 0.0% |
| notes | categorical | 0.0% |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.
This column is a dataset-level metadata tag identifying the source dataset, with every single row (n=1) carrying the value 'Steam Game Recommendations'. Cardinality is 1 and entropy is 0.0, meaning the column is entirely constant and carries zero information. The 'long_tail' and 'imbalance' alerts are triggered mechanically by the extreme top_rate of 1.0, but are not meaningful here — this is simply a constant label.
This column is a metadata timestamp recording when each record was last updated. With only 1 row in the dataset and a single value of '2025-01-20' holding a top_rate of 1.0, the column is entirely constant — there is zero variance. The alerts for long_tail and imbalance are technically correct but vacuous given the single-row dataset; no meaningful distribution analysis is possible.
This column is a dataset-level metadata note, not a real data column — its single value is a documentation string describing the broader dataset (41.1 million Steam reviews, file size, join keys, and available metrics). With n=1, cardinality=1, and top_rate=1.0, it carries zero analytical signal and is purely an artifact of how the dataset profile was constructed. The entropy of 0.0 confirms there is no variation whatsoever.
This column records the data provenance/source attribution for the dataset, and contains exactly one unique value across all rows: 'Steam Store user reviews (likely via Kaggle or SteamSpy)'. With cardinality of 1, entropy of 0.0, and a top_rate of 1.0, it is a constant column carrying zero discriminative information. The alerts for long_tail and imbalance are technically triggered but are trivially explained by the single-value nature of the column.
This column appears to be a summary or metadata field recording total row count for a dataset or batch, with a single observed value of 41,154,794. The dataset profile contains only 1 row (n=1), meaning this column is a scalar summary rather than a per-record attribute. It is flagged as 'constant' with zero variance, zero IQR, and min/max/mean/median all equal to 41,154,794.0. There is no analytical signal here — it carries no discriminative power and exists purely as a metadata annotation.
This column ('fields') contains only a single row and was skipped by the profiler, yielding no distributional statistics. With n=1 and no type inference completed, essentially nothing can be determined about its content or role. The absence of nulls is the only positive signal available.
dataset_name categorical
Show data table
| value | count | share |
|---|---|---|
| Steam Game Recommendations | 1 | 100.0% |
Top values (rank 1–20)
- Steam Game Recommendations — 1
last_updated categorical
Show data table
| value | count | share |
|---|---|---|
| 2025-01-20 | 1 | 100.0% |
Top values (rank 1–20)
- 2025-01-20 — 1
source categorical
Show data table
| value | count | share |
|---|---|---|
| Steam Store user reviews (likely via Kaggle or SteamSpy) | 1 | 100.0% |
Top values (rank 1–20)
- Steam Store user reviews (likely via Kaggle or SteamSpy) — 1
record_count numeric
Show data table
| bin | count |
|---|---|
| 4.115e+07 – 4.115e+07 | 0 |
| 4.115e+07 – 4.115e+07 | 0 |
| 4.115e+07 – 4.115e+07 | 1 |
| 4.115e+07 – 4.115e+07 | 0 |
| 4.115e+07 – 4.115e+07 | 0 |
fields unknown
notes categorical
Show data table
| value | count | share |
|---|---|---|
| 41.1 million Steam user reviews/recommendations. File is 1.9 GB. Links to games.csv via app_id and to users.csv via user_id. Includes playtime and helpfulness metrics. | 1 | 100.0% |
Top values (rank 1–20)
- 41.1 million Steam user reviews/recommendations. File is 1.9 GB. Links to games.csv via app_id and to users.csv via user_id. Includes playtime and helpfulness metrics. — 1