This is a single-row metadata record describing the Steam Users dataset, a collection of 14,306,064 Steam user profiles sourced from Steam Store data (likely via Kaggle or SteamSpy) and last updated on 2025-01-20. Rather than being an analytical dataset itself, it serves as a data catalogue entry pointing analysts toward the actual user data file (185 MB) which links to a recommendations.csv via user_id. The most important thing to note is the scale: over 14 million user profiles covering library size and review activity represent a substantial analytical resource. Before diving in, analysts should locate and join the referenced recommendations.csv to unlock the full relational value of this dataset.
saturn
/home/coolhand/html/datavis/data_trove/entertainment/gaming/users_metadata.json 1 rows sample n=1 seed 42 2026-06-21T23:42:05+00:00
Overview
| Source | /home/coolhand/html/datavis/data_trove/entertainment/gaming/users_metadata.json |
| Total rows | 1 |
| Profiled sample | 1 |
| Columns | 6 |
| Generated | 2026-06-21T23:42:05+00:00 |
Show data table
| column | kind | null % |
|---|---|---|
| dataset_name | categorical | 0.0% |
| last_updated | categorical | 0.0% |
| source | categorical | 0.0% |
| record_count | numeric | 0.0% |
| fields | unknown | 0.0% |
| notes | categorical | 0.0% |
Insights opt-in
Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.
This column is a dataset-level identifier or metadata tag indicating the source dataset, with every row labelled 'Steam Users'. With only 1 row and 1 unique value, the column carries zero entropy (0.0) and a top_rate of 1.0 — it is a constant and provides no discriminative information. The long_tail and imbalance alerts are technically correct but trivially explained by the single-row, single-value nature of the data.
This column is a timestamp or date field indicating when a record was last updated, stored as a categorical string. The dataset contains only a single row (n=1), and that row holds the value '2025-01-20', giving a cardinality of 1 and top_rate of 1.0. With only one observation, no distributional insight is possible; the 'long_tail' and 'imbalance' alerts are artefacts of the trivial sample size rather than meaningful signals.
This column is a dataset-level metadata note, not a real data column — it contains a single static string describing the dataset itself (14.3 million Steam user profiles, file size 185 MB, join key user_id). With n=1 and cardinality=1, it appears to be a singleton annotation row or a schema-level descriptor accidentally included in the profiled data. The entropy of 0.0 and top_rate of 1.0 confirm it carries zero analytical signal.
This column records the data provenance or source attribution for the dataset, with every single row carrying the identical value 'Steam Store user data (likely via Kaggle or SteamSpy)'. With n=1, cardinality=1, entropy=0.0, and top_rate=1.0, there is zero variance whatsoever — this is a constant column. It adds no analytical signal and likely exists as a metadata annotation injected during data collection or curation.
This column is a record count field, almost certainly a metadata scalar reporting the total row count of a source dataset — here fixed at 14,306,064 across all rows. With n=1, n_unique=1, and a constant value equal to mean, min, and max, there is zero variance; saturn has flagged it as 'constant'. This is not a feature or target but a summary statistic embedded in the dataset, likely from an ETL or export header row.
This column contains only a single row and was skipped by the profiler, yielding no distributional statistics. With n=1 and no uniqueness or type information available, no meaningful inference about its content or role is possible beyond the fact that it is non-null. The 'unknown' kind designation and empty stats block indicate the profiler could not parse or classify the value.
dataset_name categorical
Show data table
| value | count | share |
|---|---|---|
| Steam Users | 1 | 100.0% |
Top values (rank 1–20)
- Steam Users — 1
last_updated categorical
Show data table
| value | count | share |
|---|---|---|
| 2025-01-20 | 1 | 100.0% |
Top values (rank 1–20)
- 2025-01-20 — 1
source categorical
Show data table
| value | count | share |
|---|---|---|
| Steam Store user data (likely via Kaggle or SteamSpy) | 1 | 100.0% |
Top values (rank 1–20)
- Steam Store user data (likely via Kaggle or SteamSpy) — 1
record_count numeric
Show data table
| bin | count |
|---|---|
| 1.431e+07 – 1.431e+07 | 0 |
| 1.431e+07 – 1.431e+07 | 0 |
| 1.431e+07 – 1.431e+07 | 1 |
| 1.431e+07 – 1.431e+07 | 0 |
| 1.431e+07 – 1.431e+07 | 0 |
fields unknown
notes categorical
Show data table
| value | count | share |
|---|---|---|
| 14.3 million Steam user profiles with library size and review activity. File is 185 MB. Links to recommendations.csv via user_id. | 1 | 100.0% |
Top values (rank 1–20)
- 14.3 million Steam user profiles with library size and review activity. File is 185 MB. Links to recommendations.csv via user_id. — 1