saturn

/home/coolhand/html/datavis/data_trove/entertainment/gaming/enriched/games.csv 122,611 rows sample n=122,611 seed 42 2026-06-21T23:36:51+00:00

Overview

Source/home/coolhand/html/datavis/data_trove/entertainment/gaming/enriched/games.csv
Total rows122,611
Profiled sample122,611
Columns40
Generated2026-06-21T23:36:51+00:00
Show data table
Per-column null rate across the corpus.
columnkindnull %
column00numeric0.0%
column01text0.0%
column02text0.0%
column03categorical0.0%
column04numeric0.0%
column05numeric0.0%
column06numeric0.0%
column07numeric0.0%
column08numeric0.0%
column09text6.9%
column10text0.0%
column11text0.0%
column12text90.2%
column13text0.1%
column14text59.5%
column15text55.8%
column16text18.1%
column17categorical0.0%
column18categorical0.0%
column19categorical0.0%
column20numeric0.0%
column21text96.5%
column22numeric0.0%
column23numeric0.0%
column24numeric0.0%
column25numeric100.0%
column26numeric0.0%
column27numeric0.0%
column28text81.7%
column29numeric0.0%
column30numeric0.0%
column31numeric0.0%
column32numeric0.0%
column33text6.9%
column34text7.2%
column35text7.3%
column36text6.9%
column37text32.0%
column38text4.9%
column39unknown0.0%

Insights opt-in

Model-generated narrative. These are opinions, not facts — the stats below are what saturn measured. Generated by: anthropic:default.

Dataset high anthropic:default

This is a Steam games catalogue with 122,611 rows and 40 columns, covering titles, publishers, developers, genres, pricing, review counts, and associated URLs. The most important thing to examine first is the extreme skew across nearly all numeric engagement columns (column23, column24, column27, column29, column31): medians sit at 0–5 while means run into the hundreds or thousands, meaning a tiny fraction of blockbuster titles account for the vast majority of reviews and activity. A second area worth attention is genre distribution (column36), where just a handful of Casual/Indie/Action combinations account for the bulk of the catalogue, and the estimated owner-count banding (column03) shows over 61% of games have fewer than 20,000 owners — pointing to a long-tail market dominated by low-visibility titles.

column15 high anthropic:default

This column is a support/contact URL field — almost certainly a developer or publisher support link associated with game or software records. 95.6% of non-null values are URLs, and the one-word rate is 99.9%, consistent with bare URL strings. Two surprises stand out: the null rate is very high at 55.8%, meaning more than half of records lack this URL, and the duplicate rate is 34.7% (18,808 duplicate values out of ~54,200 non-null rows), reflecting that many games share the same support domain (e.g., Big Fish Games, EA, Facebook pages).

column14 high anthropic:default

This column contains publisher or developer website URLs, almost certainly scraped from a Steam or similar games catalogue. Virtually every non-null value is a single URL (one_word_rate 0.9999, url_rate 0.9999), pointing to publisher homepages, Facebook pages, or Steam publisher/group pages. Two signals stand out: 59.48% of rows are null, meaning many game records carry no website; and 20.08% of non-null values are duplicates (9,973 repeated URLs), reflecting publishers with large catalogues who share one website across many titles.

column10 high anthropic:default

This column contains serialized Python lists of language names, representing the supported or available languages for each record (likely a software product or game). The dominant value is `['English']` appearing 55,314 times, with `[]` (no languages listed) in 8,380 rows. The duplicate rate is extremely high at 84.4%, which is expected given the limited vocabulary of 217 unique tokens and only 19,113 unique values across 122,611 rows — the data is stored as raw string-serialized lists rather than a normalized structure, which is a notable preprocessing concern.

column11 high anthropic:default

This column contains serialized Python lists of language names, representing the supported or available languages for each record (likely a software product or media item). The dominant value is '[]' (empty list) appearing 72,730 times — nearly 60% of rows — indicating most records have no language metadata populated. Despite 122,611 rows, only 3,710 unique values exist and the duplicate rate is 96.97%, which is expected for a categorical-list field, but the vocabulary is tiny at just 194 words, confirming a closed set of language names.

column13 high anthropic:default

This column contains Steam CDN URLs pointing to game header images hosted on Akamai's steamstatic.com infrastructure — specifically `header.jpg` assets keyed by Steam app ID. With a url_rate of 1.0 and one_word_rate of 1.0, every single value is a single URL. The column is near-unique (122,420 distinct values out of 122,611 rows), with only 110 duplicates, suggesting these map closely to individual game or product records; the small number of repeated URLs (max frequency 5) likely reflects games appearing in multiple dataset rows.

column38 high anthropic:default

This column contains comma-separated lists of Steam screenshot URLs (Akamai CDN), one packed string per row representing all screenshot images for a given Steam game entry. Every value is technically 'one word' (no spaces) because the URLs are concatenated without whitespace, explaining the paradoxical one_word_rate of 1.0 alongside a mean length of ~1319 characters and a max of 29132. With 116,483 unique values out of 122,611 rows and only 110 duplicates, this is near-unique; the small duplicate count likely reflects games with identical screenshot sets.

column36 high anthropic:default

This column contains comma-separated game genre tags (e.g., 'Casual,Indie', 'Action,Adventure,Indie'), consistent with a Steam or similar game catalog dataset. The duplicate rate is extremely high at 97.5%, reflecting the natural cardinality collapse when games share genre combinations — only 2,894 unique tag-sets exist across 122,611 rows. The top words 'to', 'access', and 'play' suggest some rows contain free-text strings like 'Early Access' or 'Free to Play' mixed into the same field, indicating occasional value pollution worth investigating.

column33 high anthropic:default

This column contains game developer or publisher names, evidenced by top values such as 'Choice of Games', 'KOEI TECMO GAMES CO., LTD.', and dominant vocabulary including 'games', 'studio', 'studios', 'interactive', and 'entertainment'. The duplicate rate of 37.98% (43,364 duplicates across 122,611 rows) is expected — publishers release multiple titles — but the 70,816 unique values and a max length of 584 characters suggest occasional free-text entries or combined multi-publisher strings. The one-word rate of 31.8% and mean word count of ~2 words are consistent with company name formats, though the wide length range (1–584 chars) warrants inspection for outliers.

column34 high anthropic:default

This column contains game publisher or developer company names, as evidenced by top values like 'BFG Entertainment', 'Choice of Games', and 'Strategy First', and top words dominated by 'games', 'studio', 'studios', 'entertainment', and corporate suffixes ('llc', 'inc.', 'ltd.'). The duplicate rate is notably high at 44.9% (51,089 duplicates across 122,611 rows), which is expected since many games share the same publisher. The one-word rate of 31.8% reflects single-token studio names, and the 7.2% null rate warrants attention for records with unknown publishers.

column16 high anthropic:default

This column contains email addresses for game developers or publishers, as evidenced by the top values (e.g., 'info@bigfishgames.com', 'support@quanticlab.com'). Nearly all values (99.86%) are single tokens, consistent with email format. The duplicate rate is high at 39.7% (39,849 duplicates out of 122,611 rows), indicating many records share a contact email — expected for a publisher-level field where one entity owns multiple titles. The null rate of 18.14% is notable and should be investigated for systematic missingness.

column02 high anthropic:default

This column contains dates formatted as 'Mon DD, YYYY' (e.g., 'Oct 23, 2025'), stored as text rather than a native date type. The values span at least 2021–2025 based on top word frequencies, with a striking duplicate rate of 95.86% — 117,530 of 122,611 rows share one of only 5,081 distinct dates, meaning many records map to the same calendar day. The near-constant string length (median 12, min 11, max 12) and vocabulary of just 68 tokens confirm this is a tightly formatted date field with no free-text variation.

column37 high anthropic:default

This column contains comma-separated genre/tag lists for software or game products (e.g., 'Adventure,Casual,Hidden Object', 'Action,Indie'), consistent with a Steam-style app catalog. The null rate of 32.02% is notably high and warrants investigation before modelling. A multilingual alert is raised, but the non-English content is negligible (26 records out of 3,376 detected), suggesting near-uniform English data with minor noise. The duplicate rate of 7.4% (6,167 duplicates) is expected given finite genre combinations across a large catalog.

column28 high anthropic:default

This column contains free-text content warnings or age-rating disclosures for video games, with recurring phrases about mature content, nudity, sexual content, and violence. It is massively sparse — 81.68% of rows are null — meaning most games carry no such warning. The duplicate rate of 17.09% (3,839 duplicates across 18,620 unique values) reflects the use of templated boilerplate warning strings, while a small multilingual signal (2 Chinese, 1 Japanese entries) indicates some non-English publisher submissions. Flesch readability of 44.38 and a median length of 124 characters are consistent with dense legal/disclaimer prose.

column12 high anthropic:default

This column contains substantial free-text descriptions or reviews, most likely about games — the word 'game' is the top non-stopword at 7,882 occurrences, average text length is ~340 characters (~57 words), and the vocabulary spans 61,840 unique tokens. The 90.16% null rate is a major alert: only about 12,000 of 122,611 rows carry any content, meaning this field is sparsely populated. An emoji_rate of ~1.6% and a median Flesch readability score of ~57.8 suggest informal, consumer-written prose. The near_unique flag is partially explained by the sparse population — 11,884 unique values among ~12,000 non-null rows confirms almost every entry is distinct.

column04 high anthropic:default

This column is a heavily zero-inflated numeric field — likely a count, transaction amount, or event frequency — where 83.95% of values are exactly zero and the interquartile range is 0.0, meaning the entire middle half of the distribution is flat at zero. The remaining values are extremely right-skewed (skew = 209.95, kurtosis = 51452.44) with a max of 1,013,936 against a mean of only 54.59, indicating a small number of very large outliers; 16.05% of rows (19,676) are flagged as outliers. The 1,110 unique values and zero null rate suggest this may be a sparse activity or volume metric.

column06 medium anthropic:default

This column likely represents a monetary amount, duration, or rate — a continuous positive measure where most values are small. The distribution is extreme: the median is 2.24 and Q3 is only 5.24, yet the max reaches 999.98, producing a skew of 22.4 and a kurtosis of 1,135. Over 7.5% of rows (9,297) are flagged as outliers, and 21.4% of values are exactly zero, suggesting a two-part structure (zero-inflation plus a heavy-tailed positive component) that would violate standard regression assumptions.

column08 high anthropic:default

This column is a sparse count or event-frequency field: 85.5% of its 122,611 rows are exactly zero, the median and IQR are both 0, yet the mean is 0.55 and the max reaches 3,703. The extreme concentration at zero combined with a skew of 171.8 and kurtosis of 38,359 indicates a heavy-tailed distribution driven by rare but very large values; 14.5% of rows (17,771) are flagged as outliers. Only 117 distinct values across 122,611 rows further suggests this is a discrete count, not a continuous measure.

column23 high anthropic:default

This column is a numeric count or magnitude field — likely representing activity volume, transaction amount, or similar accumulation metric — with 122,611 non-null records and only 5,540 distinct values. The distribution is extraordinarily right-skewed (skew=177.84, kurtosis=45,295.94): the median is just 5.0 while the mean is 1,044.99, and the maximum reaches 7,642,084 — a value roughly 272x the standard deviation above the mean. About 34.5% of values are zero and 17.0% are flagged as outliers (20,797 rows), indicating a heavy zero-inflated tail with extreme rare events dominating the mean.

column24 medium anthropic:default

This column is likely a count or frequency measure (e.g., event occurrences, transaction counts, or interaction tallies) given its non-negative integer-like range and high zero rate. The distribution is extraordinarily right-skewed: the median is 1.0 and Q3 is only 10.0, yet the maximum reaches 1,173,003 — a difference of over six orders of magnitude. With 45% zeros, ~16.9% flagged outliers (20,696 rows), a skew of 156.86, and kurtosis exceeding 30,000, the bulk of records cluster near zero while a small number of extreme values dominate the mean (169.20 vs. median 1.0). This is a severe long-tail distribution that will distort any linear model if used as-is.

column26 high anthropic:default

This column is likely a count or frequency metric (e.g., event occurrences, transaction counts, or tenure in days/months), given its non-negative integer values with only 448 distinct values across 122,611 rows. The distribution is severely right-skewed (skew=32.63, kurtosis=1192.15): the median is just 2.0 while the mean is 18.09, Q1 is 0.0, and the maximum reaches 9,821—an extreme outlier relative to the IQR of 19. Nearly half the rows (48.6%) are zero, and 6.9% are flagged as outliers, signaling a heavy zero-inflated tail that will distort any linear model trained on raw values.

column27 high anthropic:default

This column is a sparse, heavily right-skewed numeric count or amount field — likely representing an event frequency, transaction volume, or similar quantity that is zero for the vast majority of records. 82.9% of the 122,611 rows are exactly zero, the median is 0.0, and the IQR is 0.0, yet the mean is 961.8 and the maximum reaches 4,830,455 — indicating a tiny fraction of extreme values driving nearly all the variance. The skew of 113.9 and kurtosis of 20,874.5 are extraordinary, and 17.1% of rows are flagged as outliers, confirming that the non-zero tail is severely extreme relative to the bulk of the distribution.

column29 high anthropic:default

This column is a heavily zero-inflated count or amount field — 78.7% of its 122,611 rows are exactly zero, and the interquartile range is 0.0, meaning the entire middle 50% of the distribution is zero. Despite a median of 0 and mean of only 208, the max reaches 3,429,544, producing extreme skew (262.89) and kurtosis (75,698), with 21.3% of rows flagged as outliers. This pattern is consistent with a sparse event-count, transaction amount, or usage metric where most entities are inactive but a small tail drives enormous values.

column31 high anthropic:default

This column is a sparse count or activity metric where the overwhelming majority of records (78.7%) are zero, producing a median of 0.0 and an IQR of exactly 0.0. The distribution is extraordinarily right-skewed (skew = 263.99, kurtosis = 76112.44), driven by extreme outliers reaching a max of 3,429,544 against a mean of only 173.57 — indicating a tiny fraction of records carry massive values. Roughly 21.3% of rows (26,119) are flagged as outliers, which is an unusually high outlier rate and signals a power-law or heavy-tailed phenomenon rather than a simple data error.

column35 high anthropic:default

This column contains a comma-delimited list of Steam game features/categories (e.g., 'Single-player', 'Steam Achievements', 'Family Sharing', 'Full controller support'), typical of the Steam store's supported features field per game. The extreme duplicate rate (88.3%, 100,367 of 122,611 rows) is expected because many games share identical feature sets, and the tiny vocabulary size of 589 words confirms a finite, enumerated tag system. The 'da' language detection on 12 rows is almost certainly a false positive from short comma-separated tokens, not actual Danish text. With only 13,291 unique combinations out of 122,611 rows, this column is highly suitable for multi-label binarization.

column17 high anthropic:default

This column is a boolean flag stored as string values ('True'/'False'), covering 122,611 rows with no nulls. It is severely imbalanced: 'True' accounts for 99.964% of rows (122,567 occurrences) while 'False' appears only 44 times. The near-zero entropy (0.0046) confirms the column carries almost no information, making it nearly constant.

column01 medium anthropic:default

This column contains short, near-unique text strings averaging ~3 words and 18 characters, consistent with game or software session/product titles. The dominant top words — 'playtest', 'vr', 'simulator' — strongly suggest these are names of VR game playtesting sessions or titles. Surprising signals include 1,156 duplicates (~0.94% duplicate rate) despite the near-unique alert, a small emoji presence (0.26%), and a maximum length of 413 characters which is anomalously long relative to the median of 16.

column09 high anthropic:default

This column contains long-form natural language text, likely user-generated content such as reviews, product descriptions, or messages — with a mean of 1,297 characters and 214 words per entry, and a vocabulary of 105,903 unique terms. The near-unique alert (113,556 unique values out of 122,611 rows) confirms these are essentially free-text narratives rather than categorical labels. Notably, 4.7% of entries contain emojis, suggesting informal or consumer-facing content, and the max length of 89,665 characters indicates some extreme outliers well beyond the 95th-percentile length of 2,966 characters. Flesch readability mean of 58.7 places the text in a 'fairly easy' register, consistent with consumer writing.

column05 high anthropic:default

This column is a low-cardinality integer count (only 15 distinct values, range 0–21) where 98.96% of rows are exactly zero, making it an extreme sparse count feature — likely recording rare events or occurrences per record. The distribution is severely right-skewed (skew 9.88, kurtosis 96.52) with only 1,272 outlier rows (1.04%) carrying any non-zero signal; the IQR is zero because all three quartiles collapse to 0.

column20 high anthropic:default

This column is a sparse numeric count or score with only 73 distinct values across 122,611 rows, almost certainly representing an event count, frequency, or discrete rating. The distribution is extraordinarily concentrated at zero — 96.5% of values are exactly 0 — with IQR of 0.0 and a median of 0.0, yet the max reaches 97.0, producing extreme positive skew (5.23) and kurtosis (25.75). The 4,256 outlier rows (3.47%) carrying non-zero values likely represent a small active or engaged sub-population, which is the analytically interesting segment.

column22 high anthropic:default

This column is almost certainly a sparse indicator or rare-event count: 99.97% of its 122,611 values are exactly zero, with only 40 flagged outliers and a maximum of 100.0. The 31 unique values and an IQR of 0.0 confirm that the vast majority of rows carry no signal at all. The extreme skew (59.25) and kurtosis (3,627.8) are a direct consequence of this near-total zero mass, making standard continuous modelling inappropriate without transformation or binarisation.

column30 high anthropic:default

This column is a heavily zero-inflated count or amount field: 96.8% of its 122,611 rows are exactly zero, driving a median of 0.0 and an IQR of 0.0. The remaining values are extremely skewed (skew = 51.68, kurtosis = 3252.96), with a mean of 13.79 pulled far right by a maximum of 20,088 — likely representing rare but large events such as transaction amounts, error counts, or penalty values. The 3,898 outliers (3.2% of rows) account for virtually all non-zero variance, which is the defining surprise here.

column32 high anthropic:default

This column is almost certainly a sparse count or occurrence field — likely an event frequency, error count, or similar rare-event tally. The zero_rate of 96.8% means the vast majority of rows have no event, while the remaining ~3.2% drive an extreme right tail (skew=48.9, kurtosis=2848.5) reaching a maximum of 20,088 against a median of 0 and mean of 14.7. The IQR of 0.0 confirms the middle 50% of the distribution is entirely flat at zero, with 3,898 flagged outliers carrying virtually all the variance.

column39 low anthropic:default

This column was skipped by the profiler, so its content and type are entirely unknown. With 122,611 rows, zero nulls, and no computed statistics or uniqueness information, no data-driven characterisation is possible. The 'skipped' alert is the only signal available.

column00 high anthropic:default

This column contains 122,611 numeric values that are all unique, null-free, and span from 10 to 4,264,350 — strongly suggesting it is a unique numeric identifier (e.g., a record ID or transaction number). The distribution is remarkably flat and near-uniform: kurtosis of -1.05, negligible skew of 0.18, and zero detected outliers, which is highly unusual for a natural measurement or feature and is consistent with a sequentially or pseudo-randomly assigned integer key. The IQR of 1,806,385 is close to half the full range, further supporting a uniform spread across the ID space.

column07 high anthropic:default

This column is a bounded numeric score or percentage, ranging from 0 to 100 with only 88 distinct values, suggesting a discretized or rounded measurement (e.g., a completion rate, satisfaction score, or grade). The most striking feature is that 66.8% of values are exactly zero, making the distribution heavily zero-inflated; the median is 0.0 while the mean is 18.35 and Q3 is only 40.0, confirming the mass is concentrated at the floor. Despite the zero inflation, kurtosis is near zero (−0.05), meaning the non-zero portion is roughly flat or uniform across the 0–100 range. Analysts should treat this as a zero-inflated bounded variable rather than a standard continuous feature.

column03 high anthropic:default

This column encodes a numeric quantity as binned range labels — almost certainly an income, revenue, or financial amount bracket given the scale (0 to 10,000,000+) and logarithmically spaced bin edges. The distribution is heavily right-skewed: 61.5% of rows fall in the '0 - 20000' bucket alone, and a notable 21,641 rows sit in '0 - 0', suggesting a zero-value spike that may warrant separate treatment. With only 14 distinct values and zero nulls across 122,611 rows, the encoding is clean but lossy.

column18 high anthropic:default

This column is a binary boolean flag stored as string literals 'True'/'False', with zero nulls across 122,611 rows. The dominant value is 'False' at 82.6% (101,319 occurrences), leaving 'True' at roughly 17.4% (21,292) — a moderately imbalanced split that may matter for classification tasks. The entropy ratio of 0.666 confirms meaningful but uneven information content.

column19 high anthropic:default

This column is a boolean flag stored as string literals 'True'/'False', covering all 122,611 rows with zero nulls. The distribution is heavily skewed: 'False' dominates at 87.2% (106,905 rows) versus 'True' at only 12.8% (15,706 rows). The low entropy of 0.552 confirms the imbalance. An analyst building a classifier on this as a target should anticipate class imbalance requiring resampling or adjusted class weights.

Numeric correlation

Show data table
Pearson correlation across 12 numeric columns (values clipped to 2 decimals).
column00column04column05column06column07column08column20column22column23column24column25column26
column00+1.00+0.19+nan-0.04-0.16-0.05-0.25+nan-0.43-0.49-0.06-0.29
column04+0.19+1.00+nan-0.04-0.06+0.38-0.03+nan+0.11-0.05-0.04-0.04
column05+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan
column06-0.04-0.04+nan+1.00-0.17+0.10-0.08+nan+0.12-0.04-0.27+0.14
column07-0.16-0.06+nan-0.17+1.00+0.09-0.06+nan+0.10+0.26+0.01-0.04
column08-0.05+0.38+nan+0.10+0.09+1.00-0.07+nan+0.19+0.24-0.22+0.20
column20-0.25-0.03+nan-0.08-0.06-0.07+1.00+nan-0.09-0.07+0.20+0.08
column22+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan+nan
column23-0.43+0.11+nan+0.12+0.10+0.19-0.09+nan+1.00+0.71-0.10+0.21
column24-0.49-0.05+nan-0.04+0.26+0.24-0.07+nan+0.71+1.00-0.14+0.12
column25-0.06-0.04+nan-0.27+0.01-0.22+0.20+nan-0.10-0.14+1.00+0.08
column26-0.29-0.04+nan+0.14-0.04+0.20+0.08+nan+0.21+0.12+0.08+1.00

Languages detected

Per-string language detection across text columns (sampled).

Show data table
Per-language counts (total 18,468 detected strings).
langcountshare
en1841999.7%
da120.1%
de100.1%
zh90.0%
ja90.0%
es60.0%
pt10.0%
fr10.0%
ca10.0%

column00 numeric

rows122,611
null0 (0.0%)
unique122,611
min10.000
max4,264,350
mean1,985,386
median1,907,380
std1,087,595
q11,063,175
q32,869,560
iqr1,806,385
skew0.177
kurtosis-1.050
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for column00 (median: 1907380.0).
bincount
10 – 1.066e+051208
1.066e+05 – 2.132e+05280
2.132e+05 – 3.198e+052468
3.198e+05 – 4.264e+053789
4.264e+05 – 5.331e+053626
5.331e+05 – 6.397e+053651
6.397e+05 – 7.463e+054043
7.463e+05 – 8.529e+054024
8.529e+05 – 9.595e+053772
9.595e+05 – 1.066e+063926
1.066e+06 – 1.173e+064091
1.173e+06 – 1.279e+063895
1.279e+06 – 1.386e+063659
1.386e+06 – 1.493e+063825
1.493e+06 – 1.599e+063975
1.599e+06 – 1.706e+063815
1.706e+06 – 1.812e+063854
1.812e+06 – 1.919e+063820
1.919e+06 – 2.026e+063749
2.026e+06 – 2.132e+062991
2.132e+06 – 2.239e+063698
2.239e+06 – 2.345e+063661
2.345e+06 – 2.452e+063593
2.452e+06 – 2.559e+063305
2.559e+06 – 2.665e+063120
2.665e+06 – 2.772e+063232
2.772e+06 – 2.878e+063172
2.878e+06 – 2.985e+063134
2.985e+06 – 3.092e+063098
3.092e+06 – 3.198e+063011
3.198e+06 – 3.305e+062792
3.305e+06 – 3.411e+062880
3.411e+06 – 3.518e+062626
3.518e+06 – 3.625e+062386
3.625e+06 – 3.731e+062314
3.731e+06 – 3.838e+062229
3.838e+06 – 3.945e+062212
3.945e+06 – 4.051e+061778
4.051e+06 – 4.158e+061353
4.158e+06 – 4.264e+06556

column01 text

99.1% of rows are unique strings
rows122,611
null1 (0.0%)
unique121,454
len_min1
len_max413
len_mean18.070
len_median16.000
len_p9537.550
word_mean2.912
word_median3.000
n_empty0
n_duplicates1,156
duplicate_rate9.43e-03
vocab_size18,813
readability_flesch_mean52.874
emoji_rate2.59e-03
url_rate0.000
one_word_rate0.187
allcaps_rate0.067
boilerplate_rate4.08e-05
Show data table
Character-length distribution for column01 (mean: 18.069627273468722).
charscount
1 – 1134028
11 – 2253727
22 – 3222850
32 – 428502
42 – 522281
52 – 63858
63 – 73213
73 – 8369
83 – 9428
94 – 10421
104 – 11413
114 – 1258
125 – 1354
135 – 1453
145 – 1561
156 – 1661
166 – 1760
176 – 1861
186 – 1970
197 – 2070
207 – 2170
217 – 2280
228 – 2380
238 – 2480
248 – 2581
258 – 2690
269 – 2790
279 – 2890
289 – 3000
300 – 3100
310 – 3200
320 – 3310
331 – 3410
341 – 3510
351 – 3620
362 – 3720
372 – 3820
382 – 3920
392 – 4030
403 – 4131
Sample values (first 10)
  1. Ice Cream Fantasy
  2. Cornflower Corbin
  3. AXYOS: Battlecards
  4. Figure Skating Legends
  5. Terra Alia VR: A Multilingual Adventure
  6. The Cursed Tape
  7. Ellie's Travel Diary
  8. Elong Plug
  9. Pike and Shot : Campaigns
  10. Sushi Clicker

column02 text

95th-percentile length under 20 chars 95.9% duplicate strings
rows122,611
null0 (0.0%)
unique5,081
len_min11
len_max12
len_mean11.719
len_median12.000
len_p9512.000
word_mean3.000
word_median3.000
n_empty0
n_duplicates117,530
duplicate_rate0.959
vocab_size68
readability_flesch_mean98.604
emoji_rate0.000
url_rate0.000
one_word_rate0.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column02 (mean: 11.718671244831214).
charscount
11 – 1134494
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 110
11 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 120
12 – 1288117
Sample values (first 10)
  1. Nov 22, 2019
  2. Aug 17, 2017
  3. Aug 13, 2020
  4. Jan 17, 2025
  5. Nov 4, 2024
  6. May 7, 2024
  7. Dec 7, 2020
  8. Feb 24, 2023
  9. Jul 12, 2025
  10. Dec 10, 2022

column03 categorical

rows122,611
null0 (0.0%)
unique14
top_value0 - 20000
top_rate0.615
cardinality14
entropy1.814
entropy_ratio0.476
Show data table
Top values for column03 (14 unique shown, of 14 total).
valuecountshare
0 - 200007540461.5%
0 - 02164117.7%
20000 - 50000113969.3%
50000 - 10000053554.4%
100000 - 20000034542.8%
200000 - 50000028532.3%
500000 - 100000011540.9%
1000000 - 20000007290.6%
2000000 - 50000004050.3%
5000000 - 100000001250.1%
10000000 - 20000000510.0%
20000000 - 50000000310.0%
50000000 - 10000000090.0%
100000000 - 20000000040.0%
Top values (rank 1–20)
  1. 0 - 20000 — 75,404
  2. 0 - 0 — 21,641
  3. 20000 - 50000 — 11,396
  4. 50000 - 100000 — 5,355
  5. 100000 - 200000 — 3,454
  6. 200000 - 500000 — 2,853
  7. 500000 - 1000000 — 1,154
  8. 1000000 - 2000000 — 729
  9. 2000000 - 5000000 — 405
  10. 5000000 - 10000000 — 125
  11. 10000000 - 20000000 — 51
  12. 20000000 - 50000000 — 31
  13. 50000000 - 100000000 — 9
  14. 100000000 - 200000000 — 4

column04 numeric

skew=+209.95 16.0% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique1,110
min0.000
max1,013,936
mean54.593
median0.000
std3,729
q10.000
q30.000
iqr0.000
skew209.951
kurtosis51,452
n_outliers19,676
outlier_rate0.160
zero_rate0.840
Show data table
Histogram bins for column04 (median: 0.0).
bincount
0 – 2.535e+04122573
2.535e+04 – 5.07e+0422
5.07e+04 – 7.605e+046
7.605e+04 – 1.014e+052
1.014e+05 – 1.267e+052
1.267e+05 – 1.521e+051
1.521e+05 – 1.774e+052
1.774e+05 – 2.028e+050
2.028e+05 – 2.281e+050
2.281e+05 – 2.535e+050
2.535e+05 – 2.788e+050
2.788e+05 – 3.042e+050
3.042e+05 – 3.295e+051
3.295e+05 – 3.549e+050
3.549e+05 – 3.802e+050
3.802e+05 – 4.056e+050
4.056e+05 – 4.309e+050
4.309e+05 – 4.563e+050
4.563e+05 – 4.816e+050
4.816e+05 – 5.07e+050
5.07e+05 – 5.323e+050
5.323e+05 – 5.577e+050
5.577e+05 – 5.83e+050
5.83e+05 – 6.084e+050
6.084e+05 – 6.337e+051
6.337e+05 – 6.591e+050
6.591e+05 – 6.844e+050
6.844e+05 – 7.098e+050
7.098e+05 – 7.351e+050
7.351e+05 – 7.605e+050
7.605e+05 – 7.858e+050
7.858e+05 – 8.111e+050
8.111e+05 – 8.365e+050
8.365e+05 – 8.618e+050
8.618e+05 – 8.872e+050
8.872e+05 – 9.125e+050
9.125e+05 – 9.379e+050
9.379e+05 – 9.632e+050
9.632e+05 – 9.886e+050
9.886e+05 – 1.014e+061

column05 numeric

skew=+9.88
rows122,611
null0 (0.0%)
unique15
min0.000
max21.000
mean0.168
median0.000
std1.654
q10.000
q30.000
iqr0.000
skew9.883
kurtosis96.519
n_outliers1,272
outlier_rate0.010
zero_rate0.990
Show data table
Histogram bins for column05 (median: 0.0).
bincount
0 – 0.525121339
0.525 – 1.052
1.05 – 1.5750
1.575 – 2.10
2.1 – 2.6250
2.625 – 3.156
3.15 – 3.6750
3.675 – 4.20
4.2 – 4.7250
4.725 – 5.250
5.25 – 5.7750
5.775 – 6.34
6.3 – 6.8250
6.825 – 7.355
7.35 – 7.8750
7.875 – 8.40
8.4 – 8.9250
8.925 – 9.450
9.45 – 9.9750
9.975 – 10.526
10.5 – 11.030
11.03 – 11.550
11.55 – 12.0823
12.08 – 12.60
12.6 – 13.12175
13.12 – 13.650
13.65 – 14.181
14.18 – 14.70
14.7 – 15.233
15.23 – 15.750
15.75 – 16.2832
16.28 – 16.80
16.8 – 17.32828
17.32 – 17.850
17.85 – 18.38164
18.38 – 18.90
18.9 – 19.430
19.43 – 19.950
19.95 – 20.481
20.48 – 212

column06 numeric

skew=+22.40 7.6% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique941
min0.000
max999.980
mean4.765
median2.240
std12.531
q10.550
q35.240
iqr4.690
skew22.404
kurtosis1,135
n_outliers9,297
outlier_rate0.076
zero_rate0.214
Show data table
Histogram bins for column06 (median: 2.24).
bincount
0 – 25120926
25 – 501081
50 – 75248
75 – 10047
100 – 1256
125 – 15013
150 – 1751
175 – 200282
200 – 2250
225 – 2500
250 – 2751
275 – 3002
300 – 3250
325 – 3500
350 – 3750
375 – 4000
400 – 4250
425 – 4500
450 – 4750
475 – 5000
500 – 5251
525 – 5500
550 – 5750
575 – 6000
600 – 6250
625 – 6500
650 – 6750
675 – 7000
700 – 7250
725 – 7500
750 – 7750
775 – 8000
800 – 8250
825 – 8500
850 – 8750
875 – 9000
900 – 9250
925 – 9500
950 – 9750
975 – 10003

column07 numeric

rows122,611
null0 (0.0%)
unique88
min0.000
max100.000
mean18.354
median0.000
std28.859
q10.000
q340.000
iqr40.000
skew1.220
kurtosis-0.051
n_outliers0
outlier_rate0.000
zero_rate0.668
Show data table
Histogram bins for column07 (median: 0.0).
bincount
0 – 2.581930
2.5 – 50
5 – 7.50
7.5 – 100
10 – 12.5620
12.5 – 1516
15 – 17.5437
17.5 – 2015
20 – 22.52394
22.5 – 25124
25 – 27.51471
27.5 – 3039
30 – 32.52689
32.5 – 35482
35 – 37.5955
37.5 – 4046
40 – 42.52358
42.5 – 4552
45 – 47.5419
47.5 – 5073
50 – 52.59742
52.5 – 5524
55 – 57.5534
57.5 – 6033
60 – 62.52493
62.5 – 6545
65 – 67.51150
67.5 – 70166
70 – 72.53120
72.5 – 7577
75 – 77.52918
77.5 – 80106
80 – 82.53754
82.5 – 85263
85 – 87.51353
87.5 – 90252
90 – 92.52217
92.5 – 9556
95 – 97.5182
97.5 – 1006

column08 numeric

skew=+171.83 14.5% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique117
min0.000
max3,703
mean0.546
median0.000
std14.516
q10.000
q30.000
iqr0.000
skew171.825
kurtosis38,360
n_outliers17,771
outlier_rate0.145
zero_rate0.855
Show data table
Histogram bins for column08 (median: 0.0).
bincount
0 – 92.58122551
92.58 – 185.239
185.2 – 277.76
277.7 – 370.33
370.3 – 462.92
462.9 – 555.50
555.5 – 6480
648 – 740.60
740.6 – 833.25
833.2 – 925.81
925.8 – 10181
1018 – 11111
1111 – 12030
1203 – 12960
1296 – 13890
1389 – 14810
1481 – 15740
1574 – 16660
1666 – 17590
1759 – 18520
1852 – 19440
1944 – 20371
2037 – 21290
2129 – 22220
2222 – 23140
2314 – 24070
2407 – 25000
2500 – 25920
2592 – 26850
2685 – 27770
2777 – 28700
2870 – 29620
2962 – 30550
3055 – 31480
3148 – 32400
3240 – 33330
3333 – 34250
3425 – 35180
3518 – 36100
3610 – 37031

column09 text

99.5% of rows are unique strings
rows122,611
null8,449 (6.9%)
unique113,556
len_min1
len_max89,665
len_mean1,297
len_median1,064
len_p952,966
word_mean214.310
word_median177.000
n_empty0
n_duplicates606
duplicate_rate5.31e-03
vocab_size105,903
readability_flesch_mean58.748
emoji_rate0.047
url_rate3.50e-04
one_word_rate4.03e-04
allcaps_rate7.56e-03
boilerplate_rate0.015
Show data table
Character-length distribution for column09 (mean: 1297.112095092938).
charscount
1 – 2243100879
2243 – 448411956
4484 – 67261006
6726 – 8967201
8967 – 1120966
11209 – 1345122
13451 – 1569214
15692 – 179347
17934 – 201750
20175 – 224173
22417 – 246590
24659 – 269001
26900 – 291420
29142 – 313831
31383 – 336250
33625 – 358670
35867 – 381080
38108 – 403500
40350 – 425910
42591 – 448330
44833 – 470751
47075 – 493160
49316 – 515580
51558 – 537990
53799 – 560410
56041 – 582830
58283 – 605240
60524 – 627661
62766 – 650070
65007 – 672490
67249 – 694911
69491 – 717320
71732 – 739740
73974 – 762150
76215 – 784570
78457 – 806990
80699 – 829400
82940 – 851820
85182 – 874230
87423 – 896653
Sample values (first 10)
  1. Ice Cream Fantasy is a puzzle game, in which all you need is to drag bricks, in order to make the pictures restored. But please remember, this game is aiming to make you relax and enjoy life. After finishing the puzzle you will be able to see the full image in the gallery.
  2. Get ready to charm, sabotage, and survive the most ridiculous speed dating showdown ever.  Will you win love - or get left in the dust? What Players Are Saying: 'I had to bite my tongue to not laugh when I saw your character turn around with that clown nose!' - Realsleepy, Discor…
  3. MARVEL Puzzle Quest is the ultimate Match 3 Super Hero RPG experience! Collect and level up your favorite characters to discover incredible strategic depth like no other Match-3 has to offer! Become a puzzle combat master as you learn to harness superpowers to bend the gem board …
  4. When a person becomes too lazy to work, they will try to simplify even the smallest task. Mankind invented Them - to simplify your life. Mechanoids to perform all of our mundane activities which finally completely replaced humans in the workspace. Creating the perfect workers tur…
  5. Mass Vector is a physics based game of skill and patience designed to be played by all the family. In fact it's really 2 games in one with 50 speed levels and 50 Hazard levels. The Speed levels are against the clock and are designed to reward the fastest players. These levels may…
  6. Light of Mine is one of the scariest experiences you will have in VR this Halloween. Explore an ancient temple where innocent looking statues come to life, but only when your back is turned. Use your candle in the darkened temple to find your way and outwit the horrors within. Yo…
  7. Cat vs Mouse! It's kawaii shoot'em up! Mice were everywhere in Japan in 2020 ...Because it was a mouse year! However, some have pursued mice since mythological times to stop it! He couldn't become the Zodiac because he was fooled by the mouse! A cat has just flew away with a Taiy…
  8. Panda in the clouds is a grid movement platform puzzle game where your challenge will be to reach the flag and accomplish all the level objectives. Find the right path to win. 30 levels with increasing difficulty. Relaxing soundtrack.
  9. In the card game, the tractor is also called :upgrade, double rise. Compared with other single-player upgrade games, this game has high artificial intelligence, and the computer licensing, licensing, has a considerable level. Those who like CARDS must not miss it. In the traditio…
  10. A fast paced explosive experience! This fast paced platformer without jumping (inspired by donkey kong's country barrel sections and downwell!) rewards quick thinking and good reflexes as you blast your way through rocks collecting precious gems to unlock the next sections , game…

column10 text

53.3% rows are a single word 84.4% duplicate strings
rows122,611
null0 (0.0%)
unique19,113
len_min2
len_max1,216
len_mean68.019
len_median11.000
len_p95224.000
word_mean6.889
word_median1.000
n_empty0
n_duplicates103,498
duplicate_rate0.844
vocab_size217
readability_flesch_mean14.071
emoji_rate0.000
url_rate0.000
one_word_rate0.533
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column10 (mean: 68.01866064219361).
charscount
2 – 3279054
32 – 6313178
63 – 937232
93 – 1235289
123 – 1545081
154 – 1843884
184 – 2142265
214 – 2451157
245 – 275630
275 – 306318
306 – 336341
336 – 366555
366 – 3971037
397 – 427512
427 – 45749
457 – 48828
488 – 51812
518 – 5485
548 – 5799
579 – 6092
609 – 6390
639 – 6702
670 – 7000
700 – 7302
730 – 7610
761 – 7911
791 – 8212
821 – 8526
852 – 8821
882 – 9124
912 – 9438
943 – 9730
973 – 10041
1004 – 10342
1034 – 10645
1064 – 10951
1095 – 11254
1125 – 11556
1155 – 11861
1186 – 12161927
Sample values (first 10)
  1. ['English']
  2. ['English']
  3. ['English', 'Japanese']
  4. ['English']
  5. ['English', 'French', 'Italian', 'German', 'Spanish - Spain', 'Simplified Chinese', 'Korean', 'Japanese', 'Portuguese - Brazil', 'Russian']
  6. ['English', 'French', 'Italian', 'German', 'Spanish - Spain', 'Arabic', 'Bulgarian', 'Portuguese - Brazil', 'Hungarian', 'Vietnamese', 'Greek', 'Danish', 'Indonesian', 'Spanish - Latin America', 'Traditional Chinese', 'Simplified Chinese', 'Korean', 'Dutch', 'Norwegian', 'Polish'…
  7. ['English', 'Korean', 'Simplified Chinese']
  8. ['English']
  9. ['English']
  10. ['English']

column11 text

81.3% rows are a single word 97.0% duplicate strings
rows122,611
null0 (0.0%)
unique3,710
len_min2
len_max1,216
len_mean24.311
len_median2.000
len_p9546.000
word_mean2.854
word_median1.000
n_empty0
n_duplicates118,901
duplicate_rate0.970
vocab_size194
readability_flesch_mean8.003
emoji_rate0.000
url_rate0.000
one_word_rate0.813
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column11 (mean: 24.311350531355263).
charscount
2 – 32110168
32 – 637499
63 – 931015
93 – 123742
123 – 154555
154 – 184422
184 – 214254
214 – 245161
245 – 27575
275 – 30632
306 – 33634
336 – 36685
366 – 397345
397 – 42770
427 – 45710
457 – 48811
488 – 5181
518 – 5481
548 – 5790
579 – 6090
609 – 6390
639 – 6700
670 – 7000
700 – 7301
730 – 7610
761 – 7910
791 – 8210
821 – 8525
852 – 8820
882 – 9120
912 – 9430
943 – 9730
973 – 10040
1004 – 10341
1034 – 10643
1064 – 10950
1095 – 11251
1125 – 11553
1155 – 11860
1186 – 12161117
Sample values (first 10)
  1. ['English']
  2. []
  3. ['English', 'Japanese']
  4. ['English']
  5. ['English', 'French', 'Italian', 'German', 'Spanish - Spain', 'Simplified Chinese', 'Korean', 'Japanese', 'Portuguese - Brazil', 'Russian']
  6. []
  7. ['Korean', 'Simplified Chinese']
  8. ['English']
  9. []
  10. ['English']

column12 text

98.5% of rows are unique strings 90.2% null
rows122,611
null110,541 (90.2%)
unique11,884
len_min3
len_max2,912
len_mean340.288
len_median295.000
len_p95763.000
word_mean57.371
word_median49.000
n_empty0
n_duplicates186
duplicate_rate0.015
vocab_size61,840
readability_flesch_mean57.834
emoji_rate0.016
url_rate0.000
one_word_rate0.000
allcaps_rate8.20e-03
boilerplate_rate0.000
Show data table
Character-length distribution for column12 (mean: 340.28823529411767).
charscount
3 – 76812
76 – 1481475
148 – 2211866
221 – 2941856
294 – 3671625
367 – 4391313
439 – 512954
512 – 585652
585 – 658495
658 – 730299
730 – 803226
803 – 876128
876 – 948104
948 – 102180
1021 – 109445
1094 – 116728
1167 – 123918
1239 – 131231
1312 – 138520
1385 – 145811
1458 – 15304
1530 – 16035
1603 – 16763
1676 – 17483
1748 – 18215
1821 – 18942
1894 – 19671
1967 – 20391
2039 – 21121
2112 – 21851
2185 – 22570
2257 – 23300
2330 – 24031
2403 – 24761
2476 – 25481
2548 – 26210
2621 – 26940
2694 – 27670
2767 – 28391
2839 – 29122
Sample values (first 10)
  1. “What I didn't expect was how relatable it would become. [...] This will hit you deeply.” a beautiful little game – the monotonist “About as bizarre and different as artsy videogames get.” weird – Clark “This game is Tamhsrebog approved, very nice” Tamhsrebog approved – Tamhsrebo…
  2. “It's an absolutely crazy multiband distortion, compression, EQ and filter, which pretty much lets you do anything.” Skrillex “I really dig the new Ohmicide plugin in its some heavy duty bizniz.” The Chemical Brothers “its safe to say Ohmicide:Melohman remains the best-sounding, …
  3. “While it's predominantly a happy-go-lucky experience, there are some poignant moments, and it's a game that could stay in your heart long after you beat it.” 9 – James Daly - GAMINGbible “It's perfect for snuggling under a blanket on a quiet evening with a scented candle and a m…
  4. “I never thought I’d be sitting here in the first half of the first month of 2023, saying that a game is a GOTY contender. But that’s exactly what I’m doing.” 10/10 – Digitally Downloaded “Excellent hybrid of VN/RPG mechanics, engaging story full of likable characters, wealth of …
  5. “With its brain-bending puzzles, delightful story, and all around impressive fairy-tale-ness, Beyond the Sky is a point-and-click adventure not to be missed.” 4.5/5 – Adventure Gamers
  6. “Ultimately, Auridia is for metroidvania and exploration fans, and other gamers who enjoy any good platformer or puzzle. It’s easy to pick up, and while it doesn’t hold your hand too much, it doesn’t need to. You are given a world full of wonders to explore in any way you’d like.…
  7. “Covid Racing: Drive against the virus spectacularly in a fairytale world” 10/10 – Nieuwspot
  8. “Element Games took a winning strategy and ran with it and they didn’t stop once they made it into the end-zone.” The Videogame Backlog “Sky Cannoneer is an excellent game that came out of nowhere. I never played the original title on which this game is based on but I really like…
  9. “The most original concept we’ve come across in years.” Gamezebo “An ingenious social puzzler.” Droid Gamers “You’re going to fall in love.” Android Central
  10. “I Am Your Beast is a fantastic action game, one that really hones in on speed, adaptability, and efficiency.” 9/10 – But Why Tho? “What I Am Your Beast pulls off is giving its players the feeling of being a professional soldier trained in improvisational chaos.” 9.5/10 – Video G…

column13 text

99.9% of rows are unique strings 100.0% rows are a single word 100.0% rows contain a URL
rows122,611
null81 (0.1%)
unique122,420
len_min93
len_max153
len_mean104.626
len_median98.000
len_p95139.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates110
duplicate_rate8.98e-04
vocab_size19,992
readability_flesch_mean-834.337
emoji_rate0.000
url_rate1.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column13 (mean: 104.62648331020975).
charscount
93 – 9429
94 – 96238
96 – 9827191
98 – 9974714
99 – 1000
100 – 1020
102 – 1040
104 – 1050
105 – 1060
106 – 1080
108 – 1100
110 – 1111
111 – 11216
112 – 1140
114 – 1160
116 – 1170
117 – 1180
118 – 1200
120 – 1220
122 – 1230
123 – 1240
124 – 1260
126 – 1280
128 – 1290
129 – 1300
130 – 1320
132 – 1340
134 – 1350
135 – 13611
136 – 13839
138 – 14019722
140 – 1410
141 – 1420
142 – 1440
144 – 1460
146 – 1470
147 – 1480
148 – 1500
150 – 15264
152 – 153505
Sample values (first 10)
  1. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/1190590/header.jpg?t=1608385385
  2. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/624780/header.jpg?t=1573098671
  3. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/1381370/header.jpg?t=1601925149
  4. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3149440/header.jpg?t=1739582306
  5. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3500800/6bf932d5c8d20957228ebef56606069b1180687a/header.jpg?t=1756980773
  6. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/1718400/header.jpg?t=1672054129
  7. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/1450020/header.jpg?t=1660634504
  8. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/380600/01c457c29370b3e68ccc91f1e49965727e9cd0f6/header_alt_assets_12.jpg?t=1766080107
  9. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/377520/header.jpg?t=1726824054
  10. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/4017920/f7de4d185442087727cd13dd5b301a94b40aa664/header.jpg?t=1764729889

column14 text

100.0% rows are a single word 100.0% rows contain a URL 59.5% null 20.1% duplicate strings
rows122,611
null72,935 (59.5%)
unique39,703
len_min7
len_max236
len_mean32.569
len_median29.000
len_p9556.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates9,973
duplicate_rate0.201
vocab_size17,059
readability_flesch_mean-260.326
emoji_rate0.000
url_rate1.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column14 (mean: 32.568805861985666).
charscount
7 – 132
13 – 181050
18 – 2410802
24 – 3013931
30 – 3610202
36 – 415397
41 – 473308
47 – 531755
53 – 591232
59 – 64791
64 – 70277
70 – 76245
76 – 81223
81 – 87149
87 – 9353
93 – 9934
99 – 10422
104 – 11029
110 – 11626
116 – 12233
122 – 12726
127 – 13318
133 – 13916
139 – 14420
144 – 1504
150 – 1566
156 – 16212
162 – 1673
167 – 1731
173 – 1797
179 – 1841
184 – 1900
190 – 1960
196 – 2020
202 – 2070
207 – 2130
213 – 2190
219 – 2250
225 – 2300
230 – 2361
Sample values (first 10)
  1. http://www.wavesofdeath.com
  2. https://pqube.co.uk/
  3. https://www.gagex.co.jp/extra/works/virgo/
  4. http://www.thylacinestudios.com
  5. https://www.lunarisgames.com
  6. https://www.youtube.com/@TegridyMadeGames
  7. http://MoviesTycoon.com/
  8. https://www.doesntmattergames.com
  9. https://www.solomonsboneyard.com
  10. https://www.aldorlea.org/

column15 text

99.9% rows are a single word 95.6% rows contain a URL 55.8% null 34.7% duplicate strings
rows122,611
null68,404 (55.8%)
unique35,399
len_min1
len_max851
len_mean31.185
len_median29.000
len_p9551.000
word_mean1.002
word_median1.000
n_empty0
n_duplicates18,808
duplicate_rate0.347
vocab_size14,875
readability_flesch_mean-245.098
emoji_rate0.000
url_rate0.956
one_word_rate0.999
allcaps_rate7.93e-04
boilerplate_rate0.000
Show data table
Character-length distribution for column15 (mean: 31.185455752947036).
charscount
1 – 229827
22 – 4438992
44 – 654610
65 – 86477
86 – 107155
107 – 12882
128 – 15034
150 – 17110
171 – 1922
192 – 21412
214 – 2351
235 – 2562
256 – 2770
277 – 2980
298 – 3201
320 – 3410
341 – 3621
362 – 3840
384 – 4050
405 – 4260
426 – 4470
447 – 4680
468 – 4900
490 – 5110
511 – 5320
532 – 5540
554 – 5750
575 – 5960
596 – 6170
617 – 6380
638 – 6600
660 – 6810
681 – 7020
702 – 7240
724 – 7450
745 – 7660
766 – 7870
787 – 8080
808 – 8300
830 – 8511
Sample values (first 10)
  1. https://www.herinteractive.com/support/contact-support/
  2. https://sekaiproject.com/contact/
  3. http://www.geertverhoeff.com
  4. https://pqube.co.uk/
  5. https://store.steampowered.com/curator/33024510
  6. https://slipgate.ca/tr12/
  7. www.laywoodgames.com
  8. http://www.carlsengames.com
  9. http://www.nightdivestudios.com/?utm_source=steampowered.com&utm_medium=product&utm_campaign=support%20-%20labyrinth%20of%20time
  10. http://www.cwdigames.com

column16 text

99.9% rows are a single word 39.7% duplicate strings
rows122,611
null22,243 (18.1%)
unique60,519
len_min1
len_max169
len_mean22.908
len_median23.000
len_p9531.000
word_mean1.004
word_median1.000
n_empty0
n_duplicates39,849
duplicate_rate0.397
vocab_size15,319
readability_flesch_mean-223.742
emoji_rate9.96e-06
url_rate3.91e-03
one_word_rate0.999
allcaps_rate1.02e-03
boilerplate_rate0.000
Show data table
Character-length distribution for column16 (mean: 22.90802845528455).
charscount
1 – 554
5 – 923
9 – 141014
14 – 1810969
18 – 2228430
22 – 2639631
26 – 3014560
30 – 353948
35 – 39954
39 – 43416
43 – 47184
47 – 5155
51 – 5661
56 – 6037
60 – 648
64 – 684
68 – 723
72 – 772
77 – 811
81 – 850
85 – 893
89 – 930
93 – 980
98 – 1023
102 – 1061
106 – 1103
110 – 1141
114 – 1190
119 – 1230
123 – 1270
127 – 1310
131 – 1350
135 – 1400
140 – 1440
144 – 1480
148 – 1520
152 – 1562
156 – 1610
161 – 1650
165 – 1691
Sample values (first 10)
  1. me@petraller.com
  2. fig@happy-figs.com
  3. support-fleet-steam@choiceofgames.com
  4. support@degaussgame.com
  5. raphael@minimolgames.com
  6. hikmetcelik@hotmail.com.tr
  7. RealGamesInteractive@tutamail.com
  8. rewindappstudio@gmail.com
  9. steam.kuryu0623@gmail.com
  10. contact@tako-studio.com

column17 categorical

top value is 100.0% of rows
rows122,611
null0 (0.0%)
unique2
top_valueTrue
top_rate1.000
cardinality2
entropy4.62e-03
entropy_ratio4.62e-03
Show data table
Top values for column17 (2 unique shown, of 2 total).
valuecountshare
True122567100.0%
False440.0%
Top values (rank 1–20)
  1. True — 122,567
  2. False — 44

column18 categorical

rows122,611
null0 (0.0%)
unique2
top_valueFalse
top_rate0.826
cardinality2
entropy0.666
entropy_ratio0.666
Show data table
Top values for column18 (2 unique shown, of 2 total).
valuecountshare
False10131982.6%
True2129217.4%
Top values (rank 1–20)
  1. False — 101,319
  2. True — 21,292

column19 categorical

rows122,611
null0 (0.0%)
unique2
top_valueFalse
top_rate0.872
cardinality2
entropy0.552
entropy_ratio0.552
Show data table
Top values for column19 (2 unique shown, of 2 total).
valuecountshare
False10690587.2%
True1570612.8%
Top values (rank 1–20)
  1. False — 106,905
  2. True — 15,706

column20 numeric

skew=+5.23
rows122,611
null0 (0.0%)
unique73
min0.000
max97.000
mean2.565
median0.000
std13.661
q10.000
q30.000
iqr0.000
skew5.227
kurtosis25.749
n_outliers4,256
outlier_rate0.035
zero_rate0.965
Show data table
Histogram bins for column20 (median: 0.0).
bincount
0 – 2.425118355
2.425 – 4.850
4.85 – 7.2751
7.275 – 9.70
9.7 – 12.120
12.12 – 14.550
14.55 – 16.970
16.97 – 19.40
19.4 – 21.821
21.82 – 24.252
24.25 – 26.670
26.67 – 29.12
29.1 – 31.522
31.52 – 33.953
33.95 – 36.388
36.38 – 38.86
38.8 – 41.2216
41.22 – 43.6514
43.65 – 46.0719
46.07 – 48.521
48.5 – 50.9229
50.92 – 53.3562
53.35 – 55.7756
55.77 – 58.2100
58.2 – 60.6285
60.62 – 63.05178
63.05 – 65.47154
65.47 – 67.9179
67.9 – 70.32398
70.32 – 72.75270
72.75 – 75.17514
75.17 – 77.6383
77.6 – 80.02605
80.02 – 82.45367
82.45 – 84.88283
84.88 – 87.3268
87.3 – 89.72109
89.72 – 92.1588
92.15 – 94.5724
94.57 – 979

column21 text

97.7% of rows are unique strings 100.0% rows are a single word 100.0% rows contain a URL 96.5% null
rows122,611
null118,355 (96.5%)
unique4,160
len_min42
len_max142
len_mean72.429
len_median70.000
len_p9591.000
word_mean1.000
word_median1.000
n_empty0
n_duplicates96
duplicate_rate0.023
vocab_size4,160
readability_flesch_mean-704.053
emoji_rate0.000
url_rate1.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column21 (mean: 72.42857142857143).
charscount
42 – 442
44 – 471
47 – 500
50 – 525
52 – 545
54 – 571
57 – 6089
60 – 62209
62 – 64597
64 – 67444
67 – 70632
70 – 72377
72 – 74441
74 – 77238
77 – 80307
80 – 82189
82 – 84241
84 – 8794
87 – 90106
90 – 9274
92 – 9474
94 – 9733
97 – 10038
100 – 10217
102 – 10410
104 – 10710
107 – 11012
110 – 1123
112 – 1141
114 – 1170
117 – 1205
120 – 1220
122 – 1240
124 – 1270
127 – 1300
130 – 1320
132 – 1340
134 – 1370
137 – 1400
140 – 1421
Sample values (first 10)
  1. https://www.metacritic.com/game/pc/shadow-of-the-tomb-raider?ftag=MCD-06-10aaa1f
  2. https://www.metacritic.com/game/pc/disc-room?ftag=MCD-06-10aaa1f
  3. https://www.metacritic.com/game/pc/full-metal-furies?ftag=MCD-06-10aaa1f
  4. https://www.metacritic.com/game/pc/if-found?ftag=MCD-06-10aaa1f
  5. https://www.metacritic.com/game/pc/smoke-and-sacrifice?ftag=MCD-06-10aaa1f
  6. https://www.metacritic.com/game/pc/voidwrought?ftag=MCD-06-10aaa1f
  7. https://www.metacritic.com/game/pc/the-knight-witch?ftag=MCD-06-10aaa1f
  8. https://www.metacritic.com/game/pc/0rbitalis?ftag=MCD-06-10aaa1f
  9. https://www.metacritic.com/game/pc/hamiltons-great-adventure?ftag=MCD-06-10aaa1f
  10. https://www.metacritic.com/game/pc/a-fistful-of-gun?ftag=MCD-06-10aaa1f

column22 numeric

skew=+59.25
rows122,611
null0 (0.0%)
unique31
min0.000
max100.000
mean0.025
median0.000
std1.395
q10.000
q30.000
iqr0.000
skew59.247
kurtosis3,628
n_outliers40
outlier_rate3.26e-04
zero_rate1.000
Show data table
Histogram bins for column22 (median: 0.0).
bincount
0 – 2.5122571
2.5 – 50
5 – 7.50
7.5 – 100
10 – 12.50
12.5 – 150
15 – 17.50
17.5 – 200
20 – 22.50
22.5 – 250
25 – 27.50
27.5 – 300
30 – 32.50
32.5 – 350
35 – 37.51
37.5 – 400
40 – 42.50
42.5 – 450
45 – 47.52
47.5 – 500
50 – 52.52
52.5 – 551
55 – 57.52
57.5 – 600
60 – 62.52
62.5 – 651
65 – 67.52
67.5 – 703
70 – 72.51
72.5 – 751
75 – 77.53
77.5 – 801
80 – 82.53
82.5 – 853
85 – 87.51
87.5 – 901
90 – 92.51
92.5 – 951
95 – 97.53
97.5 – 1005

column23 numeric

skew=+177.84 17.0% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique5,540
min0.000
max7,642,084
mean1,045
median5.000
std28,092
q10.000
q337.000
iqr37.000
skew177.842
kurtosis45,296
n_outliers20,797
outlier_rate0.170
zero_rate0.345
Show data table
Histogram bins for column23 (median: 5.0).
bincount
0 – 1.911e+05122511
1.911e+05 – 3.821e+0557
3.821e+05 – 5.732e+0516
5.732e+05 – 7.642e+0510
7.642e+05 – 9.553e+056
9.553e+05 – 1.146e+065
1.146e+06 – 1.337e+061
1.337e+06 – 1.528e+062
1.528e+06 – 1.719e+060
1.719e+06 – 1.911e+061
1.911e+06 – 2.102e+061
2.102e+06 – 2.293e+060
2.293e+06 – 2.484e+060
2.484e+06 – 2.675e+060
2.675e+06 – 2.866e+060
2.866e+06 – 3.057e+060
3.057e+06 – 3.248e+060
3.248e+06 – 3.439e+060
3.439e+06 – 3.63e+060
3.63e+06 – 3.821e+060
3.821e+06 – 4.012e+060
4.012e+06 – 4.203e+060
4.203e+06 – 4.394e+060
4.394e+06 – 4.585e+060
4.585e+06 – 4.776e+060
4.776e+06 – 4.967e+060
4.967e+06 – 5.158e+060
5.158e+06 – 5.349e+060
5.349e+06 – 5.541e+060
5.541e+06 – 5.732e+060
5.732e+06 – 5.923e+060
5.923e+06 – 6.114e+060
6.114e+06 – 6.305e+060
6.305e+06 – 6.496e+060
6.496e+06 – 6.687e+060
6.687e+06 – 6.878e+060
6.878e+06 – 7.069e+060
7.069e+06 – 7.26e+060
7.26e+06 – 7.451e+060
7.451e+06 – 7.642e+061

column24 numeric

skew=+156.86 16.9% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique2,725
min0.000
max1,173,003
mean169.197
median1.000
std5,375
q10.000
q310.000
iqr10.000
skew156.863
kurtosis30,627
n_outliers20,696
outlier_rate0.169
zero_rate0.450
Show data table
Histogram bins for column24 (median: 1.0).
bincount
0 – 2.933e+04122529
2.933e+04 – 5.865e+0440
5.865e+04 – 8.798e+0418
8.798e+04 – 1.173e+059
1.173e+05 – 1.466e+053
1.466e+05 – 1.76e+053
1.76e+05 – 2.053e+050
2.053e+05 – 2.346e+051
2.346e+05 – 2.639e+052
2.639e+05 – 2.933e+051
2.933e+05 – 3.226e+051
3.226e+05 – 3.519e+051
3.519e+05 – 3.812e+050
3.812e+05 – 4.106e+050
4.106e+05 – 4.399e+050
4.399e+05 – 4.692e+051
4.692e+05 – 4.985e+050
4.985e+05 – 5.279e+050
5.279e+05 – 5.572e+050
5.572e+05 – 5.865e+050
5.865e+05 – 6.158e+050
6.158e+05 – 6.452e+050
6.452e+05 – 6.745e+050
6.745e+05 – 7.038e+050
7.038e+05 – 7.331e+050
7.331e+05 – 7.625e+050
7.625e+05 – 7.918e+050
7.918e+05 – 8.211e+050
8.211e+05 – 8.504e+050
8.504e+05 – 8.798e+050
8.798e+05 – 9.091e+050
9.091e+05 – 9.384e+050
9.384e+05 – 9.677e+050
9.677e+05 – 9.971e+050
9.971e+05 – 1.026e+060
1.026e+06 – 1.056e+061
1.056e+06 – 1.085e+060
1.085e+06 – 1.114e+060
1.114e+06 – 1.144e+060
1.144e+06 – 1.173e+061

column25 numeric

100.0% null
rows122,611
null122,571 (100.0%)
unique3
min98.000
max100.000
mean99.175
median99.000
std0.675
q199.000
q3100.000
iqr1.000
skew-0.215
kurtosis-0.787
n_outliers0
outlier_rate0.000
zero_rate0.000
Show data table
Histogram bins for column25 (median: 99.0).
bincount
98 – 98.336
98.33 – 98.670
98.67 – 990
99 – 99.3321
99.33 – 99.670
99.67 – 10013

column26 numeric

skew=+32.63 6.9% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique448
min0.000
max9,821
mean18.087
median2.000
std141.494
q10.000
q319.000
iqr19.000
skew32.631
kurtosis1,192
n_outliers8,433
outlier_rate0.069
zero_rate0.486
Show data table
Histogram bins for column26 (median: 2.0).
bincount
0 – 245.5122280
245.5 – 491.1109
491.1 – 736.644
736.6 – 982.116
982.1 – 122818
1228 – 147314
1473 – 171912
1719 – 19644
1964 – 221011
2210 – 24555
2455 – 27012
2701 – 29462
2946 – 31927
3192 – 34374
3437 – 36831
3683 – 39280
3928 – 41743
4174 – 44192
4419 – 46652
4665 – 49105
4910 – 515668
5156 – 54021
5402 – 56470
5647 – 58930
5893 – 61380
6138 – 63840
6384 – 66290
6629 – 68750
6875 – 71200
7120 – 73660
7366 – 76110
7611 – 78570
7857 – 81020
8102 – 83480
8348 – 85930
8593 – 88390
8839 – 90840
9084 – 93300
9330 – 95750
9575 – 98211

column27 numeric

skew=+113.91 17.1% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique5,332
min0.000
max4,830,455
mean961.825
median0.000
std21,879
q10.000
q30.000
iqr0.000
skew113.908
kurtosis20,875
n_outliers20,906
outlier_rate0.171
zero_rate0.829
Show data table
Histogram bins for column27 (median: 0.0).
bincount
0 – 1.208e+05122458
1.208e+05 – 2.415e+0585
2.415e+05 – 3.623e+0530
3.623e+05 – 4.83e+0511
4.83e+05 – 6.038e+052
6.038e+05 – 7.246e+054
7.246e+05 – 8.453e+058
8.453e+05 – 9.661e+052
9.661e+05 – 1.087e+062
1.087e+06 – 1.208e+061
1.208e+06 – 1.328e+065
1.328e+06 – 1.449e+060
1.449e+06 – 1.57e+060
1.57e+06 – 1.691e+060
1.691e+06 – 1.811e+061
1.811e+06 – 1.932e+061
1.932e+06 – 2.053e+060
2.053e+06 – 2.174e+060
2.174e+06 – 2.294e+060
2.294e+06 – 2.415e+060
2.415e+06 – 2.536e+060
2.536e+06 – 2.657e+060
2.657e+06 – 2.778e+060
2.778e+06 – 2.898e+060
2.898e+06 – 3.019e+060
3.019e+06 – 3.14e+060
3.14e+06 – 3.261e+060
3.261e+06 – 3.381e+060
3.381e+06 – 3.502e+060
3.502e+06 – 3.623e+060
3.623e+06 – 3.744e+060
3.744e+06 – 3.864e+060
3.864e+06 – 3.985e+060
3.985e+06 – 4.106e+060
4.106e+06 – 4.227e+060
4.227e+06 – 4.347e+060
4.347e+06 – 4.468e+060
4.468e+06 – 4.589e+060
4.589e+06 – 4.71e+060
4.71e+06 – 4.83e+061

column28 text

4 languages detected in sample 81.7% null
rows122,611
null100,152 (81.7%)
unique18,620
len_min2
len_max2,020
len_mean164.099
len_median124.000
len_p95445.000
word_mean25.735
word_median20.000
n_empty0
n_duplicates3,839
duplicate_rate0.171
vocab_size23,061
readability_flesch_mean44.381
emoji_rate7.12e-04
url_rate8.91e-05
one_word_rate9.04e-03
allcaps_rate8.19e-03
boilerplate_rate9.48e-03
Show data table
Character-length distribution for column28 (mean: 164.09902488979918).
charscount
2 – 524251
52 – 1035273
103 – 1533975
153 – 2043096
204 – 2541915
254 – 3051167
305 – 355787
355 – 406579
406 – 456389
456 – 506246
506 – 557175
557 – 607143
607 – 65891
658 – 70864
708 – 75960
759 – 80949
809 – 86034
860 – 91025
910 – 96132
961 – 101123
1011 – 106118
1061 – 111213
1112 – 116210
1162 – 12136
1213 – 12634
1263 – 13148
1314 – 13645
1364 – 14153
1415 – 14652
1465 – 15164
1516 – 15662
1566 – 16163
1616 – 16670
1667 – 17172
1717 – 17682
1768 – 18180
1818 – 18692
1869 – 19190
1919 – 19700
1970 – 20201
Sample values (first 10)
  1. Contains expressions related to suicide
  2. The Divine Speaker: The Sun and the Moon features consensual sex scenes between male adults.
  3. May feature veiled Nudity. Descriptions or depiction of Violence or Death.
  4. All character is over 18 year old
  5. The game contains strong language, nudity, sexual content, violence, gore and horror.
  6. Use of alcohol. Simulated gambling.
  7. Contains partial nudity of young ladies in an anime style. A large section of the gallery contains artwork with covered nudity, upskirt shots with visible panties, swimwear, bare legs, buttocks, and breasts.
  8. The game contains non-consensual acts, sexual harassment, hypnosis, interracial intercourse, sex trafficking, and BDSM.
  9. The mature content is exclusively female nudity, except for the final sex scenes which include male genitalia as well. Sex is consensual, no drugs, no alcohol.
  10. blood/gore/violence, strong language

column29 numeric

skew=+262.89 21.3% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique3,037
min0.000
max3,429,544
mean208.023
median0.000
std11,218
q10.000
q30.000
iqr0.000
skew262.895
kurtosis75,698
n_outliers26,119
outlier_rate0.213
zero_rate0.787
Show data table
Histogram bins for column29 (median: 0.0).
bincount
0 – 8.574e+04122594
8.574e+04 – 1.715e+0510
1.715e+05 – 2.572e+053
2.572e+05 – 3.43e+051
3.43e+05 – 4.287e+051
4.287e+05 – 5.144e+050
5.144e+05 – 6.002e+050
6.002e+05 – 6.859e+050
6.859e+05 – 7.716e+050
7.716e+05 – 8.574e+050
8.574e+05 – 9.431e+050
9.431e+05 – 1.029e+060
1.029e+06 – 1.115e+060
1.115e+06 – 1.2e+060
1.2e+06 – 1.286e+060
1.286e+06 – 1.372e+060
1.372e+06 – 1.458e+060
1.458e+06 – 1.543e+060
1.543e+06 – 1.629e+060
1.629e+06 – 1.715e+061
1.715e+06 – 1.801e+060
1.801e+06 – 1.886e+060
1.886e+06 – 1.972e+060
1.972e+06 – 2.058e+060
2.058e+06 – 2.143e+060
2.143e+06 – 2.229e+060
2.229e+06 – 2.315e+060
2.315e+06 – 2.401e+060
2.401e+06 – 2.486e+060
2.486e+06 – 2.572e+060
2.572e+06 – 2.658e+060
2.658e+06 – 2.744e+060
2.744e+06 – 2.829e+060
2.829e+06 – 2.915e+060
2.915e+06 – 3.001e+060
3.001e+06 – 3.087e+060
3.087e+06 – 3.172e+060
3.172e+06 – 3.258e+060
3.258e+06 – 3.344e+060
3.344e+06 – 3.43e+061

column30 numeric

skew=+51.68
rows122,611
null0 (0.0%)
unique993
min0.000
max20,088
mean13.789
median0.000
std270.378
q10.000
q30.000
iqr0.000
skew51.677
kurtosis3,253
n_outliers3,898
outlier_rate0.032
zero_rate0.968
Show data table
Histogram bins for column30 (median: 0.0).
bincount
0 – 502.2121943
502.2 – 1004377
1004 – 1507112
1507 – 200957
2009 – 251135
2511 – 301314
3013 – 35157
3515 – 40187
4018 – 45209
4520 – 50222
5022 – 55242
5524 – 60263
6026 – 65294
6529 – 70315
7031 – 75332
7533 – 80352
8035 – 85374
8537 – 90401
9040 – 95420
9542 – 1.004e+042
1.004e+04 – 1.055e+042
1.055e+04 – 1.105e+041
1.105e+04 – 1.155e+040
1.155e+04 – 1.205e+041
1.205e+04 – 1.256e+040
1.256e+04 – 1.306e+041
1.306e+04 – 1.356e+043
1.356e+04 – 1.406e+040
1.406e+04 – 1.456e+040
1.456e+04 – 1.507e+040
1.507e+04 – 1.557e+040
1.557e+04 – 1.607e+040
1.607e+04 – 1.657e+043
1.657e+04 – 1.707e+043
1.707e+04 – 1.758e+040
1.758e+04 – 1.808e+040
1.808e+04 – 1.858e+040
1.858e+04 – 1.908e+040
1.908e+04 – 1.959e+040
1.959e+04 – 2.009e+049

column31 numeric

skew=+263.99 21.3% rows beyond 1.5 IQR
rows122,611
null0 (0.0%)
unique2,511
min0.000
max3,429,544
mean173.571
median0.000
std11,203
q10.000
q30.000
iqr0.000
skew263.990
kurtosis76,112
n_outliers26,119
outlier_rate0.213
zero_rate0.787
Show data table
Histogram bins for column31 (median: 0.0).
bincount
0 – 8.574e+04122592
8.574e+04 – 1.715e+0512
1.715e+05 – 2.572e+053
2.572e+05 – 3.43e+051
3.43e+05 – 4.287e+051
4.287e+05 – 5.144e+050
5.144e+05 – 6.002e+050
6.002e+05 – 6.859e+050
6.859e+05 – 7.716e+050
7.716e+05 – 8.574e+050
8.574e+05 – 9.431e+050
9.431e+05 – 1.029e+060
1.029e+06 – 1.115e+060
1.115e+06 – 1.2e+060
1.2e+06 – 1.286e+060
1.286e+06 – 1.372e+060
1.372e+06 – 1.458e+060
1.458e+06 – 1.543e+060
1.543e+06 – 1.629e+060
1.629e+06 – 1.715e+061
1.715e+06 – 1.801e+060
1.801e+06 – 1.886e+060
1.886e+06 – 1.972e+060
1.972e+06 – 2.058e+060
2.058e+06 – 2.143e+060
2.143e+06 – 2.229e+060
2.229e+06 – 2.315e+060
2.315e+06 – 2.401e+060
2.401e+06 – 2.486e+060
2.486e+06 – 2.572e+060
2.572e+06 – 2.658e+060
2.658e+06 – 2.744e+060
2.744e+06 – 2.829e+060
2.829e+06 – 2.915e+060
2.915e+06 – 3.001e+060
3.001e+06 – 3.087e+060
3.087e+06 – 3.172e+060
3.172e+06 – 3.258e+060
3.258e+06 – 3.344e+060
3.344e+06 – 3.43e+061

column32 numeric

skew=+48.91
rows122,611
null0 (0.0%)
unique993
min0.000
max20,088
mean14.722
median0.000
std294.510
q10.000
q30.000
iqr0.000
skew48.909
kurtosis2,848
n_outliers3,898
outlier_rate0.032
zero_rate0.968
Show data table
Histogram bins for column32 (median: 0.0).
bincount
0 – 502.2121952
502.2 – 1004342
1004 – 1507114
1507 – 200966
2009 – 251135
2511 – 301319
3013 – 35155
3515 – 40188
4018 – 452012
4520 – 50226
5022 – 55242
5524 – 60264
6026 – 65295
6529 – 70315
7031 – 75331
7533 – 80351
8035 – 85373
8537 – 90401
9040 – 95420
9542 – 1.004e+040
1.004e+04 – 1.055e+042
1.055e+04 – 1.105e+041
1.105e+04 – 1.155e+041
1.155e+04 – 1.205e+041
1.205e+04 – 1.256e+041
1.256e+04 – 1.306e+041
1.306e+04 – 1.356e+043
1.356e+04 – 1.406e+040
1.406e+04 – 1.456e+040
1.456e+04 – 1.507e+041
1.507e+04 – 1.557e+040
1.557e+04 – 1.607e+041
1.607e+04 – 1.657e+043
1.657e+04 – 1.707e+044
1.707e+04 – 1.758e+040
1.758e+04 – 1.808e+040
1.808e+04 – 1.858e+040
1.858e+04 – 1.908e+040
1.908e+04 – 1.959e+041
1.959e+04 – 2.009e+0410

column33 text

31.8% rows are a single word 38.0% duplicate strings
rows122,611
null8,431 (6.9%)
unique70,816
len_min1
len_max584
len_mean14.366
len_median13.000
len_p9527.000
word_mean2.019
word_median2.000
n_empty0
n_duplicates43,364
duplicate_rate0.380
vocab_size18,429
readability_flesch_mean38.733
emoji_rate8.93e-04
url_rate2.19e-04
one_word_rate0.318
allcaps_rate0.080
boilerplate_rate0.000
Show data table
Character-length distribution for column33 (mean: 14.365659485023647).
charscount
1 – 1677264
16 – 3033047
30 – 452696
45 – 59611
59 – 74241
74 – 88129
88 – 10371
103 – 11834
118 – 13218
132 – 14715
147 – 16110
161 – 1767
176 – 1907
190 – 2058
205 – 2202
220 – 2344
234 – 2491
249 – 2634
263 – 2781
278 – 2922
292 – 3072
307 – 3220
322 – 3361
336 – 3510
351 – 3650
365 – 3801
380 – 3950
395 – 4090
409 – 4241
424 – 4380
438 – 4530
453 – 4670
467 – 4820
482 – 4970
497 – 5110
511 – 5260
526 – 5401
540 – 5551
555 – 5690
569 – 5841
Sample values (first 10)
  1. Rafael Farias
  2. Denis Galewski
  3. Denatsu Games
  4. Flux-Soul
  5. Element Studios
  6. Areta Watanabe
  7. Immergity LLC
  8. Attack Studio
  9. SK Soft
  10. Justin K. Touchstone

column34 text

31.8% rows are a single word 44.9% duplicate strings
rows122,611
null8,833 (7.2%)
unique62,689
len_min1
len_max164
len_mean13.825
len_median13.000
len_p9526.000
word_mean1.988
word_median2.000
n_empty0
n_duplicates51,089
duplicate_rate0.449
vocab_size15,765
readability_flesch_mean40.218
emoji_rate9.14e-04
url_rate2.29e-04
one_word_rate0.318
allcaps_rate0.082
boilerplate_rate0.000
Show data table
Character-length distribution for column34 (mean: 13.824825537450122).
charscount
1 – 56733
5 – 922067
9 – 1332481
13 – 1728105
17 – 2113359
21 – 255059
25 – 302901
30 – 341309
34 – 38764
38 – 42388
42 – 46199
46 – 50132
50 – 5458
54 – 5890
58 – 6246
62 – 6635
66 – 7011
70 – 7415
74 – 787
78 – 825
82 – 872
87 – 911
91 – 951
95 – 992
99 – 1033
103 – 1071
107 – 1110
111 – 1150
115 – 1190
119 – 1230
123 – 1271
127 – 1311
131 – 1350
135 – 1400
140 – 1440
144 – 1480
148 – 1520
152 – 1560
156 – 1600
160 – 1642
Sample values (first 10)
  1. Petraller Studios
  2. Two and a Half Studios
  3. Baruhara
  4. Zhongce Games
  5. Element Studios
  6. Areta Watanabe
  7. Toblue
  8. TaoJeruen
  9. Electronic Arts
  10. SOFTON ENTERTAINMENT

column35 text

88.3% duplicate strings
rows122,611
null8,953 (7.3%)
unique13,291
len_min3
len_max534
len_mean71.584
len_median59.000
len_p95178.000
word_mean5.089
word_median4.000
n_empty0
n_duplicates100,367
duplicate_rate0.883
vocab_size589
readability_flesch_mean-105.855
emoji_rate0.000
url_rate0.000
one_word_rate0.040
allcaps_rate8.80e-06
boilerplate_rate0.000
Show data table
Character-length distribution for column35 (mean: 71.58431434654841).
charscount
3 – 164980
16 – 3024221
30 – 436419
43 – 5620213
56 – 6912806
69 – 8310422
83 – 969469
96 – 1095839
109 – 1223729
122 – 1362863
136 – 1492700
149 – 1622033
162 – 1761967
176 – 1891499
189 – 2021190
202 – 215830
215 – 229625
229 – 242475
242 – 255363
255 – 268282
268 – 282175
282 – 295137
295 – 308105
308 – 32270
322 – 33566
335 – 34854
348 – 36138
361 – 37519
375 – 38821
388 – 40118
401 – 4156
415 – 4287
428 – 4413
441 – 4544
454 – 4681
468 – 4816
481 – 4941
494 – 5070
507 – 5210
521 – 5342
Sample values (first 10)
  1. Single-player,Steam Achievements,Full controller support,Steam Trading Cards,Family Sharing
  2. Single-player,Family Sharing
  3. Single-player,Family Sharing
  4. Multi-player,Co-op,Shared/Split Screen Co-op,Shared/Split Screen,Steam Achievements,Remote Play Together
  5. Single-player,Family Sharing
  6. Single-player,Steam Achievements,Family Sharing
  7. Single-player,Steam Achievements,Camera Comfort,Custom Volume Controls,Adjustable Difficulty,Playable without Timed Input,Stereo Sound,Steam Leaderboards,Family Sharing
  8. Single-player,Steam Achievements,Family Sharing
  9. Single-player,Steam Achievements,Steam Cloud,Stats,Family Sharing
  10. Single-player,Multi-player,Co-op,Shared/Split Screen Co-op,Shared/Split Screen,Steam Achievements,Full controller support,Steam Trading Cards,Remote Play Together,Family Sharing

column36 text

78.9% rows are a single word 97.5% duplicate strings
rows122,611
null8,413 (6.9%)
unique2,894
len_min3
len_max236
len_mean22.205
len_median21.000
len_p9545.000
word_mean1.364
word_median1.000
n_empty0
n_duplicates111,304
duplicate_rate0.975
vocab_size940
readability_flesch_mean-206.050
emoji_rate0.000
url_rate0.000
one_word_rate0.789
allcaps_rate9.78e-03
boilerplate_rate0.000
Show data table
Character-length distribution for column36 (mean: 22.205064887301003).
charscount
3 – 912259
9 – 1521084
15 – 2022318
20 – 2625837
26 – 3212284
32 – 388026
38 – 445596
44 – 502995
50 – 551587
55 – 61848
61 – 67593
67 – 73229
73 – 79196
79 – 85137
85 – 9071
90 – 9635
96 – 10237
102 – 10813
108 – 11415
114 – 12010
120 – 1256
125 – 1316
131 – 1374
137 – 1432
143 – 1492
149 – 1541
154 – 1600
160 – 1661
166 – 1724
172 – 1780
178 – 1840
184 – 1890
189 – 1950
195 – 2010
201 – 2070
207 – 2131
213 – 2190
219 – 2240
224 – 2300
230 – 2361
Sample values (first 10)
  1. Indie,Strategy
  2. Action,Indie,Strategy
  3. Action,Adventure,Indie,RPG,Simulation
  4. RPG,Strategy
  5. Casual
  6. Adventure,Indie,Simulation
  7. Action
  8. Indie,RPG,Strategy
  9. Simulation
  10. Free To Play,Massively Multiplayer,RPG

column37 text

9 languages detected in sample 32.0% null
rows122,611
null39,265 (32.0%)
unique77,179
len_min3
len_max295
len_mean141.315
len_median163.000
len_p95228.000
word_mean4.923
word_median5.000
n_empty0
n_duplicates6,167
duplicate_rate0.074
vocab_size57,260
readability_flesch_mean-449.744
emoji_rate0.000
url_rate0.000
one_word_rate0.123
allcaps_rate4.80e-05
boilerplate_rate0.000
Show data table
Character-length distribution for column37 (mean: 141.31500011998176).
charscount
3 – 10584
10 – 181637
18 – 252337
25 – 323056
32 – 402371
40 – 472362
47 – 542419
54 – 611862
61 – 691883
69 – 761806
76 – 831905
83 – 911584
91 – 981595
98 – 1051879
105 – 1121648
112 – 1201657
120 – 1271916
127 – 1341722
134 – 1421741
142 – 1491845
149 – 1562108
156 – 1641958
164 – 1712426
171 – 1783392
178 – 1863991
186 – 1934902
193 – 2006178
200 – 2075471
207 – 2154901
215 – 2223666
222 – 2292999
229 – 2371612
237 – 244918
244 – 251599
251 – 258239
258 – 266120
266 – 27337
273 – 28013
280 – 2885
288 – 2952
Sample values (first 10)
  1. Puzzle-Platformer,FPS,Singleplayer,Adventure,3D,Puzzle,Action-Adventure,Colorful,Arcade,Experimental,Interactive Fiction,Platformer,3D Platformer,First-Person,Stylized,Action,Physics,Runner,Early Access,Indie
  2. Adventure,Action,Arcade,Platformer,Action-Adventure,Shooter,2D Platformer,Idler,Side Scroller,Exploration,Looter Shooter,Arena Shooter,2D,Cute,Minimalist,Stylized,Space,Mars,Dark,Science
  3. Simulation,Education,3D,Outbreak Sim,First-Person,Flight,Linear,Indie,Early Access,Conversation,Tutorial,Singleplayer
  4. Word Game,RPG,Adventure,Puzzle-Platformer,Pixel Graphics,Text-Based,Singleplayer,Strategy,Indie,Casual
  5. Rogue-lite,Action Roguelike,2D,RPG,Rogue-like,Action,Casual,Action RPG,Narration,Colorful,Hand-drawn,Arcade,Bullet Hell,Top-Down,Comedy,Fantasy,Magic,Atmospheric,Singleplayer,Pixel Graphics
  6. Strategy,RTS,Sci-fi,Real Time Tactics,Classic
  7. Card Battler,Strategy,RPG,Solitaire,Martial Arts,PvP,Card Game,Indie,Simulation,2D,Turn-Based Combat,Pixel Graphics,Multiplayer,Early Access,Singleplayer,Logic,Turn-Based Strategy,Turn-Based Tactics,Stylized,Crafting
  8. 2D Platformer,Side Scroller,Precision Platformer,Collectathon,Exploration,Platformer,Action,Adventure,Difficult,Female Protagonist,Puzzle-Platformer,Action-Adventure,Cute,Cartoony,Colorful,Pixel Graphics,Cats,Retro,Singleplayer,Controller
  9. Strategy,Adventure,Casual,Simulation,3D Vision,Cartoony,Colorful,Cute,Stylized,Singleplayer,Indie,Flight,Job Simulator,Shooter,Top-Down Shooter
  10. Strategy,RPG,Traditional Roguelike,Dungeon Crawler,Turn-Based Tactics,Turn-Based Strategy,Strategy RPG,Top-Down,Third Person,Indie,Dark Fantasy,Cartoon,Dark,Singleplayer

column38 text

99.9% of rows are unique strings 100.0% rows are a single word 100.0% rows contain a URL
rows122,611
null6,018 (4.9%)
unique116,483
len_min144
len_max29,132
len_mean1,319
len_median1,039
len_p952,773
word_mean1.000
word_median1.000
n_empty0
n_duplicates110
duplicate_rate9.43e-04
vocab_size19,994
readability_flesch_mean-5,099
emoji_rate0.000
url_rate1.000
one_word_rate1.000
allcaps_rate0.000
boilerplate_rate0.000
Show data table
Character-length distribution for column38 (mean: 1318.9448423147187).
charscount
144 – 86928597
869 – 159359377
1593 – 231818118
2318 – 30436216
3043 – 37682157
3768 – 4492959
4492 – 5217470
5217 – 5942274
5942 – 6666167
6666 – 739195
7391 – 811647
8116 – 884029
8840 – 956525
9565 – 1029020
10290 – 110147
11014 – 1173910
11739 – 124642
12464 – 131894
13189 – 139132
13913 – 146380
14638 – 153631
15363 – 160875
16087 – 168122
16812 – 175371
17537 – 182620
18262 – 189861
18986 – 197110
19711 – 204360
20436 – 211604
21160 – 218850
21885 – 226100
22610 – 233340
23334 – 240590
24059 – 247841
24784 – 255080
25508 – 262330
26233 – 269581
26958 – 276830
27683 – 284070
28407 – 291321
Sample values (first 10)
  1. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/426590/ss_1f3bdb17454e203684f2aa2a73c24e0fb8d5e5b3.1920x1080.jpg?t=1460225792,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/426590/ss_65f40c99802fba66479e2b7d9ebad8465647a63f.1920x1080.jpg?t=1…
  2. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/840020/ss_fb02f9d34edacf2227a29393716977cb4cfbd68b.1920x1080.jpg?t=1724977094,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/840020/ss_625edf1cee6615c41232e447ba7b6c9649b2b5bf.1920x1080.jpg?t=1…
  3. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3383290/ss_35709f1b169961d1188300616ee89ceaef276997.1920x1080.jpg?t=1736909574,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3383290/ss_b4d95ecad6f07d306499ce02e9af21da7bbbc547.1920x1080.jpg?t…
  4. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/4141620/ac47cf5cb77d1ec5fb8bbabc3bd5fc8c28ecc705/ss_ac47cf5cb77d1ec5fb8bbabc3bd5fc8c28ecc705.1920x1080.jpg?t=1765182578,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/4141620/45cf37abbe0dbf3b3f…
  5. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3500800/469927cb9830ca9cbb11b16b4ba57d63a94e3d98/ss_469927cb9830ca9cbb11b16b4ba57d63a94e3d98.1920x1080.jpg?t=1756980773,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3500800/71fef1f7c6a42d506c…
  6. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/1718400/ss_b0facb8ebc7c80b860890be01e6cabf317a5b423.1920x1080.jpg?t=1672054129,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/1718400/ss_336a4af9306399fc16cc6bdd630dadc2bfd5e582.1920x1080.jpg?t…
  7. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/105300/ss_079d1826ba05a220d9787783d3d9f57844dd6d69.1920x1080.jpg?t=1447354297,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/105300/ss_84a7a161234e2c9ebd403dcfd564a87e5322a51f.1920x1080.jpg?t=1…
  8. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3118010/ss_abc47dd657c140deff04453cfbee7516292221e2.1920x1080.jpg?t=1750959926,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/3118010/ss_e82a7ee610ec4137204409cdb2f7e4531b0003cf.1920x1080.jpg?t…
  9. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/2475730/ss_7846e950463aaf8d66602274d3eaecaf6cdb5a05.1920x1080.jpg?t=1718801988,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/2475730/ss_9d7b54a4e4b58bfa5f45fc99f7d232e75aee6dd8.1920x1080.jpg?t…
  10. https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/2252550/ss_fdd69097fd7aafba7a189619c0b59b348983d234.1920x1080.jpg?t=1760994400,https://shared.akamai.steamstatic.com/store_item_assets/steam/apps/2252550/ss_2d1184f97df203304a9fe43df502049feda069e9.1920x1080.jpg?t…

column39 unknown

no profiler for kind=unknown
rows122,611
null0 (0.0%)