Summary confidence: high
This dataset is a 2000-row index from a WLASL (Word-Level American Sign Language) source, with two columns: 'gloss' (text labels) and 'instances' (an unparsed/unknown field, likely nested data). The 'gloss' column is essentially a vocabulary list — every one of the 2000 rows is unique, 97.75% are single words, and the mean length is just 6 characters. The 'instances' column was skipped by the profiler and warrants manual inspection, since it likely contains the actual sign-language sample records keyed to each gloss. Start by looking at the gloss length distribution to confirm the single-word pattern, then dig into the structure of 'instances' separately.
citing: row_count · column_count · columns[gloss].n_unique · columns[gloss].stats.one_word_rate · columns[gloss].stats.len_mean · columns[gloss].stats.len_max · columns[gloss].stats.word_mean · columns[gloss].stats.vocab_size · columns[gloss].top_words · columns[instances].alerts