{"columns":[{"alerts":[{"code":"multilingual","level":"info","message":"31 languages detected in sample"},{"code":"allcaps","level":"info","message":"16.9% rows are all-caps"}],"column":"text","extras":{"language_counts":{"__engine":"fasttext:4,768","als":2,"ar":9,"bg":3,"ca":6,"cs":10,"de":108,"el":11,"en":3309,"eo":4,"es":71,"et":2,"fi":13,"fr":78,"hi":2,"id":11,"it":30,"ja":656,"ko":125,"nl":46,"no":3,"pl":12,"pt":78,"ru":30,"sr":3,"sv":7,"th":34,"tr":33,"uk":3,"vi":7,"zh":37},"language_sample_size":5000,"length_histogram":{"counts":[11042,12354,11078,8177,7049,6309,5146,4495,3865,3459,3244,2592,2219,2065,1867,1708,1624,1467,1439,1439,1563,1657,4862,33,261,3,5,1,2,9,3,0,1,1,0,0,0,0,0,1],"edges":[1.0,14.1,27.2,40.3,53.4,66.5,79.6,92.7,105.8,118.89999999999999,132.0,145.1,158.2,171.29999999999998,184.4,197.5,210.6,223.7,236.79999999999998,249.9,263.0,276.09999999999997,289.2,302.3,315.4,328.5,341.59999999999997,354.7,367.8,380.9,394.0,407.09999999999997,420.2,433.3,446.4,459.5,472.59999999999997,485.7,498.8,511.9,525.0]},"near_unique":false,"sample":["Un client arr\u00eat\u00e9 apr\u00e8s avoir poignard\u00e9 son livreur de repas\n\n\"Un livreur de repas a \u00e9t\u00e9 gri\u00e8vement bless\u00e9 lors d\u2019une tentative de meurtre dans la nuit de dimanche \u00e0 lundi, \u00e0 B\u00fclach. Le...\"\n\nhttps://www.20min.ch/fr/story/buelach-zh-un-client-arrete-apres-avoir-poignarde-son-livreur-de-repas-103470448","F\u0131nd\u0131\u011f\u0131m bu giboyla ilgili itiraf etmek istedi\u011fin bir \u015fey varsa tam vakti \ud83d\ude05 Dm kutuma gel anlat,s\u00f6z bende kalacak anlatt\u0131klar\u0131n \ud83d\ude05\ud83d\ude05","-#strongertogether","Tu b\u2019Shevat is approaching rapidly. Nit to put any pressure on you, but\u2026","Someone mentioned it in my timeline, and so I just rewatched \"The Blue Carbuncle\", from The Adventures of Sherlock Holmes (1984) with Jeremy Brett.  \n\nIt's a great Christmas story and Jeremy Brett is without question the best Holmes ever.\n\nI found it on Britbox","https://trecome.info/articles/89cfe941-6cf1-45ae-8626-cc0241375b46\n\u3010\u65b0\u7740\u8a18\u4e8b\u3011\n\u5b87\u5b99\u30b9\u30c6\u30fc\u30b7\u30e7\u30f3\u306f\uff62\u7d44\u307f\u7acb\u3066\u308b\uff63\u6642\u4ee3\u304b\u3089\uff62\u4e00\u767a\u3067\u5e83\u3052\u308b\uff63\u6642\u4ee3\u3078\uff1f","Happy Christmas Eve, sweet Flanoy! \n\ud83c\udf84\ud83e\udde1\ud83d\udc08","THESE FUCKERS SERIOUSLY COULDN\u2019T WAIT ONE DAY!? /vneg\n#dandysworld","\uc5b4\ub51c \uac00\uc9c0.....","Made in hckr.fr \ud83c\udff4\u200d\u2620\ufe0f\ud83d\udda4 Le genre de petit message qui me fait chaud au c\u0153ur.","\"Why would you have to shrink me to swallow me?\"\n\"It's a lot to swallow. I would know\"\n\na","Yea Newage is great: it's pricey, but you get what you pay for and the quality and engineering are top notch \n\nThe dinobots are great, as is their Jetfire mold and their Blaster mold\n\nWould advise avoiding their Galvatron mold, though: unfortunately he's a bit of a dud and the paint chips easily","we are Energy we are Power we can conquer the world","Un oui que le Tessin redoute\n\n\"L\u2019acceptation de l\u2019initiative visant \u00e0 diviser par deux le financement de la SSR aurait des r\u00e9percussions \u00e9conomiques plus importantes en Suisse italienne qu\u2019en Suisse romande et en Suisse al\u00e9manique, ...\"\n\nhttps://lecourrier.ch/2025/12/21/un-oui-que-le-tessin-redoute/","\u660e\u65e5\u3001\u96e8\u3068\u98a8\u304c\u5f37\u3081\u306a\u306e\u304b\u3001Flood Watch\u3068Wind Advisory\u304b\u3089\u901a\u77e5\u304c\u3067\u3066\u305f\u3002\u505c\u96fb\u306b\u306f\u306a\u3089\u306a\u3044\u307b\u3069\u3067\u3042\u3063\u3066\u307b\u3057\u3044\u3002","No no you've mentioned it before. I had some problems with my kidneys not too long ago thanks to not taking care of my blood pressure. I know it's not the same but the danger I was told I was in was... yanno.\n\nWish I knew what you looked like so I could imagine what more loss would do to you \ud83d\udc40","And people say 10 void is unbeatable \ud83e\udd2d we like the thicc boys (and Illaoi)","\ud83c\udfb6 So much for a \"Merry Christmas\" \ud83c\udfb6","Pr\u016fzkum na dne\u0161n\u00ed den: V\u00e1noce mezi n\u00e1mi\n\n#pruzkum","(FOTD, cont.) there\u2019s no room for those Keith and Jerry solos in a fast FOTD.\n\nTypically great Brown-Eyed Women and then a remarkable Let It Grow, in some ways the quintessential Let It Grow for me\n\nGetting to the 2nd set\u2014 love Jerry\u2019s power chord that kicks off PITB out of Sunrise","Meanwhile, Seattle can't stop homeless deaths; over 200 this year\nHow ironic","#pareridiparte #ritardi #manovra","No president\u2026no matter how powerful or popular\u2026is above the law. \n\nopen.substack.com/pub/thejackh...","\u6614wiifit\u304b\u306a\u3093\u304b\u3067\u30c0\u30a4\u30d3\u30f3\u30b0\u30b2\u30fc\u30e0\u3084\u3063\u305f\u3068\u304d\u306b\u30d2\u30c8\u30c7\u304c\u6016\u3044\u3053\u3068\u306b\u6c17\u4ed8\u3044\u305f \u306a\u305c\u304b\u306f\u308f\u304b\u3089\u306a\u3044 \u661f\u306e\u7802\u3068\u304b\u3082\u82e6\u624b\u3060\u304b\u3089\u3001\u81ea\u7136\u7269\u306a\u306e\u306b\u661f\u578b\u3057\u3066\u308b\u306e\u304c\u6c17\u6301\u3061\u60aa\u304b\u3063\u305f\u306e\u304b\u306a","Feed: \"Daily Post Nigeria\"\nBy: Winner James on Tuesday, December 23, 2025","\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042","\u307f\u3066\u30fc\u30fc\u30fc\u30fc\uff01\uff01\uff01\uff01\u304b\u308f\u3044\u3044\u30fc\u30fc\u30fc\u30fc\uff01\uff01\uff01\uff01\uff01","who wants to take bets on how long before dementia donny slips up and says something like \"[it's] not a crime. and if it was a crime, it doesn't matter because i'm the president and the supreme court says anything i do is legal\" ?\nand yet, even if he did confess, the cult would defend him","@gothtatertot.bsky.social"," ","\u6b63\u76f4\u6614\u304b\u3089\u6b32\u3057\u304b\u3063\u305f\u306e\u3067\u3069\u3063\u304b\u306e\u30bf\u30a4\u30df\u30f3\u30b0\u3067\u53d6\u308a\u305f\u3044\u306a\uff5e\u3063\u3066\u6c17\u6301\u3061\u306f\u3042\u308a\u307e\u3059(\u306a\u304a\u30d0\u30a4\u30c8\u00d72\u3068\u30d0\u30f3\u30c9\u3067\u305d\u3093\u306a\u4f59\u88d5\u304c\u7121\u3044)","Same, but without the SPY","on the suit give her a pin of a random pride flag that would fit what anyone has said so far","That\u2019s a marvelously innovative solution to Phoenix\u2019s water shortage!","Yup, planning to power through the game and hopefully listen by early Jan\ud83e\udd1e \n\nHappy holidays, Fine Time crew! \ud83c\udf84\ud83c\udf81\n\nMy thoughts are with Aaron rn, as I'm sure yours are too. Wishing him the best in this tough time.","Tyler Kolek delivers Knicks silver lining with career night in road loss","Yes.\n\nI also learned from my parents that gifts are quite transactional at times.  And that you have to keep score.  Because reciprocation is a statement, but obviously, non-reciprocation is also a statement.\n\nOf course, exceptions are made for families in \u55aa\u4e2d (mourning).","\ud83d\ude29 what a beauty!!! Perfection!","The man is simply not that smart. Arrogance and ignorance are often the two sides of the same coin.","hi :3","\u571f\u65b9\u3055\u3093\u306f\u52d5\u63fa\u3059\u308b\u3053\u3068\u3042\u308b\u306e\u304b\u306a\u301c\u3002\u3059\u3093\u3054\u3044\u6b63\u6c17\u3092\u5931\u3046\u307b\u3069\u306e\u52d5\u63fa\u3092\u3059\u308c\u3070\u3044\u3044\u3068\u601d\u3044\u307e\u3059\u3045\uff08\uff1f\uff1f\uff1f\uff1f\n\uff08\u65e9\u304f\u571f\u65b9\u3055\u3093\u30eb\u30fc\u30c8\u898b\u306b\u884c\u3051\u3070\u3044\u3044\u306e\u306b\u79c1\uff09\n#\u3078\u304d\u3055\u308a\u3085\u3046\u307f\u3053","Yes YES AND YES. \ud83d\udc99\ud83d\udc99\ud83d\udc99\ud83d\udc99\ud83d\udc99 I only post Blue hearts b/c red hearts don't exist. I wish Bluesky gave us this option.","Unlocking Innovation and Securing Knowledge: The Xerox-Stack Overflow Blueprint for Enterprise\u00a0Collaboration\n\nAt AllSafeUs Research Labs, we constantly monitor trends that shape the future of enterprise security and operational excellence. The recent insights into how Xerox leveraged an internal\u2026","Literally all of my emails and phone calls are about late fees and that I have no money, every bank is angry at me, etc. the ratio of that signal to noise with radio silence from people and the great two weeks of doom that is Christmas is a lot. I continue to eke out rent via credit. It\u2019s inevitable","\u72e9\u308a\u30c7\u3059\u3002","i finally have a laptop new enough to use my ipad as a screen extender so i can use csp on there lfggggggg","I'll wait a thousand years just to see you smile again.","Damn it. I've GOT to stop them. But what if I can't? There was no point thinking about it, but he did anyway. Could he run for it? He'd run from Diavolo, knowing he didn't stand a chance. But Giorno Giovanna had defeated Diavolo. Could he get away from him?","Romance is where the big money is, they will try the worst possible of the crap on you first.","\u5168\u4eba\u985e\u5e7c\u5973\u306b\u306a\u3089\u306d\u3047\u304b\u306a\u3041"],"top_values":[["\u0e40\u0e27\u0e47\u0e1a\u0e15\u0e23\u0e07\u0e17\u0e35\u0e48\u0e19\u0e48\u0e32\u0e40\u0e0a\u0e37\u0e48\u0e2d\u0e16\u0e37\u0e2d API \u0e41\u0e17\u0e49 \u0e2d\u0e31\u0e19\u0e14\u0e31\u0e1a1\u0e02\u0e2d\u0e07\u0e44\u0e17\u0e22 \u0e1d\u0e32\u0e01 \u0e16\u0e2d\u0e19 \u0e14\u0e49\u0e27\u0e22\u0e23\u0e30\u0e1a\u0e1a\u0e2d\u0e2d\u0e42\u0e15\u0e49\n\u0e41\u0e2d\u0e14\u0e21\u0e34\u0e19\u0e1a\u0e23\u0e34\u0e01\u0e32\u0e23 24\u0e0a\u0e21.\n\u0e2a\u0e21\u0e31\u0e04\u0e23\u0e23\u0e31\u0e1a\u0e1f\u0e23\u0e35 422 \u0e1a\u0e32\u0e17 rebrand.ly/889a531\n\u0e2a\u0e21\u0e31\u0e04\u0e23\u0e23\u0e31\u0e1a\u0e1f\u0e23\u0e35 422 \u0e1a\u0e32\u0e17 rebrand.ly/889a531\n#\u0e23\u0e31\u0e1a\u0e1f\u0e23\u0e35\n#422\n#\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\n#\u0e41\u0e08\u0e01\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\n#\u0e04\u0e32\u0e2a\u0e34\u0e42\u0e19\u0e2a\u0e14\n#\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\u0e41\u0e04\u0e48\u0e2a\u0e21\u0e31\u0e04\u0e23 #\u0e42\u0e04\u0e49\u0e14\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35 #\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\u0e44\u0e21\u0e48\u0e15\u0e49\u0e2d\u0e07\u0e1d\u0e32\u0e01\u0e44\u0e21\u0e48\u0e15\u0e49\u0e2d\u0e07\u0e41\u0e0a\u0e23\u0e4c #\u0e40\u0e27\u0e47\u0e1a\u0e15\u0e23\u0e07\u0e2a\u0e25\u0e47\u0e2d\u0e15 #\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e3550 #\u0e2a\u0e25\u0e47\u0e2d\u0e15\u0e41\u0e15\u0e01\u0e2b\u0e19\u0e31\u0e01 #\u0e2a\u0e25\u0e47\u0e2d\u0e15\u0e1f\u0e23\u0e35",256],["\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11",224],["bsky.app/profile/dark...",64],[" ",57],["If you missed the pulled 60 Minutes report on CECOT, you can view it here. \n\nPlease share widely. \n\narchive.org/details/60mi...",42],["Christmas (Please Come Home)/Darlene Love bass & vocal by Cara\n\nVery easy on bass, challenging to sing! youtu.be/c4Bg3DPpOU8 \nI'm going to taking 24th-25th off so Merry Christmas and Happy Holidays  #christmas #darlenelove #philspector #ronniespector",41],["\ud83d\ude02",40],["\ud83d\udc40",37],["Can I trouble you to subscribe to my nature-comedy YouTube channel? \ud83d\ude4f \n\nyoutu.be/R9CdYuf-JaE?...",33],["\u2764\ufe0f",31],["\ud83e\udd23",30],["\ud83d\ude0d",27],["Hold our Navy responsible! open.substack.com/pub/growingu...",27],["G\u00fcnayd\u0131n \u2728\ufe0f\ud83d\ude0a\u2728\ufe0f\u2615\ufe0f\u2728\ufe0f",26],["\ud83e\udec2",24],["\ud83e\udd23\ud83e\udd23\ud83e\udd23",22],["bsky.app/profile/phoe...",22],["\ud83c\udff7\ufe0f Special Price: Product\n\u2728 Save money today!\n\n\ud83d\udc46 Check it out!",21],["#shadowicexploit #shadowic #shadowintegration #shadows #shadowsense #shadowiclineexploit #lastpostabouthtelines #shadowictrinity\n#shadowicazazel #shadowicelizabeth #shadowicthomas #shadowicsolomon #shadowiccrowley\n#shadowic2020 #shadowic2025  #shadowix #shift #massshift #anon #shadowanon #anonymous",21],["\ud83d\ude02\ud83d\ude02\ud83d\ude02",20]],"top_words":[["the",7424],["a",4857],["to",4818],["i",4158],["and",4144],["of",3490],["in",2730],["is",2670],["for",2358],["you",1965],["it",1860],["that",1796],["on",1745],["this",1526],["my",1435],["with",1374],["but",1150],["be",1082],["so",1036],["have",977],["was",966],["at",939],["-",913],["not",903],["are",897]],"vocab_skipped":null,"word_histogram":{"counts":[43756,17836,13187,7566,6713,3921,3682,2614,1521,182,40,8,4,3,1,2,0,0,0,0,0,1,0,0,0,0,1,0,1,1],"edges":[1.0,7.466666666666667,13.933333333333334,20.4,26.866666666666667,33.333333333333336,39.8,46.266666666666666,52.733333333333334,59.2,65.66666666666667,72.13333333333334,78.6,85.06666666666666,91.53333333333333,98.0,104.46666666666667,110.93333333333334,117.4,123.86666666666667,130.33333333333334,136.8,143.26666666666668,149.73333333333335,156.2,162.66666666666666,169.13333333333333,175.6,182.06666666666666,188.53333333333333,195.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":95935,"null_rate":0.0,"stats":{"allcaps_rate":0.1690716547901821,"boilerplate_rate":0.001049089469517023,"duplicate_rate":0.05052454473475851,"emoji_rate":0.1832343626286619,"len_max":525,"len_mean":97.62657363420428,"len_median":68.0,"len_min":1,"len_p95":290.0,"n_duplicates":5105,"n_empty":0,"one_word_rate":0.189944576405384,"readability_flesch_mean":64.09147328214267,"url_rate":0.07586104513064133,"vocab_size":77183,"word_mean":14.234619952494063,"word_median":10.0}},{"alerts":[{"code":"duplicates","level":"warn","message":"56.5% duplicate strings"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"}],"column":"author_did_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":false,"sample":["203b2f94ca34ad57","e3fb7462b68ce168","8b80d746cd58f608","74e2cbc89edd37a6","ed4f29630f55ae1d","039de54e8bef8899","61ee7267320497ee","b464fb16192641fa","7422e82a369d2ace","8dc83ec255dde07a","6d6760c7aa86e8e3","66ad99c8fcf5f811","9e4716810429aeca","203b2f94ca34ad57","760203400659f56f","a647051605b55e99","65a0490266af6cea","adeb848bd5142712","fa2043d9fdf56be2","bb678261b563da06","631c8517a01c3e9b","afdcf7421bf5ff34","43b058474ce580d4","e219c042f81df881","634772531b7617e2","05ca1924909285a8","79c19a068331cdf9","5d691986621a5b29","c9abf11307ab8693","95a5305808625469","52806bc5b3077c99","552942e53108b579","510f724fbec56084","236843991ae6f5ff","81c0251ad5a83dce","02f769fbad1b3d05","28e076f5430b3198","b92dfe84ea4b3589","eb452ce2479900da","d1df157966160af1","2d1566f8fd35d0f1","e0a88bbddc0314d0","0e73a5b8648607e1","8b3de10e35e304a9","14d93f819a37b7a8","658d8e90ac98f9ba","af96045733cb7caf","b015d70803283e70","4bf2aa20366a489e","05ca1924909285a8"],"top_values":[["634772531b7617e2",1016],["fb4e916ee2673591",726],["7bef67724621686b",590],["6c53c0fac294c5a4",549],["203b2f94ca34ad57",455],["2ea2a3bb1eb67cd7",411],["31165f7346de9da8",391],["c161bb58161ffd89",255],["ae71c0ad4484309f",232],["5d2b4f7adcec93f3",227],["f96d8b230da452cf",193],["b1df12534bde5974",172],["779ea61a1d359785",157],["4f563f4b2171b9c5",140],["3e5f82241b71d8e7",90],["c63b71e5907c2f34",77],["ce5433548d273f29",73],["6531a5cf8f5fe8d0",73],["0fb2807d4a383d70",72],["2f4d663477771c99",70]],"top_words":[["634772531b7617e2",181],["fb4e916ee2673591",148],["6c53c0fac294c5a4",113],["7bef67724621686b",108],["203b2f94ca34ad57",86],["31165f7346de9da8",72],["2ea2a3bb1eb67cd7",71],["5d2b4f7adcec93f3",50],["ae71c0ad4484309f",50],["c161bb58161ffd89",38],["b1df12534bde5974",32],["779ea61a1d359785",31],["f96d8b230da452cf",30],["4f563f4b2171b9c5",21],["f0ff207f7a7a0939",20],["2f4d663477771c99",19],["6531a5cf8f5fe8d0",19],["2221340be707fd97",18],["76ff98d8b589b65c",18],["1bbb376d1f618f2c",17],["f5c2ce0d362d0ebd",16],["63fc19a8e5da7ebf",16],["2b6aeb9dd9232866",15],["2ff5866b1c8ec5c3",15],["b2a3bb0389254922",15]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":0,"n_unique":43998,"null_rate":0.0,"stats":{"allcaps_rate":0.0003463974663499604,"boilerplate_rate":0.0,"duplicate_rate":0.5645486935866983,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":57042,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":68.34500000000003,"url_rate":0.0,"vocab_size":13938,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"near_unique","level":"info","message":"100.0% of rows are unique strings"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"}],"column":"uri_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101039,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":true,"sample":["1a925eae4a68e954","9c7b35c448e9f56a","00000da0008897eb","cd823fdacd02b11c","0a525ed50b0474f2","175ab73228973fa3","60f3a1a69409b7ae","e9e0e481dbe7f266","d49a9bf37ba42904","ef51be69a5ee76f8","c60d77c71a6e8368","9e5ad9a403761b25","f783b2ea36e56ac1","5bf20312746a2a9a","421b2c913026eb9c","826a317da4197bb7","5df1f3a26eb43121","ccb51fe4f8c3f397","1f54bdfdf263b300","7cf22e593d9bc6ed","4f766a3bb7833dfa","4ee72d67c35a55cb","f703f2dc01779a54","29cd0823f2d3c76b","f67222e907209d80","e95d5a4652824780","a06c815cda779e65","7ae4569d81570275","9f59f75daa68b0ee","fafcc6b305bc62cd","d21e0678f85c5ec4","f081102ab401665e","720720a55becacb2","086c1ebf93747eeb","2b3f0aa42419125e","54c7b27aad00789d","55eeaadb1cab926e","beaef59a094b6c60","c55b838089921f4a","1f4cdfd2fa3601a0","dbf5ae46203822ae","7e17c82d0b973d79","f4a242fe88642345","a1f7780887eefee9","6baebdcc74bfec02","c3f9d4f4eb24f31b","a42dd84d3bfea1c4","ecb6795f67203eb1","3bc0cf5cc09c1830","9c97e61614ef1444"],"top_values":[],"top_words":[["5684957e422e6ba3",1],["0f88f07348d28f4d",1],["82a16f69d6d20ec1",1],["694ec42695658643",1],["c8f5155b33e96150",1],["df10ce1838069b85",1],["87a923ccc4d8705d",1],["518b0a65f315d1b7",1],["2eb75fb69c2ac4c1",1],["365e1a015347b48d",1],["2cc514a3b35c378a",1],["14166f86704326c6",1],["f10c29ce1abdce0c",1],["cc8f953e97d31653",1],["1ce8d0e81f36c45e",1],["7e4f909df5fe925d",1],["87c68c39a12afe72",1],["434a59da1aefd3ce",1],["e7a81bac740bd1dd",1],["f64c88672bc950e7",1],["1a5b2bfe43eafef9",1],["3a8f1dcd008d5fcd",1],["182fa17deab05a1c",1],["21188534628a908f",1],["15b9277b7f3fda32",1]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101039,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":1,"n_unique":101039,"null_rate":9.897070467141727e-06,"stats":{"allcaps_rate":0.0005245499262660953,"boilerplate_rate":0.0,"duplicate_rate":0.0,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":0,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":69.61400000000003,"url_rate":0.0,"vocab_size":20000,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"null_rate","level":"warn","message":"57.7% null"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"}],"column":"reply_parent_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":false,"sample":["fc2267f29dd1a492","6b56ce9644d8dcfc","701912916dd3aecb","f16b66c1507d3da9","63ea68b3eabeb6c5","2e341c64d79713f6","bfbbd6900834f900","dd990c5f31cc4ea6","3b2f41bfb941204a","a5ba750d7bf30263","8f8158886219d809","a1c0dc65878bddf4","761a8b8fdccf274c","ff199266a712c936","9654108e92a10dc3","0d63acd9fb1cd064","edd2fa096e8f1acd","979cb6872a8381c3","c3a452f029201028","03eb863e04aa2809","d138f778bc427e59","bdfd3b4bdea8e6f5","be3960ec846e3ec7","a0653755bcad3a93","0d00d9c76bb2df35","ce3faccd3dc1fbf4","61a677bb4fe941b8","cad0660ae64abe4d","c7b4674638507471","1dcc2e279114d2c4","049b10017b1065bf","63f9f4b87bcc0e12","0c16af442fea883c","8a6fcf8af1666afa","114dcf9d942a7f30","32c4ebcad16e8dae","c925c7cd193e26f8","68063c9592df8549","a313886af42c0c38","5f45c8d8175af57d","d4a33f28a6573229","6167f628ae7f437f","0195c50ea2389a16","88666396c1ed98f4","cdf55822380da919","3fea641a8d29d974","219cfd754b18c6f8","86176d3c50983c85","d45a442f16b20217","0e25a874dc39e212"],"top_values":[["04a1db17fbc9ff3a",121],["63f9f4b87bcc0e12",120],["5d60ee5d282843bb",72],["c66d243660d15680",66],["701912916dd3aecb",63],["64481eb4185ef487",58],["3fea641a8d29d974",55],["ece7c6a36c75292d",54],["8071d6d751bc0d25",40],["3783af2182fe114c",36],["2b421be421c362bc",36],["c5e129a05bcd1d8e",35],["b80da3d51026ffef",34],["9340e77013ea61e3",34],["b86f724d032846fa",34],["c376b13fa813b268",33],["93cf5eee050ee084",33],["c7c829a8ea452941",33],["0cc534242e254bf0",31],["9b02c4bf6a573e76",30]],"top_words":[["04a1db17fbc9ff3a",55],["63f9f4b87bcc0e12",46],["5d60ee5d282843bb",37],["64481eb4185ef487",30],["c66d243660d15680",29],["701912916dd3aecb",28],["3fea641a8d29d974",26],["ece7c6a36c75292d",24],["b86f724d032846fa",20],["8071d6d751bc0d25",20],["2b421be421c362bc",19],["c7c829a8ea452941",17],["3783af2182fe114c",17],["c376b13fa813b268",16],["c5e129a05bcd1d8e",16],["93cf5eee050ee084",15],["9b02c4bf6a573e76",15],["86b70746bf58709b",14],["0cc534242e254bf0",14],["de84adf5c370090f",14],["b80da3d51026ffef",12],["6905d954d2274dbb",12],["7f1bd1feeb3a697d",12],["f1ee1162bcc48dd2",12],["9340e77013ea61e3",11]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":58270,"n_unique":34738,"null_rate":0.5767022961203484,"stats":{"allcaps_rate":0.0007949497311199439,"boilerplate_rate":0.0,"duplicate_rate":0.18779518353986438,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":8032,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":71.72900000000003,"url_rate":0.0,"vocab_size":17415,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"duplicates","level":"warn","message":"50.3% duplicate strings"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"null_rate","level":"warn","message":"57.7% null"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"}],"column":"reply_root_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":false,"sample":["fc2267f29dd1a492","6b56ce9644d8dcfc","701912916dd3aecb","f16b66c1507d3da9","63ea68b3eabeb6c5","2e341c64d79713f6","dc0cf00aab42248a","65f573012a42f37a","152ff36a17b9ab54","2da15f2e55e9a171","8f8158886219d809","a1c0dc65878bddf4","761a8b8fdccf274c","ff199266a712c936","e192ff4b5e6a63cf","0d63acd9fb1cd064","edd2fa096e8f1acd","979cb6872a8381c3","c3a452f029201028","03eb863e04aa2809","8bcd5caa97e16650","c6e9611badba68fb","b7f10d1f67a22882","38da10ae5b7b9ca1","0d00d9c76bb2df35","ce3faccd3dc1fbf4","61a677bb4fe941b8","cad0660ae64abe4d","c7b4674638507471","647c7d83cbf87fd4","049b10017b1065bf","63f9f4b87bcc0e12","0c16af442fea883c","e1fe8c748a6157e7","114dcf9d942a7f30","32c4ebcad16e8dae","d82c366ef0df9d4f","68063c9592df8549","41e8804cc09291c2","5f45c8d8175af57d","d3976f32e7582cdd","ccd2eb68f0536ea5","66fe7374cc0d2ebd","cb704fda8f95f3c3","cdf55822380da919","3fea641a8d29d974","219cfd754b18c6f8","86176d3c50983c85","07afed23dc936e21","0e25a874dc39e212"],"top_values":[["63f9f4b87bcc0e12",151],["04a1db17fbc9ff3a",148],["5d60ee5d282843bb",103],["7af1a48b8d39ed44",101],["20b004b78a60a470",93],["64481eb4185ef487",92],["c66d243660d15680",76],["701912916dd3aecb",76],["c5e129a05bcd1d8e",74],["3783af2182fe114c",73],["9aa8c958e3601e71",69],["3fea641a8d29d974",65],["5e11823c64750ffb",59],["ece7c6a36c75292d",59],["a8b36c1e17ab6e21",57],["5ec6a225f7cdc6e4",57],["19e98969b38fd9fd",57],["a1257c51645e9ed7",57],["c376b13fa813b268",53],["b85f83c856e20b05",53]],"top_words":[["04a1db17fbc9ff3a",66],["63f9f4b87bcc0e12",58],["7af1a48b8d39ed44",56],["5d60ee5d282843bb",54],["64481eb4185ef487",49],["20b004b78a60a470",41],["701912916dd3aecb",35],["3783af2182fe114c",33],["c66d243660d15680",33],["9aa8c958e3601e71",31],["c5e129a05bcd1d8e",31],["5e11823c64750ffb",29],["19e98969b38fd9fd",29],["3fea641a8d29d974",29],["b85f83c856e20b05",28],["ece7c6a36c75292d",25],["8071d6d751bc0d25",25],["f53a5087ebb15eb9",25],["a1257c51645e9ed7",24],["9b02c4bf6a573e76",23],["c376b13fa813b268",23],["fe819c2f3a5967f2",23],["c7c829a8ea452941",22],["2b421be421c362bc",22],["b80da3d51026ffef",21]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":58270,"n_unique":21277,"null_rate":0.5767022961203484,"stats":{"allcaps_rate":0.0008183306055646482,"boilerplate_rate":0.0,"duplicate_rate":0.5025251344400281,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":21493,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":77.22800000000002,"url_rate":0.0,"vocab_size":12498,"word_mean":1.0,"word_median":1.0}},{"alerts":[],"column":"sentiment","extras":{"singletons":0,"top_values":[["neutral",48981],["positive",34622],["negative",17437]]},"kind":"categorical","n":101040,"n_null":0,"n_unique":3,"null_rate":0.0,"stats":{"cardinality":3,"entropy":1.4732925089885032,"entropy_ratio":0.9295440796347906,"top_rate":0.4847684085510689,"top_value":"neutral"}},{"alerts":[{"code":"outliers","level":"warn","message":"5.7% rows beyond 1.5 IQR"}],"column":"sentiment_score","extras":{"histogram":{"counts":[247,541,705,822,812,874,1081,1067,1130,1234,1387,1127,895,1120,1585,792,726,672,624,48620,358,740,623,781,1426,1343,1581,2179,3463,2801,1918,2382,2518,2004,1886,1870,2266,2011,1758,1071],"edges":[-0.998,-0.94805,-0.8981,-0.84815,-0.7982,-0.74825,-0.6982999999999999,-0.64835,-0.5984,-0.54845,-0.4985,-0.44855,-0.39859999999999995,-0.34865,-0.29869999999999997,-0.24875000000000003,-0.19879999999999998,-0.14884999999999993,-0.09889999999999999,-0.04894999999999994,0.0010000000000000009,0.05095000000000005,0.10089999999999999,0.15084999999999993,0.2008000000000001,0.25075000000000003,0.30069999999999997,0.35065000000000013,0.40060000000000007,0.45055,0.5005,0.5504500000000001,0.6004,0.65035,0.7003000000000001,0.7502500000000001,0.8002,0.85015,0.9001000000000001,0.9500500000000001,1.0]},"sample":[0.44,0.557,0.0,0.735,0.0,-0.296,0.0,0.128,-0.67,0.0,0.509,-0.735,0.612,0.226,0.077,-0.962,0.0,0.0,-0.226,0.527,0.026,0.557,0.0,0.907,0.786,0.0,0.0,-0.572,0.0,-0.557,0.0,0.34,-0.959,0.026,0.0,0.372,0.0,0.0,0.0,0.542,0.0,0.0,0.0,0.0,0.557,0.524,0.0,0.0,-0.309,0.0,-0.178,0.421,0.44,0.0,0.421,0.0,0.296,-0.103,0.0,0.617,0.0,0.637,0.0,0.0,0.0,0.0,0.689,0.67,0.0,0.865,0.0,0.0,0.0,-0.459,-0.571,-0.318,0.0,0.0,-0.318,0.0,0.0,0.0,0.924,0.44,0.0,0.004,0.0,0.0,0.402,0.0,0.0,0.957,0.0,0.542,0.0,0.0,0.742,0.0,0.834,0.0,0.34,0.0,0.0,0.0,0.0,0.0,-0.599,-0.538,0.0,0.0,0.0,0.691,0.0,0.0,0.0,0.402,0.0,0.863,0.0,0.542,0.0,-0.604,0.542,0.0,-0.179,0.128,-0.178,0.0,0.494,0.802,0.0,0.0,0.0,0.0,-0.445,-0.226,0.224,0.0,-0.153,0.34,-0.515,0.0,0.0,0.44,0.318,0.0,0.957,0.0,0.026,0.984,0.0,0.0,-0.057,0.0,0.637,0.188,0.0,0.0,0.0,0.991,0.44,0.226,-0.34,0.0,0.0,0.67,0.612,-0.89,0.0,-0.128,0.0,0.0,0.0,0.0,-0.318,0.266,0.0,-0.922,0.361,0.718,0.0,0.0,0.872,0.0,-0.511,-0.296,0.0,0.0,-0.026,0.0,0.0,0.44,0.115,0.0,0.633,0.0,0.494,0.494,-0.455,0.0,0.026,0.077,0.0,0.0,-0.542,-0.42,-0.951,0.0,0.421,0.0,0.827,0.459,0.0,0.0,0.0,0.904,0.0,0.44,0.0,0.557,0.527,0.459,0.226,0.81,0.361,0.0,-0.273,0.0,-0.783,0.0,0.285,0.0,-0.871,0.296,0.0,0.0,0.0,0.394,0.0,0.0,0.0,0.0,0.867,0.727,0.296,0.781,0.0,0.202,-0.866,0.807,0.0,0.0,0.718,0.0,0.0,0.599,0.0,0.674,0.697,-0.361,0.0,0.0,0.0,-0.599,0.0,0.0,-0.359,-0.178,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-0.557,0.0,-0.525,-0.542,0.0,0.872,0.0,0.0,-0.572,0.42,0.0,0.0,0.67,0.077,0.542,0.0,0.0,0.44,0.0,0.0,0.0,0.226,0.0,0.599,0.077,0.0,0.095,0.527,0.0,0.0,-0.2,0.852,0.0,0.0,0.421,0.0,0.337,0.361,0.827,0.052,0.0,0.0,0.0,0.0,0.12,0.34,0.649,0.0,0.511,-0.681,0.128,0.421,0.0,0.0,0.369,0.61,-0.296,0.0,0.0,-0.718,0.85,0.0,0.0,-0.477,0.072,0.0,0.648,0.0,0.077,0.0,0.0,0.433,0.459,-0.557,0.0,0.296,0.612,0.0,0.0,0.0,0.0,0.764,0.0,0.519,0.202,-0.625,-0.261,0.585,0.67,0.0,0.9,0.98,0.0,-0.482,0.625,0.0,0.421,0.0,0.25,0.586,0.34,0.71,0.0,0.0,0.318,0.0,0.0,-0.272,-0.153,-0.772,0.597,0.0,0.979,0.0,0.103,0.296,0.0,0.0,0.0,0.0,0.625,-0.494,0.0,0.0,0.0,-0.402,0.0,-0.421,0.0,0.0,-0.625,0.0,0.0,0.494,0.0,0.845,0.0,-0.987,0.511,-0.929,0.511,0.625,0.0,0.961,0.0,0.0,0.0,-0.772,0.44,0.273,0.95,0.494,0.459,0.103,0.0,0.0,0.0,-0.813,0.0,0.0,0.44,0.329,0.0,0.879,0.452,0.361,0.0,0.318,0.0,0.0,0.494,0.541,-0.599,0.0,0.0,0.875,0.0,0.0,0.813,0.0,0.0,-0.494,0.0,0.957,0.0,0.252,0.1,0.026,0.0,-0.832,0.572,0.572,0.493,0.44,0.0,-0.625,-0.202,0.0,0.887,0.0,0.586,0.0,-0.475,0.0,0.128,0.0,-0.471,0.0,0.0,0.0,0.0,0.916,0.0,0.696,0.0,0.425,0.542,0.637,0.0,0.0,0.0,0.0,0.0,-0.421,0.74]},"kind":"numeric","n":101040,"n_null":0,"n_unique":1928,"null_rate":0.0,"stats":{"iqr":0.402,"kurtosis":0.01774018153532797,"max":1.0,"mean":0.10737404988123513,"median":0.0,"min":-0.998,"n_outliers":5763,"outlier_rate":0.05703681710213777,"q1":0.0,"q3":0.402,"skew":0.01861160445986652,"std":0.41035286023047746,"zero_rate":0.4779790182106097}},{"alerts":[{"code":"near_unique","level":"info","message":"95.6% of rows are unique strings"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"allcaps","level":"info","message":"100.0% rows are all-caps"}],"column":"created_at","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[2195,0,0,0,0,1,0,0,3,0,88967,0,0,2823,0,0,1,0,2202,0,0,118,0,0,1342,0,61,0,0,0,0,0,3296,0,28,0,0,0,0,3],"edges":[20.0,20.375,20.75,21.125,21.5,21.875,22.25,22.625,23.0,23.375,23.75,24.125,24.5,24.875,25.25,25.625,26.0,26.375,26.75,27.125,27.5,27.875,28.25,28.625,29.0,29.375,29.75,30.125,30.5,30.875,31.25,31.625,32.0,32.375,32.75,33.125,33.5,33.875,34.25,34.625,35.0]},"near_unique":true,"sample":["2025-12-15T15:02:56.000000Z","2025-12-24T05:46:28.199Z","2025-12-24T05:53:52.540Z","2025-12-24T05:51:06.770Z","2025-12-24T05:24:05.186Z","2025-12-24T06:00:49.556+00:00","2025-12-24T05:25:08.535Z","2025-12-24T05:56:08.507Z","2025-12-24T05:51:12.695Z","2025-12-24T05:00:11.869Z","2025-12-24T05:36:17.493Z","2025-12-24T05:31:13.198Z","2025-12-24T05:14:27.095Z","2025-12-21T18:10:24.000000Z","2025-12-24T05:15:16.096Z","2025-12-24T05:08:42.328Z","2025-12-24T05:03:04.652Z","2025-12-24T05:52:01.731Z","2025-12-24T07:00:05+01:00","2025-12-24T05:22:31.315Z","2025-12-24T05:16:06.574Z","2025-12-24T05:00:15.066Z","2025-12-24T05:07:13.958Z","2025-12-24T05:03:23.481Z","2025-12-24T05:42:34.395429+00:00","2025-12-24T05:47:06.407Z","2025-12-24T05:32:40.305Z","2025-12-24T05:08:58.084Z","2025-12-24T05:01:58.611Z","2025-12-24T06:00:44.69651400Z","2025-12-24T05:39:36.521Z","2025-12-24T05:40:02.871Z","2025-12-24T05:40:02.315Z","2025-12-24T05:21:42.322Z","2025-12-24T05:19:48.019Z","2025-12-24T05:24:45.786Z","2025-12-24T05:58:53.732Z","2025-12-24T05:29:49.642Z","2025-12-24T05:25:29.364Z","2025-12-24T05:30:56.414Z","2025-12-24T05:50:22.290Z","2025-12-24T05:58:02.196Z","2025-12-24T05:14:13Z","2025-12-24T05:57:56.486Z","2025-12-24T06:00:03.501Z","2025-12-24T06:01:00.722Z","2025-12-24T05:18:33.75536200Z","2025-12-24T05:21:42.89654100Z","2025-12-24T05:06:56.863Z","2025-12-24T05:55:01.488Z"],"top_values":[],"top_words":[["2025-12-24t05:30:00.000000z",6],["2025-12-24t05:00:18+00:00",5],["2025-12-24t05:00:07+00:00",5],["2025-12-24t05:00:00.000z",5],["2025-12-24t06:00:12+00:00",4],["2025-12-24t05:01:25+00:00",4],["2025-12-24t05:10:25.000z",4],["2025-12-24t05:05:26.344z",3],["2025-12-24t05:16:57.000z",3],["2025-12-24t05:00:11+00:00",3],["2025-12-24t05:10:02.000z",3],["2025-12-24t05:15:11+00:00",3],["2025-12-24t05:57:17+00:00",3],["2025-12-24t05:30:19+00:00",3],["2025-12-24t05:10:00.000z",3],["2025-12-24t05:26:24+00:00",3],["2025-12-15t20:40:01.000000z",3],["2025-12-24t05:00:09+00:00",3],["2025-12-24t05:01:24+00:00",3],["2025-12-24t05:00:23+00:00",3],["2025-12-24t05:36:13z",3],["2025-12-24t05:30:10+00:00",3],["2025-12-24t06:00:58z",3],["2025-12-24t05:00:00.000000z",3],["2025-12-24t05:20:09+00:00",3]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":0,"n_unique":96576,"null_rate":0.0,"stats":{"allcaps_rate":1.0,"boilerplate_rate":0.0,"duplicate_rate":0.044180522565320665,"emoji_rate":0.0,"len_max":35,"len_mean":24.344883214568487,"len_median":24.0,"len_min":20,"len_p95":27.0,"n_duplicates":4464,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":121.22000000000004,"url_rate":0.0,"vocab_size":19720,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"near_unique","level":"info","message":"100.0% of rows are unique strings"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"allcaps","level":"info","message":"100.0% rows are all-caps"}],"column":"timestamp","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[25.5,25.525,25.55,25.575,25.6,25.625,25.65,25.675,25.7,25.725,25.75,25.775,25.8,25.825,25.85,25.875,25.9,25.925,25.95,25.975,26.0,26.025,26.05,26.075,26.1,26.125,26.15,26.175,26.2,26.225,26.25,26.275,26.3,26.325,26.35,26.375,26.4,26.425,26.45,26.475,26.5]},"near_unique":true,"sample":["2025-12-23T23:35:20.812256","2025-12-23T23:46:29.113253","2025-12-23T23:53:52.818216","2025-12-23T23:51:06.721420","2025-12-23T23:24:08.130284","2025-12-24T00:00:49.619695","2025-12-23T23:25:09.314207","2025-12-23T23:56:13.728686","2025-12-23T23:51:13.117978","2025-12-23T23:00:12.134916","2025-12-23T23:36:17.036168","2025-12-23T23:31:13.714937","2025-12-23T23:14:26.511690","2025-12-23T23:36:02.614701","2025-12-23T23:15:16.232765","2025-12-23T23:08:43.419178","2025-12-23T23:03:06.224482","2025-12-23T23:52:03.525787","2025-12-24T00:00:06.710285","2025-12-23T23:22:31.725584","2025-12-23T23:16:07.221539","2025-12-23T23:00:18.828637","2025-12-23T23:07:15.210597","2025-12-23T23:04:37.914250","2025-12-23T23:42:37.623367","2025-12-23T23:47:21.225655","2025-12-23T23:32:40.130904","2025-12-23T23:08:58.632159","2025-12-23T23:11:00.411999","2025-12-24T00:00:50.418800","2025-12-23T23:39:37.115449","2025-12-23T23:40:03.222485","2025-12-23T23:40:03.622812","2025-12-23T23:21:42.521784","2025-12-23T23:19:46.611876","2025-12-23T23:24:46.315084","2025-12-23T23:58:54.123171","2025-12-23T23:29:50.524379","2025-12-23T23:25:29.911828","2025-12-23T23:30:57.425562","2025-12-23T23:50:20.512413","2025-12-23T23:58:04.230868","2025-12-23T23:14:16.720933","2025-12-23T23:57:56.821107","2025-12-24T00:00:05.228046","2025-12-24T00:01:00.915232","2025-12-23T23:18:39.211864","2025-12-23T23:21:46.340984","2025-12-23T23:06:57.019142","2025-12-23T23:55:23.124122"],"top_values":[],"top_words":[["2025-12-23t23:31:14.814458",1],["2025-12-24t00:01:00.814493",1],["2025-12-24t00:01:32.810712",1],["2025-12-23t23:10:53.027823",1],["2025-12-23t23:50:31.015750",1],["2025-12-23t23:51:36.411245",1],["2025-12-23t23:05:29.141589",1],["2025-12-23t23:52:35.225405",1],["2025-12-23t23:22:39.629314",1],["2025-12-23t23:53:49.623856",1],["2025-12-23t23:21:40.228097",1],["2025-12-23t23:18:01.109980",1],["2025-12-23t23:54:43.722945",1],["2025-12-23t23:45:49.317370",1],["2025-12-23t23:34:32.230216",1],["2025-12-23t23:31:39.424766",1],["2025-12-23t23:38:48.919134",1],["2025-12-24t00:00:06.926022",1],["2025-12-23t23:27:27.413944",1],["2025-12-23t23:21:37.016189",1],["2025-12-23t23:07:36.630496",1],["2025-12-24t00:00:10.634942",1],["2025-12-23t23:49:05.730511",1],["2025-12-23t23:59:15.810600",1],["2025-12-23t23:46:14.226083",1]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":0,"n_unique":101040,"null_rate":0.0,"stats":{"allcaps_rate":1.0,"boilerplate_rate":0.0,"duplicate_rate":0.0,"emoji_rate":0.0,"len_max":26,"len_mean":26.0,"len_median":26.0,"len_min":26,"len_p95":26.0,"n_duplicates":0,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":121.22000000000004,"url_rate":0.0,"vocab_size":20000,"word_mean":1.0,"word_median":1.0}},{"alerts":[],"column":"language","extras":{"singletons":10,"top_values":[["en",61468],["ja",12607],["unknown",11481],["en-US",3617],["ko",2406],["de",1821],["pt",1295],["es",1153],["fr",746],["th",612],["tr",548],["nl",525],["zh",315],["it",276],["ru",213],["fi",193],["ja-JP",170],["id",158],["pl",139],["el",116]]},"kind":"categorical","n":101040,"n_null":0,"n_unique":90,"null_rate":0.0,"stats":{"cardinality":90,"entropy":2.178453126992494,"entropy_ratio":0.33556722474575634,"top_rate":0.6083531274742676,"top_value":"en"}},{"alerts":[],"column":"char_count","extras":{"histogram":{"counts":[11042,12354,11078,8177,7049,6309,5146,4495,3865,3459,3244,2592,2219,2065,1867,1708,1624,1467,1439,1439,1563,1657,4862,33,261,3,5,1,2,9,3,0,1,1,0,0,0,0,0,1],"edges":[1.0,14.1,27.2,40.3,53.4,66.5,79.6,92.7,105.8,118.89999999999999,132.0,145.1,158.2,171.29999999999998,184.4,197.5,210.6,223.7,236.79999999999998,249.9,263.0,276.09999999999997,289.2,302.3,315.4,328.5,341.59999999999997,354.7,367.8,380.9,394.0,407.09999999999997,420.2,433.3,446.4,459.5,472.59999999999997,485.7,498.8,511.9,525.0]},"sample":[72.0,131.0,116.0,108.0,87.0,18.0,33.0,194.0,48.0,15.0,19.0,39.0,70.0,221.0,65.0,273.0,80.0,15.0,89.0,45.0,76.0,251.0,205.0,60.0,63.0,31.0,27.0,249.0,45.0,97.0,111.0,132.0,272.0,215.0,29.0,294.0,53.0,25.0,143.0,41.0,49.0,163.0,85.0,47.0,285.0,53.0,56.0,15.0,73.0,30.0,152.0,73.0,40.0,21.0,73.0,17.0,5.0,165.0,23.0,95.0,25.0,49.0,47.0,24.0,31.0,176.0,193.0,2.0,18.0,134.0,34.0,194.0,47.0,128.0,22.0,126.0,71.0,146.0,187.0,13.0,34.0,30.0,75.0,75.0,186.0,60.0,16.0,28.0,44.0,109.0,75.0,299.0,31.0,51.0,119.0,42.0,17.0,119.0,88.0,97.0,26.0,295.0,4.0,6.0,21.0,116.0,152.0,86.0,27.0,24.0,150.0,72.0,24.0,62.0,132.0,63.0,271.0,108.0,24.0,17.0,24.0,147.0,34.0,49.0,299.0,26.0,160.0,33.0,1.0,78.0,62.0,10.0,36.0,1.0,156.0,15.0,267.0,36.0,209.0,87.0,76.0,29.0,103.0,262.0,59.0,108.0,234.0,8.0,107.0,234.0,12.0,3.0,117.0,298.0,123.0,84.0,59.0,62.0,50.0,44.0,39.0,69.0,12.0,139.0,54.0,18.0,18.0,273.0,5.0,56.0,261.0,31.0,42.0,44.0,152.0,294.0,60.0,161.0,99.0,133.0,40.0,106.0,169.0,28.0,19.0,253.0,104.0,14.0,89.0,22.0,4.0,10.0,48.0,23.0,37.0,143.0,79.0,74.0,59.0,38.0,143.0,265.0,95.0,5.0,97.0,88.0,270.0,175.0,43.0,37.0,7.0,300.0,122.0,100.0,1.0,39.0,54.0,20.0,300.0,93.0,295.0,29.0,175.0,165.0,112.0,24.0,69.0,26.0,232.0,41.0,126.0,49.0,96.0,92.0,65.0,47.0,37.0,223.0,8.0,101.0,4.0,86.0,49.0,276.0,35.0,197.0,286.0,158.0,233.0,38.0,33.0,37.0,60.0,20.0,6.0,300.0,81.0,18.0,101.0,55.0,34.0,89.0,6.0,139.0,54.0,8.0,289.0,59.0,9.0,33.0,52.0,59.0,46.0,50.0,28.0,292.0,11.0,111.0,17.0,34.0,27.0,150.0,15.0,58.0,148.0,98.0,67.0,275.0,35.0,54.0,22.0,81.0,153.0,19.0,29.0,23.0,111.0,99.0,55.0,57.0,315.0,118.0,139.0,100.0,60.0,98.0,146.0,7.0,46.0,66.0,87.0,300.0,10.0,3.0,61.0,237.0,37.0,104.0,12.0,67.0,157.0,40.0,178.0,280.0,142.0,101.0,193.0,25.0,80.0,17.0,184.0,137.0,11.0,58.0,12.0,47.0,116.0,163.0,28.0,254.0,292.0,116.0,16.0,5.0,156.0,25.0,63.0,52.0,42.0,82.0,186.0,30.0,64.0,7.0,29.0,45.0,163.0,225.0,168.0,144.0,76.0,280.0,29.0,280.0,18.0,138.0,9.0,39.0,105.0,98.0,226.0,26.0,64.0,35.0,3.0,135.0,112.0,65.0,30.0,79.0,68.0,38.0,70.0,23.0,10.0,40.0,56.0,81.0,13.0,75.0,262.0,84.0,51.0,55.0,12.0,37.0,111.0,255.0,22.0,76.0,128.0,26.0,50.0,3.0,10.0,132.0,14.0,300.0,35.0,34.0,265.0,46.0,187.0,194.0,119.0,84.0,9.0,55.0,93.0,71.0,2.0,300.0,106.0,57.0,88.0,4.0,34.0,83.0,228.0,299.0,108.0,9.0,128.0,25.0,151.0,57.0,16.0,42.0,23.0,105.0,27.0,106.0,49.0,32.0,64.0,9.0,9.0,32.0,1.0,29.0,300.0,5.0,249.0,269.0,299.0,23.0,173.0,5.0,208.0,35.0,299.0,35.0,69.0,56.0,38.0,9.0,58.0,24.0,3.0,327.0,24.0,151.0,101.0,135.0,7.0,101.0,172.0,252.0,14.0,256.0,80.0,187.0,162.0,8.0,16.0,7.0,307.0,283.0,172.0,9.0,94.0,20.0,13.0,62.0,20.0,75.0,208.0,110.0,94.0,143.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":341,"null_rate":0.0,"stats":{"iqr":113.0,"kurtosis":-0.05732629898733377,"max":525.0,"mean":97.62657363420428,"median":68.0,"min":1.0,"n_outliers":289,"outlier_rate":0.002860253365003959,"q1":30.0,"q3":143.0,"skew":1.0177078866571954,"std":86.05175233030357,"zero_rate":0.0}},{"alerts":[],"column":"word_count","extras":{"histogram":{"counts":[21450,9307,8043,7043,6459,5882,4926,4074,3582,3218,2640,2512,2280,3311,1923,1805,1633,1314,1144,1057,1076,1046,994,965,918,805,910,334,189,88,31,23,21,7,10,14,3,1,0,2],"edges":[0.0,2.075,4.15,6.2250000000000005,8.3,10.375,12.450000000000001,14.525000000000002,16.6,18.675,20.75,22.825000000000003,24.900000000000002,26.975,29.050000000000004,31.125000000000004,33.2,35.275000000000006,37.35,39.425000000000004,41.5,43.575,45.650000000000006,47.725,49.800000000000004,51.87500000000001,53.95,56.025000000000006,58.10000000000001,60.175000000000004,62.25000000000001,64.325,66.4,68.47500000000001,70.55000000000001,72.625,74.7,76.775,78.85000000000001,80.92500000000001,83.0]},"sample":[10.0,25.0,17.0,11.0,4.0,5.0,2.0,33.0,8.0,1.0,4.0,1.0,14.0,42.0,9.0,51.0,5.0,1.0,10.0,9.0,14.0,34.0,22.0,12.0,12.0,1.0,6.0,42.0,8.0,17.0,18.0,26.0,51.0,45.0,5.0,51.0,11.0,2.0,25.0,8.0,9.0,24.0,3.0,8.0,44.0,10.0,9.0,3.0,14.0,1.0,28.0,14.0,8.0,3.0,12.0,2.0,2.0,36.0,4.0,18.0,3.0,10.0,7.0,1.0,6.0,9.0,34.0,1.0,2.0,23.0,6.0,31.0,8.0,24.0,3.0,23.0,13.0,16.0,33.0,2.0,6.0,7.0,15.0,11.0,29.0,9.0,2.0,5.0,9.0,2.0,10.0,56.0,7.0,4.0,8.0,7.0,4.0,8.0,16.0,5.0,5.0,6.0,1.0,1.0,5.0,6.0,20.0,15.0,1.0,1.0,14.0,12.0,1.0,8.0,20.0,12.0,41.0,18.0,1.0,3.0,4.0,24.0,7.0,8.0,65.0,5.0,33.0,1.0,1.0,14.0,11.0,1.0,5.0,1.0,27.0,3.0,43.0,9.0,39.0,18.0,12.0,6.0,17.0,58.0,11.0,15.0,38.0,1.0,17.0,50.0,1.0,1.0,21.0,54.0,19.0,16.0,12.0,9.0,9.0,3.0,4.0,13.0,3.0,23.0,15.0,5.0,3.0,52.0,1.0,9.0,10.0,1.0,5.0,10.0,11.0,49.0,10.0,31.0,19.0,26.0,4.0,1.0,29.0,6.0,4.0,39.0,11.0,2.0,10.0,3.0,1.0,2.0,9.0,5.0,6.0,27.0,13.0,13.0,11.0,1.0,26.0,42.0,13.0,1.0,20.0,11.0,44.0,24.0,6.0,9.0,2.0,46.0,10.0,1.0,1.0,6.0,7.0,4.0,42.0,18.0,54.0,7.0,32.0,30.0,16.0,1.0,13.0,6.0,43.0,8.0,25.0,4.0,13.0,9.0,11.0,8.0,5.0,37.0,1.0,11.0,1.0,14.0,13.0,8.0,7.0,39.0,50.0,13.0,43.0,8.0,1.0,7.0,11.0,2.0,1.0,54.0,14.0,4.0,21.0,9.0,1.0,8.0,1.0,16.0,4.0,1.0,46.0,10.0,3.0,1.0,8.0,11.0,4.0,1.0,2.0,43.0,3.0,21.0,3.0,2.0,5.0,3.0,3.0,10.0,19.0,14.0,10.0,52.0,8.0,9.0,1.0,3.0,8.0,4.0,4.0,5.0,22.0,2.0,10.0,10.0,9.0,19.0,17.0,12.0,1.0,18.0,27.0,2.0,2.0,14.0,17.0,59.0,2.0,1.0,9.0,29.0,8.0,14.0,3.0,11.0,26.0,7.0,31.0,45.0,26.0,17.0,29.0,5.0,13.0,4.0,34.0,8.0,3.0,5.0,2.0,8.0,15.0,6.0,4.0,53.0,1.0,17.0,3.0,2.0,6.0,1.0,10.0,11.0,7.0,14.0,37.0,6.0,11.0,1.0,6.0,9.0,23.0,51.0,34.0,24.0,14.0,57.0,4.0,57.0,1.0,24.0,1.0,1.0,18.0,14.0,28.0,6.0,1.0,6.0,1.0,7.0,18.0,8.0,1.0,11.0,10.0,7.0,8.0,3.0,2.0,7.0,7.0,15.0,2.0,13.0,50.0,13.0,6.0,2.0,1.0,6.0,17.0,42.0,4.0,12.0,22.0,4.0,9.0,1.0,2.0,22.0,3.0,1.0,8.0,1.0,51.0,1.0,32.0,36.0,24.0,15.0,2.0,1.0,15.0,17.0,1.0,54.0,16.0,14.0,10.0,1.0,6.0,18.0,44.0,5.0,2.0,1.0,26.0,6.0,16.0,6.0,2.0,7.0,5.0,14.0,1.0,14.0,10.0,5.0,10.0,3.0,1.0,7.0,1.0,1.0,46.0,1.0,32.0,28.0,47.0,3.0,17.0,1.0,35.0,1.0,53.0,8.0,14.0,9.0,7.0,1.0,13.0,4.0,1.0,29.0,4.0,31.0,12.0,22.0,2.0,20.0,26.0,45.0,3.0,23.0,17.0,34.0,18.0,3.0,1.0,1.0,33.0,4.0,22.0,2.0,19.0,3.0,4.0,1.0,1.0,16.0,41.0,6.0,9.0,11.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":79,"null_rate":0.0,"stats":{"iqr":19.0,"kurtosis":0.6990225572196218,"max":83.0,"mean":14.674574425969913,"median":10.0,"min":0.0,"n_outliers":2882,"outlier_rate":0.028523357086302454,"q1":3.0,"q3":22.0,"skew":1.208503579975851,"std":14.223127548992764,"zero_rate":0.0006037212984956453}},{"alerts":[{"code":"high_skew","level":"info","message":"skew=+2.12"},{"code":"outliers","level":"warn","message":"13.6% rows beyond 1.5 IQR"}],"column":"has_images","extras":{"histogram":{"counts":[87272,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,13768],"edges":[0.0,0.025,0.05,0.07500000000000001,0.1,0.125,0.15000000000000002,0.17500000000000002,0.2,0.225,0.25,0.275,0.30000000000000004,0.325,0.35000000000000003,0.375,0.4,0.42500000000000004,0.45,0.47500000000000003,0.5,0.525,0.55,0.5750000000000001,0.6000000000000001,0.625,0.65,0.675,0.7000000000000001,0.7250000000000001,0.75,0.775,0.8,0.8250000000000001,0.8500000000000001,0.875,0.9,0.925,0.9500000000000001,0.9750000000000001,1.0]},"sample":[0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,1.0,0.0,1.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":2,"null_rate":0.0,"stats":{"iqr":0.0,"kurtosis":2.496516184894217,"max":1.0,"mean":0.1362628661916073,"median":0.0,"min":0.0,"n_outliers":13768,"outlier_rate":0.1362628661916073,"q1":0.0,"q3":0.0,"skew":2.1204990414744866,"std":0.3430691801066323,"zero_rate":0.8637371338083927}},{"alerts":[{"code":"high_skew","level":"info","message":"skew=+8.50"}],"column":"has_video","extras":{"histogram":{"counts":[99696,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1344],"edges":[0.0,0.025,0.05,0.07500000000000001,0.1,0.125,0.15000000000000002,0.17500000000000002,0.2,0.225,0.25,0.275,0.30000000000000004,0.325,0.35000000000000003,0.375,0.4,0.42500000000000004,0.45,0.47500000000000003,0.5,0.525,0.55,0.5750000000000001,0.6000000000000001,0.625,0.65,0.675,0.7000000000000001,0.7250000000000001,0.75,0.775,0.8,0.8250000000000001,0.8500000000000001,0.875,0.9,0.925,0.9500000000000001,0.9750000000000001,1.0]},"sample":[0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":2,"null_rate":0.0,"stats":{"iqr":0.0,"kurtosis":70.19205241075723,"max":1.0,"mean":0.01330166270783848,"median":0.0,"min":0.0,"n_outliers":1344,"outlier_rate":0.01330166270783848,"q1":0.0,"q3":0.0,"skew":8.49659063452849,"std":0.1145637742687172,"zero_rate":0.9866983372921615}},{"alerts":[{"code":"outliers","level":"warn","message":"18.0% rows beyond 1.5 IQR"}],"column":"has_link","extras":{"histogram":{"counts":[82900,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,18140],"edges":[0.0,0.025,0.05,0.07500000000000001,0.1,0.125,0.15000000000000002,0.17500000000000002,0.2,0.225,0.25,0.275,0.30000000000000004,0.325,0.35000000000000003,0.375,0.4,0.42500000000000004,0.45,0.47500000000000003,0.5,0.525,0.55,0.5750000000000001,0.6000000000000001,0.625,0.65,0.675,0.7000000000000001,0.7250000000000001,0.75,0.775,0.8,0.8250000000000001,0.8500000000000001,0.875,0.9,0.925,0.9500000000000001,0.9750000000000001,1.0]},"sample":[0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,1.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,1.0,0.0,0.0,0.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":2,"null_rate":0.0,"stats":{"iqr":0.0,"kurtosis":0.7888288781930632,"max":1.0,"mean":0.1795328582739509,"median":0.0,"min":0.0,"n_outliers":18140,"outlier_rate":0.1795328582739509,"q1":0.0,"q3":0.0,"skew":1.6699787059100677,"std":0.3837997771428118,"zero_rate":0.8204671417260491}},{"alerts":[{"code":"null_rate","level":"warn","message":"61.2% null"}],"column":"embed_type","extras":{"singletons":0,"top_values":[["app.bsky.embed.external",18140],["app.bsky.embed.images",13768],["app.bsky.embed.record",5126],["app.bsky.embed.video",1344],["app.bsky.embed.recordWithMedia",871]]},"kind":"categorical","n":101040,"n_null":61791,"n_unique":5,"null_rate":0.6115498812351544,"stats":{"cardinality":5,"entropy":1.716940999074542,"entropy_ratio":0.7394462398965165,"top_rate":0.46217738031542205,"top_value":"app.bsky.embed.external"}},{"alerts":[{"code":"duplicates","level":"warn","message":"90.0% duplicate strings"},{"code":"one_word","level":"warn","message":"90.2% rows are a single word"}],"column":"hashtags","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[92111,3305,2893,1061,466,316,183,120,90,321,42,28,47,20,5,5,5,6,0,4,5,0,1,1,0,0,1,1,0,1,1,0,0,0,0,0,0,0,0,1],"edges":[2.0,30.0,58.0,86.0,114.0,142.0,170.0,198.0,226.0,254.0,282.0,310.0,338.0,366.0,394.0,422.0,450.0,478.0,506.0,534.0,562.0,590.0,618.0,646.0,674.0,702.0,730.0,758.0,786.0,814.0,842.0,870.0,898.0,926.0,954.0,982.0,1010.0,1038.0,1066.0,1094.0,1122.0]},"near_unique":false,"sample":["[]","[]","[\"#strongertogether\"]","[]","[]","[]","[]","[\"#dandysworld\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[\"#pruzkum\"]","[]","[]","[\"#pareridiparte\", \"#ritardi\", \"#manovra\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[\"#\\u3078\\u304d\\u3055\\u308a\\u3085\\u3046\\u307f\\u3053\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]"],"top_values":[["[]",87318],["[\"#\\u0e23\", \"#422\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e41\\u0e08\\u0e01\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e04\\u0e32\\u0e2a\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e42\\u0e04\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e40\\u0e27\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e2a\\u0e25\", \"#\\u0e2a\\u0e25\"]",256],["[\"#Berlin\", \"#Verkehr\", \"#Baustelle\", \"#Sperrung\", \"#St\\u00f6rung\", \"#Stra\\u00dfe\"]",253],["[\"#NowPlaying\"]",115],["[\"#nowplaying\"]",85],["[\"#dpdi\", \"#dpdi\"]",60],["[\"#\\u30e9\\u30f3\\u30c0\\u30e0\\u6587\\u5b57\"]",46],["[\"#a\"]",46],["[\"#christmas\", \"#darlenelove\", \"#philspector\", \"#ronniespector\"]",41],["[\"#OhBrookeOhTaylorOhBrooke\"]",38],["[\"#1\"]",33],["[\"#uk\", \"#news\", \"#uknews\"]",29],["[\"#rva\"]",27],["[\"#pixiv\"]",27],["[\"#fr\", \"#france\"]",27],["[\"#nba\"]",26],["[\"#ohbrookeohtaylorohbrooke\"]",24],["[\"#Tetsujin28FX\"]",23],["[\"#de\", \"#deutschland\"]",23],["[\"#1649\"]",22]],"top_words":[["[]",17275],["\"#\\u0e40\\u0e04\\u0e23\\u0e14\",",191],["[\"#\\u0e23\",",48],["\"#\\u0e41\\u0e08\\u0e01\\u0e40\\u0e04\\u0e23\\u0e14\",",48],["\"#\\u0e04\\u0e32\\u0e2a\",",48],["\"#\\u0e42\\u0e04\",",48],["\"#\\u0e40\\u0e27\",",48],["\"#\\u0e2a\\u0e25\",",48],["\"#\\u0e2a\\u0e25\"]",48],["\"#422\",",47],["[\"#nowplaying\"]",40],["[\"#berlin\",",38],["\"#verkehr\",",38],["\"#baustelle\",",38],["\"#sperrung\",",38],["\"#st\\u00f6rung\",",38],["\"#stra\\u00dfe\"]",38],["[\"#nowplaying\",",33],["[\"#art\",",21],["\"#nsfw\",",20],["\"#art\",",20],["[\"#\\u30a2\\u30de\\u30be\\u30f3\",",18],["\"#oc\",",16],["[\"#\\u6771\\u4eac\\u90fd\",",15],["[\"#dpdi\",",15]],"vocab_skipped":null,"word_histogram":{"counts":[95104,3116,1657,287,195,392,98,62,28,37,25,12,7,2,7,5,2,0,2,1,0,0,0,0,0,0,0,0,0,1],"edges":[1.0,3.033333333333333,5.066666666666666,7.1,9.133333333333333,11.166666666666666,13.2,15.233333333333333,17.266666666666666,19.299999999999997,21.333333333333332,23.366666666666667,25.4,27.43333333333333,29.466666666666665,31.5,33.53333333333333,35.56666666666666,37.599999999999994,39.63333333333333,41.666666666666664,43.699999999999996,45.733333333333334,47.766666666666666,49.8,51.83333333333333,53.86666666666666,55.9,57.93333333333333,59.96666666666666,62.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":10103,"null_rate":0.0,"stats":{"allcaps_rate":0.005700712589073635,"boilerplate_rate":0.0,"duplicate_rate":0.9000098970704672,"emoji_rate":0.0,"len_max":1122,"len_mean":10.377622723673792,"len_median":2.0,"len_min":2,"len_p95":63.0,"n_duplicates":90937,"n_empty":0,"one_word_rate":0.9019596199524941,"readability_flesch_mean":2.751632142857146,"url_rate":0.0,"vocab_size":7036,"word_mean":1.3841745843230404,"word_median":1.0}},{"alerts":[{"code":"duplicates","level":"warn","message":"98.1% duplicate strings"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"one_word","level":"warn","message":"99.6% rows are a single word"}],"column":"mentions","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[98659,1998,0,180,0,45,0,33,0,20,0,21,0,21,0,18,0,14,20,0,5,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3],"edges":[2.0,12.45,22.9,33.349999999999994,43.8,54.25,64.69999999999999,75.14999999999999,85.6,96.05,106.5,116.94999999999999,127.39999999999999,137.85,148.29999999999998,158.75,169.2,179.64999999999998,190.1,200.54999999999998,211.0,221.45,231.89999999999998,242.35,252.79999999999998,263.25,273.7,284.15,294.59999999999997,305.04999999999995,315.5,325.95,336.4,346.84999999999997,357.29999999999995,367.75,378.2,388.65,399.09999999999997,409.54999999999995,420.0]},"near_unique":false,"sample":["[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[\"602826fc65fa6aa0\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]"],"top_values":[["[]",98659],["[\"f212c35005158af4\"]",16],["[\"f50c0569501abab1\"]",15],["[\"c2c301341c84573d\"]",15],["[\"10b5533239dcddce\"]",13],["[\"10b60c157e2d651d\"]",11],["[\"607bca3b4bc405e9\"]",9],["[\"3ffed484fcc4f856\"]",9],["[\"1792f17a90a3d717\"]",9],["[\"d38369e879391173\"]",9],["[\"77d7e860f21c08fc\"]",7],["[\"f9f2b4c905347f44\"]",7],["[\"ceaa893232d29639\"]",7],["[\"84bfdfccb5651fd6\"]",7],["[\"b8d36411702552c5\"]",7],["[\"29630d82ffa90155\"]",6],["[\"0df2feb382b3c760\"]",6],["[\"a97c657865a0ea62\"]",6],["[\"90a0897c0263ae9a\"]",6],["[\"1719171f2f3c519d\"]",5]],"top_words":[["[]",19542],["[\"f50c0569501abab1\"]",6],["[\"84bfdfccb5651fd6\"]",4],["[\"10b60c157e2d651d\"]",3],["[\"3f2e7922962fd5fa\"]",3],["[\"f212c35005158af4\"]",3],["\"660dbd4b7e3cff3e\",",2],["[\"adb95d00bcba3bdb\",",2],["\"ffc3e7fe15f5b095\"]",2],["[\"423f7a6ffdd9e437\"]",2],["[\"3ffed484fcc4f856\"]",2],["[\"1719171f2f3c519d\"]",2],["[\"9186e9ed09e370a0\"]",2],["[\"d5e601007771e2a9\"]",2],["[\"90179da90188b233\"]",2],["\"e18cad2b21bbb448\",",2],["[\"a97c657865a0ea62\"]",2],["[\"6736f8a1dfb08d71\"]",2],["[\"77d7e860f21c08fc\"]",2],["[\"29167ae562816f3c\"]",2],["[\"71739959f21ce92e\"]",2],["[\"eceadd1b29f52cbd\"]",2],["[\"ceaa893232d29639\"]",2],["[\"65a3a93838fb7b0b\"]",2],["[\"0df2feb382b3c760\"]",2]],"vocab_skipped":null,"word_histogram":{"counts":[100657,180,0,45,33,0,20,21,0,21,18,0,14,20,0,5,3,0,0,0,0,0,0,0,0,0,0,0,0,3],"edges":[1.0,1.6666666666666665,2.333333333333333,3.0,3.6666666666666665,4.333333333333333,5.0,5.666666666666666,6.333333333333333,7.0,7.666666666666666,8.333333333333332,9.0,9.666666666666666,10.333333333333332,11.0,11.666666666666666,12.333333333333332,13.0,13.666666666666666,14.333333333333332,15.0,15.666666666666666,16.333333333333332,17.0,17.666666666666664,18.333333333333332,19.0,19.666666666666664,20.333333333333332,21.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":1921,"null_rate":0.0,"stats":{"allcaps_rate":3.9588281868566906e-05,"boilerplate_rate":0.0,"duplicate_rate":0.9809877276326208,"emoji_rate":0.0,"len_max":420,"len_mean":2.6698139350752177,"len_median":2.0,"len_min":2,"len_p95":2.0,"n_duplicates":99119,"n_empty":0,"one_word_rate":0.9962094220110848,"readability_flesch_mean":0.7019500000000006,"url_rate":0.0,"vocab_size":660,"word_mean":1.012282264449723,"word_median":1.0}},{"alerts":[{"code":"duplicates","level":"warn","message":"96.3% duplicate strings"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"one_word","level":"warn","message":"99.8% rows are a single word"}],"column":"links","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[96211,0,44,293,447,1282,347,301,208,252,218,133,109,185,116,158,169,119,139,84,33,43,40,39,18,19,13,1,5,1,5,0,2,1,0,1,0,0,1,3],"edges":[2.0,8.6,15.2,21.799999999999997,28.4,35.0,41.599999999999994,48.199999999999996,54.8,61.4,68.0,74.6,81.19999999999999,87.8,94.39999999999999,101.0,107.6,114.19999999999999,120.8,127.39999999999999,134.0,140.6,147.2,153.79999999999998,160.39999999999998,167.0,173.6,180.2,186.79999999999998,193.39999999999998,200.0,206.6,213.2,219.79999999999998,226.39999999999998,233.0,239.6,246.2,252.79999999999998,259.4,266.0]},"near_unique":false,"sample":["[\"https://www.20min.ch/fr/story/buelach-zh-un-client-arrete-apres-avoir-poignarde-son-livreur-de-repas-103470448\"]","[]","[]","[]","[]","[\"https://trecome.info/articles/89cfe941-6cf1-45ae-8626-cc0241375b46\"]","[]","[]","[]","[]","[]","[]","[]","[\"https://lecourrier.ch/2025/12/21/un-oui-que-le-tessin-redoute/\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]"],"top_values":[["[]",96211],["[\"https://www.bbc.com/news/arti...\"]",43],["[\"https://www.radiofrance.fr/fip\"]",37],["[\"https://www.kbradio.online\"]",32],["[\"https://sphynx.radio-progres.fr/listen/radio_progres/radio.mp3\", \"https://radio-progres.fr\"]",28],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd3/bluesky\"]",23],["[\"https://radiotempete.com/\"]",21],["[\"https://bvf.wtf\"]",20],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd2/bluesky\"]",19],["[\"https://streaming.shoutcast.com/tiorr3\", \"https://listen.openstream.co/6128/audio\"]",18],["[\"https://oakgroveradio.com/player\"]",17],["[\"https://www.hot21radio.com\"]",17],["[\"https://www.radiosouvenirsfm.com\"]",16],["[\"https://radiofonico.it\"]",14],["[\"https://www.enlaradio.cl\"]",14],["[\"https://adachi-fm.com/\"]",14],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm/bluesky\"]",14],["[\"https://trance.ie\"]",12],["[\"https://untidyradio.com\"]",11],["[\"https://amasale.newif.net/ranking/kdetail/299\"]",10]],"top_words":[["[]",19055],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd2/bluesky\"]",6],["[\"https://www.radiofrance.fr/fip\"]",5],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd3/bluesky\"]",5],["[\"https://streaming.shoutcast.com/tiorr3\",",4],["\"https://listen.openstream.co/6128/audio\"]",4],["[\"https://www.kbradio.online\"]",4],["[\"https://www.hot21radio.com\"]",4],["[\"https://bvf.wtf\"]",4],["[\"https://radiotempete.com/\"]",4],["[\"https://sphynx.radio-progres.fr/listen/radio_progres/radio.mp3\",",3],["\"https://radio-progres.fr\"]",3],["[\"https://adachi-fm.com/\"]",3],["[\"https://oakgroveradio.com/player\"]",3],["[\"https://radiofonico.it\"]",3],["[\"https://www.bbc.com/news/arti...\"]",3],["[\"https://www.ume2001.com/support/labo/amesh-v2.html?&tm=202512241400\"]",3],["[\"https://www.20min.ch/fr/story/zurich-un-chauffage-a-23-milliards-de-la-folie-climatique-pour-l-udc-103470364\"]",3],["[\"https://t.co/jcedttntn9\",",3],["\"https://t.co/d6j014jtlf\"]",3],["[\"https://www.project-anime.com/1315216/\"]",3],["[\"https://www.ume2001.com/support/labo/amesh-v2.html?&tm=202512241435\"]",3],["[\"https://thexwgxx.radio12345.com\"]",3],["[\"https://jeffro.radio\"]",3],["[\"https://www.enlaradio.cl\"]",3]],"vocab_skipped":null,"word_histogram":{"counts":[100818,0,0,0,0,0,0,185,0,0,0,0,0,0,0,26,0,0,0,0,0,0,8,0,0,0,0,0,0,3],"edges":[1.0,1.1333333333333333,1.2666666666666666,1.4,1.5333333333333332,1.6666666666666665,1.8,1.9333333333333333,2.0666666666666664,2.2,2.333333333333333,2.466666666666667,2.6,2.7333333333333334,2.8666666666666667,3.0,3.1333333333333333,3.2666666666666666,3.4,3.533333333333333,3.6666666666666665,3.8,3.933333333333333,4.066666666666666,4.2,4.333333333333334,4.466666666666667,4.6,4.733333333333333,4.866666666666667,5.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":3771,"null_rate":0.0,"stats":{"allcaps_rate":0.0,"boilerplate_rate":0.0,"duplicate_rate":0.9626781472684085,"emoji_rate":0.0,"len_max":266,"len_mean":4.95048495645289,"len_median":2.0,"len_min":2,"len_p95":2.0,"n_duplicates":97269,"n_empty":0,"one_word_rate":0.9978028503562946,"readability_flesch_mean":-20.1082,"url_rate":0.047792953285827396,"vocab_size":904,"word_mean":1.0027019002375297,"word_median":1.0}}],"insights":{"errors":[],"insights":[{"confidence":"high","critiques":[],"evidence_keys":["row_count","column_count","language.top_values","sentiment.top_values","embed_type.top_values","embed_type.null_rate","has_images.stats.mean","has_link.stats.mean","has_video.stats.mean","reply_root_hash.null_rate","char_count.stats","text.language_counts"],"featured_charts":[{"caption":"See how heavily English dominates and where Japanese, Korean, and 'unknown' sit in the long tail.","column":"language","kind":"bar"},{"caption":"Check the neutral-heavy split with positive roughly twice negative.","column":"sentiment","kind":"donut"},{"caption":"Among posts that have an embed, compare external links, images, quoted records, and video shares.","column":"embed_type","kind":"bar"},{"caption":"Look at the right-skewed length distribution \u2014 median 68 chars but a long tail out to 525.","column":"char_count","kind":"histogram"},{"caption":"Inspect the spike at zero (~48%) and the modest positive lean in the continuous score.","column":"sentiment_score","kind":"histogram"}],"model":"anthropic:claude-opus-4-7","narrative":"This dataset captures 101,040 anonymized Bluesky firehose posts from late December 2025, with 19 columns covering post hashes, authorship, timestamps, content text, embeds, hashtags, mentions, links, language, and sentiment. The text column is richly multilingual \u2014 English dominates at ~61% of posts, followed by Japanese (~12.6k) and a sizable 'unknown' bucket (~11.5k) \u2014 and sentiment skews neutral (48.5%) with positive outweighing negative roughly 2:1. Engagement-style features are heavily zero-inflated: only ~13.6% of posts include images, ~18% include links, and just ~1.3% include video, so most posts are plain text. About 58% of posts have no reply_root_hash, suggesting top-level posts dominate over threaded replies. The most useful first cuts are language mix, sentiment distribution, embed_type composition, and post-length shape via char_count.","scope":"dataset","target":"__global__"},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","language_counts","stats.len_max","stats.len_median","stats.word_median","stats.emoji_rate","stats.allcaps_rate","stats.one_word_rate","stats.duplicate_rate","stats.n_duplicates","stats.url_rate","top_values","top_words","alerts"],"model":"anthropic:claude-opus-4-7","narrative":"Short user-generated posts (likely social/Bluesky given the bsky.app top value and hashtag/emoji patterns), with median 10 words and a 525-character cap suggesting a platform limit. Heavily English-skewed (3309 of 101040) but genuinely multilingual with sizeable Japanese (656) and Korean (125) tails, plus 18.3% emoji rate, 16.9% all-caps lines, and 19% one-word entries. Note 5105 duplicates (5.05%) including spam-like Thai promo and repeated sheep-emoji posts among the top values.","role":"free_text","scope":"column","target":"text","treatment":"Deduplicate, language-detect and route per language, then tokenize/embed for modelling."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.duplicate_rate","stats.n_duplicates","stats.len_min","stats.len_max","stats.len_mean","stats.one_word_rate","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"Fixed 16-character single-token hex strings, almost certainly hashed author DIDs acting as a pseudonymous user identifier. Across 101,040 rows there are only 43,998 unique values and a 56.5% duplicate rate, with the top author appearing 1,016 times \u2014 so this is a foreign-key-style author handle, not a per-row id. No nulls or empties, and length is constant at 16.","role":"foreign_key","scope":"column","target":"author_did_hash","treatment":"Treat as a categorical author key; left-join on this to author-level features rather than feeding the raw hash into a model."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.len_min","stats.len_max","stats.len_median","stats.one_word_rate","stats.duplicate_rate","stats.n_duplicates","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"Fixed 16-character single-token strings with 101,039 unique values across 101,040 rows and zero nulls \u2014 almost certainly hex digests (16 hex chars = 64-bit hash) of URIs, used as row identifiers. The column is effectively a primary key: one_word_rate is 1.0, length is exactly 16 at min/median/max, and duplicate_rate is 0.0. No analytic signal lives here beyond identity.","role":"identifier","scope":"column","target":"uri_hash","treatment":"drop from modelling; retain only as a join key or deduplication handle."},{"confidence":"high","critiques":[],"evidence_keys":["null_rate","len_min","len_max","one_word_rate","n_unique","n","duplicate_rate","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"Every non-null value is a single 16-character token (len_min=len_max=16, one_word_rate=1.0), strongly indicating a hex hash identifier pointing to a parent post. 57.67% of rows are null, consistent with a reply-only field where most posts are top-level rather than replies. Among populated rows, 34,738 unique hashes cover 101,040 entries with an 18.78% duplicate rate, so some parents attract many replies (top hash appears 121 times).","role":"foreign_key","scope":"column","target":"reply_parent_hash","treatment":"Treat as a foreign key to the parent post; left-join on this hash and ignore for modelling."},{"confidence":"high","critiques":[],"evidence_keys":["null_rate","n_unique","stats.duplicate_rate","stats.len_min","stats.len_max","stats.one_word_rate","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This is a 16-character hex hash identifying the root post of a reply thread, with every value being a single token of fixed length 16. About 57.67% of rows are null (likely top-level posts with no reply root) and 50.25% of the non-null values are duplicates across 21,277 unique hashes, consistent with many replies sharing the same thread root.","role":"foreign_key","scope":"column","target":"reply_root_hash","treatment":"left-join on this id to the parent post table; do not feature-encode."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.top_value","stats.top_rate","stats.entropy_ratio","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"A three-class sentiment label across 101040 rows with no nulls and only 3 unique values: neutral, positive, negative. Distribution is uneven \u2014 neutral leads at 48.5%, followed by positive (34622) and negative (17437), so negatives are roughly half as common as positives. Entropy ratio of 0.93 indicates the classes are reasonably spread but not balanced.","role":"label","scope":"column","target":"sentiment","treatment":"Use as a categorical target; consider class weighting to offset the under-represented negative class."},{"confidence":"high","critiques":[],"evidence_keys":["min","max","mean","median","std","skew","kurtosis","zero_rate","outlier_rate","n_outliers","q1","q3","iqr","n_unique"],"model":"anthropic:claude-opus-4-7","narrative":"This is a bounded sentiment polarity score in [-0.998, 1.0], typical of lexicon- or model-based sentiment scoring. Distribution is roughly symmetric (skew 0.019, kurtosis 0.018) but heavily zero-inflated: 47.8% of rows are exactly 0 and the median is 0, suggesting many neutral or unscoreable texts. Despite the symmetry, 5,763 rows (5.7%) are flagged as outliers, indicating fat tails of strong polarity at both ends.","role":"feature","scope":"column","target":"sentiment_score","treatment":"Treat zeros as a separate 'neutral' indicator and use the raw score as a feature; no transform needed given symmetric bounded range."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","stats.len_min","stats.len_max","stats.duplicate_rate","stats.n_duplicates","stats.one_word_rate","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"This is a creation timestamp column stored as ISO 8601 strings (lengths 20-35, single-token), not parsed datetimes. Values are near-unique (96576/101040) yet 4464 duplicates exist and the top values cluster tightly around 2025-12-24T05:00, suggesting a narrow ingestion window or batch insert. Format is inconsistent across rows, mixing '+00:00', '.000Z', and microsecond-precision 'Z' suffixes, which will break naive string sorting.","role":"timestamp","scope":"column","target":"created_at","treatment":"Parse to a normalized UTC datetime before any temporal analysis or joins."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.len_min","stats.len_max","stats.len_mean","stats.one_word_rate","stats.allcaps_rate","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"This is an ISO-8601 timestamp column stored as text, with every one of the 101040 values unique and exactly 26 characters long. Sampled values cluster on 2025-12-23 and 2025-12-24, suggesting a narrow capture window rather than a broad historical range. The 'allcaps' and 'one_word' alerts are artefacts of the ISO format (the literal 'T' separator and no whitespace), not a data quality issue.","role":"timestamp","scope":"column","target":"timestamp","treatment":"Parse to datetime and derive features (hour, day, delta) instead of using as text."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.top_value","stats.top_rate","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"Language tag of each record, using ISO-style codes across 90 distinct values with no nulls. English dominates at 60.8% of rows, followed by Japanese (12,607) and a sizeable 'unknown' bucket (11,481) that signals missing-data leakage into the category itself. Note the inconsistent granularity: bare 'en' coexists with locale-specific 'en-US' (3,617), so codes need normalisation before grouping.","role":"feature","scope":"column","target":"language","treatment":"Normalise locale codes (collapse en-US into en), treat 'unknown' as missing, then one-hot or target-encode."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.min","stats.max","stats.mean","stats.median","stats.q1","stats.q3","stats.iqr","stats.std","stats.skew","stats.kurtosis","stats.outlier_rate","stats.zero_rate"],"model":"anthropic:claude-opus-4-7","narrative":"This is almost certainly a per-row character count of some text field, ranging from 1 to 525 with a median of 68 and mean of 97.6. The distribution is right-skewed (skew 1.02) with a wide IQR of 113, but only 0.29% of values flag as outliers and there are no nulls or zeros. With just 341 unique integer values across 101,040 rows, the field is discrete and well-behaved.","role":"feature","scope":"column","target":"char_count","treatment":"Consider a log or sqrt transform before regression to tame the right skew."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.min","stats.max","stats.mean","stats.median","stats.q1","stats.q3","stats.iqr","stats.std","stats.skew","stats.kurtosis","stats.outlier_rate","stats.zero_rate"],"model":"anthropic:claude-opus-4-7","narrative":"This is a per-row word count, ranging from 0 to 83 with a median of 10 and mean of 14.67, indicating most entries are short snippets rather than long documents. The distribution is right-skewed (skew 1.21) with a wide IQR of 19 and ~2.85% outliers, suggesting a long tail of unusually verbose rows. Only 79 unique values across 101,040 rows and a near-zero zero_rate (0.06%) confirm it's a bounded discrete count with virtually no empty texts.","role":"feature","scope":"column","target":"word_count","treatment":"Log-transform or bin before modelling to dampen the right skew."},{"confidence":"high","critiques":[],"evidence_keys":["n_unique","min","max","mean","zero_rate","outlier_rate","n"],"model":"anthropic:claude-opus-4-7","narrative":"This is a binary indicator (only 2 unique values, min 0, max 1) flagging whether a record has images. It's heavily imbalanced: 86.4% zeros and a mean of 0.136, meaning only ~13.6% of rows have images. The 'outliers' alert simply reflects the minority class rather than anomalous values.","role":"feature","scope":"column","target":"has_images","treatment":"Treat as a boolean flag; no transformation needed, but watch class imbalance if used as a target."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.mean","stats.zero_rate","stats.skew","stats.kurtosis","stats.n_outliers"],"model":"anthropic:claude-opus-4-7","narrative":"Binary flag indicating whether a record has an associated video, stored as 0/1 with no nulls across 101040 rows. The positive class is rare: 98.67% are zero and only 1.33% are one, producing extreme skew (8.50) and kurtosis (70.19). The 1344 ones are flagged as outliers purely because of the imbalance, not because they are anomalous.","role":"feature","scope":"column","target":"has_video","treatment":"Treat as a boolean indicator; expect minimal signal given 98.67% zeros and consider class-imbalance handling if used as a target."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.mean","stats.zero_rate","stats.min","stats.max","stats.n_outliers"],"model":"anthropic:claude-opus-4-7","narrative":"Binary 0/1 flag indicating whether a record contains a link, with 82.0% zeros and a mean of 0.180 across 101,040 rows. The 18,140 'outliers' are simply the minority positive class, not anomalies \u2014 IQR-based outlier detection misfires on binary data. No nulls, exactly 2 unique values.","role":"feature","scope":"column","target":"has_link","treatment":"Use directly as a boolean feature; ignore the outlier alert."},{"confidence":"high","critiques":[],"evidence_keys":["null_rate","n_unique","stats.top_value","stats.top_rate","stats.entropy_ratio","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column tags the embed type attached to a Bluesky post, with 5 distinct AT Protocol lexicon values like app.bsky.embed.external and app.bsky.embed.images. 61.15% of rows are null, consistent with most posts having no embed, and among the populated rows external links (46.2%) and images dominate while video and recordWithMedia are rare. Entropy ratio of 0.74 indicates a moderately concentrated but not degenerate distribution.","role":"feature","scope":"column","target":"embed_type","treatment":"Treat nulls as a 'no embed' category and one-hot encode the 5 levels."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.duplicate_rate","stats.one_word_rate","stats.len_median","stats.word_mean","stats.vocab_size","top_values","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"This column stores serialized JSON arrays of hashtags extracted from social posts, with 87,318 of 101,040 rows holding an empty list `[]` and only 10,103 distinct values overall (duplicate_rate 0.90). When hashtags are present they are short \u2014 word_mean 1.38 and len_median 2 \u2014 and span multiple scripts (Thai, Japanese, German, English), so any text processing must be Unicode-aware. The mix of `#NowPlaying` (115) and `#nowplaying` (85) shows case is not normalized.","role":"feature","scope":"column","target":"hashtags","treatment":"Parse the JSON list, lowercase, and one-hot or count-encode the most frequent tags; treat `[]` as an explicit no-tag category."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","duplicate_rate","one_word_rate","top_values","len_mean","len_max","vocab_size"],"model":"anthropic:claude-opus-4-7","narrative":"This column stores a serialized JSON array of mention IDs (hex tokens) attached to each record, but 98,659 of 101,040 rows hold the empty list `[]`, giving a 0.98 duplicate rate and 0.996 one-word rate. When mentions do appear, they are almost always single-element arrays referencing 16-character hex IDs, with the most-cited ID occurring only 16 times. Effectively a sparse foreign-key list dominated by absence.","role":"foreign_key","scope":"column","target":"mentions","treatment":"Parse the JSON array and explode to a mention-id join table, or collapse to a binary has_mentions flag given the 98% empty rate."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","duplicate_rate","n_duplicates","url_rate","len_median","len_p95","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column stores JSON-encoded arrays of URLs, but 96,211 of 101,040 rows hold the empty literal \"[]\", driving a 96.3% duplicate rate and a median length of 2 characters. Only 3,771 unique values exist and just 4.8% contain a URL, so the field is overwhelmingly absent rather than informative. When populated, entries point to radio/streaming sites (BBC, Radio France, shoutcast), suggesting a sparse list-of-links attribute.","role":"metadata","scope":"column","target":"links","treatment":"Parse the JSON arrays, derive a has_links boolean and link count, and skip the raw string for modelling."}],"providers":["anthropic:claude-opus-4-7"],"total_usage":{"completion_tokens":6051,"prompt_tokens":40264,"total_tokens":46315}},"language_counts":{"als":2,"ar":9,"bg":3,"ca":6,"cs":10,"de":108,"el":11,"en":3309,"eo":4,"es":71,"et":2,"fi":13,"fr":78,"hi":2,"id":11,"it":30,"ja":656,"ko":125,"nl":46,"no":3,"pl":12,"pt":78,"ru":30,"sr":3,"sv":7,"th":34,"tr":33,"uk":3,"vi":7,"zh":37},"meta":{"generated_at":"2026-04-22T05:56:37+00:00","mode":"full","row_count":101040,"sampled_rows":101040,"seed":42,"source":"/home/coolhand/datasets/bsky-firehose-anonymized-dec-2025/bluesky_posts.csv"},"notes":[],"saturn_version":"0.2.0","schema":{"author_did_hash":"text","char_count":"numeric","created_at":"text","embed_type":"categorical","has_images":"numeric","has_link":"numeric","has_video":"numeric","hashtags":"text","language":"categorical","links":"text","mentions":"text","reply_parent_hash":"text","reply_root_hash":"text","sentiment":"categorical","sentiment_score":"numeric","text":"text","timestamp":"text","uri_hash":"text","word_count":"numeric"}}
