{"columns":[{"alerts":[{"code":"multilingual","level":"info","message":"31 languages detected in sample"},{"code":"allcaps","level":"info","message":"16.9% rows are all-caps"}],"column":"text","extras":{"language_counts":{"__engine":"fasttext:4,768","als":2,"ar":9,"bg":3,"ca":6,"cs":10,"de":108,"el":11,"en":3309,"eo":4,"es":71,"et":2,"fi":13,"fr":78,"hi":2,"id":11,"it":30,"ja":656,"ko":125,"nl":46,"no":3,"pl":12,"pt":78,"ru":30,"sr":3,"sv":7,"th":34,"tr":33,"uk":3,"vi":7,"zh":37},"language_sample_size":5000,"length_histogram":{"counts":[11042,12354,11078,8177,7049,6309,5146,4495,3865,3459,3244,2592,2219,2065,1867,1708,1624,1467,1439,1439,1563,1657,4862,33,261,3,5,1,2,9,3,0,1,1,0,0,0,0,0,1],"edges":[1.0,14.1,27.2,40.3,53.4,66.5,79.6,92.7,105.8,118.89999999999999,132.0,145.1,158.2,171.29999999999998,184.4,197.5,210.6,223.7,236.79999999999998,249.9,263.0,276.09999999999997,289.2,302.3,315.4,328.5,341.59999999999997,354.7,367.8,380.9,394.0,407.09999999999997,420.2,433.3,446.4,459.5,472.59999999999997,485.7,498.8,511.9,525.0]},"near_unique":false,"sample":["Un client arr\u00eat\u00e9 apr\u00e8s avoir poignard\u00e9 son livreur de repas\n\n\"Un livreur de repas a \u00e9t\u00e9 gri\u00e8vement bless\u00e9 lors d\u2019une tentative de meurtre dans la nuit de dimanche \u00e0 lundi, \u00e0 B\u00fclach. Le...\"\n\nhttps://www.20min.ch/fr/story/buelach-zh-un-client-arrete-apres-avoir-poignarde-son-livreur-de-repas-103470448","F\u0131nd\u0131\u011f\u0131m bu giboyla ilgili itiraf etmek istedi\u011fin bir \u015fey varsa tam vakti \ud83d\ude05 Dm kutuma gel anlat,s\u00f6z bende kalacak anlatt\u0131klar\u0131n \ud83d\ude05\ud83d\ude05","-#strongertogether","Tu b\u2019Shevat is approaching rapidly. Nit to put any pressure on you, but\u2026","Someone mentioned it in my timeline, and so I just rewatched \"The Blue Carbuncle\", from The Adventures of Sherlock Holmes (1984) with Jeremy Brett.  \n\nIt's a great Christmas story and Jeremy Brett is without question the best Holmes ever.\n\nI found it on Britbox","https://trecome.info/articles/89cfe941-6cf1-45ae-8626-cc0241375b46\n\u3010\u65b0\u7740\u8a18\u4e8b\u3011\n\u5b87\u5b99\u30b9\u30c6\u30fc\u30b7\u30e7\u30f3\u306f\uff62\u7d44\u307f\u7acb\u3066\u308b\uff63\u6642\u4ee3\u304b\u3089\uff62\u4e00\u767a\u3067\u5e83\u3052\u308b\uff63\u6642\u4ee3\u3078\uff1f","Happy Christmas Eve, sweet Flanoy! \n\ud83c\udf84\ud83e\udde1\ud83d\udc08","THESE FUCKERS SERIOUSLY COULDN\u2019T WAIT ONE DAY!? /vneg\n#dandysworld","\uc5b4\ub51c \uac00\uc9c0.....","Made in hckr.fr \ud83c\udff4\u200d\u2620\ufe0f\ud83d\udda4 Le genre de petit message qui me fait chaud au c\u0153ur.","\"Why would you have to shrink me to swallow me?\"\n\"It's a lot to swallow. I would know\"\n\na","Yea Newage is great: it's pricey, but you get what you pay for and the quality and engineering are top notch \n\nThe dinobots are great, as is their Jetfire mold and their Blaster mold\n\nWould advise avoiding their Galvatron mold, though: unfortunately he's a bit of a dud and the paint chips easily","we are Energy we are Power we can conquer the world","Un oui que le Tessin redoute\n\n\"L\u2019acceptation de l\u2019initiative visant \u00e0 diviser par deux le financement de la SSR aurait des r\u00e9percussions \u00e9conomiques plus importantes en Suisse italienne qu\u2019en Suisse romande et en Suisse al\u00e9manique, ...\"\n\nhttps://lecourrier.ch/2025/12/21/un-oui-que-le-tessin-redoute/","\u660e\u65e5\u3001\u96e8\u3068\u98a8\u304c\u5f37\u3081\u306a\u306e\u304b\u3001Flood Watch\u3068Wind Advisory\u304b\u3089\u901a\u77e5\u304c\u3067\u3066\u305f\u3002\u505c\u96fb\u306b\u306f\u306a\u3089\u306a\u3044\u307b\u3069\u3067\u3042\u3063\u3066\u307b\u3057\u3044\u3002","No no you've mentioned it before. I had some problems with my kidneys not too long ago thanks to not taking care of my blood pressure. I know it's not the same but the danger I was told I was in was... yanno.\n\nWish I knew what you looked like so I could imagine what more loss would do to you \ud83d\udc40","And people say 10 void is unbeatable \ud83e\udd2d we like the thicc boys (and Illaoi)","\ud83c\udfb6 So much for a \"Merry Christmas\" \ud83c\udfb6","Pr\u016fzkum na dne\u0161n\u00ed den: V\u00e1noce mezi n\u00e1mi\n\n#pruzkum","(FOTD, cont.) there\u2019s no room for those Keith and Jerry solos in a fast FOTD.\n\nTypically great Brown-Eyed Women and then a remarkable Let It Grow, in some ways the quintessential Let It Grow for me\n\nGetting to the 2nd set\u2014 love Jerry\u2019s power chord that kicks off PITB out of Sunrise","Meanwhile, Seattle can't stop homeless deaths; over 200 this year\nHow ironic","#pareridiparte #ritardi #manovra","No president\u2026no matter how powerful or popular\u2026is above the law. \n\nopen.substack.com/pub/thejackh...","\u6614wiifit\u304b\u306a\u3093\u304b\u3067\u30c0\u30a4\u30d3\u30f3\u30b0\u30b2\u30fc\u30e0\u3084\u3063\u305f\u3068\u304d\u306b\u30d2\u30c8\u30c7\u304c\u6016\u3044\u3053\u3068\u306b\u6c17\u4ed8\u3044\u305f \u306a\u305c\u304b\u306f\u308f\u304b\u3089\u306a\u3044 \u661f\u306e\u7802\u3068\u304b\u3082\u82e6\u624b\u3060\u304b\u3089\u3001\u81ea\u7136\u7269\u306a\u306e\u306b\u661f\u578b\u3057\u3066\u308b\u306e\u304c\u6c17\u6301\u3061\u60aa\u304b\u3063\u305f\u306e\u304b\u306a","Feed: \"Daily Post Nigeria\"\nBy: Winner James on Tuesday, December 23, 2025","\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042\u3042","\u307f\u3066\u30fc\u30fc\u30fc\u30fc\uff01\uff01\uff01\uff01\u304b\u308f\u3044\u3044\u30fc\u30fc\u30fc\u30fc\uff01\uff01\uff01\uff01\uff01","who wants to take bets on how long before dementia donny slips up and says something like \"[it's] not a crime. and if it was a crime, it doesn't matter because i'm the president and the supreme court says anything i do is legal\" ?\nand yet, even if he did confess, the cult would defend him","@gothtatertot.bsky.social"," ","\u6b63\u76f4\u6614\u304b\u3089\u6b32\u3057\u304b\u3063\u305f\u306e\u3067\u3069\u3063\u304b\u306e\u30bf\u30a4\u30df\u30f3\u30b0\u3067\u53d6\u308a\u305f\u3044\u306a\uff5e\u3063\u3066\u6c17\u6301\u3061\u306f\u3042\u308a\u307e\u3059(\u306a\u304a\u30d0\u30a4\u30c8\u00d72\u3068\u30d0\u30f3\u30c9\u3067\u305d\u3093\u306a\u4f59\u88d5\u304c\u7121\u3044)","Same, but without the SPY","on the suit give her a pin of a random pride flag that would fit what anyone has said so far","That\u2019s a marvelously innovative solution to Phoenix\u2019s water shortage!","Yup, planning to power through the game and hopefully listen by early Jan\ud83e\udd1e \n\nHappy holidays, Fine Time crew! \ud83c\udf84\ud83c\udf81\n\nMy thoughts are with Aaron rn, as I'm sure yours are too. Wishing him the best in this tough time.","Tyler Kolek delivers Knicks silver lining with career night in road loss","Yes.\n\nI also learned from my parents that gifts are quite transactional at times.  And that you have to keep score.  Because reciprocation is a statement, but obviously, non-reciprocation is also a statement.\n\nOf course, exceptions are made for families in \u55aa\u4e2d (mourning).","\ud83d\ude29 what a beauty!!! Perfection!","The man is simply not that smart. Arrogance and ignorance are often the two sides of the same coin.","hi :3","\u571f\u65b9\u3055\u3093\u306f\u52d5\u63fa\u3059\u308b\u3053\u3068\u3042\u308b\u306e\u304b\u306a\u301c\u3002\u3059\u3093\u3054\u3044\u6b63\u6c17\u3092\u5931\u3046\u307b\u3069\u306e\u52d5\u63fa\u3092\u3059\u308c\u3070\u3044\u3044\u3068\u601d\u3044\u307e\u3059\u3045\uff08\uff1f\uff1f\uff1f\uff1f\n\uff08\u65e9\u304f\u571f\u65b9\u3055\u3093\u30eb\u30fc\u30c8\u898b\u306b\u884c\u3051\u3070\u3044\u3044\u306e\u306b\u79c1\uff09\n#\u3078\u304d\u3055\u308a\u3085\u3046\u307f\u3053","Yes YES AND YES. \ud83d\udc99\ud83d\udc99\ud83d\udc99\ud83d\udc99\ud83d\udc99 I only post Blue hearts b/c red hearts don't exist. I wish Bluesky gave us this option.","Unlocking Innovation and Securing Knowledge: The Xerox-Stack Overflow Blueprint for Enterprise\u00a0Collaboration\n\nAt AllSafeUs Research Labs, we constantly monitor trends that shape the future of enterprise security and operational excellence. The recent insights into how Xerox leveraged an internal\u2026","Literally all of my emails and phone calls are about late fees and that I have no money, every bank is angry at me, etc. the ratio of that signal to noise with radio silence from people and the great two weeks of doom that is Christmas is a lot. I continue to eke out rent via credit. It\u2019s inevitable","\u72e9\u308a\u30c7\u3059\u3002","i finally have a laptop new enough to use my ipad as a screen extender so i can use csp on there lfggggggg","I'll wait a thousand years just to see you smile again.","Damn it. I've GOT to stop them. But what if I can't? There was no point thinking about it, but he did anyway. Could he run for it? He'd run from Diavolo, knowing he didn't stand a chance. But Giorno Giovanna had defeated Diavolo. Could he get away from him?","Romance is where the big money is, they will try the worst possible of the crap on you first.","\u5168\u4eba\u985e\u5e7c\u5973\u306b\u306a\u3089\u306d\u3047\u304b\u306a\u3041"],"top_values":[["\u0e40\u0e27\u0e47\u0e1a\u0e15\u0e23\u0e07\u0e17\u0e35\u0e48\u0e19\u0e48\u0e32\u0e40\u0e0a\u0e37\u0e48\u0e2d\u0e16\u0e37\u0e2d API \u0e41\u0e17\u0e49 \u0e2d\u0e31\u0e19\u0e14\u0e31\u0e1a1\u0e02\u0e2d\u0e07\u0e44\u0e17\u0e22 \u0e1d\u0e32\u0e01 \u0e16\u0e2d\u0e19 \u0e14\u0e49\u0e27\u0e22\u0e23\u0e30\u0e1a\u0e1a\u0e2d\u0e2d\u0e42\u0e15\u0e49\n\u0e41\u0e2d\u0e14\u0e21\u0e34\u0e19\u0e1a\u0e23\u0e34\u0e01\u0e32\u0e23 24\u0e0a\u0e21.\n\u0e2a\u0e21\u0e31\u0e04\u0e23\u0e23\u0e31\u0e1a\u0e1f\u0e23\u0e35 422 \u0e1a\u0e32\u0e17 rebrand.ly/889a531\n\u0e2a\u0e21\u0e31\u0e04\u0e23\u0e23\u0e31\u0e1a\u0e1f\u0e23\u0e35 422 \u0e1a\u0e32\u0e17 rebrand.ly/889a531\n#\u0e23\u0e31\u0e1a\u0e1f\u0e23\u0e35\n#422\n#\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\n#\u0e41\u0e08\u0e01\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\n#\u0e04\u0e32\u0e2a\u0e34\u0e42\u0e19\u0e2a\u0e14\n#\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\u0e41\u0e04\u0e48\u0e2a\u0e21\u0e31\u0e04\u0e23 #\u0e42\u0e04\u0e49\u0e14\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35 #\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e35\u0e44\u0e21\u0e48\u0e15\u0e49\u0e2d\u0e07\u0e1d\u0e32\u0e01\u0e44\u0e21\u0e48\u0e15\u0e49\u0e2d\u0e07\u0e41\u0e0a\u0e23\u0e4c #\u0e40\u0e27\u0e47\u0e1a\u0e15\u0e23\u0e07\u0e2a\u0e25\u0e47\u0e2d\u0e15 #\u0e40\u0e04\u0e23\u0e14\u0e34\u0e15\u0e1f\u0e23\u0e3550 #\u0e2a\u0e25\u0e47\u0e2d\u0e15\u0e41\u0e15\u0e01\u0e2b\u0e19\u0e31\u0e01 #\u0e2a\u0e25\u0e47\u0e2d\u0e15\u0e1f\u0e23\u0e35",256],["\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11\ud83d\udc11",224],["bsky.app/profile/dark...",64],[" ",57],["If you missed the pulled 60 Minutes report on CECOT, you can view it here. \n\nPlease share widely. \n\narchive.org/details/60mi...",42],["Christmas (Please Come Home)/Darlene Love bass & vocal by Cara\n\nVery easy on bass, challenging to sing! youtu.be/c4Bg3DPpOU8 \nI'm going to taking 24th-25th off so Merry Christmas and Happy Holidays  #christmas #darlenelove #philspector #ronniespector",41],["\ud83d\ude02",40],["\ud83d\udc40",37],["Can I trouble you to subscribe to my nature-comedy YouTube channel? \ud83d\ude4f \n\nyoutu.be/R9CdYuf-JaE?...",33],["\u2764\ufe0f",31],["\ud83e\udd23",30],["\ud83d\ude0d",27],["Hold our Navy responsible! open.substack.com/pub/growingu...",27],["G\u00fcnayd\u0131n \u2728\ufe0f\ud83d\ude0a\u2728\ufe0f\u2615\ufe0f\u2728\ufe0f",26],["\ud83e\udec2",24],["\ud83e\udd23\ud83e\udd23\ud83e\udd23",22],["bsky.app/profile/phoe...",22],["\ud83c\udff7\ufe0f Special Price: Product\n\u2728 Save money today!\n\n\ud83d\udc46 Check it out!",21],["#shadowicexploit #shadowic #shadowintegration #shadows #shadowsense #shadowiclineexploit #lastpostabouthtelines #shadowictrinity\n#shadowicazazel #shadowicelizabeth #shadowicthomas #shadowicsolomon #shadowiccrowley\n#shadowic2020 #shadowic2025  #shadowix #shift #massshift #anon #shadowanon #anonymous",21],["\ud83d\ude02\ud83d\ude02\ud83d\ude02",20]],"top_words":[["the",7424],["a",4857],["to",4818],["i",4158],["and",4144],["of",3490],["in",2730],["is",2670],["for",2358],["you",1965],["it",1860],["that",1796],["on",1745],["this",1526],["my",1435],["with",1374],["but",1150],["be",1082],["so",1036],["have",977],["was",966],["at",939],["-",913],["not",903],["are",897]],"vocab_skipped":null,"word_histogram":{"counts":[43756,17836,13187,7566,6713,3921,3682,2614,1521,182,40,8,4,3,1,2,0,0,0,0,0,1,0,0,0,0,1,0,1,1],"edges":[1.0,7.466666666666667,13.933333333333334,20.4,26.866666666666667,33.333333333333336,39.8,46.266666666666666,52.733333333333334,59.2,65.66666666666667,72.13333333333334,78.6,85.06666666666666,91.53333333333333,98.0,104.46666666666667,110.93333333333334,117.4,123.86666666666667,130.33333333333334,136.8,143.26666666666668,149.73333333333335,156.2,162.66666666666666,169.13333333333333,175.6,182.06666666666666,188.53333333333333,195.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":95935,"null_rate":0.0,"stats":{"allcaps_rate":0.1690716547901821,"boilerplate_rate":0.001049089469517023,"duplicate_rate":0.05052454473475851,"emoji_rate":0.1832343626286619,"len_max":525,"len_mean":97.62657363420428,"len_median":68.0,"len_min":1,"len_p95":290.0,"n_duplicates":5105,"n_empty":0,"one_word_rate":0.189944576405384,"readability_flesch_mean":64.09147328214267,"url_rate":0.07586104513064133,"vocab_size":77183,"word_mean":14.234619952494063,"word_median":10.0}},{"alerts":[{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"duplicates","level":"warn","message":"56.5% duplicate strings"}],"column":"author_did_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":false,"sample":["203b2f94ca34ad57","e3fb7462b68ce168","8b80d746cd58f608","74e2cbc89edd37a6","ed4f29630f55ae1d","039de54e8bef8899","61ee7267320497ee","b464fb16192641fa","7422e82a369d2ace","8dc83ec255dde07a","6d6760c7aa86e8e3","66ad99c8fcf5f811","9e4716810429aeca","203b2f94ca34ad57","760203400659f56f","a647051605b55e99","65a0490266af6cea","adeb848bd5142712","fa2043d9fdf56be2","bb678261b563da06","631c8517a01c3e9b","afdcf7421bf5ff34","43b058474ce580d4","e219c042f81df881","634772531b7617e2","05ca1924909285a8","79c19a068331cdf9","5d691986621a5b29","c9abf11307ab8693","95a5305808625469","52806bc5b3077c99","552942e53108b579","510f724fbec56084","236843991ae6f5ff","81c0251ad5a83dce","02f769fbad1b3d05","28e076f5430b3198","b92dfe84ea4b3589","eb452ce2479900da","d1df157966160af1","2d1566f8fd35d0f1","e0a88bbddc0314d0","0e73a5b8648607e1","8b3de10e35e304a9","14d93f819a37b7a8","658d8e90ac98f9ba","af96045733cb7caf","b015d70803283e70","4bf2aa20366a489e","05ca1924909285a8"],"top_values":[["634772531b7617e2",1016],["fb4e916ee2673591",726],["7bef67724621686b",590],["6c53c0fac294c5a4",549],["203b2f94ca34ad57",455],["2ea2a3bb1eb67cd7",411],["31165f7346de9da8",391],["c161bb58161ffd89",255],["ae71c0ad4484309f",232],["5d2b4f7adcec93f3",227],["f96d8b230da452cf",193],["b1df12534bde5974",172],["779ea61a1d359785",157],["4f563f4b2171b9c5",140],["3e5f82241b71d8e7",90],["c63b71e5907c2f34",77],["ce5433548d273f29",73],["6531a5cf8f5fe8d0",73],["0fb2807d4a383d70",72],["2f4d663477771c99",70]],"top_words":[["634772531b7617e2",181],["fb4e916ee2673591",148],["6c53c0fac294c5a4",113],["7bef67724621686b",108],["203b2f94ca34ad57",86],["31165f7346de9da8",72],["2ea2a3bb1eb67cd7",71],["5d2b4f7adcec93f3",50],["ae71c0ad4484309f",50],["c161bb58161ffd89",38],["b1df12534bde5974",32],["779ea61a1d359785",31],["f96d8b230da452cf",30],["4f563f4b2171b9c5",21],["f0ff207f7a7a0939",20],["2f4d663477771c99",19],["6531a5cf8f5fe8d0",19],["2221340be707fd97",18],["76ff98d8b589b65c",18],["1bbb376d1f618f2c",17],["f5c2ce0d362d0ebd",16],["63fc19a8e5da7ebf",16],["2b6aeb9dd9232866",15],["2ff5866b1c8ec5c3",15],["b2a3bb0389254922",15]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":0,"n_unique":43998,"null_rate":0.0,"stats":{"allcaps_rate":0.0003463974663499604,"boilerplate_rate":0.0,"duplicate_rate":0.5645486935866983,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":57042,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":68.34500000000003,"url_rate":0.0,"vocab_size":13938,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"near_unique","level":"info","message":"100.0% of rows are unique strings"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"}],"column":"uri_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101039,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":true,"sample":["1a925eae4a68e954","9c7b35c448e9f56a","00000da0008897eb","cd823fdacd02b11c","0a525ed50b0474f2","175ab73228973fa3","60f3a1a69409b7ae","e9e0e481dbe7f266","d49a9bf37ba42904","ef51be69a5ee76f8","c60d77c71a6e8368","9e5ad9a403761b25","f783b2ea36e56ac1","5bf20312746a2a9a","421b2c913026eb9c","826a317da4197bb7","5df1f3a26eb43121","ccb51fe4f8c3f397","1f54bdfdf263b300","7cf22e593d9bc6ed","4f766a3bb7833dfa","4ee72d67c35a55cb","f703f2dc01779a54","29cd0823f2d3c76b","f67222e907209d80","e95d5a4652824780","a06c815cda779e65","7ae4569d81570275","9f59f75daa68b0ee","fafcc6b305bc62cd","d21e0678f85c5ec4","f081102ab401665e","720720a55becacb2","086c1ebf93747eeb","2b3f0aa42419125e","54c7b27aad00789d","55eeaadb1cab926e","beaef59a094b6c60","c55b838089921f4a","1f4cdfd2fa3601a0","dbf5ae46203822ae","7e17c82d0b973d79","f4a242fe88642345","a1f7780887eefee9","6baebdcc74bfec02","c3f9d4f4eb24f31b","a42dd84d3bfea1c4","ecb6795f67203eb1","3bc0cf5cc09c1830","9c97e61614ef1444"],"top_values":[],"top_words":[["5684957e422e6ba3",1],["0f88f07348d28f4d",1],["82a16f69d6d20ec1",1],["694ec42695658643",1],["c8f5155b33e96150",1],["df10ce1838069b85",1],["87a923ccc4d8705d",1],["518b0a65f315d1b7",1],["2eb75fb69c2ac4c1",1],["365e1a015347b48d",1],["2cc514a3b35c378a",1],["14166f86704326c6",1],["f10c29ce1abdce0c",1],["cc8f953e97d31653",1],["1ce8d0e81f36c45e",1],["7e4f909df5fe925d",1],["87c68c39a12afe72",1],["434a59da1aefd3ce",1],["e7a81bac740bd1dd",1],["f64c88672bc950e7",1],["1a5b2bfe43eafef9",1],["3a8f1dcd008d5fcd",1],["182fa17deab05a1c",1],["21188534628a908f",1],["15b9277b7f3fda32",1]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101039,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":1,"n_unique":101039,"null_rate":9.897070467141727e-06,"stats":{"allcaps_rate":0.0005245499262660953,"boilerplate_rate":0.0,"duplicate_rate":0.0,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":0,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":69.61400000000003,"url_rate":0.0,"vocab_size":20000,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"null_rate","level":"warn","message":"57.7% null"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"}],"column":"reply_parent_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":false,"sample":["fc2267f29dd1a492","6b56ce9644d8dcfc","701912916dd3aecb","f16b66c1507d3da9","63ea68b3eabeb6c5","2e341c64d79713f6","bfbbd6900834f900","dd990c5f31cc4ea6","3b2f41bfb941204a","a5ba750d7bf30263","8f8158886219d809","a1c0dc65878bddf4","761a8b8fdccf274c","ff199266a712c936","9654108e92a10dc3","0d63acd9fb1cd064","edd2fa096e8f1acd","979cb6872a8381c3","c3a452f029201028","03eb863e04aa2809","d138f778bc427e59","bdfd3b4bdea8e6f5","be3960ec846e3ec7","a0653755bcad3a93","0d00d9c76bb2df35","ce3faccd3dc1fbf4","61a677bb4fe941b8","cad0660ae64abe4d","c7b4674638507471","1dcc2e279114d2c4","049b10017b1065bf","63f9f4b87bcc0e12","0c16af442fea883c","8a6fcf8af1666afa","114dcf9d942a7f30","32c4ebcad16e8dae","c925c7cd193e26f8","68063c9592df8549","a313886af42c0c38","5f45c8d8175af57d","d4a33f28a6573229","6167f628ae7f437f","0195c50ea2389a16","88666396c1ed98f4","cdf55822380da919","3fea641a8d29d974","219cfd754b18c6f8","86176d3c50983c85","d45a442f16b20217","0e25a874dc39e212"],"top_values":[["04a1db17fbc9ff3a",121],["63f9f4b87bcc0e12",120],["5d60ee5d282843bb",72],["c66d243660d15680",66],["701912916dd3aecb",63],["64481eb4185ef487",58],["3fea641a8d29d974",55],["ece7c6a36c75292d",54],["8071d6d751bc0d25",40],["3783af2182fe114c",36],["2b421be421c362bc",36],["c5e129a05bcd1d8e",35],["b80da3d51026ffef",34],["9340e77013ea61e3",34],["b86f724d032846fa",34],["c376b13fa813b268",33],["93cf5eee050ee084",33],["c7c829a8ea452941",33],["0cc534242e254bf0",31],["9b02c4bf6a573e76",30]],"top_words":[["04a1db17fbc9ff3a",55],["63f9f4b87bcc0e12",46],["5d60ee5d282843bb",37],["64481eb4185ef487",30],["c66d243660d15680",29],["701912916dd3aecb",28],["3fea641a8d29d974",26],["ece7c6a36c75292d",24],["b86f724d032846fa",20],["8071d6d751bc0d25",20],["2b421be421c362bc",19],["c7c829a8ea452941",17],["3783af2182fe114c",17],["c376b13fa813b268",16],["c5e129a05bcd1d8e",16],["93cf5eee050ee084",15],["9b02c4bf6a573e76",15],["86b70746bf58709b",14],["0cc534242e254bf0",14],["de84adf5c370090f",14],["b80da3d51026ffef",12],["6905d954d2274dbb",12],["7f1bd1feeb3a697d",12],["f1ee1162bcc48dd2",12],["9340e77013ea61e3",11]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":58270,"n_unique":34738,"null_rate":0.5767022961203484,"stats":{"allcaps_rate":0.0007949497311199439,"boilerplate_rate":0.0,"duplicate_rate":0.18779518353986438,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":8032,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":71.72900000000003,"url_rate":0.0,"vocab_size":17415,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"null_rate","level":"warn","message":"57.7% null"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"duplicates","level":"warn","message":"50.3% duplicate strings"}],"column":"reply_root_hash","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[15.5,15.525,15.55,15.575,15.6,15.625,15.65,15.675,15.7,15.725,15.75,15.775,15.8,15.825,15.85,15.875,15.9,15.925,15.95,15.975,16.0,16.025,16.05,16.075,16.1,16.125,16.15,16.175,16.2,16.225,16.25,16.275,16.3,16.325,16.35,16.375,16.4,16.425,16.45,16.475,16.5]},"near_unique":false,"sample":["fc2267f29dd1a492","6b56ce9644d8dcfc","701912916dd3aecb","f16b66c1507d3da9","63ea68b3eabeb6c5","2e341c64d79713f6","dc0cf00aab42248a","65f573012a42f37a","152ff36a17b9ab54","2da15f2e55e9a171","8f8158886219d809","a1c0dc65878bddf4","761a8b8fdccf274c","ff199266a712c936","e192ff4b5e6a63cf","0d63acd9fb1cd064","edd2fa096e8f1acd","979cb6872a8381c3","c3a452f029201028","03eb863e04aa2809","8bcd5caa97e16650","c6e9611badba68fb","b7f10d1f67a22882","38da10ae5b7b9ca1","0d00d9c76bb2df35","ce3faccd3dc1fbf4","61a677bb4fe941b8","cad0660ae64abe4d","c7b4674638507471","647c7d83cbf87fd4","049b10017b1065bf","63f9f4b87bcc0e12","0c16af442fea883c","e1fe8c748a6157e7","114dcf9d942a7f30","32c4ebcad16e8dae","d82c366ef0df9d4f","68063c9592df8549","41e8804cc09291c2","5f45c8d8175af57d","d3976f32e7582cdd","ccd2eb68f0536ea5","66fe7374cc0d2ebd","cb704fda8f95f3c3","cdf55822380da919","3fea641a8d29d974","219cfd754b18c6f8","86176d3c50983c85","07afed23dc936e21","0e25a874dc39e212"],"top_values":[["63f9f4b87bcc0e12",151],["04a1db17fbc9ff3a",148],["5d60ee5d282843bb",103],["7af1a48b8d39ed44",101],["20b004b78a60a470",93],["64481eb4185ef487",92],["c66d243660d15680",76],["701912916dd3aecb",76],["c5e129a05bcd1d8e",74],["3783af2182fe114c",73],["9aa8c958e3601e71",69],["3fea641a8d29d974",65],["5e11823c64750ffb",59],["ece7c6a36c75292d",59],["a8b36c1e17ab6e21",57],["5ec6a225f7cdc6e4",57],["19e98969b38fd9fd",57],["a1257c51645e9ed7",57],["c376b13fa813b268",53],["b85f83c856e20b05",53]],"top_words":[["04a1db17fbc9ff3a",66],["63f9f4b87bcc0e12",58],["7af1a48b8d39ed44",56],["5d60ee5d282843bb",54],["64481eb4185ef487",49],["20b004b78a60a470",41],["701912916dd3aecb",35],["3783af2182fe114c",33],["c66d243660d15680",33],["9aa8c958e3601e71",31],["c5e129a05bcd1d8e",31],["5e11823c64750ffb",29],["19e98969b38fd9fd",29],["3fea641a8d29d974",29],["b85f83c856e20b05",28],["ece7c6a36c75292d",25],["8071d6d751bc0d25",25],["f53a5087ebb15eb9",25],["a1257c51645e9ed7",24],["9b02c4bf6a573e76",23],["c376b13fa813b268",23],["fe819c2f3a5967f2",23],["c7c829a8ea452941",22],["2b421be421c362bc",22],["b80da3d51026ffef",21]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,42770,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":58270,"n_unique":21277,"null_rate":0.5767022961203484,"stats":{"allcaps_rate":0.0008183306055646482,"boilerplate_rate":0.0,"duplicate_rate":0.5025251344400281,"emoji_rate":0.0,"len_max":16,"len_mean":16.0,"len_median":16.0,"len_min":16,"len_p95":16.0,"n_duplicates":21493,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":77.22800000000002,"url_rate":0.0,"vocab_size":12498,"word_mean":1.0,"word_median":1.0}},{"alerts":[],"column":"sentiment","extras":{"singletons":0,"top_values":[["neutral",48981],["positive",34622],["negative",17437]]},"kind":"categorical","n":101040,"n_null":0,"n_unique":3,"null_rate":0.0,"stats":{"cardinality":3,"entropy":1.4732925089885032,"entropy_ratio":0.9295440796347906,"top_rate":0.4847684085510689,"top_value":"neutral"}},{"alerts":[{"code":"outliers","level":"warn","message":"5.7% rows beyond 1.5 IQR"}],"column":"sentiment_score","extras":{"histogram":{"counts":[247,541,705,822,812,874,1081,1067,1130,1234,1387,1127,895,1120,1585,792,726,672,624,48620,358,740,623,781,1426,1343,1581,2179,3463,2801,1918,2382,2518,2004,1886,1870,2266,2011,1758,1071],"edges":[-0.998,-0.94805,-0.8981,-0.84815,-0.7982,-0.74825,-0.6982999999999999,-0.64835,-0.5984,-0.54845,-0.4985,-0.44855,-0.39859999999999995,-0.34865,-0.29869999999999997,-0.24875000000000003,-0.19879999999999998,-0.14884999999999993,-0.09889999999999999,-0.04894999999999994,0.0010000000000000009,0.05095000000000005,0.10089999999999999,0.15084999999999993,0.2008000000000001,0.25075000000000003,0.30069999999999997,0.35065000000000013,0.40060000000000007,0.45055,0.5005,0.5504500000000001,0.6004,0.65035,0.7003000000000001,0.7502500000000001,0.8002,0.85015,0.9001000000000001,0.9500500000000001,1.0]},"sample":[0.44,0.557,0.0,0.735,0.0,-0.296,0.0,0.128,-0.67,0.0,0.509,-0.735,0.612,0.226,0.077,-0.962,0.0,0.0,-0.226,0.527,0.026,0.557,0.0,0.907,0.786,0.0,0.0,-0.572,0.0,-0.557,0.0,0.34,-0.959,0.026,0.0,0.372,0.0,0.0,0.0,0.542,0.0,0.0,0.0,0.0,0.557,0.524,0.0,0.0,-0.309,0.0,-0.178,0.421,0.44,0.0,0.421,0.0,0.296,-0.103,0.0,0.617,0.0,0.637,0.0,0.0,0.0,0.0,0.689,0.67,0.0,0.865,0.0,0.0,0.0,-0.459,-0.571,-0.318,0.0,0.0,-0.318,0.0,0.0,0.0,0.924,0.44,0.0,0.004,0.0,0.0,0.402,0.0,0.0,0.957,0.0,0.542,0.0,0.0,0.742,0.0,0.834,0.0,0.34,0.0,0.0,0.0,0.0,0.0,-0.599,-0.538,0.0,0.0,0.0,0.691,0.0,0.0,0.0,0.402,0.0,0.863,0.0,0.542,0.0,-0.604,0.542,0.0,-0.179,0.128,-0.178,0.0,0.494,0.802,0.0,0.0,0.0,0.0,-0.445,-0.226,0.224,0.0,-0.153,0.34,-0.515,0.0,0.0,0.44,0.318,0.0,0.957,0.0,0.026,0.984,0.0,0.0,-0.057,0.0,0.637,0.188,0.0,0.0,0.0,0.991,0.44,0.226,-0.34,0.0,0.0,0.67,0.612,-0.89,0.0,-0.128,0.0,0.0,0.0,0.0,-0.318,0.266,0.0,-0.922,0.361,0.718,0.0,0.0,0.872,0.0,-0.511,-0.296,0.0,0.0,-0.026,0.0,0.0,0.44,0.115,0.0,0.633,0.0,0.494,0.494,-0.455,0.0,0.026,0.077,0.0,0.0,-0.542,-0.42,-0.951,0.0,0.421,0.0,0.827,0.459,0.0,0.0,0.0,0.904,0.0,0.44,0.0,0.557,0.527,0.459,0.226,0.81,0.361,0.0,-0.273,0.0,-0.783,0.0,0.285,0.0,-0.871,0.296,0.0,0.0,0.0,0.394,0.0,0.0,0.0,0.0,0.867,0.727,0.296,0.781,0.0,0.202,-0.866,0.807,0.0,0.0,0.718,0.0,0.0,0.599,0.0,0.674,0.697,-0.361,0.0,0.0,0.0,-0.599,0.0,0.0,-0.359,-0.178,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-0.557,0.0,-0.525,-0.542,0.0,0.872,0.0,0.0,-0.572,0.42,0.0,0.0,0.67,0.077,0.542,0.0,0.0,0.44,0.0,0.0,0.0,0.226,0.0,0.599,0.077,0.0,0.095,0.527,0.0,0.0,-0.2,0.852,0.0,0.0,0.421,0.0,0.337,0.361,0.827,0.052,0.0,0.0,0.0,0.0,0.12,0.34,0.649,0.0,0.511,-0.681,0.128,0.421,0.0,0.0,0.369,0.61,-0.296,0.0,0.0,-0.718,0.85,0.0,0.0,-0.477,0.072,0.0,0.648,0.0,0.077,0.0,0.0,0.433,0.459,-0.557,0.0,0.296,0.612,0.0,0.0,0.0,0.0,0.764,0.0,0.519,0.202,-0.625,-0.261,0.585,0.67,0.0,0.9,0.98,0.0,-0.482,0.625,0.0,0.421,0.0,0.25,0.586,0.34,0.71,0.0,0.0,0.318,0.0,0.0,-0.272,-0.153,-0.772,0.597,0.0,0.979,0.0,0.103,0.296,0.0,0.0,0.0,0.0,0.625,-0.494,0.0,0.0,0.0,-0.402,0.0,-0.421,0.0,0.0,-0.625,0.0,0.0,0.494,0.0,0.845,0.0,-0.987,0.511,-0.929,0.511,0.625,0.0,0.961,0.0,0.0,0.0,-0.772,0.44,0.273,0.95,0.494,0.459,0.103,0.0,0.0,0.0,-0.813,0.0,0.0,0.44,0.329,0.0,0.879,0.452,0.361,0.0,0.318,0.0,0.0,0.494,0.541,-0.599,0.0,0.0,0.875,0.0,0.0,0.813,0.0,0.0,-0.494,0.0,0.957,0.0,0.252,0.1,0.026,0.0,-0.832,0.572,0.572,0.493,0.44,0.0,-0.625,-0.202,0.0,0.887,0.0,0.586,0.0,-0.475,0.0,0.128,0.0,-0.471,0.0,0.0,0.0,0.0,0.916,0.0,0.696,0.0,0.425,0.542,0.637,0.0,0.0,0.0,0.0,0.0,-0.421,0.74]},"kind":"numeric","n":101040,"n_null":0,"n_unique":1928,"null_rate":0.0,"stats":{"iqr":0.402,"kurtosis":0.01774018153532797,"max":1.0,"mean":0.10737404988123514,"median":0.0,"min":-0.998,"n_outliers":5763,"outlier_rate":0.05703681710213777,"q1":0.0,"q3":0.402,"skew":0.01861160445986652,"std":0.41035286023047746,"zero_rate":0.4779790182106097}},{"alerts":[{"code":"near_unique","level":"info","message":"95.6% of rows are unique strings"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"allcaps","level":"info","message":"100.0% rows are all-caps"}],"column":"created_at","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[2195,0,0,0,0,1,0,0,3,0,88967,0,0,2823,0,0,1,0,2202,0,0,118,0,0,1342,0,61,0,0,0,0,0,3296,0,28,0,0,0,0,3],"edges":[20.0,20.375,20.75,21.125,21.5,21.875,22.25,22.625,23.0,23.375,23.75,24.125,24.5,24.875,25.25,25.625,26.0,26.375,26.75,27.125,27.5,27.875,28.25,28.625,29.0,29.375,29.75,30.125,30.5,30.875,31.25,31.625,32.0,32.375,32.75,33.125,33.5,33.875,34.25,34.625,35.0]},"near_unique":true,"sample":["2025-12-15T15:02:56.000000Z","2025-12-24T05:46:28.199Z","2025-12-24T05:53:52.540Z","2025-12-24T05:51:06.770Z","2025-12-24T05:24:05.186Z","2025-12-24T06:00:49.556+00:00","2025-12-24T05:25:08.535Z","2025-12-24T05:56:08.507Z","2025-12-24T05:51:12.695Z","2025-12-24T05:00:11.869Z","2025-12-24T05:36:17.493Z","2025-12-24T05:31:13.198Z","2025-12-24T05:14:27.095Z","2025-12-21T18:10:24.000000Z","2025-12-24T05:15:16.096Z","2025-12-24T05:08:42.328Z","2025-12-24T05:03:04.652Z","2025-12-24T05:52:01.731Z","2025-12-24T07:00:05+01:00","2025-12-24T05:22:31.315Z","2025-12-24T05:16:06.574Z","2025-12-24T05:00:15.066Z","2025-12-24T05:07:13.958Z","2025-12-24T05:03:23.481Z","2025-12-24T05:42:34.395429+00:00","2025-12-24T05:47:06.407Z","2025-12-24T05:32:40.305Z","2025-12-24T05:08:58.084Z","2025-12-24T05:01:58.611Z","2025-12-24T06:00:44.69651400Z","2025-12-24T05:39:36.521Z","2025-12-24T05:40:02.871Z","2025-12-24T05:40:02.315Z","2025-12-24T05:21:42.322Z","2025-12-24T05:19:48.019Z","2025-12-24T05:24:45.786Z","2025-12-24T05:58:53.732Z","2025-12-24T05:29:49.642Z","2025-12-24T05:25:29.364Z","2025-12-24T05:30:56.414Z","2025-12-24T05:50:22.290Z","2025-12-24T05:58:02.196Z","2025-12-24T05:14:13Z","2025-12-24T05:57:56.486Z","2025-12-24T06:00:03.501Z","2025-12-24T06:01:00.722Z","2025-12-24T05:18:33.75536200Z","2025-12-24T05:21:42.89654100Z","2025-12-24T05:06:56.863Z","2025-12-24T05:55:01.488Z"],"top_values":[],"top_words":[["2025-12-24t05:30:00.000000z",6],["2025-12-24t05:00:18+00:00",5],["2025-12-24t05:00:07+00:00",5],["2025-12-24t05:00:00.000z",5],["2025-12-24t06:00:12+00:00",4],["2025-12-24t05:01:25+00:00",4],["2025-12-24t05:10:25.000z",4],["2025-12-24t05:05:26.344z",3],["2025-12-24t05:16:57.000z",3],["2025-12-24t05:00:11+00:00",3],["2025-12-24t05:10:02.000z",3],["2025-12-24t05:15:11+00:00",3],["2025-12-24t05:57:17+00:00",3],["2025-12-24t05:30:19+00:00",3],["2025-12-24t05:10:00.000z",3],["2025-12-24t05:26:24+00:00",3],["2025-12-15t20:40:01.000000z",3],["2025-12-24t05:00:09+00:00",3],["2025-12-24t05:01:24+00:00",3],["2025-12-24t05:00:23+00:00",3],["2025-12-24t05:36:13z",3],["2025-12-24t05:30:10+00:00",3],["2025-12-24t06:00:58z",3],["2025-12-24t05:00:00.000000z",3],["2025-12-24t05:20:09+00:00",3]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":0,"n_unique":96576,"null_rate":0.0,"stats":{"allcaps_rate":1.0,"boilerplate_rate":0.0,"duplicate_rate":0.044180522565320665,"emoji_rate":0.0,"len_max":35,"len_mean":24.344883214568487,"len_median":24.0,"len_min":20,"len_p95":27.0,"n_duplicates":4464,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":121.22000000000004,"url_rate":0.0,"vocab_size":19720,"word_mean":1.0,"word_median":1.0}},{"alerts":[{"code":"near_unique","level":"info","message":"100.0% of rows are unique strings"},{"code":"one_word","level":"warn","message":"100.0% rows are a single word"},{"code":"allcaps","level":"info","message":"100.0% rows are all-caps"}],"column":"timestamp","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[25.5,25.525,25.55,25.575,25.6,25.625,25.65,25.675,25.7,25.725,25.75,25.775,25.8,25.825,25.85,25.875,25.9,25.925,25.95,25.975,26.0,26.025,26.05,26.075,26.1,26.125,26.15,26.175,26.2,26.225,26.25,26.275,26.3,26.325,26.35,26.375,26.4,26.425,26.45,26.475,26.5]},"near_unique":true,"sample":["2025-12-23T23:35:20.812256","2025-12-23T23:46:29.113253","2025-12-23T23:53:52.818216","2025-12-23T23:51:06.721420","2025-12-23T23:24:08.130284","2025-12-24T00:00:49.619695","2025-12-23T23:25:09.314207","2025-12-23T23:56:13.728686","2025-12-23T23:51:13.117978","2025-12-23T23:00:12.134916","2025-12-23T23:36:17.036168","2025-12-23T23:31:13.714937","2025-12-23T23:14:26.511690","2025-12-23T23:36:02.614701","2025-12-23T23:15:16.232765","2025-12-23T23:08:43.419178","2025-12-23T23:03:06.224482","2025-12-23T23:52:03.525787","2025-12-24T00:00:06.710285","2025-12-23T23:22:31.725584","2025-12-23T23:16:07.221539","2025-12-23T23:00:18.828637","2025-12-23T23:07:15.210597","2025-12-23T23:04:37.914250","2025-12-23T23:42:37.623367","2025-12-23T23:47:21.225655","2025-12-23T23:32:40.130904","2025-12-23T23:08:58.632159","2025-12-23T23:11:00.411999","2025-12-24T00:00:50.418800","2025-12-23T23:39:37.115449","2025-12-23T23:40:03.222485","2025-12-23T23:40:03.622812","2025-12-23T23:21:42.521784","2025-12-23T23:19:46.611876","2025-12-23T23:24:46.315084","2025-12-23T23:58:54.123171","2025-12-23T23:29:50.524379","2025-12-23T23:25:29.911828","2025-12-23T23:30:57.425562","2025-12-23T23:50:20.512413","2025-12-23T23:58:04.230868","2025-12-23T23:14:16.720933","2025-12-23T23:57:56.821107","2025-12-24T00:00:05.228046","2025-12-24T00:01:00.915232","2025-12-23T23:18:39.211864","2025-12-23T23:21:46.340984","2025-12-23T23:06:57.019142","2025-12-23T23:55:23.124122"],"top_values":[],"top_words":[["2025-12-23t23:31:14.814458",1],["2025-12-24t00:01:00.814493",1],["2025-12-24t00:01:32.810712",1],["2025-12-23t23:10:53.027823",1],["2025-12-23t23:50:31.015750",1],["2025-12-23t23:51:36.411245",1],["2025-12-23t23:05:29.141589",1],["2025-12-23t23:52:35.225405",1],["2025-12-23t23:22:39.629314",1],["2025-12-23t23:53:49.623856",1],["2025-12-23t23:21:40.228097",1],["2025-12-23t23:18:01.109980",1],["2025-12-23t23:54:43.722945",1],["2025-12-23t23:45:49.317370",1],["2025-12-23t23:34:32.230216",1],["2025-12-23t23:31:39.424766",1],["2025-12-23t23:38:48.919134",1],["2025-12-24t00:00:06.926022",1],["2025-12-23t23:27:27.413944",1],["2025-12-23t23:21:37.016189",1],["2025-12-23t23:07:36.630496",1],["2025-12-24t00:00:10.634942",1],["2025-12-23t23:49:05.730511",1],["2025-12-23t23:59:15.810600",1],["2025-12-23t23:46:14.226083",1]],"vocab_skipped":null,"word_histogram":{"counts":[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,101040,0,0,0,0,0,0,0,0,0,0,0,0,0,0],"edges":[0.5,0.5333333333333333,0.5666666666666667,0.6,0.6333333333333333,0.6666666666666666,0.7,0.7333333333333334,0.7666666666666666,0.8,0.8333333333333333,0.8666666666666667,0.9,0.9333333333333333,0.9666666666666667,1.0,1.0333333333333332,1.0666666666666667,1.1,1.1333333333333333,1.1666666666666665,1.2,1.2333333333333334,1.2666666666666666,1.3,1.3333333333333335,1.3666666666666667,1.4,1.4333333333333333,1.4666666666666668,1.5]}},"kind":"text","n":101040,"n_null":0,"n_unique":101040,"null_rate":0.0,"stats":{"allcaps_rate":1.0,"boilerplate_rate":0.0,"duplicate_rate":0.0,"emoji_rate":0.0,"len_max":26,"len_mean":26.0,"len_median":26.0,"len_min":26,"len_p95":26.0,"n_duplicates":0,"n_empty":0,"one_word_rate":1.0,"readability_flesch_mean":121.22000000000004,"url_rate":0.0,"vocab_size":20000,"word_mean":1.0,"word_median":1.0}},{"alerts":[],"column":"language","extras":{"singletons":10,"top_values":[["en",61468],["ja",12607],["unknown",11481],["en-US",3617],["ko",2406],["de",1821],["pt",1295],["es",1153],["fr",746],["th",612],["tr",548],["nl",525],["zh",315],["it",276],["ru",213],["fi",193],["ja-JP",170],["id",158],["pl",139],["el",116]]},"kind":"categorical","n":101040,"n_null":0,"n_unique":90,"null_rate":0.0,"stats":{"cardinality":90,"entropy":2.178453126992494,"entropy_ratio":0.33556722474575634,"top_rate":0.6083531274742676,"top_value":"en"}},{"alerts":[],"column":"char_count","extras":{"histogram":{"counts":[11042,12354,11078,8177,7049,6309,5146,4495,3865,3459,3244,2592,2219,2065,1867,1708,1624,1467,1439,1439,1563,1657,4862,33,261,3,5,1,2,9,3,0,1,1,0,0,0,0,0,1],"edges":[1.0,14.1,27.2,40.3,53.4,66.5,79.6,92.7,105.8,118.89999999999999,132.0,145.1,158.2,171.29999999999998,184.4,197.5,210.6,223.7,236.79999999999998,249.9,263.0,276.09999999999997,289.2,302.3,315.4,328.5,341.59999999999997,354.7,367.8,380.9,394.0,407.09999999999997,420.2,433.3,446.4,459.5,472.59999999999997,485.7,498.8,511.9,525.0]},"sample":[72.0,131.0,116.0,108.0,87.0,18.0,33.0,194.0,48.0,15.0,19.0,39.0,70.0,221.0,65.0,273.0,80.0,15.0,89.0,45.0,76.0,251.0,205.0,60.0,63.0,31.0,27.0,249.0,45.0,97.0,111.0,132.0,272.0,215.0,29.0,294.0,53.0,25.0,143.0,41.0,49.0,163.0,85.0,47.0,285.0,53.0,56.0,15.0,73.0,30.0,152.0,73.0,40.0,21.0,73.0,17.0,5.0,165.0,23.0,95.0,25.0,49.0,47.0,24.0,31.0,176.0,193.0,2.0,18.0,134.0,34.0,194.0,47.0,128.0,22.0,126.0,71.0,146.0,187.0,13.0,34.0,30.0,75.0,75.0,186.0,60.0,16.0,28.0,44.0,109.0,75.0,299.0,31.0,51.0,119.0,42.0,17.0,119.0,88.0,97.0,26.0,295.0,4.0,6.0,21.0,116.0,152.0,86.0,27.0,24.0,150.0,72.0,24.0,62.0,132.0,63.0,271.0,108.0,24.0,17.0,24.0,147.0,34.0,49.0,299.0,26.0,160.0,33.0,1.0,78.0,62.0,10.0,36.0,1.0,156.0,15.0,267.0,36.0,209.0,87.0,76.0,29.0,103.0,262.0,59.0,108.0,234.0,8.0,107.0,234.0,12.0,3.0,117.0,298.0,123.0,84.0,59.0,62.0,50.0,44.0,39.0,69.0,12.0,139.0,54.0,18.0,18.0,273.0,5.0,56.0,261.0,31.0,42.0,44.0,152.0,294.0,60.0,161.0,99.0,133.0,40.0,106.0,169.0,28.0,19.0,253.0,104.0,14.0,89.0,22.0,4.0,10.0,48.0,23.0,37.0,143.0,79.0,74.0,59.0,38.0,143.0,265.0,95.0,5.0,97.0,88.0,270.0,175.0,43.0,37.0,7.0,300.0,122.0,100.0,1.0,39.0,54.0,20.0,300.0,93.0,295.0,29.0,175.0,165.0,112.0,24.0,69.0,26.0,232.0,41.0,126.0,49.0,96.0,92.0,65.0,47.0,37.0,223.0,8.0,101.0,4.0,86.0,49.0,276.0,35.0,197.0,286.0,158.0,233.0,38.0,33.0,37.0,60.0,20.0,6.0,300.0,81.0,18.0,101.0,55.0,34.0,89.0,6.0,139.0,54.0,8.0,289.0,59.0,9.0,33.0,52.0,59.0,46.0,50.0,28.0,292.0,11.0,111.0,17.0,34.0,27.0,150.0,15.0,58.0,148.0,98.0,67.0,275.0,35.0,54.0,22.0,81.0,153.0,19.0,29.0,23.0,111.0,99.0,55.0,57.0,315.0,118.0,139.0,100.0,60.0,98.0,146.0,7.0,46.0,66.0,87.0,300.0,10.0,3.0,61.0,237.0,37.0,104.0,12.0,67.0,157.0,40.0,178.0,280.0,142.0,101.0,193.0,25.0,80.0,17.0,184.0,137.0,11.0,58.0,12.0,47.0,116.0,163.0,28.0,254.0,292.0,116.0,16.0,5.0,156.0,25.0,63.0,52.0,42.0,82.0,186.0,30.0,64.0,7.0,29.0,45.0,163.0,225.0,168.0,144.0,76.0,280.0,29.0,280.0,18.0,138.0,9.0,39.0,105.0,98.0,226.0,26.0,64.0,35.0,3.0,135.0,112.0,65.0,30.0,79.0,68.0,38.0,70.0,23.0,10.0,40.0,56.0,81.0,13.0,75.0,262.0,84.0,51.0,55.0,12.0,37.0,111.0,255.0,22.0,76.0,128.0,26.0,50.0,3.0,10.0,132.0,14.0,300.0,35.0,34.0,265.0,46.0,187.0,194.0,119.0,84.0,9.0,55.0,93.0,71.0,2.0,300.0,106.0,57.0,88.0,4.0,34.0,83.0,228.0,299.0,108.0,9.0,128.0,25.0,151.0,57.0,16.0,42.0,23.0,105.0,27.0,106.0,49.0,32.0,64.0,9.0,9.0,32.0,1.0,29.0,300.0,5.0,249.0,269.0,299.0,23.0,173.0,5.0,208.0,35.0,299.0,35.0,69.0,56.0,38.0,9.0,58.0,24.0,3.0,327.0,24.0,151.0,101.0,135.0,7.0,101.0,172.0,252.0,14.0,256.0,80.0,187.0,162.0,8.0,16.0,7.0,307.0,283.0,172.0,9.0,94.0,20.0,13.0,62.0,20.0,75.0,208.0,110.0,94.0,143.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":341,"null_rate":0.0,"stats":{"iqr":113.0,"kurtosis":-0.05732629898733377,"max":525.0,"mean":97.62657363420428,"median":68.0,"min":1.0,"n_outliers":289,"outlier_rate":0.002860253365003959,"q1":30.0,"q3":143.0,"skew":1.0177078866571954,"std":86.05175233030359,"zero_rate":0.0}},{"alerts":[],"column":"word_count","extras":{"histogram":{"counts":[21450,9307,8043,7043,6459,5882,4926,4074,3582,3218,2640,2512,2280,3311,1923,1805,1633,1314,1144,1057,1076,1046,994,965,918,805,910,334,189,88,31,23,21,7,10,14,3,1,0,2],"edges":[0.0,2.075,4.15,6.2250000000000005,8.3,10.375,12.450000000000001,14.525000000000002,16.6,18.675,20.75,22.825000000000003,24.900000000000002,26.975,29.050000000000004,31.125000000000004,33.2,35.275000000000006,37.35,39.425000000000004,41.5,43.575,45.650000000000006,47.725,49.800000000000004,51.87500000000001,53.95,56.025000000000006,58.10000000000001,60.175000000000004,62.25000000000001,64.325,66.4,68.47500000000001,70.55000000000001,72.625,74.7,76.775,78.85000000000001,80.92500000000001,83.0]},"sample":[10.0,25.0,17.0,11.0,4.0,5.0,2.0,33.0,8.0,1.0,4.0,1.0,14.0,42.0,9.0,51.0,5.0,1.0,10.0,9.0,14.0,34.0,22.0,12.0,12.0,1.0,6.0,42.0,8.0,17.0,18.0,26.0,51.0,45.0,5.0,51.0,11.0,2.0,25.0,8.0,9.0,24.0,3.0,8.0,44.0,10.0,9.0,3.0,14.0,1.0,28.0,14.0,8.0,3.0,12.0,2.0,2.0,36.0,4.0,18.0,3.0,10.0,7.0,1.0,6.0,9.0,34.0,1.0,2.0,23.0,6.0,31.0,8.0,24.0,3.0,23.0,13.0,16.0,33.0,2.0,6.0,7.0,15.0,11.0,29.0,9.0,2.0,5.0,9.0,2.0,10.0,56.0,7.0,4.0,8.0,7.0,4.0,8.0,16.0,5.0,5.0,6.0,1.0,1.0,5.0,6.0,20.0,15.0,1.0,1.0,14.0,12.0,1.0,8.0,20.0,12.0,41.0,18.0,1.0,3.0,4.0,24.0,7.0,8.0,65.0,5.0,33.0,1.0,1.0,14.0,11.0,1.0,5.0,1.0,27.0,3.0,43.0,9.0,39.0,18.0,12.0,6.0,17.0,58.0,11.0,15.0,38.0,1.0,17.0,50.0,1.0,1.0,21.0,54.0,19.0,16.0,12.0,9.0,9.0,3.0,4.0,13.0,3.0,23.0,15.0,5.0,3.0,52.0,1.0,9.0,10.0,1.0,5.0,10.0,11.0,49.0,10.0,31.0,19.0,26.0,4.0,1.0,29.0,6.0,4.0,39.0,11.0,2.0,10.0,3.0,1.0,2.0,9.0,5.0,6.0,27.0,13.0,13.0,11.0,1.0,26.0,42.0,13.0,1.0,20.0,11.0,44.0,24.0,6.0,9.0,2.0,46.0,10.0,1.0,1.0,6.0,7.0,4.0,42.0,18.0,54.0,7.0,32.0,30.0,16.0,1.0,13.0,6.0,43.0,8.0,25.0,4.0,13.0,9.0,11.0,8.0,5.0,37.0,1.0,11.0,1.0,14.0,13.0,8.0,7.0,39.0,50.0,13.0,43.0,8.0,1.0,7.0,11.0,2.0,1.0,54.0,14.0,4.0,21.0,9.0,1.0,8.0,1.0,16.0,4.0,1.0,46.0,10.0,3.0,1.0,8.0,11.0,4.0,1.0,2.0,43.0,3.0,21.0,3.0,2.0,5.0,3.0,3.0,10.0,19.0,14.0,10.0,52.0,8.0,9.0,1.0,3.0,8.0,4.0,4.0,5.0,22.0,2.0,10.0,10.0,9.0,19.0,17.0,12.0,1.0,18.0,27.0,2.0,2.0,14.0,17.0,59.0,2.0,1.0,9.0,29.0,8.0,14.0,3.0,11.0,26.0,7.0,31.0,45.0,26.0,17.0,29.0,5.0,13.0,4.0,34.0,8.0,3.0,5.0,2.0,8.0,15.0,6.0,4.0,53.0,1.0,17.0,3.0,2.0,6.0,1.0,10.0,11.0,7.0,14.0,37.0,6.0,11.0,1.0,6.0,9.0,23.0,51.0,34.0,24.0,14.0,57.0,4.0,57.0,1.0,24.0,1.0,1.0,18.0,14.0,28.0,6.0,1.0,6.0,1.0,7.0,18.0,8.0,1.0,11.0,10.0,7.0,8.0,3.0,2.0,7.0,7.0,15.0,2.0,13.0,50.0,13.0,6.0,2.0,1.0,6.0,17.0,42.0,4.0,12.0,22.0,4.0,9.0,1.0,2.0,22.0,3.0,1.0,8.0,1.0,51.0,1.0,32.0,36.0,24.0,15.0,2.0,1.0,15.0,17.0,1.0,54.0,16.0,14.0,10.0,1.0,6.0,18.0,44.0,5.0,2.0,1.0,26.0,6.0,16.0,6.0,2.0,7.0,5.0,14.0,1.0,14.0,10.0,5.0,10.0,3.0,1.0,7.0,1.0,1.0,46.0,1.0,32.0,28.0,47.0,3.0,17.0,1.0,35.0,1.0,53.0,8.0,14.0,9.0,7.0,1.0,13.0,4.0,1.0,29.0,4.0,31.0,12.0,22.0,2.0,20.0,26.0,45.0,3.0,23.0,17.0,34.0,18.0,3.0,1.0,1.0,33.0,4.0,22.0,2.0,19.0,3.0,4.0,1.0,1.0,16.0,41.0,6.0,9.0,11.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":79,"null_rate":0.0,"stats":{"iqr":19.0,"kurtosis":0.6990225572196218,"max":83.0,"mean":14.674574425969913,"median":10.0,"min":0.0,"n_outliers":2882,"outlier_rate":0.028523357086302454,"q1":3.0,"q3":22.0,"skew":1.208503579975851,"std":14.223127548992759,"zero_rate":0.0006037212984956453}},{"alerts":[{"code":"high_skew","level":"info","message":"skew=+2.12"},{"code":"outliers","level":"warn","message":"13.6% rows beyond 1.5 IQR"}],"column":"has_images","extras":{"histogram":{"counts":[87272,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,13768],"edges":[0.0,0.025,0.05,0.07500000000000001,0.1,0.125,0.15000000000000002,0.17500000000000002,0.2,0.225,0.25,0.275,0.30000000000000004,0.325,0.35000000000000003,0.375,0.4,0.42500000000000004,0.45,0.47500000000000003,0.5,0.525,0.55,0.5750000000000001,0.6000000000000001,0.625,0.65,0.675,0.7000000000000001,0.7250000000000001,0.75,0.775,0.8,0.8250000000000001,0.8500000000000001,0.875,0.9,0.925,0.9500000000000001,0.9750000000000001,1.0]},"sample":[0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,1.0,0.0,1.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":2,"null_rate":0.0,"stats":{"iqr":0.0,"kurtosis":2.496516184894217,"max":1.0,"mean":0.1362628661916073,"median":0.0,"min":0.0,"n_outliers":13768,"outlier_rate":0.1362628661916073,"q1":0.0,"q3":0.0,"skew":2.1204990414744866,"std":0.3430691801066323,"zero_rate":0.8637371338083927}},{"alerts":[{"code":"high_skew","level":"info","message":"skew=+8.50"}],"column":"has_video","extras":{"histogram":{"counts":[99696,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1344],"edges":[0.0,0.025,0.05,0.07500000000000001,0.1,0.125,0.15000000000000002,0.17500000000000002,0.2,0.225,0.25,0.275,0.30000000000000004,0.325,0.35000000000000003,0.375,0.4,0.42500000000000004,0.45,0.47500000000000003,0.5,0.525,0.55,0.5750000000000001,0.6000000000000001,0.625,0.65,0.675,0.7000000000000001,0.7250000000000001,0.75,0.775,0.8,0.8250000000000001,0.8500000000000001,0.875,0.9,0.925,0.9500000000000001,0.9750000000000001,1.0]},"sample":[0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":2,"null_rate":0.0,"stats":{"iqr":0.0,"kurtosis":70.19205241075723,"max":1.0,"mean":0.01330166270783848,"median":0.0,"min":0.0,"n_outliers":1344,"outlier_rate":0.01330166270783848,"q1":0.0,"q3":0.0,"skew":8.49659063452849,"std":0.1145637742687172,"zero_rate":0.9866983372921615}},{"alerts":[{"code":"outliers","level":"warn","message":"18.0% rows beyond 1.5 IQR"}],"column":"has_link","extras":{"histogram":{"counts":[82900,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,18140],"edges":[0.0,0.025,0.05,0.07500000000000001,0.1,0.125,0.15000000000000002,0.17500000000000002,0.2,0.225,0.25,0.275,0.30000000000000004,0.325,0.35000000000000003,0.375,0.4,0.42500000000000004,0.45,0.47500000000000003,0.5,0.525,0.55,0.5750000000000001,0.6000000000000001,0.625,0.65,0.675,0.7000000000000001,0.7250000000000001,0.75,0.775,0.8,0.8250000000000001,0.8500000000000001,0.875,0.9,0.925,0.9500000000000001,0.9750000000000001,1.0]},"sample":[0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,1.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,1.0,0.0,0.0,0.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,1.0,0.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0]},"kind":"numeric","n":101040,"n_null":0,"n_unique":2,"null_rate":0.0,"stats":{"iqr":0.0,"kurtosis":0.7888288781930632,"max":1.0,"mean":0.1795328582739509,"median":0.0,"min":0.0,"n_outliers":18140,"outlier_rate":0.1795328582739509,"q1":0.0,"q3":0.0,"skew":1.6699787059100677,"std":0.3837997771428118,"zero_rate":0.8204671417260491}},{"alerts":[{"code":"null_rate","level":"warn","message":"61.2% null"}],"column":"embed_type","extras":{"singletons":0,"top_values":[["app.bsky.embed.external",18140],["app.bsky.embed.images",13768],["app.bsky.embed.record",5126],["app.bsky.embed.video",1344],["app.bsky.embed.recordWithMedia",871]]},"kind":"categorical","n":101040,"n_null":61791,"n_unique":5,"null_rate":0.6115498812351544,"stats":{"cardinality":5,"entropy":1.716940999074542,"entropy_ratio":0.7394462398965165,"top_rate":0.46217738031542205,"top_value":"app.bsky.embed.external"}},{"alerts":[{"code":"one_word","level":"warn","message":"90.2% rows are a single word"},{"code":"duplicates","level":"warn","message":"90.0% duplicate strings"}],"column":"hashtags","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[92111,3305,2893,1061,466,316,183,120,90,321,42,28,47,20,5,5,5,6,0,4,5,0,1,1,0,0,1,1,0,1,1,0,0,0,0,0,0,0,0,1],"edges":[2.0,30.0,58.0,86.0,114.0,142.0,170.0,198.0,226.0,254.0,282.0,310.0,338.0,366.0,394.0,422.0,450.0,478.0,506.0,534.0,562.0,590.0,618.0,646.0,674.0,702.0,730.0,758.0,786.0,814.0,842.0,870.0,898.0,926.0,954.0,982.0,1010.0,1038.0,1066.0,1094.0,1122.0]},"near_unique":false,"sample":["[]","[]","[\"#strongertogether\"]","[]","[]","[]","[]","[\"#dandysworld\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[\"#pruzkum\"]","[]","[]","[\"#pareridiparte\", \"#ritardi\", \"#manovra\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[\"#\\u3078\\u304d\\u3055\\u308a\\u3085\\u3046\\u307f\\u3053\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]"],"top_values":[["[]",87318],["[\"#\\u0e23\", \"#422\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e41\\u0e08\\u0e01\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e04\\u0e32\\u0e2a\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e42\\u0e04\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e40\\u0e27\", \"#\\u0e40\\u0e04\\u0e23\\u0e14\", \"#\\u0e2a\\u0e25\", \"#\\u0e2a\\u0e25\"]",256],["[\"#Berlin\", \"#Verkehr\", \"#Baustelle\", \"#Sperrung\", \"#St\\u00f6rung\", \"#Stra\\u00dfe\"]",253],["[\"#NowPlaying\"]",115],["[\"#nowplaying\"]",85],["[\"#dpdi\", \"#dpdi\"]",60],["[\"#\\u30e9\\u30f3\\u30c0\\u30e0\\u6587\\u5b57\"]",46],["[\"#a\"]",46],["[\"#christmas\", \"#darlenelove\", \"#philspector\", \"#ronniespector\"]",41],["[\"#OhBrookeOhTaylorOhBrooke\"]",38],["[\"#1\"]",33],["[\"#uk\", \"#news\", \"#uknews\"]",29],["[\"#rva\"]",27],["[\"#pixiv\"]",27],["[\"#fr\", \"#france\"]",27],["[\"#nba\"]",26],["[\"#ohbrookeohtaylorohbrooke\"]",24],["[\"#Tetsujin28FX\"]",23],["[\"#de\", \"#deutschland\"]",23],["[\"#1649\"]",22]],"top_words":[["[]",17275],["\"#\\u0e40\\u0e04\\u0e23\\u0e14\",",191],["[\"#\\u0e23\",",48],["\"#\\u0e41\\u0e08\\u0e01\\u0e40\\u0e04\\u0e23\\u0e14\",",48],["\"#\\u0e04\\u0e32\\u0e2a\",",48],["\"#\\u0e42\\u0e04\",",48],["\"#\\u0e40\\u0e27\",",48],["\"#\\u0e2a\\u0e25\",",48],["\"#\\u0e2a\\u0e25\"]",48],["\"#422\",",47],["[\"#nowplaying\"]",40],["[\"#berlin\",",38],["\"#verkehr\",",38],["\"#baustelle\",",38],["\"#sperrung\",",38],["\"#st\\u00f6rung\",",38],["\"#stra\\u00dfe\"]",38],["[\"#nowplaying\",",33],["[\"#art\",",21],["\"#nsfw\",",20],["\"#art\",",20],["[\"#\\u30a2\\u30de\\u30be\\u30f3\",",18],["\"#oc\",",16],["[\"#\\u6771\\u4eac\\u90fd\",",15],["[\"#dpdi\",",15]],"vocab_skipped":null,"word_histogram":{"counts":[95104,3116,1657,287,195,392,98,62,28,37,25,12,7,2,7,5,2,0,2,1,0,0,0,0,0,0,0,0,0,1],"edges":[1.0,3.033333333333333,5.066666666666666,7.1,9.133333333333333,11.166666666666666,13.2,15.233333333333333,17.266666666666666,19.299999999999997,21.333333333333332,23.366666666666667,25.4,27.43333333333333,29.466666666666665,31.5,33.53333333333333,35.56666666666666,37.599999999999994,39.63333333333333,41.666666666666664,43.699999999999996,45.733333333333334,47.766666666666666,49.8,51.83333333333333,53.86666666666666,55.9,57.93333333333333,59.96666666666666,62.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":10103,"null_rate":0.0,"stats":{"allcaps_rate":0.005700712589073635,"boilerplate_rate":0.0,"duplicate_rate":0.9000098970704672,"emoji_rate":0.0,"len_max":1122,"len_mean":10.377622723673792,"len_median":2.0,"len_min":2,"len_p95":63.0,"n_duplicates":90937,"n_empty":0,"one_word_rate":0.9019596199524941,"readability_flesch_mean":2.751632142857146,"url_rate":0.0,"vocab_size":7036,"word_mean":1.3841745843230404,"word_median":1.0}},{"alerts":[{"code":"one_word","level":"warn","message":"99.6% rows are a single word"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"duplicates","level":"warn","message":"98.1% duplicate strings"}],"column":"mentions","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[98659,1998,0,180,0,45,0,33,0,20,0,21,0,21,0,18,0,14,20,0,5,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3],"edges":[2.0,12.45,22.9,33.349999999999994,43.8,54.25,64.69999999999999,75.14999999999999,85.6,96.05,106.5,116.94999999999999,127.39999999999999,137.85,148.29999999999998,158.75,169.2,179.64999999999998,190.1,200.54999999999998,211.0,221.45,231.89999999999998,242.35,252.79999999999998,263.25,273.7,284.15,294.59999999999997,305.04999999999995,315.5,325.95,336.4,346.84999999999997,357.29999999999995,367.75,378.2,388.65,399.09999999999997,409.54999999999995,420.0]},"near_unique":false,"sample":["[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[\"602826fc65fa6aa0\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]"],"top_values":[["[]",98659],["[\"f212c35005158af4\"]",16],["[\"f50c0569501abab1\"]",15],["[\"c2c301341c84573d\"]",15],["[\"10b5533239dcddce\"]",13],["[\"10b60c157e2d651d\"]",11],["[\"607bca3b4bc405e9\"]",9],["[\"3ffed484fcc4f856\"]",9],["[\"1792f17a90a3d717\"]",9],["[\"d38369e879391173\"]",9],["[\"77d7e860f21c08fc\"]",7],["[\"f9f2b4c905347f44\"]",7],["[\"ceaa893232d29639\"]",7],["[\"84bfdfccb5651fd6\"]",7],["[\"b8d36411702552c5\"]",7],["[\"29630d82ffa90155\"]",6],["[\"0df2feb382b3c760\"]",6],["[\"a97c657865a0ea62\"]",6],["[\"90a0897c0263ae9a\"]",6],["[\"1719171f2f3c519d\"]",5]],"top_words":[["[]",19542],["[\"f50c0569501abab1\"]",6],["[\"84bfdfccb5651fd6\"]",4],["[\"10b60c157e2d651d\"]",3],["[\"3f2e7922962fd5fa\"]",3],["[\"f212c35005158af4\"]",3],["\"660dbd4b7e3cff3e\",",2],["[\"adb95d00bcba3bdb\",",2],["\"ffc3e7fe15f5b095\"]",2],["[\"423f7a6ffdd9e437\"]",2],["[\"3ffed484fcc4f856\"]",2],["[\"1719171f2f3c519d\"]",2],["[\"9186e9ed09e370a0\"]",2],["[\"d5e601007771e2a9\"]",2],["[\"90179da90188b233\"]",2],["\"e18cad2b21bbb448\",",2],["[\"a97c657865a0ea62\"]",2],["[\"6736f8a1dfb08d71\"]",2],["[\"77d7e860f21c08fc\"]",2],["[\"29167ae562816f3c\"]",2],["[\"71739959f21ce92e\"]",2],["[\"eceadd1b29f52cbd\"]",2],["[\"ceaa893232d29639\"]",2],["[\"65a3a93838fb7b0b\"]",2],["[\"0df2feb382b3c760\"]",2]],"vocab_skipped":null,"word_histogram":{"counts":[100657,180,0,45,33,0,20,21,0,21,18,0,14,20,0,5,3,0,0,0,0,0,0,0,0,0,0,0,0,3],"edges":[1.0,1.6666666666666665,2.333333333333333,3.0,3.6666666666666665,4.333333333333333,5.0,5.666666666666666,6.333333333333333,7.0,7.666666666666666,8.333333333333332,9.0,9.666666666666666,10.333333333333332,11.0,11.666666666666666,12.333333333333332,13.0,13.666666666666666,14.333333333333332,15.0,15.666666666666666,16.333333333333332,17.0,17.666666666666664,18.333333333333332,19.0,19.666666666666664,20.333333333333332,21.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":1921,"null_rate":0.0,"stats":{"allcaps_rate":3.9588281868566906e-05,"boilerplate_rate":0.0,"duplicate_rate":0.9809877276326208,"emoji_rate":0.0,"len_max":420,"len_mean":2.6698139350752177,"len_median":2.0,"len_min":2,"len_p95":2.0,"n_duplicates":99119,"n_empty":0,"one_word_rate":0.9962094220110848,"readability_flesch_mean":0.7019500000000006,"url_rate":0.0,"vocab_size":660,"word_mean":1.012282264449723,"word_median":1.0}},{"alerts":[{"code":"one_word","level":"warn","message":"99.8% rows are a single word"},{"code":"short_text","level":"info","message":"95th-percentile length under 20 chars"},{"code":"duplicates","level":"warn","message":"96.3% duplicate strings"}],"column":"links","extras":{"language_counts":{},"language_sample_size":5000,"length_histogram":{"counts":[96211,0,44,293,447,1282,347,301,208,252,218,133,109,185,116,158,169,119,139,84,33,43,40,39,18,19,13,1,5,1,5,0,2,1,0,1,0,0,1,3],"edges":[2.0,8.6,15.2,21.799999999999997,28.4,35.0,41.599999999999994,48.199999999999996,54.8,61.4,68.0,74.6,81.19999999999999,87.8,94.39999999999999,101.0,107.6,114.19999999999999,120.8,127.39999999999999,134.0,140.6,147.2,153.79999999999998,160.39999999999998,167.0,173.6,180.2,186.79999999999998,193.39999999999998,200.0,206.6,213.2,219.79999999999998,226.39999999999998,233.0,239.6,246.2,252.79999999999998,259.4,266.0]},"near_unique":false,"sample":["[\"https://www.20min.ch/fr/story/buelach-zh-un-client-arrete-apres-avoir-poignarde-son-livreur-de-repas-103470448\"]","[]","[]","[]","[]","[\"https://trecome.info/articles/89cfe941-6cf1-45ae-8626-cc0241375b46\"]","[]","[]","[]","[]","[]","[]","[]","[\"https://lecourrier.ch/2025/12/21/un-oui-que-le-tessin-redoute/\"]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]","[]"],"top_values":[["[]",96211],["[\"https://www.bbc.com/news/arti...\"]",43],["[\"https://www.radiofrance.fr/fip\"]",37],["[\"https://www.kbradio.online\"]",32],["[\"https://sphynx.radio-progres.fr/listen/radio_progres/radio.mp3\", \"https://radio-progres.fr\"]",28],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd3/bluesky\"]",23],["[\"https://radiotempete.com/\"]",21],["[\"https://bvf.wtf\"]",20],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd2/bluesky\"]",19],["[\"https://streaming.shoutcast.com/tiorr3\", \"https://listen.openstream.co/6128/audio\"]",18],["[\"https://oakgroveradio.com/player\"]",17],["[\"https://www.hot21radio.com\"]",17],["[\"https://www.radiosouvenirsfm.com\"]",16],["[\"https://radiofonico.it\"]",14],["[\"https://www.enlaradio.cl\"]",14],["[\"https://adachi-fm.com/\"]",14],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm/bluesky\"]",14],["[\"https://trance.ie\"]",12],["[\"https://untidyradio.com\"]",11],["[\"https://amasale.newif.net/ranking/kdetail/299\"]",10]],"top_words":[["[]",19055],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd2/bluesky\"]",6],["[\"https://www.radiofrance.fr/fip\"]",5],["[\"https://radio.913aycltfm.com/listen/91.3_ayclt_fm_hd3/bluesky\"]",5],["[\"https://streaming.shoutcast.com/tiorr3\",",4],["\"https://listen.openstream.co/6128/audio\"]",4],["[\"https://www.kbradio.online\"]",4],["[\"https://www.hot21radio.com\"]",4],["[\"https://bvf.wtf\"]",4],["[\"https://radiotempete.com/\"]",4],["[\"https://sphynx.radio-progres.fr/listen/radio_progres/radio.mp3\",",3],["\"https://radio-progres.fr\"]",3],["[\"https://adachi-fm.com/\"]",3],["[\"https://oakgroveradio.com/player\"]",3],["[\"https://radiofonico.it\"]",3],["[\"https://www.bbc.com/news/arti...\"]",3],["[\"https://www.ume2001.com/support/labo/amesh-v2.html?&tm=202512241400\"]",3],["[\"https://www.20min.ch/fr/story/zurich-un-chauffage-a-23-milliards-de-la-folie-climatique-pour-l-udc-103470364\"]",3],["[\"https://t.co/jcedttntn9\",",3],["\"https://t.co/d6j014jtlf\"]",3],["[\"https://www.project-anime.com/1315216/\"]",3],["[\"https://www.ume2001.com/support/labo/amesh-v2.html?&tm=202512241435\"]",3],["[\"https://thexwgxx.radio12345.com\"]",3],["[\"https://jeffro.radio\"]",3],["[\"https://www.enlaradio.cl\"]",3]],"vocab_skipped":null,"word_histogram":{"counts":[100818,0,0,0,0,0,0,185,0,0,0,0,0,0,0,26,0,0,0,0,0,0,8,0,0,0,0,0,0,3],"edges":[1.0,1.1333333333333333,1.2666666666666666,1.4,1.5333333333333332,1.6666666666666665,1.8,1.9333333333333333,2.0666666666666664,2.2,2.333333333333333,2.466666666666667,2.6,2.7333333333333334,2.8666666666666667,3.0,3.1333333333333333,3.2666666666666666,3.4,3.533333333333333,3.6666666666666665,3.8,3.933333333333333,4.066666666666666,4.2,4.333333333333334,4.466666666666667,4.6,4.733333333333333,4.866666666666667,5.0]}},"kind":"text","n":101040,"n_null":0,"n_unique":3771,"null_rate":0.0,"stats":{"allcaps_rate":0.0,"boilerplate_rate":0.0,"duplicate_rate":0.9626781472684085,"emoji_rate":0.0,"len_max":266,"len_mean":4.95048495645289,"len_median":2.0,"len_min":2,"len_p95":2.0,"n_duplicates":97269,"n_empty":0,"one_word_rate":0.9978028503562946,"readability_flesch_mean":-20.1082,"url_rate":0.047792953285827396,"vocab_size":904,"word_mean":1.0027019002375297,"word_median":1.0}}],"insights":{"errors":[],"insights":[{"confidence":"high","critiques":[],"evidence_keys":["row_count","column_count","language","sentiment","char_count","has_link","has_images","has_video","embed_type","reply_root_hash","text"],"featured_charts":[{"caption":"Shows English dominance alongside a long multilingual tail led by Japanese and Korean.","column":"language","kind":"bar"},{"caption":"Reveals the neutral-heavy split with positive roughly double the negative share.","column":"sentiment","kind":"donut"},{"caption":"Highlights the right-skewed post length distribution clustered well below the 525-char ceiling.","column":"char_count","kind":"histogram"},{"caption":"Among the ~39% of posts with an embed, external links and images are the dominant types.","column":"embed_type","kind":"bar"},{"caption":"Check the spike at zero (~48% of rows) versus the spread of non-neutral scores.","column":"sentiment_score","kind":"histogram"}],"model":"anthropic:claude-opus-4-7","narrative":"This is an anonymized Bluesky firehose snapshot of 101,040 posts from late December 2025, with 19 columns covering hashed identifiers, post text, timestamps, embed metadata, language, and sentiment. The content is heavily multilingual: English dominates at roughly 61% of posts, but Japanese, Korean, German, and Portuguese also have meaningful presence, alongside an 'unknown' bucket worth investigating. Sentiment skews neutral (~48%) with positive outweighing negative roughly 2:1, and post length is right-skewed (median 68 chars, max 525). Engagement features are sparse \u2014 about 18% of posts carry a link, 14% have images, and only 1.3% have video \u2014 and the embed_type field is null for ~61% of rows, which is the biggest data-quality flag to check first. Reply hashes are null for ~58% of rows, suggesting most posts are top-level rather than replies.","scope":"dataset","target":"__global__"},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","stats.len_mean","stats.len_median","stats.word_mean","language_counts","stats.emoji_rate","stats.allcaps_rate","stats.duplicate_rate","n_duplicates","stats.url_rate","top_values","alerts"],"model":"anthropic:claude-opus-4-7","narrative":"Short user-generated posts (likely social media, given bsky.app URLs and hashtag-heavy entries), averaging 14 words and a median of 68 characters. The corpus is predominantly English (3309) but spans 30 detected languages with notable Japanese (656) and Korean (125) presence, and 18.3% of rows contain emoji while 16.9% are flagged all-caps. Watch the 5,105 duplicates (5.05%) and the top entries dominated by emoji spam (224 sheep emojis) and repeated promotional Thai text.","role":"free_text","scope":"column","target":"text","treatment":"Deduplicate, normalize casing/emoji, and apply a multilingual tokenizer or embedding model before downstream NLP."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","len_min","len_max","len_mean","one_word_rate","duplicate_rate","n_duplicates","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column holds 16-character hexadecimal hashes of author DIDs, with every value exactly 16 chars long and a single token (one_word_rate 1.0). Across 101040 rows there are only 43998 unique authors and a 56.5% duplicate rate, with the top author hash appearing 1016 times \u2014 so a small number of authors generate a disproportionate share of records.","role":"foreign_key","scope":"column","target":"author_did_hash","treatment":"Treat as an author identifier; group/join on it rather than feeding the raw hash to a model."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.len_min","stats.len_max","stats.len_mean","stats.one_word_rate","stats.duplicate_rate","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"This column holds 16-character single-token strings that look like truncated hex digests of URIs, with 101,039 unique values across 101,040 rows and zero nulls. It is effectively a primary key \u2014 duplicate_rate is 0.0 and every entry is exactly 16 characters long. No textual signal is present beyond the hash itself.","role":"identifier","scope":"column","target":"uri_hash","treatment":"Use as a row key or join key; drop before modelling."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.len_min","stats.len_max","stats.len_mean","stats.one_word_rate","stats.duplicate_rate","stats.n_duplicates","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column is a 16-character hexadecimal hash pointing at a parent message, populated only when a row is a reply \u2014 57.67% of the 101,040 values are null. Every non-null entry is exactly 16 chars and one token, with 34,738 unique hashes and an 18.78% duplicate rate (8,032 duplicates), indicating popular parents that attract many replies (top hash 04a1db17fbc9ff3a appears 121 times). The flesch_mean of 71.73 is meaningless here since the strings are opaque IDs.","role":"foreign_key","scope":"column","target":"reply_parent_hash","treatment":"Treat as a self-referential foreign key to the message hash; use for thread reconstruction and leave nulls as 'not a reply'."},{"confidence":"high","critiques":[],"evidence_keys":["kind","n","n_unique","null_rate","stats.len_min","stats.len_max","stats.len_mean","stats.one_word_rate","stats.word_mean","stats.duplicate_rate","stats.n_duplicates","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column holds 16-character hexadecimal hashes identifying the root of a reply thread, with every value exactly 16 chars and a single token. 57.67% of rows are null (consistent with non-reply posts being root-less), and among the 42.33% populated rows there are 21,277 unique hashes with a 50.25% duplicate rate, indicating many replies share common thread roots.","role":"foreign_key","scope":"column","target":"reply_root_hash","treatment":"left-join on this id to retrieve the root post; do not feature-engineer the hash itself."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.cardinality","stats.top_value","stats.top_rate","stats.entropy_ratio","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This is a three-class sentiment label with values neutral, positive, and negative across 101040 rows and no nulls. The distribution is uneven: neutral dominates at 48.5% (top_rate 0.4847684085510689), positive accounts for 34622, and negative trails at 17437 \u2014 roughly half the positive count. Entropy ratio of 0.93 still indicates a fairly balanced spread despite the negative class being underrepresented.","role":"label","scope":"column","target":"sentiment","treatment":"Use as classification target; consider class weighting or resampling to offset the underrepresented negative class."},{"confidence":"high","critiques":[],"evidence_keys":["min","max","mean","median","q1","q3","std","skew","kurtosis","zero_rate","n_outliers","outlier_rate","n_unique"],"model":"anthropic:claude-opus-4-7","narrative":"Continuous sentiment polarity bounded in [-0.998, 1.0], consistent with a tool like VADER/TextBlob compound score. Distribution is roughly symmetric (skew 0.019, kurtosis 0.018) but dominated by a neutral spike: 47.8% of rows are exactly 0 and the median is 0, pulling Q1 to 0 as well. Despite the symmetry, 5,763 rows (5.7%) flag as outliers, suggesting heavy tails of strongly polarised text alongside the neutral mass.","role":"feature","scope":"column","target":"sentiment_score","treatment":"Treat the zero-mass separately (e.g., add an is_neutral flag) before using the score as a numeric feature."},{"confidence":"high","critiques":[],"evidence_keys":["column","n","n_unique","stats.len_min","stats.len_max","stats.word_mean","stats.duplicate_rate","stats.n_duplicates","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"This is a creation timestamp stored as ISO-8601 strings rather than a native datetime, with lengths from 20 to 35 characters and a single token per value. The format is inconsistent: top values mix offset suffixes like `+00:00` with `Z` and microsecond `.000000Z` variants, which will break naive lexical comparisons. Cardinality is high (96576 unique of 101040) but 4464 duplicates (4.4%) suggest concurrent events sharing timestamps.","role":"timestamp","scope":"column","target":"created_at","treatment":"Parse to a normalized UTC datetime before any time-based join or sort."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.len_min","stats.len_max","stats.len_mean","stats.one_word_rate","top_words"],"model":"anthropic:claude-opus-4-7","narrative":"This column holds ISO-8601 timestamps with microsecond precision, stored as text rather than a native datetime type. All 101040 values are unique with zero nulls and a fixed length of 26 characters, and the sampled values cluster on 2025-12-23 and 2025-12-24. The text-profile alerts (near_unique, one_word, allcaps) are artefacts of the string representation, not genuine quality issues.","role":"timestamp","scope":"column","target":"timestamp","treatment":"parse to datetime and use for time-based ordering or feature extraction"},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.top_value","stats.top_rate","stats.entropy_ratio","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"Language code of the record, dominated by English ('en' at 60.8% of 101,040 rows) with Japanese ('ja') a distant second at 12,607. Notably, 11,481 rows are literally 'unknown' and the codes mix bare ISO ('en') with locale variants ('en-US', 3,617), so the 90 distinct values overstate true language diversity. No nulls, but entropy ratio of 0.34 confirms heavy concentration at the top.","role":"feature","scope":"column","target":"language","treatment":"Normalize locale variants to base codes and treat 'unknown' as missing before one-hot or target encoding."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.min","stats.max","stats.mean","stats.median","stats.q1","stats.q3","stats.iqr","stats.skew","stats.std","stats.outlier_rate","stats.zero_rate"],"model":"anthropic:claude-opus-4-7","narrative":"This is almost certainly a character-count feature for some text field, with all 101040 rows populated and only 341 distinct integer values between 1 and 525. The distribution is right-skewed (skew 1.02) with median 68 well below the mean of 97.6 and an IQR of 30-143, indicating most texts are short but a tail stretches out. Outliers are minimal (0.29%) and there are no zero-length entries.","role":"feature","scope":"column","target":"char_count","treatment":"Consider a log or sqrt transform before modelling to tame the right skew."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.min","stats.max","stats.mean","stats.median","stats.q1","stats.q3","stats.iqr","stats.std","stats.skew","stats.kurtosis","stats.outlier_rate","stats.n_outliers","stats.zero_rate"],"model":"anthropic:claude-opus-4-7","narrative":"This is a numeric feature counting words per record, ranging from 0 to 83 with a median of 10 and mean of 14.67. The distribution is right-skewed (skew 1.21) with an IQR of 19 and about 2.85% of values flagged as outliers, suggesting a long tail of unusually long entries. Only 79 unique values across 101,040 rows and a near-zero zero_rate (0.06%) indicate counts cluster in a narrow integer range.","role":"feature","scope":"column","target":"word_count","treatment":"Consider a log or sqrt transform before modelling to tame the right skew."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.mean","stats.zero_rate","stats.skew","stats.n_outliers","stats.min","stats.max"],"model":"anthropic:claude-opus-4-7","narrative":"Binary flag indicating whether a record has images, encoded as 0/1 with only 2 unique values across 101040 rows and no nulls. The positive class is rare at mean 0.136 (zero_rate 0.864), which is why the profiler flags high skew (2.12) and treats the 13768 ones as 'outliers'.","role":"feature","scope":"column","target":"has_images","treatment":"Treat as a boolean feature; ignore the outlier flag since it just reflects class imbalance."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.min","stats.max","stats.mean","stats.zero_rate","stats.skew","stats.kurtosis","alerts"],"model":"anthropic:claude-opus-4-7","narrative":"This is a binary flag indicating whether a record has a video, stored numerically with only 2 unique values (0 and 1) and no nulls across 101040 rows. The positive class is rare: 98.67% are zero and only 1.33% are ones, producing the flagged high skew (8.50) and extreme kurtosis (70.19).","role":"feature","scope":"column","target":"has_video","treatment":"Cast to boolean and keep as-is, but expect class imbalance when using it as a predictor or target."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.mean","stats.zero_rate","stats.min","stats.max","stats.n_outliers","stats.iqr"],"model":"anthropic:claude-opus-4-7","narrative":"This is a binary 0/1 flag indicating whether a record contains a link, with no nulls across 101,040 rows. About 17.95% are 1s and 82.05% are 0s, which is why the IQR is 0 and the minority class gets labelled as 18,140 'outliers' \u2014 that's a quirk of applying numeric outlier logic to a Bernoulli variable, not a data issue.","role":"feature","scope":"column","target":"has_link","treatment":"Cast to boolean and use directly as a binary feature; ignore the outlier flag."},{"confidence":"high","critiques":[],"evidence_keys":["null_rate","n_unique","stats.top_value","stats.top_rate","stats.entropy_ratio","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column tags the embed type attached to Bluesky posts, with five AT Protocol values like app.bsky.embed.external and app.bsky.embed.images. Note the 61.15% null rate \u2014 most posts carry no embed at all \u2014 and among posts that do, external links dominate at 46.22%, with video and recordWithMedia being rare tails. Entropy ratio of 0.74 shows reasonable spread across the five categories when present.","role":"feature","scope":"column","target":"embed_type","treatment":"One-hot encode with an explicit \"none\" category for nulls."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","null_rate","stats.duplicate_rate","stats.one_word_rate","stats.len_median","stats.word_median","stats.vocab_size","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column stores per-row hashtag lists serialised as JSON arrays, with `[]` (empty list) appearing 87,318 times \u2014 the dominant value in a 101,040-row column. Duplicates are extreme (duplicate_rate 0.90, n_unique 10,103) and 90% of values are effectively one token, consistent with most rows having zero or one hashtag (word_median 1, len_median 2). The non-empty examples mix scripts (Thai, Japanese, German, English) including bot-like patterns such as `#NowPlaying` and repeated `#dpdi`, so any text treatment must be multilingual.","role":"feature","scope":"column","target":"hashtags","treatment":"Parse the JSON list, derive features like hashtag_count and presence flags, and treat empty lists as a distinct category."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","duplicate_rate","n_duplicates","one_word_rate","len_max","len_p95","word_mean","vocab_size","top_values"],"model":"anthropic:claude-opus-4-7","narrative":"This column stores serialized JSON arrays of mentioned entity IDs (16-hex tokens), but the overwhelming majority are empty: '[]' appears 98,659 of 101,040 times and the duplicate rate is 0.981. When non-empty, arrays almost always hold a single ID (word_mean 1.01, len_p95 2.0), though len_max reaches 420 indicating occasional multi-mention rows. Vocabulary is small (660) across 1,921 unique values, so signal is sparse and ID-like rather than textual.","role":"foreign_key","scope":"column","target":"mentions","treatment":"Parse the JSON array and explode to a mentions bridge table keyed by these hex IDs; treat empty as no-mention rather than null."},{"confidence":"high","critiques":[],"evidence_keys":["n","n_unique","duplicate_rate","n_duplicates","len_median","len_p95","url_rate","top_values","one_word_rate"],"model":"anthropic:claude-opus-4-7","narrative":"This column holds JSON-encoded arrays of URLs, but 96,211 of 101,040 rows are the empty array '[]', driving a 96.3% duplicate rate and a median length of 2 characters. Only 3,771 unique values exist across 101,040 rows, and just 4.8% of rows actually contain a URL. The non-empty entries point to streaming radio and news endpoints (BBC, Radio France, shoutcast streams).","role":"metadata","scope":"column","target":"links","treatment":"Parse the JSON array and convert to a has_link boolean or explode for per-URL analysis; the column is too sparse to use as-is."}],"providers":["anthropic:claude-opus-4-7"],"total_usage":{"completion_tokens":5992,"prompt_tokens":40264,"total_tokens":46256}},"language_counts":{"als":2,"ar":9,"bg":3,"ca":6,"cs":10,"de":108,"el":11,"en":3309,"eo":4,"es":71,"et":2,"fi":13,"fr":78,"hi":2,"id":11,"it":30,"ja":656,"ko":125,"nl":46,"no":3,"pl":12,"pt":78,"ru":30,"sr":3,"sv":7,"th":34,"tr":33,"uk":3,"vi":7,"zh":37},"meta":{"generated_at":"2026-05-01T23:26:26+00:00","mode":"full","row_count":101040,"sampled_rows":101040,"seed":42,"source":"/home/coolhand/datasets/bsky-firehose-anonymized-dec-2025/bluesky_posts.csv"},"notes":[],"saturn_version":"0.2.0","schema":{"author_did_hash":"text","char_count":"numeric","created_at":"text","embed_type":"categorical","has_images":"numeric","has_link":"numeric","has_video":"numeric","hashtags":"text","language":"categorical","links":"text","mentions":"text","reply_parent_hash":"text","reply_root_hash":"text","sentiment":"categorical","sentiment_score":"numeric","text":"text","timestamp":"text","uri_hash":"text","word_count":"numeric"}}
