← Back to the gallery

About the alt text tools

A quick guide to which tool to use, what you'll need, and what happens to your data.

What is this?

A small collection of tools for generating image alt text — from a simple drop-and-go web page to multi-pass analyzers, a safety scanner, and an art critic. All of them run in your browser. Some talk to commercial vision models, some talk to a local model on your own machine, and one runs entirely client-side.

Open the gallery, pick a tool, drop an image in. Most tools accept drag-and-drop, paste (Ctrl+V / Cmd+V), or a click to upload.

Which one should I pick?

If you want…Use this
A quick, accessible descriptionAlt generator — tuned or the Visual Describer
More thoroughness (image split into regions)Visual Describer
A description focused on art and compositionThe Art Critic
A safety scan of an imageSafe Spaces
Several models compared on the same imageVariant describer
A specific vision modelPick a model section
A local LM Studio server as the backendLM Studio backend
Everything in your browser, no serverOffline generator
A desktop app that runs a model on your computerDesktop download
Alt text for a Tenor GIFTenor GIF alt generator

What do I need?

Most of the web tools need an API key for whichever vision model they use. You paste the key into the page and it stays in your browser's local storage — nothing gets uploaded anywhere except direct calls to the model provider you chose.

A few tools talk to a small proxy on this server instead of calling the APIs directly. If one of those loads but the Generate button doesn't return anything, the proxy is probably offline — try a different tool, use the offline generator, or grab the desktop version.

What happens to my data?

Desktop app

If you'd rather keep everything on your own computer, there's a desktop version at dr.eamer.dev/downloads/alt_text/. It bundles an Electron frontend with Ollama for local vision models, so your images never leave your machine.

Source code: github.com/lukeslp/alt-text-local-llm (public).

Bluesky alt-text dataset

A free dataset of 279,000 curated image-description pairs, scraped from 489 validated Bluesky accounts. Good for training, evaluation, or research: dr.eamer.dev/downloads/bluesky-alt-text/.

Reporting a problem or contributing

Something broken? A description looks wrong? Want a new model or feature? Email luke@lukesteuber.com or reach out on Bluesky @lukesteuber.com. The source for this gallery lives in a private repository; if you'd like access for serious contributions, mention that in your email.

Credits

Built by Luke Steuber. The describers use vision models from xAI, Anthropic, OpenAI, Mistral, Hugging Face, Google (Gemma), and Ollama. The Bluesky dataset was collected from the public firehose. Accessibility follows WCAG 2.2 AA.

Site: lukesteuber.com · Bluesky: @lukesteuber.com