Alt text generators

A collection of image describers, from a simple drop-and-go page to multi-pass analyzers, a safety scanner, and an art critic. Pick whichever fits what you're doing. Every tool below is live.

Main describers

Drop or paste an image, get alt text. Start here.

Visual Describer

/alt/advanced/ · created 2025-06-05

The flagship. Splits your image into quadrants and describes each region for more thorough results. Works in light and dark mode.

Open ↗

Alt generator — tuned

/alt/index.bak.html · created 2026-03-21

The original single-page generator — click, drop, or paste an image and get alt text back. Fastest option if you just need a description.

Open ↗

Alt Text Next

/alt/test/ · created 2025-06-05

A preview of the next major version. Functionally complete, still being polished.

Open ↗

Offline generator

/alt/local_storage/ · created 2025-06-05

Runs entirely in your browser, saves nothing to a server. Bring your own API key and nothing leaves your machine except the image going to the model.

Open ↗

Advanced tools

Specialised tools for when plain alt text isn't enough — deeper analysis, longer critiques, safety scans, and model comparisons.

The Art Critic

/alt/advanced/artcritic.html · created 2025-06-05

Critiques an image as art — composition, mood, technique — then gives a subjective Hot Take. Great for artwork and photography; overkill for screenshots.

Open ↗

Safe Spaces

/alt/advanced/safespaces.html · created 2025-06-05

Scans an image for unsafe content and flags what it finds by category. Meant as a moderation aid, not a gatekeeper.

Open ↗

Multi-scan describer

/alt/advanced/multi_scan.html · created 2025-06-05

Runs the image through three passes — overview, detailed scan, and final synthesis — then merges them into one description.

Open ↗

Parallel scan

/alt/advanced/parallel_scan.html · created 2025-06-05

The three-pass describer but all passes run at once. Faster, uses more API quota.

Open ↗

Sequential describer

/alt/advanced/sequential_describer.html · created 2025-06-05

A three-stage pipeline where you can pick a different model for each stage — overview, detailed analysis, and synthesis. Good for mixing a cheap model with a precise one.

Open ↗

Variant describer

/alt/advanced/variant_desriber.html · created 2025-06-05

Runs the same image through several models side by side so you can compare their outputs and pick the one you like.

Open ↗

LM Studio local backend

/alt/advanced/alt_lm_studio.html · created 2025-06-05

Uses your own LM Studio server as the vision backend. Keep everything on your machine, use whichever model you've loaded locally.

Open ↗

Pick a specific model

Same simple describer, one vision model each. Use whichever you have an API key for, or try a few and compare how they describe the same image.

xAI Grok Vision

/alt/xai/ · created 2025-06-05

Grok's vision model. Fast, conversational, decent on photographs.

Open ↗

Claude (Anthropic)

/alt/anthropic/ · created 2025-06-05

Claude's vision capability. Tends toward careful, thorough descriptions.

Open ↗

GPT-4 Vision (OpenAI)

/alt/openai/ · created 2025-06-05

OpenAI's vision. Good general-purpose baseline — often the fallback if another model struggles.

Open ↗

Pixtral (Mistral)

/alt/pixtral/ · created 2025-06-05

Mistral's Pixtral vision model. Experimental — sometimes excellent, occasionally patchy.

Open ↗

Pixtral — saved history

/alt/pixtral/storage/ · created 2026-03-21

A variant of the Pixtral describer that keeps a history of images and their generated alt text.

Open ↗

Pixtral — scratch

/alt/pixtral/storage/temp_alt/ · created 2026-03-21

A smaller Pixtral sandbox for quick experiments. Same model, stripped-down UI.

Open ↗

Hugging Face vision

/alt/huggingface/ · created 2025-06-05 · updated 2025-11-10

Uses Hugging Face Inference for a smaller, open vision model. Cheaper and slower than the commercial ones above.

Open ↗

Camina (via Coze)

/alt/_coze/ · created 2025-06-05

A describer routed through a Coze bot tuned for accessible, ethical image descriptions. Slower, more careful output.

Open ↗

Gemma 3 (not working)

/alt/_gemma/ · created 2025-06-05

Google's Gemma 3 vision variant. Currently broken — kept here so the work isn't lost.

Open ↗

Work in progress

Development copies kept live so they can be tested in the open. Expect rough edges.

Dev mirror — Visual Describer

/alt/advanced/dev_index.html · created 2025-06-05 · updated 2025-11-10

Bleeding-edge copy of the main Visual Describer. Same tool, newer code.

Open ↗

Dev mirror — Alt generator

/alt/dev/ · created 2025-06-05 · updated 2025-06-09

Dev copy of the tuned single-page generator.

Open ↗

Modular rebuild test

/alt/dev/test.html · created 2025-06-05

A rebuild of the alt generator with the UI split into smaller reusable pieces. For testing, not production use.

Open ↗

Original intermission page

/alt/_placeholder/intermission.html · 2025-11-07

The "be right back" page that used to live at /alt/. Kept for continuity.

Open ↗

Desktop apps & source code

If you'd rather not use a web tool, these run on your own computer. Linked to their downloads or GitHub repositories.

Alt Text Generator — desktop

/downloads/alt_text/ · created 2025-12-18

Downloadable desktop app. Runs a local vision model so your images never leave your computer.

Open ↗

alt-text-local-llm — source

github.com/lukeslp/alt-text-local-llm · public

Source code for the desktop app. Electron + Ollama. Clone and build your own.

Accessible Image Viewer — source

github.com/lukeslp/image-viewer · private

A full image viewer with EXIF parsing, Imgur integration, and alt-text generation built in. Private repo — ask for access.

Bluesky Alt Text Dataset — source

github.com/lukeslp/alt-text-dataset · private

The scraper that built the dataset above. Collects and validates alt text from the Bluesky firehose.