Alt text generators

Main describers

Drop or paste an image, get alt text. Start here.

Visual Describer

/alt/advanced/ · created 2025-06-05

The flagship. Splits your image into quadrants and describes each region for more thorough results. Works in light and dark mode.

Open ↗

Alt generator — tuned

/alt/index.bak.html · created 2026-03-21

The original single-page generator — click, drop, or paste an image and get alt text back. Fastest option if you just need a description.

Open ↗

Alt Text Next

/alt/test/ · created 2025-06-05

A preview of the next major version. Functionally complete, still being polished.

Open ↗

Offline generator

/alt/local_storage/ · created 2025-06-05

Runs entirely in your browser, saves nothing to a server. Bring your own API key and nothing leaves your machine except the image going to the model.

Open ↗

Advanced tools

Specialised tools for when plain alt text isn't enough — deeper analysis, longer critiques, safety scans, and model comparisons.

The Art Critic

/alt/advanced/artcritic.html · created 2025-06-05

Critiques an image as art — composition, mood, technique — then gives a subjective Hot Take. Great for artwork and photography; overkill for screenshots.

Open ↗

Safe Spaces

/alt/advanced/safespaces.html · created 2025-06-05

Scans an image for unsafe content and flags what it finds by category. Meant as a moderation aid, not a gatekeeper.

Open ↗

Multi-scan describer

/alt/advanced/multi_scan.html · created 2025-06-05

Runs the image through three passes — overview, detailed scan, and final synthesis — then merges them into one description.

Open ↗

Parallel scan

/alt/advanced/parallel_scan.html · created 2025-06-05

The three-pass describer but all passes run at once. Faster, uses more API quota.

Open ↗

Sequential describer

/alt/advanced/sequential_describer.html · created 2025-06-05

A three-stage pipeline where you can pick a different model for each stage — overview, detailed analysis, and synthesis. Good for mixing a cheap model with a precise one.

Open ↗

Variant describer

/alt/advanced/variant_desriber.html · created 2025-06-05

Runs the same image through several models side by side so you can compare their outputs and pick the one you like.

Open ↗

LM Studio local backend

/alt/advanced/alt_lm_studio.html · created 2025-06-05

Uses your own LM Studio server as the vision backend. Keep everything on your machine, use whichever model you've loaded locally.

Open ↗

Pick a specific model

Same simple describer, one vision model each. Use whichever you have an API key for, or try a few and compare how they describe the same image.

xAI Grok Vision

/alt/xai/ · created 2025-06-05

Grok's vision model. Fast, conversational, decent on photographs.

Open ↗

Claude (Anthropic)

/alt/anthropic/ · created 2025-06-05

Claude's vision capability. Tends toward careful, thorough descriptions.

Open ↗

GPT-4 Vision (OpenAI)

/alt/openai/ · created 2025-06-05

OpenAI's vision. Good general-purpose baseline — often the fallback if another model struggles.

Open ↗

Pixtral (Mistral)

/alt/pixtral/ · created 2025-06-05

Mistral's Pixtral vision model. Experimental — sometimes excellent, occasionally patchy.

Open ↗

Pixtral — saved history

/alt/pixtral/storage/ · created 2026-03-21

A variant of the Pixtral describer that keeps a history of images and their generated alt text.

Open ↗

Pixtral — scratch

/alt/pixtral/storage/temp_alt/ · created 2026-03-21

A smaller Pixtral sandbox for quick experiments. Same model, stripped-down UI.

Open ↗

Hugging Face vision

/alt/huggingface/ · created 2025-06-05 · updated 2025-11-10

Uses Hugging Face Inference for a smaller, open vision model. Cheaper and slower than the commercial ones above.

Open ↗

Camina (via Coze)

/alt/_coze/ · created 2025-06-05

A describer routed through a Coze bot tuned for accessible, ethical image descriptions. Slower, more careful output.

Open ↗

Gemma 3 (not working)

/alt/_gemma/ · created 2025-06-05

Google's Gemma 3 vision variant. Currently broken — kept here so the work isn't lost.

Open ↗

Work in progress

Development copies kept live so they can be tested in the open. Expect rough edges.

Dev mirror — Visual Describer

/alt/advanced/dev_index.html · created 2025-06-05 · updated 2025-11-10

Bleeding-edge copy of the main Visual Describer. Same tool, newer code.

Open ↗

Dev mirror — Alt generator

/alt/dev/ · created 2025-06-05 · updated 2025-06-09

Dev copy of the tuned single-page generator.

Open ↗

Modular rebuild test

/alt/dev/test.html · created 2025-06-05

A rebuild of the alt generator with the UI split into smaller reusable pieces. For testing, not production use.

Open ↗

Original intermission page

/alt/_placeholder/intermission.html · 2025-11-07

The "be right back" page that used to live at /alt/. Kept for continuity.

Open ↗

Desktop apps & source code

If you'd rather not use a web tool, these run on your own computer. Linked to their downloads or GitHub repositories.

Alt Text Generator — desktop

/downloads/alt_text/ · created 2025-12-18

Downloadable desktop app. Runs a local vision model so your images never leave your computer.

Open ↗

alt-text-local-llm — source

github.com/lukeslp/alt-text-local-llm · public

Source code for the desktop app. Electron + Ollama. Clone and build your own.

Open on GitHub ↗

Accessible Image Viewer — source

github.com/lukeslp/image-viewer · private

A full image viewer with EXIF parsing, Imgur integration, and alt-text generation built in. Private repo — ask for access.

Open on GitHub ↗

Bluesky Alt Text Dataset — source

github.com/lukeslp/alt-text-dataset · private

The scraper that built the dataset above. Collects and validates alt text from the Bluesky firehose.

Open on GitHub ↗

Main describers

Advanced tools

Pick a specific model

Work in progress

Related tools on this site

Desktop apps & source code