Home/Directory/AI & ML/AI Vision

AI Vision

@tan-yong-sheng · github.com/tan-yong-sheng/ai-vision-mcp

Multimodal AI vision MCP server for image, video, and object detection analysis. Enables UI/UX evaluation, visual regression testing, and interface understanding using Google Gemini and Vertex AI.

ClaimedAI & MLstdiolocal auth
GitHub

Install one command

terminal
$ npx ai-vision-mcp

Or add it with the Claude CLI: claude mcp add tan-yong-sheng-ai-vision-mcp

Tools exposed 0 tool

Configuration claude_desktop_config.json

claude_desktop_config.json
{
  "mcpServers": {
    "tan-yong-sheng-ai-vision-mcp": {
      "command": "npx",
      "args": ["ai-vision-mcp"],
      "env": {
        "GEMINI_API_KEY": "<your-gemini-api-key>",
        "MAX_TOKENS_FOR_DETECT_OBJECTS_IN_IMAGE": "<your-max-tokens-for-detect-objects-in-image>",
        "MAX_TOKENS": "<your-max-tokens>"
      }
    }
  }
}

Paste under mcpServers in your config, then restart Claude Desktop. For Claude Code, reference this server in your project's CLAUDE.md.

Related

Similar servers

More in AI & ML
Compare
@microsoft

MCP tool access to MarkItDown a library that converts many file formats (local or remote) to Markdown for LLM consumption.

AI & MLstdio no-auth
149.5k3w ago0 toolslocal
$pip install -e 'packages/markitdown[all]'
Compare
@upstash

Up-to-date code documentation for LLMs and AI code editors.

AI & MLstdio auth
57.1k5d ago0 toolslocal
$npx ctx7 setup
Compare
@eyaltoledano

AI-powered task management system for AI-driven development.

AI & MLstdio auth
27.4k2mo ago15 toolslocal
$npx task-master init