YouTube Vision MCP Server (

YouTube Vision MCP Server (

By minbang930 GitHub

MCP (Model Context Protocol) server that utilizes the Google Gemini Vision API to interact with YouTube videos.

Overview

What is YouTube Vision MCP?

YouTube Vision MCP is a server that utilizes the Google Gemini Vision API to interact with YouTube videos, allowing users to obtain descriptions, summaries, answers to questions, and extract key moments from videos.

How to use YouTube Vision MCP?

To use the server, you can install it via npx or manually from the source. You need to set up your Google Gemini API key and configure the server in your MCP client's settings.

Key features of YouTube Vision MCP?

  • Analyzes YouTube videos using the Gemini Vision API.
  • Provides tools for general description, summarization, and key moment extraction.
  • Lists available Gemini models supporting content generation.
  • Configurable via environment variables.

Use cases of YouTube Vision MCP?

  1. Generating summaries of educational videos.
  2. Extracting key moments from tutorials for quick reference.
  3. Answering specific questions about video content.

FAQ from YouTube Vision MCP?

  • What do I need to use this server?

You need Node.js (version 18 or higher) and a Google Gemini API key.

  • Is there a recommended way to install it?

Yes, using npx is the easiest method for quick use.

  • Can I modify the code?

Yes, you can clone the repository and modify the code as needed.

No tools information available.
No content found.