Florence-2 MCP Server

Florence-2 MCP Server

By jkawamoto GitHub

An MCP server for processing images using Florence-2

python florence-2
Overview

what is Florence-2 MCP Server?

Florence-2 MCP Server is a server designed for processing images using the Florence-2 model, which is capable of performing various image processing tasks.

how to use Florence-2 MCP Server?

To use the Florence-2 MCP Server, you need to configure it with either Claude Desktop or Goose CLI by editing the respective configuration files to include the server settings. After configuration, restart the application to start using the server.

key features of Florence-2 MCP Server?

  • Optical Character Recognition (OCR) capabilities for image files.
  • Ability to generate detailed captions for images.
  • Supports processing of both local image files and image URLs.

use cases of Florence-2 MCP Server?

  1. Extracting text from scanned documents using OCR.
  2. Generating captions for images in applications like social media or content management systems.
  3. Automating image processing tasks in various workflows.

FAQ from Florence-2 MCP Server?

  • What types of images can be processed?

The server can process various image formats as long as they are accessible via file paths or URLs.

  • Is there a limit to the number of images processed at once?

The server can handle multiple images, but performance may vary based on the server's configuration and resources.

  • Is the Florence-2 MCP Server free to use?

Yes, the server is open-source and available for free under the MIT License.

Content

Florence-2 MCP Server

Python Application GitHub License pre-commit Ruff smithery badge

An MCP server for processing images using Florence-2.

You can process images or PDF files stored on a local or web server to extract text using OCR (Optical Character Recognition) or generate descriptive captions summarizing the content of the images.

Installation

For Claude Desktop

To configure this server for Claude Desktop, edit the claude_desktop_config.json file with the following entry under mcpServers:

{
  "mcpServers": {
    "florence-2": {
      "command": "uvx",
      "args": [
        "--from",
        "git+https://github.com/jkawamoto/mcp-florence2",
        "mcp-florence2"
      ]
    }
  }
}

After editing, restart the application. For more information, see: For Claude Desktop Users - Model Context Protocol.

For Goose CLI

To enable the Bear extension in Goose CLI, edit the configuration file ~/.config/goose/config.yaml to include the following entry:

extensions:
  bear:
    name: Florence-2
    cmd: uvx
    args: [ --from, git+https://github.com/jkawamoto/mcp-florence2, mcp-florence2 ]
    enabled: true
    type: stdio

For Goose Desktop

Add a new extension with the following settings:

  • Type: Standard IO
  • ID: florence-2
  • Name: Florence-2
  • Description: An MCP server for processing images using Florence-2
  • Command: uvx --from git+https://github.com/jkawamoto/mcp-florence2 mcp-florence2

For more details on configuring MCP servers in Goose Desktop, refer to the documentation: Using Extensions - MCP Servers.

Tools

ocr

Process an image file or URL using OCR to extract text.

Arguments:

  • src: A file path or URL to the image file that needs to be processed.

caption

Processes an image file and generates captions for the image.

Arguments:

  • src: A file path or URL to the image file that needs to be processed.

License

This application is licensed under the MIT License. See the LICENSE file for more details.

No tools information available.

The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep research

python typescript
View Details

AI assistant built with Streamlit, NVIDIA NIM (LLaMa 3.3:70B) / Ollama, and Model Control Protocol (MCP).

python mcp
View Details

MCP Client Implementation Using LangChain ReAct Agent / Python

python mcp
View Details

Real-time stock API with Python, MCP server example, yfinance stock analysis dashboard

python fastapi
View Details

YouTube MCP Server is an AI-powered solution designed to revolutionize your YouTube experience. It empowers users to search for YouTube videos, retrieve detailed transcripts, and perform semantic searches over video content—all without relying on the official API. By integrating with a vector database, this server streamlines content discovery.

python machine-learning
View Details

A simple MCP server for weather

python mcp
View Details

The OpenAPI to Model Context Protocol (MCP) proxy server bridges the gap between AI agents and external APIs by dynamically translating OpenAPI specifications into standardized MCP tools. This simplifies the integration process, significantly reducing development time and complexity associated with custom API wrappers.

python ai
View Details