
Florence-2 MCP Server
An MCP server for processing images using Florence-2
what is Florence-2 MCP Server?
Florence-2 MCP Server is a server designed for processing images using the Florence-2 model, which is capable of performing various image processing tasks.
how to use Florence-2 MCP Server?
To use the Florence-2 MCP Server, you need to configure it with either Claude Desktop or Goose CLI by editing the respective configuration files to include the server settings. After configuration, restart the application to start using the server.
key features of Florence-2 MCP Server?
- Optical Character Recognition (OCR) capabilities for image files.
- Ability to generate detailed captions for images.
- Supports processing of both local image files and image URLs.
use cases of Florence-2 MCP Server?
- Extracting text from scanned documents using OCR.
- Generating captions for images in applications like social media or content management systems.
- Automating image processing tasks in various workflows.
FAQ from Florence-2 MCP Server?
- What types of images can be processed?
The server can process various image formats as long as they are accessible via file paths or URLs.
- Is there a limit to the number of images processed at once?
The server can handle multiple images, but performance may vary based on the server's configuration and resources.
- Is the Florence-2 MCP Server free to use?
Yes, the server is open-source and available for free under the MIT License.
Florence-2 MCP Server
An MCP server for processing images using Florence-2.
You can process images or PDF files stored on a local or web server to extract text using OCR (Optical Character Recognition) or generate descriptive captions summarizing the content of the images.
Installation
For Claude Desktop
To configure this server for Claude Desktop, edit the claude_desktop_config.json
file with the following entry under
mcpServers
:
{
"mcpServers": {
"florence-2": {
"command": "uvx",
"args": [
"--from",
"git+https://github.com/jkawamoto/mcp-florence2",
"mcp-florence2"
]
}
}
}
After editing, restart the application. For more information, see: For Claude Desktop Users - Model Context Protocol.
For Goose CLI
To enable the Bear extension in Goose CLI,
edit the configuration file ~/.config/goose/config.yaml
to include the following entry:
extensions:
bear:
name: Florence-2
cmd: uvx
args: [ --from, git+https://github.com/jkawamoto/mcp-florence2, mcp-florence2 ]
enabled: true
type: stdio
For Goose Desktop
Add a new extension with the following settings:
- Type: Standard IO
- ID: florence-2
- Name: Florence-2
- Description: An MCP server for processing images using Florence-2
- Command:
uvx --from git+https://github.com/jkawamoto/mcp-florence2 mcp-florence2
For more details on configuring MCP servers in Goose Desktop, refer to the documentation: Using Extensions - MCP Servers.
Tools
ocr
Process an image file or URL using OCR to extract text.
Arguments:
- src: A file path or URL to the image file that needs to be processed.
caption
Processes an image file and generates captions for the image.
Arguments:
- src: A file path or URL to the image file that needs to be processed.
License
This application is licensed under the MIT License. See the LICENSE file for more details.