What is Google OCR MCP Server?
Google OCR MCP Server is a server application designed to facilitate the storage and summarization of notes using a custom note protocol. It leverages Optical Character Recognition (OCR) technology to manage and summarize notes effectively.
How to use Google OCR MCP Server?
To use the server, you can install it via Smithery or configure it manually on your local machine. Once set up, you can add notes and request summaries through the provided prompts.
Key features of Google OCR MCP Server?
- Custom note storage system with a unique URI scheme for accessing notes.
- Ability to create summaries of stored notes with adjustable detail levels.
- Simple command-line interface for adding notes and managing the server.
Use cases of Google OCR MCP Server?
- Storing and summarizing lecture notes for easy review.
- Managing personal notes and generating concise summaries for quick reference.
- Assisting in research by summarizing large amounts of text data.
FAQ from Google OCR MCP Server?
- Can I access my notes from different devices?
Yes! As long as the server is running, you can access your notes from any device configured to connect to it.
- Is there a limit to the number of notes I can store?
No, you can store as many notes as your server's storage capacity allows.
- How do I install the server?
You can install it via Smithery or manually configure it on your local machine following the provided instructions.
📸 Google OCR MCP server 📸
Components
Resources
The server implements a simple note storage system with:
- Custom note:// URI scheme for accessing individual notes
- Each note resource has a name, description and text/plain mimetype
Prompts
The server provides a single prompt:
- summarize-notes: Creates summaries of all stored notes
- Optional "style" argument to control detail level (brief/detailed)
- Generates prompt combining all current notes with style preference
Tools
The server implements one tool:
- add-note: Adds a new note to the server
- Takes "name" and "content" as required string arguments
- Updates server state and notifies clients of resource changes
Configuration
[TODO: Add configuration details specific to your implementation]
Quickstart
Install
Claude Desktop
- On MacOS:
~/Library/Application\ Support/Claude/claude_desktop_config.json
- On Windows:
%APPDATA%/Claude/claude_desktop_config.json
Development/Unpublished Servers Configuration
{
"mcpServers": {
"google-ocr-mcp-server": {
"command": "uv",
"args": ["run", "google-ocr-mcp-server"],
"env": {
"GOOGLE_APPLICATION_CREDENTIALS": "/path/to/google-application-credentials.json",
"SAVE_RESULTS": false
}
}
}
}
Published Servers Configuration
{
"mcpServers": {
"google-ocr-mcp-server": {
"command": "uvx",
"args": ["google-ocr-mcp-server"],
"env": {
"GOOGLE_APPLICATION_CREDENTIALS": "/path/to/google-application-credentials.json",
"SAVE_RESULTS": false
}
}
}
}
Installing via Smithery
To install google-ocr-mcp-server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Zerohertz/google-ocr-mcp-server --client claude
Development
Building and Publishing
To prepare the package for distribution:
- Sync dependencies and update lockfile:
uv sync
- Build package distributions:
uv build
This will create source and wheel distributions in the dist/
directory.
- Publish to PyPI:
uv publish
Note: You'll need to set PyPI credentials via environment variables or command flags:
- Token:
--token
orUV_PUBLISH_TOKEN
- Or username/password:
--username
/UV_PUBLISH_USERNAME
and--password
/UV_PUBLISH_PASSWORD
Debugging
Since MCP servers run over stdio, debugging can be challenging. For the best debugging experience, we strongly recommend using the MCP Inspector.
You can launch the MCP Inspector via npm
with this command:
$ npx @modelcontextprotocol/inspector uv --directory /Users/zerohertz/Downloads/google-ocr-mcp-server run google-ocr-mcp-server
Starting MCP inspector...
⚙️ Proxy server listening on port 6277
🔍 MCP Inspector is up and running at http://127.0.0.1:6274 🚀
Upon launching, the Inspector will display a URL that you can access in your browser to begin debugging.
