Librarian MCP Server

By SegaraRai GitHub

Librarian is a Model Context Protocol (MCP) server that provides an API for listing, searching, and retrieving markdown files stored in a structured manner.

mcp mcp-server

Overview

What is Librarian MCP Server?

Librarian MCP Server is a Model Context Protocol (MCP) server that provides an API for listing, searching, and retrieving markdown files stored in a structured manner, serving as a knowledge base for Large Language Models (LLMs).

How to use Librarian MCP Server?

To use the Librarian MCP Server, clone the repository, install dependencies, and start the server. You can then interact with the API to retrieve documents, search content, and list tags.

Key features of Librarian MCP Server?

Structured document organization by sections.
Tag-based filtering and hierarchical tag inheritance.
Flexible search capabilities including string and regex searches.
Efficient document retrieval by path.
Integration with the Model Context Protocol (MCP).

Use cases of Librarian MCP Server?

Providing structured documentation for LLMs.
Enabling efficient search and retrieval of markdown files.
Serving as a knowledge base for various applications.

FAQ from Librarian MCP Server?

Can Librarian handle all markdown files?

Yes, it is designed to work with markdown files organized in a structured manner.

Is Librarian free to use?

Yes, it is open-source and free to use under the MIT License.

How does tag inheritance work?

Tags are inherited from parent directories, allowing for a hierarchical organization of tags.

Content

Librarian MCP Server

Knowledge at your fingertips

Overview

Librarian is a Model Context Protocol (MCP) server that provides an API for listing, searching, and retrieving markdown files stored in a structured manner. It serves as a knowledge base for Large Language Models (LLMs), providing them with the information they need on demand.

Librarian does not provide any writing operations - it is a read-only service designed to efficiently deliver document content to LLMs through the MCP framework.

Features

Structured Document Organization: Documents are organized by section (e.g., daisyui/components/button.md)
Tag-Based Filtering: Filter documents by tags defined in frontmatter
Hierarchical Tag Inheritance: Tags are inherited from parent directories
Flexible Search Capabilities:
- Simple string searches (case insensitive)
- Regular expression searches with customizable flags
Efficient Document Retrieval: Quickly access specific documents by path
Tag Discovery: List all available tags with usage counts and optional file paths
MCP Integration: Seamlessly integrates with the Model Context Protocol

Project Structure

The Librarian MCP server is organized into modular components:

src/lib/config.ts: Type definitions and loader for configuration
src/lib/load.ts: Document loading and processing functionality
src/lib/librarian.ts: Core librarian implementation with schemas
src/lib/util.ts: Formatting utilities for plaintext responses
src/lib/server.ts: MCP server implementation
src/bin.ts: CLI entry point
src/index.ts: Library entry point

This modular design allows for easy extension and maintenance, with clear separation of concerns.

Installation

Prerequisites

Node.js (v14 or higher)
npm or pnpm

Installation Steps

# Clone the repository
git clone https://github.com/yourusername/mcp-server-librarian.git
cd mcp-server-librarian

# Install dependencies
pnpm install

# Build the project
pnpm build

Configuration

The document folder path can be configured using the following methods (in order of priority):

Command-line Arguments

node dist/bin.js --docs-root /path/to/your/docs

Environment Variables

LIBRARIAN_DOCS_ROOT=/path/to/your/docs node dist/bin.js

Document Structure

Organization

Documents are organized by section, following a hierarchical structure:

<docs_root>/
├── daisyui/
│   ├── index.md              # DaisyUI section info with tags
│   └── components/
│       ├── index.md          # Components section info with tags
│       ├── button.md         # Document with specific tags
│       └── card.md           # Another document with specific tags
└── tailwind4/
    ├── index.md              # Tailwind section info with tags
    └── getting-started.md    # Document with specific tags

Frontmatter

Each markdown document can include frontmatter with a tags field:

---
tags: ["frontend", "ui", "button"]
---

# Button Component

This document describes the button component...

Tag Inheritance

Tags are inherited through the folder hierarchy:

A document inherits all tags from index.md files in its parent directories
Tags are merged from the most general (root) to the most specific (document)

For example, if we have:

/daisyui/index.md with tags: ["ui"]
/daisyui/components/index.md with tags: ["components"]
/daisyui/components/button.md with tags: ["interactive", "form"]

Then /daisyui/components/button.md will effectively have all tags: ["documentation", "ui", "components", "interactive", "form"]

Folder Description

You can describe a folder itself by creating an index.md file within that folder. This file can contain both frontmatter (with tags) and content describing the purpose of that section.

API Reference

The Librarian MCP server provides the following tools:

getDocuments

Retrieves multiple documents by their paths.

Parameters:

filepaths: Array of file paths to retrieve

Response:

**/path/to/document1.md**
- tags: tag1, tag2, tag3
======
Document 1 content
======

**/path/to/document2.md**
- tags: tag1, tag4, tag5
======
Document 2 content
======

getDocument

Retrieves a specific document by path.

Parameters:

filepath: The path to the document

Response:

**/path/to/document.md**
- tags: tag1, tag2, tag3
======
Document content
======

listDocuments

Lists all documents with optional filtering by directory and tags.

Parameters:

directory (optional): The directory path to list documents from (default: "/")
tags (optional): Array of tags to filter by (default: [])
includeContents (optional): Whether to include document contents in results (default: false)
depth (optional): Maximum directory depth to traverse (-1 for infinite, default: -1)

Response: (When includeContents is false)

- /path/to/document1.md
  - tags: tag1, tag2, tag3
- /path/to/document2.md
  - tags: tag1, tag4, tag5
...

Response: (When includeContents is true)

**/path/to/document1.md**
- tags: tag1, tag2, tag3
======
Document 1 content
======

**/path/to/document2.md**
- tags: tag1, tag4, tag5
======
Document 2 content
======

searchDocuments

Searches document content using string or regex patterns.

Parameters:

query: The search query (string or regex pattern)
mode (optional): Search mode ("string" or "regex", default: "string")
caseSensitive (optional): Whether the search should be case-sensitive (default: false)
directory (optional): The directory path to search in (default: "/")
tags (optional): Array of tags to filter by (default: [])
includeContents (optional): Whether to include document contents in results (default: false)
depth (optional): Maximum directory depth to traverse (-1 for infinite, default: -1)

Response: (When includeContents is false)

- /path/to/document1.md
  - tags: tag1, tag2, tag3
- /path/to/document2.md
  - tags: tag1, tag4, tag5
...

Response: (When includeContents is true)

**/path/to/document1.md**
- tags: tag1, tag2, tag3
======
Document 1 content
======

**/path/to/document2.md**
- tags: tag1, tag4, tag5
======
Document 2 content
======

Lists all tags with counts and optional filepaths.

Parameters:

directory (optional): The directory path to list tags from (default: "/")
includeFilepaths (optional): Whether to include filepaths in results (default: false)
depth (optional): Maximum directory depth to traverse (-1 for infinite, default: -1)

Response: (When includeFilepaths is false)

- tag1 (5)
- tag2 (3)
- tag3 (2)
...

Response: (When includeFilepaths is true)

- tag1 (5)
  - /path/to/document1.md
  - /path/to/document2.md
  - ...
- tag2 (3)
  - /path/to/document3.md
  - ...

Usage Examples

Starting the Server

# Start with default configuration
node dist/bin.js

# Start with custom docs directory
node dist/bin.js --docs-root ./my-documentation

# Start with environment variable
LIBRARIAN_DOCS_ROOT=./my-documentation node dist/bin.js

Example Queries

Listing Documents

// List all documents
const allDocs = await mcp.useTool("librarian", "listDocuments", {});

// List documents in a specific directory
const uiDocs = await mcp.useTool("librarian", "listDocuments", {
  directory: "/daisyui/components",
});

// List documents with specific tags
const buttonDocs = await mcp.useTool("librarian", "listDocuments", {
  tags: ["button", "interactive"],
});

// List documents with depth limit
const topLevelDocs = await mcp.useTool("librarian", "listDocuments", {
  directory: "/daisyui",
  depth: 1, // Only include direct children, not nested subdirectories
});

Searching Documents

// Simple string search
const results = await mcp.useTool("librarian", "searchDocuments", {
  query: "button styling",
});

// Regex search
const regexResults = await mcp.useTool("librarian", "searchDocuments", {
  query: "\\bbutton\\b.*\\bstyle\\b",
  mode: "regex",
  caseSensitive: true,
  includeContents: true,
});

// Search with tag filtering
const filteredResults = await mcp.useTool("librarian", "searchDocuments", {
  query: "installation",
  tags: ["tutorial"],
  directory: "/tailwind4",
});

// Search with depth limit
const topLevelResults = await mcp.useTool("librarian", "searchDocuments", {
  query: "component",
  directory: "/daisyui",
  depth: 1, // Only search in direct children, not nested subdirectories
});

Retrieving a Document

// Get a specific document
const document = await mcp.useTool("librarian", "getDocument", {
  filepath: "/daisyui/components/button.md",
});

Retrieving Multiple Documents

// Get multiple specific documents
const documents = await mcp.useTool("librarian", "getDocuments", {
  filepaths: ["/daisyui/components/button.md", "/daisyui/components/card.md"],
});

Listing Tags

// List all tags
const allTags = await mcp.useTool("librarian", "listTags", {});

// List tags in a specific directory
const tailwindTags = await mcp.useTool("librarian", "listTags", {
  directory: "/tailwind",
});

// List tags with filepaths
const tagsWithFiles = await mcp.useTool("librarian", "listTags", {
  includeFilepaths: true,
});

// List tags with depth limit
const topLevelTags = await mcp.useTool("librarian", "listTags", {
  directory: "/daisyui",
  depth: 1, // Only include tags from direct children, not nested subdirectories
});

Integration with LLMs

Librarian is designed to work seamlessly with LLMs through the Model Context Protocol. Here's how an LLM might use Librarian:

Tag Discovery: The LLM can list available tags to understand the knowledge taxonomy
Document Discovery: The LLM can list available documents to understand what knowledge is available
Search: When the LLM needs specific information, it can search across documents
Retrieval: Once the LLM identifies a relevant document, it can retrieve its full content
Context Building: The LLM can use the retrieved content to build context for generating responses

Error Handling

Librarian uses standard MCP error responses with appropriate error codes and messages:

INVALID_ARGUMENT: When provided parameters are invalid
NOT_FOUND: When a requested document or directory doesn't exist
INTERNAL: For unexpected server errors

Each error response includes:

An error code
A descriptive message
Optional details for debugging

Troubleshooting

Common Issues

Document Not Found

If you're getting NOT_FOUND errors:

Check that the document path is correct
Verify that the --docs-root points to the correct directory
Ensure file permissions allow the server to read the files

Search Returns No Results

If searches aren't returning expected results:

Check that the query syntax is correct (especially for regex searches)
Verify that the documents contain the expected content
Try broadening your search terms or using simpler patterns

Tag Filtering Not Working

If tag filtering isn't working as expected:

Verify that the tags are correctly defined in the frontmatter
Check the inheritance hierarchy to understand which tags apply to which documents
Ensure tag names match exactly (tags are case-sensitive)