Back to catalog

PDF Reader MCP Server

A comprehensive Model Context Protocol (MCP) server that provides advanced PDF text extraction, search, and analysis functionality with customizable options for processing local PDF files.

A comprehensive Model Context Protocol (MCP) server that provides advanced PDF text extraction, search, and analysis functionality with customizable options for processing local PDF files.

Installation

NPM Install

npm install

Configuration

Cursor

{
  "mcpServers": {
    "mcp-gp-pdf-reader": {
      "command": "node",
      "args": ["/absolute/path/to/mcp_gp_pdf_reader/index.js"]
    }
  }
}

Available Tools

Tool Description
read-pdf Extract text from PDF files with customizable options, including page ranges, metadata inclusion...
search-pdf Search for specific text in PDF documents with search options for case sensitivity, whole words, and regex...
pdf-metadata Retrieve comprehensive metadata from PDF files, including author, title, creation date, keywords, and more...

Features

  • Text extraction from PDF files with customizable options
  • Advanced text search with case sensitivity, whole words, and regex options
  • Comprehensive metadata extraction (author, title, creation date, keywords)
  • Page-by-page processing and content extraction from specific page ranges
  • Text cleaning and normalization capabilities
  • File size limit protection (50MB limit)
  • Asynchronous processing for non-blocking file operations
  • File validation and path sanitization for security

Usage Examples

Extract all text from /documents/report.pdf
Search for 'quarterly results' in /documents/financial-report.pdf
Get metadata from /documents/contract.pdf

Notes

Requires Node.js 18.0.0 or higher. Planned future features include OCR support, image extraction, table detection, form data extraction, password-protected PDF support, and batch processing capabilities.

Comments (0)

Sign In Sign in to leave a comment.