Docsray MCP
Advanced Document Perception for Claude - Extract Everything from Any Document
# Install
pip install docsray-mcp
# Run with uvx (recommended for MCP clients)
uvx docsray-mcp
# Configure in Claude Desktop or Cursor, then use:
"Xray document.pdf with provider llama-parse"
Comprehensive Extraction
Extract EVERYTHING from documents - text, tables, images, entities, layouts, metadata, and more with a single command using AI-powered analysis.
Multi-Provider Support
Choose between LlamaParse for deep AI analysis or PyMuPDF for lightning-fast extraction. Auto-selection picks the best provider for your needs.
Intelligent Caching
All extractions are cached locally for instant retrieval. Process once, access instantly forever. Smart invalidation when documents change.
MCP Native
Built specifically for Claude and MCP ecosystem. Five powerful tools that work seamlessly with natural language prompts.
Production Ready
52+ passing tests, comprehensive error handling, timeout protection, and battle-tested with real-world documents.
Universal Format Support
PDF, DOCX, PPTX, XLSX, HTML, Markdown, and more. Works with local files and URLs. Handles everything from invoices to research papers.
Choose Your Provider
Provider | Speed | Capabilities | Best For |
---|---|---|---|
LlamaParse π§ | 5-30s | AI analysis, entities, tables, images, layouts, custom instructions | Comprehensive extraction, deep analysis |
PyMuPDF β‘ | <1s | Text, basic markdown, fast extraction | Quick text retrieval, simple documents |
Maximum Data Extraction
Get EVERYTHING with one prompt in Claude:
# Ask Claude to analyze your document:
"Xray document.pdf with provider llama-parse and extract:
1) Complete text content preserving exact formatting
2) All tables with complete data and structure
3) All images with descriptions and metadata
4) Complete document metadata
5) Full document structure with all sections
6) All form fields and values
7) All hyperlinks and cross-references
8) All mathematical equations
9) Page-by-page layout information
10) All entity recognition (people, orgs, dates, amounts)
"""
)
# Returns EVERYTHING in result['full_extraction']
Five Powerful Tools
π Peek
Quick overview, metadata, available formats
πΊοΈ Map
Complete document structure and hierarchy
π©» Xray
Deep AI analysis, entities, comprehensive extraction
π Extract
Get content in markdown, JSON, or text
π― Seek
Navigate to specific pages or sections