Introduction
Welcome to Docsray MCP - the most comprehensive document perception server for Claude and the MCP ecosystem.
What is Docsray?β
Docsray is an advanced document processing MCP server that enables Claude to extract everything from any document. With support for multiple providers and intelligent caching, it's the ultimate tool for document analysis, extraction, and understanding.
Key Featuresβ
π― Comprehensive Extractionβ
Extract ALL data from documents with a single command:
- Complete text with exact formatting
- Tables with full structure
- Images with descriptions
- Entities (people, organizations, dates, amounts)
- Document hierarchy and layout
- Metadata and properties
π Multi-Provider Architectureβ
- LlamaParse: AI-powered deep analysis (5-30s)
- PyMuPDF: Lightning-fast extraction (<1s)
- Auto-selection: Intelligent provider choice
β‘ Intelligent Cachingβ
- Process once, access forever
- Instant retrieval of cached results
- Smart invalidation on document changes
Quick Exampleβ
# In Claude Desktop or Cursor, ask:
"Xray document.pdf with provider llama-parse"
# Claude will use Docsray to extract:
- All entities (people, organizations, dates, amounts)
- All tables with complete structure
- All images with descriptions
- Complete document hierarchy
Five Powerful Toolsβ
- Peek π - Quick overview and metadata
- Map πΊοΈ - Complete document structure
- Xray π©» - Comprehensive AI analysis
- Extract π - Content in any format
- Seek π― - Navigate to specific locations
Why Docsray?β
- Production Ready: 52+ tests, comprehensive error handling
- Universal Support: PDF, DOCX, PPTX, HTML, and more
- Local & Remote: Works with files and URLs
- Natural Language: Designed for Claude's conversational interface
Getting Startedβ
Ready to extract everything from your documents? Head to our Quickstart Guide to begin!
Communityβ
Licenseβ
Docsray is open source and available under the Apache 2.0 license.