Turn PDFs into structured data at scale. Powered by frontier open-weight OCR models with a type-safe TypeScript SDK.
- Best-in-class OCR - PaddleOCR-VL-0.9B for accurate text extraction
- Structured extraction - Define schemas, get JSON back
- Built for scale - Queue-based processing for thousands of documents
- Type-safe SDK - Full TypeScript support with React hooks
- Real-time updates - WebSocket notifications for job progress
- Self-hostable - Run on your own infrastructure
import { createOCRBaseClient } from "@ocrbase/sdk";
const client = createOCRBaseClient({ baseUrl: "https://your-instance.com" });
// Process a document
const job = await client.jobs.create({ file: document, type: "parse" });
const result = await client.jobs.get(job.id);
console.log(result.markdownResult);See SDK documentation for React hooks and advanced usage.
See Self-Hosting Guide for deployment instructions.
Requirements: Docker, Bun, CUDA GPU with 12GB+ VRAM
MIT - See LICENSE for details.
For API access, on-premise deployment, or questions: [email protected]