Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
-
Updated
Apr 2, 2026 - Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
MinerU免安装部署一键启动整合包
🎨 Display system theme colors and their references easily with this Lua Plugin for GrandMA3, simplifying your color selection process.
PDF table extraction for RAG — convert to clean HTML. Fast, local, no GPU.
A small web app that finds relevant documents and produces query-focused summaries using Gemini. Supports PDF upload with one-time multimodal preprocessing into per-page Markdown + metadata.
🔄 Optimize model loading in ComfyUI with flexible node connections and controlled sequences for better performance and memory management.
Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.
🎨 Enhance video generation by syncing audio to visuals with ComfyUI-PainterAI2V. Create precise lip-syncing and seamless transitions using dual model workflows.
🖼️ Segment characters in images with ComfyUI using a Vision LLM agent, enhancing your projects with detailed and high-quality masks.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
🎨 Build interactive Blazor applications with A2UI, a secure and portable protocol for rich UI rendered natively across platforms without code execution risks.
💡 Control 3D lighting angles effortlessly with ComfyUI - adjust direction, intensity, and color while relighting images in real-time.
🎶 Generate multilingual AI music with lyrics in English, Chinese, Japanese, Korean, and Spanish using ComfyUI's HeartMuLa model.
Parse JSON quickly using a fast, recursive-descent parser designed for lightweight integration in C++ projects.
Implements Unreal Engine 5 network protocol in Python to connect, authenticate, and replicate actors with UE5 Lyra Starter Game servers.
UE5 Server Emulator 2026 🎮 | Python Lyra Client & Replication Tools
📝 Manage your projects and notes locally with Ironpad, a file-based system that keeps your data safe in Markdown format without cloud reliance.
🤖 Process SCAIL-pose data with ComfyUI nodes, utilizing VitPose for accurate face and hand detection in an efficient, streamlined setup.
Add a description, image, and links to the pdf-extractor-rag topic page so that developers can more easily learn about it.
To associate your repository with the pdf-extractor-rag topic, visit your repo's landing page and select "manage topics."