document-understanding

Here are 35 public repositories matching this topic...

infiniflow / ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Updated Mar 4, 2026
Python

deepdoctection / deepdoctection

Star

A Repo For Document AI

python nlp ocr tensorflow pytorch document-parser document-layout-analysis table-recognition table-detection document-understanding publaynet layoutlm document-ai document-image-analysis pubtabnet

Updated Mar 4, 2026
Python

X-PLUG / mPLUG-DocOwl

Star

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

multimodal table-understanding document-understanding mllm multimodal-large-language-models chart-understanding

Updated May 30, 2025
Python

OpenBMB / VisRAG

Star

Parsing-free RAG supported by VLMs

retrieval multi-modal document-retrieval rag multi-modality document-understanding vision-language-model retrieval-augmented-generation

Updated Dec 7, 2025
Python

wenwenyu / PICK-pytorch

Star

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

document-analysis graph-convolutional-network graph-learning graph-neural-networks document-understanding key-information-extraction

Updated Jul 25, 2024
Python

jpWang / LiLT

Star

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

nlp information-extraction document-analysis document-understanding multilingual-models document-ai multimodal-pre-trained-model

Updated Oct 31, 2022
Python

huggingface / chug

Star

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

computer-vision pdf-document datasets distributed-training dataloading document-understanding multi-modal-learning webdataset

Updated Apr 3, 2024
Python

athrael-soju / Snappy

Star

🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐

python docker typescript computer-vision nextjs document-retrieval rag fastapi vector-search document-understanding pdf-search vector-database vision-ai qdrant colpali multimodal-ai multivector-search deepseek-ocr visual-retrieval

Updated Feb 9, 2026
Python

microsoft / CompHRDoc

Star

Datasets and Evaluation Scripts for CompHRDoc

document-understanding document-structure-analysis rag-related

Updated Feb 25, 2025
Python

ZeningLin / PEneo

Star

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

ocr document-understanding key-information-extraction document-ai visual-information-extraction

Updated Apr 7, 2025
Python

athrael-soju / little-scripts

Star

A monorepo containing various utility scripts, tools, and applications for development, automation, and AI-powered tasks.

text-to-speech ocr computer-vision cuda speech-to-text gradio fastapi vector-search document-understanding qdrant paddle-ocr flash-attention rag-chatbot colpali deepseek-ocr

Updated Nov 30, 2025
Python

jacobmarks / pytesseract-ocr-plugin

Star

Run optical character recognition with PyTesseract from the FiftyOne App!

python plugin nlp ocr computer-vision tesseract tesseract-ocr document-understanding fiftyone

Updated Apr 5, 2024
Python

Haruhiyuki / yuque-rag

Star

将语雀知识库接入大语言模型，实现基于 RAG（检索增强生成）的智能问答系统，支持FastAPI，兼容OpenAI API与本地Ollama模型。

ai-search rag document-understanding

Updated Jun 12, 2025
Python

VLR-CVC / DocVQA2026

Star

Official evaluation scripts and baseline prompts for the DocVQA 2026 (ICDAR 2026) Competition on Multimodal Reasoning over Documents.

competition vqa-dataset multimodal-datasets document-understanding

Updated Feb 20, 2026
Python

yuvaraj3855 / preocr

Star

Fast document classification and OCR detection. Analyzes any file type to determine if OCR is needed, saving time and money on unnecessary processing.

Updated Feb 26, 2026
Python

marcel-lamott / SlimDoc

Star

Official implementation for "SlimDoc: Lightweight Distillation of Document Transformer Models," published in the International Journal on Document Analysis and Recognition (IJDAR), 2025

distillation document-understanding

Updated Jun 22, 2025
Python

irgroup / labelstudio-to-fonduer

Star

This small module connects Label Studio with Fonduer by creating a fonduer labeling function for gold labels from a label studio export. Documentation: https://irgroup.github.io/labelstudio-to-fonduer/

data-annotation knowledge-base-construction document-understanding label-studio fonduer

Updated Feb 14, 2023
Python

PAIR-Systems-Inc / little-dorrit-editor

Star

Multimodal benchmark for evaluating handwritten editorial correction in printed text.

benchmark ocr multimodal-deep-learning document-understanding llm-evaluation

Updated Feb 23, 2026
Python

ponpaku / GLM-OCR-server

Star

GLM-OCRを使ったローカルOCRサーバー（FastAPI + Web UI / 画像・PDF対応）

python pdf ocr web-ui self-hosted python3 information-extraction webui glm local-server image2text table-recognition fastapi document-understanding fast-api document-ai formula-recognition glm-ocr

Updated Feb 6, 2026
Python

AI4WA / Docs2Synth

Star

A Synthetic Data Tuned Retriever Framework for Documents Understanding

ai document-understanding

Updated Nov 17, 2025
Python

Improve this page

Add a description, image, and links to the document-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-understanding

Here are 35 public repositories matching this topic...

infiniflow / ragflow

deepdoctection / deepdoctection

X-PLUG / mPLUG-DocOwl

OpenBMB / VisRAG

wenwenyu / PICK-pytorch

jpWang / LiLT

huggingface / chug

athrael-soju / Snappy

microsoft / CompHRDoc

ZeningLin / PEneo

athrael-soju / little-scripts

jacobmarks / pytesseract-ocr-plugin

Haruhiyuki / yuque-rag

VLR-CVC / DocVQA2026

yuvaraj3855 / preocr

marcel-lamott / SlimDoc

irgroup / labelstudio-to-fonduer

PAIR-Systems-Inc / little-dorrit-editor

ponpaku / GLM-OCR-server

AI4WA / Docs2Synth

Improve this page

Add this topic to your repo