-
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65 -
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Paper • 2309.01131 • Published • 1 -
On the Hidden Mystery of OCR in Large Multimodal Models
Paper • 2305.07895 • Published -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181
Onur Savas PRO
onursavas
AI & ML interests
None yet
Recent Activity
liked
a Space
18 days ago
webml-community/kokoro-web
liked
a Space
6 months ago
MaziyarPanahi/Phi-3.5-Vision
liked
a Space
12 months ago
echo840/ocrbench-leaderboard
Organizations
Collections
3
spaces
9
models
None public yet
datasets
None public yet