Aligning Large Multi-Modal Model with Robust Instruction Tuning Paper • 2306.14565 • Published Jun 26, 2023 • 6
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 181