IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models Paper • 2501.13920 • Published 8 days ago • 13
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published 7 days ago • 29
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published 25 days ago • 9
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Paper • 2501.02393 • Published 27 days ago • 8
Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models Paper • 2501.02376 • Published 27 days ago • 3
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control Paper • 2501.02260 • Published 27 days ago • 5
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 24 days ago • 23
Demystifying Domain-adaptive Post-training for Financial LLMs Paper • 2501.04961 • Published 23 days ago • 11
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published Dec 24, 2024 • 72
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper • 2412.18072 • Published Dec 24, 2024 • 17
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Paper • 2412.18176 • Published Dec 24, 2024 • 15