Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published 22 days ago • 12
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy Paper • 2410.13218 • Published Oct 17 • 4
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design Paper • 2410.05677 • Published Oct 8 • 14
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies Paper • 2308.03188 • Published Aug 6, 2023 • 2
Let's Think Frame by Frame: Evaluating Video Chain of Thought with Video Infilling and Prediction Paper • 2305.13903 • Published May 23, 2023
Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings Paper • 2305.02317 • Published May 3, 2023
WikiWhy: Answering and Explaining Cause-and-Effect Questions Paper • 2210.12152 • Published Oct 21, 2022 • 1
Not All Errors are Equal: Learning Text Generation Metrics using Stratified Error Synthesis Paper • 2210.05035 • Published Oct 10, 2022
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation Paper • 2406.08656 • Published Jun 12 • 7
Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts Paper • 2406.16851 • Published Jun 24
Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning Paper • 2305.12295 • Published May 20, 2023
The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources Paper • 2406.16746 • Published Jun 24
BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM Paper • 2406.12168 • Published Jun 18 • 7
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17 • 50
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation Paper • 2406.08656 • Published Jun 12 • 7
Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning Paper • 2310.09676 • Published Oct 14, 2023
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Paper • 2406.08407 • Published Jun 12 • 24
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback Paper • 2405.18750 • Published May 29 • 21