BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published 11 days ago • 48
Open World Object Detection in the Era of Foundation Models Paper • 2312.05745 • Published Dec 10, 2023 • 1
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Paper • 2407.06189 • Published Jul 8, 2024 • 26
μ-Bench: A Vision-Language Benchmark for Microscopy Understanding Paper • 2407.01791 • Published Jul 1, 2024 • 5
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records Paper • 2308.14089 • Published Aug 27, 2023 • 30