Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published Nov 15, 2024 • 73
view article Article AIGS: Generating Science from AI-Powered Automated Falsification By mikelabs • Nov 22, 2024 • 2
view article Article Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models By mikelabs • Nov 21, 2024 • 2
view article Article Robust ASR Error Correction with Conservative Data Filtering By mikelabs • Nov 20, 2024 • 2
view article Article That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design By mikelabs • Nov 19, 2024 • 1
view article Article Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions By mikelabs • Nov 19, 2024 • 3
view article Article The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use By mikelabs • Nov 19, 2024 • 2
view article Article Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations By mikelabs • Nov 19, 2024 • 1
view article Article StableV2V: Stablizing Shape Consistency in Video-to-Video Editing By mikelabs • Nov 19, 2024 • 2
view article Article GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees By mikelabs • Nov 18, 2024 • 1
view article Article Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model By mikelabs • Nov 18, 2024 • 1
Part123: Part-aware 3D Reconstruction from a Single-view Image Paper • 2405.16888 • Published May 27, 2024 • 11
STT: Stateful Tracking with Transformers for Autonomous Driving Paper • 2405.00236 • Published Apr 30, 2024 • 9
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings Paper • 2404.16820 • Published Apr 25, 2024 • 16
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25, 2024 • 77
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Paper • 2403.09611 • Published Mar 14, 2024 • 126
HyperFields: Towards Zero-Shot Generation of NeRFs from Text Paper • 2310.17075 • Published Oct 26, 2023 • 15
3D-GPT: Procedural 3D Modeling with Large Language Models Paper • 2310.12945 • Published Oct 19, 2023 • 58
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Paper • 2310.11511 • Published Oct 17, 2023 • 76