Submitted by razzant 132 LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers · 7 authors 3
Submitted by DongkiKim 41 Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model · 3 authors 2
Submitted by wzhgba 36 MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction · 6 authors 2
Submitted by yiren98 33 PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data · 7 authors 6
Submitted by Sterzhang 27 VLM$^2$-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues · 6 authors 2
Submitted by Yeonjun 14 Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models · 7 authors 2
Submitted by WhiteCatY 13 StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following · 5 authors 2
Submitted by sangheeeee 9 Evaluating Multimodal Generative AI with Korean Educational Standards · 2 authors 3
Submitted by heegyu 8 Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge · 15 authors 1
Submitted by SP2001 8 MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models · 7 authors 2
Submitted by akhaliq 8 Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence · 3 authors 2
Submitted by BestWishYsh 7 FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation · 6 authors 2
Submitted by ahmedheakl 6 KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding · 10 authors 2
Submitted by AngxiaoYue 6 ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation · 3 authors 3
Submitted by akhaliq 6 InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback · 5 authors 2
Submitted by akhaliq 6 The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer · 3 authors 2
Submitted by pkargupta 4 Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis · 4 authors 2
Submitted by taesiri 4 Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? · 13 authors 2
Submitted by akhaliq 4 One-step Diffusion Models with $f$-Divergence Distribution Matching · 3 authors 2
Submitted by AjayP13 3 mStyleDistance: Multilingual Style Embeddings and their Evaluation · 5 authors 2
Submitted by kjunh 3 EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild · 8 authors 2
Submitted by fromthesky 1 PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference · 1 authors 1
Submitted by oceanusity 1 Learning to Discover Regulatory Elements for Gene Expression Prediction · 4 authors 2
Submitted by elliotschu 1 Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease · 3 authors 2
Submitted by yueqingliang 1 Benchmarking LLMs for Political Science: A United Nations Perspective · 9 authors 2
Submitted by circleLZY 1 JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework · 6 authors 2
Submitted by vaidehi99 1 UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning · 3 authors 2
Submitted by dnoever - Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries · 2 authors 3