CHBench: A Chinese Dataset for Evaluating Health in Large Language Models Paper • 2409.15766 • Published Sep 24, 2024
NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models Paper • 2502.14482 • Published 6 days ago
StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following Paper • 2502.14494 • Published 6 days ago • 13
Large Language Model Evaluation via Matrix Nuclear-Norm Paper • 2410.10672 • Published Oct 14, 2024 • 19