DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 2 days ago • 161
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Paper • 2501.12202 • Published 3 days ago • 25
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 10 days ago • 47
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published 20 days ago • 25
PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17, 2024 • 1
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published Nov 4, 2024 • 24
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Paper • 2407.07061 • Published Jul 9, 2024 • 27
AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability Paper • 2405.14129 • Published May 23, 2024 • 12
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper • 2405.12981 • Published May 21, 2024 • 29