SciCode: A Research Coding Benchmark Curated by Scientists Paper • 2407.13168 • Published Jul 18 • 13
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper • 2401.00812 • Published Jan 1 • 3
What is the Visual Cognition Gap between Humans and Multimodal LLMs? Paper • 2406.10424 • Published Jun 14
Mitigating Transformer Overconfidence via Lipschitz Regularization Paper • 2306.06849 • Published Jun 12, 2023
MACP: Efficient Model Adaptation for Cooperative Perception Paper • 2310.16870 • Published Oct 25, 2023
LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs Paper • 2312.04372 • Published Dec 7, 2023
Open-NeRF: Towards Open Vocabulary NeRF Decomposition Paper • 2310.16383 • Published Oct 25, 2023 • 2
Learning Implicit Representation for Reconstructing Articulated Objects Paper • 2401.08809 • Published Jan 16
MagicPose4D: Crafting Articulated Models with Appearance and Motion Control Paper • 2405.14017 • Published May 22 • 3
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression Paper • 2403.15447 • Published Mar 18 • 16
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data Paper • 2401.17600 • Published Jan 31
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer Paper • 2312.03724 • Published Nov 27, 2023 • 1
LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models Paper • 2308.16137 • Published Aug 30, 2023 • 39
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models Paper • 2306.11698 • Published Jun 20, 2023 • 12
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models Paper • 2306.11698 • Published Jun 20, 2023 • 12
Video Pre-trained Transformer: A Multimodal Mixture of Pre-trained Experts Paper • 2304.10505 • Published Mar 24, 2023 • 1