The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 24 days ago • 182
Controllable Dynamic Appearance for Neural 3D Portraits Paper • 2309.11009 • Published Sep 20, 2023 • 3
End-to-End Speech Recognition Contextualization with Large Language Models Paper • 2309.10917 • Published Sep 19, 2023 • 10
The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute Paper • 2309.11197 • Published Sep 20, 2023 • 5
A Large-scale Dataset for Audio-Language Representation Learning Paper • 2309.11500 • Published Sep 20, 2023 • 10
Chain-of-Verification Reduces Hallucination in Large Language Models Paper • 2309.11495 • Published Sep 20, 2023 • 38
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions Paper • 2309.10150 • Published Sep 18, 2023 • 25
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch Paper • 2309.10706 • Published Sep 19, 2023 • 17
SlimPajama-DC: Understanding Data Combinations for LLM Training Paper • 2309.10818 • Published Sep 19, 2023 • 11
360^circ Reconstruction From a Single Image Using Space Carved Outpainting Paper • 2309.10279 • Published Sep 19, 2023 • 6
Augmenting text for spoken language understanding with Large Language Models Paper • 2309.09390 • Published Sep 17, 2023 • 3
Enhance audio generation controllability through representation similarity regularization Paper • 2309.08773 • Published Sep 15, 2023 • 4
Stack-and-Delay: a new codebook pattern for music generation Paper • 2309.08804 • Published Sep 15, 2023 • 5
S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs Paper • 2309.08827 • Published Sep 16, 2023 • 5
Recovering from Privacy-Preserving Masking with Large Language Models Paper • 2309.08628 • Published Sep 12, 2023 • 5
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale Paper • 2309.06497 • Published Sep 12, 2023 • 6
TextBind: Multi-turn Interleaved Multimodal Instruction-following Paper • 2309.08637 • Published Sep 14, 2023 • 8
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? Paper • 2309.08963 • Published Sep 16, 2023 • 11