TangoDJ
's Collections
Papers
updated
Text-to-3D using Gaussian Splatting
Paper
•
2309.16585
•
Published
•
31
FP8-LM: Training FP8 Large Language Models
Paper
•
2310.18313
•
Published
•
33
Zephyr: Direct Distillation of LM Alignment
Paper
•
2310.16944
•
Published
•
123
Beyond Human Data: Scaling Self-Training for Problem-Solving with
Language Models
Paper
•
2312.06585
•
Published
•
28
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
•
2402.03300
•
Published
•
77
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Paper
•
2402.09727
•
Published
•
36
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
104
Paper
•
2402.13144
•
Published
•
95
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
•
2402.14083
•
Published
•
47
Watermarking Makes Language Models Radioactive
Paper
•
2402.14904
•
Published
•
23
Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
Paper
•
2403.05530
•
Published
•
61
Jamba: A Hybrid Transformer-Mamba Language Model
Paper
•
2403.19887
•
Published
•
104
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
•
2404.14219
•
Published
•
254
Executable Code Actions Elicit Better LLM Agents
Paper
•
2402.01030
•
Published
•
40
Scaling Laws for Pre-training Agents and World Models
Paper
•
2411.04434
•
Published
How Abilities in Large Language Models are Affected by Supervised
Fine-tuning Data Composition
Paper
•
2310.05492
•
Published
•
2
DSPy: Compiling Declarative Language Model Calls into Self-Improving
Pipelines
Paper
•
2310.03714
•
Published
•
33
Training Language Models on the Knowledge Graph: Insights on
Hallucinations and Their Detectability
Paper
•
2408.07852
•
Published
•
16
Paper
•
2410.01201
•
Published
•
51
Towards Understanding Sycophancy in Language Models
Paper
•
2310.13548
•
Published
•
4
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large
Language Models
Paper
•
2406.10162
•
Published
RULER: What's the Real Context Size of Your Long-Context Language
Models?
Paper
•
2404.06654
•
Published
•
34
Scalable MatMul-free Language Modeling
Paper
•
2406.02528
•
Published
•
11
Training language models to follow instructions with human feedback
Paper
•
2203.02155
•
Published
•
16
Adding Error Bars to Evals: A Statistical Approach to Language Model
Evaluations
Paper
•
2411.00640
•
Published
•
3
Discovering Preference Optimization Algorithms with and for Large
Language Models
Paper
•
2406.08414
•
Published
•
14
Flow Matching Guide and Code
Paper
•
2412.06264
•
Published
•
1
Training Large Language Models to Reason in a Continuous Latent Space
Paper
•
2412.06769
•
Published
•
71
An Evolved Universal Transformer Memory
Paper
•
2410.13166
•
Published
•
3
A Survey on Uncertainty Quantification of Large Language Models:
Taxonomy, Open Research Challenges, and Future Directions
Paper
•
2412.05563
•
Published