Papers - a TangoDJ Collection

TangoDJ 's Collections

Paper - Alignment

Paper - Application

Paper - LLM Laws

Papers - Intro, Review, Survey

Papers

models

models multimodal

Papers - LLM on Data

Pappers - Robots

Papers

updated 25 days ago

Text-to-3D using Gaussian Splatting

Paper • 2309.16585 • Published Sep 28, 2023 • 31
FP8-LM: Training FP8 Large Language Models

Paper • 2310.18313 • Published Oct 27, 2023 • 33
Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Paper • 2312.06585 • Published Dec 11, 2023 • 28
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 77
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15, 2024 • 36
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 104
Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20, 2024 • 95
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21, 2024 • 47
Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 23
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 61
Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 104
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 254
Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 40
Scaling Laws for Pre-training Agents and World Models

Paper • 2411.04434 • Published Nov 7, 2024
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Paper • 2310.03714 • Published Oct 5, 2023 • 33
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Paper • 2408.07852 • Published Aug 14, 2024 • 16
Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51
Towards Understanding Sycophancy in Language Models

Paper • 2310.13548 • Published Oct 20, 2023 • 4
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Paper • 2406.10162 • Published Jun 14, 2024
RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 34
Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4, 2024 • 11
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 16
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

Paper • 2411.00640 • Published Nov 1, 2024 • 3
Discovering Preference Optimization Algorithms with and for Large Language Models

Paper • 2406.08414 • Published Jun 12, 2024 • 14
Flow Matching Guide and Code

Paper • 2412.06264 • Published Dec 9, 2024 • 1
Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 71
An Evolved Universal Transformer Memory

Paper • 2410.13166 • Published Oct 17, 2024 • 3
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

Paper • 2412.05563 • Published Dec 7, 2024