Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2312.15685

Candidate papers to read in the H4 journal club

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Paper • 2210.14986 • Published Oct 26, 2022 • 4
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 18
Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 75
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Paper • 2309.04269 • Published Sep 8, 2023 • 32

Instruction Tuning Datasets

For both SFT and DPO

tatsu-lab/alpaca

Viewer • Updated May 22, 2023 • 52k • 89.1k • 678
elichen3051/alpaca52k-alignment-handbook

Viewer • Updated Jun 7 • 52k • 2 • 1
yahma/alpaca-cleaned

Viewer • Updated Apr 10, 2023 • 51.8k • 44.5k • 546
HuggingFaceH4/ultrachat_200k

Viewer • Updated Feb 22 • 515k • 29k • 442

lshort-transformers

Papers useful when writing the paper: "The Not So Short Transfromers"

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 63
SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 67
Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 149
Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 61

ibm/AttaQ

Viewer • Updated Jan 26 • 1.4k • 1.2k • 10
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11 • 2 • 9
corbyrosset/researchy_questions

Viewer • Updated Feb 29 • 96.4k • 56 • 24
argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 647 • 64

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12 • 10
Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 86
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 47
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 140
ReFT: Reasoning with Reinforced Fine-Tuning

Paper • 2401.08967 • Published Jan 17 • 27
Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16 • 20
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10 • 63

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 86
NEFTune: Noisy Embeddings Improve Instruction Finetuning

Paper • 2310.05914 • Published Oct 9, 2023 • 14
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 56
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon

Paper • 2401.03462 • Published Jan 7 • 26

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 61
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17

AugmentedLearning

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17
mistralai/Mixtral-8x7B-Instruct-v0.1

Text Generation • Updated Aug 19 • 562k • • 4.11k
microsoft/phi-2

Text Generation • Updated Apr 29 • 335k • 3.23k
TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • Updated Mar 17 • 622k • 1.06k

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs