-
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 17 -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 706k • • 4.19k -
microsoft/phi-2
Text Generation • Updated • 235k • 3.24k -
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation • Updated • 1.25M • 1.09k
Collections
Discover the best community collections!
Collections including paper arxiv:2307.01201
-
A technical note on bilinear layers for interpretability
Paper • 2305.03452 • Published • 1 -
Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Paper • 2305.13417 • Published • 1 -
Explainable AI for Pre-Trained Code Models: What Do They Learn? When They Do Not Work?
Paper • 2211.12821 • Published • 1 -
The Linear Representation Hypothesis and the Geometry of Large Language Models
Paper • 2311.03658 • Published • 1
-
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Paper • 2309.08963 • Published • 9 -
DSG: An End-to-End Document Structure Generator
Paper • 2310.09118 • Published • 2 -
Integrating Graphs with Large Language Models: Methods and Prospects
Paper • 2310.05499 • Published • 1 -
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Paper • 2307.01201 • Published • 2
-
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Paper • 2309.03118 • Published • 2 -
Head-to-Tail: How Knowledgeable are Large Language Models (LLM)? A.K.A. Will LLMs Replace Knowledge Graphs?
Paper • 2308.10168 • Published • 2 -
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models
Paper • 2308.09729 • Published • 5 -
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
Paper • 2110.07178 • Published • 1
-
Are Emergent Abilities in Large Language Models just In-Context Learning?
Paper • 2309.01809 • Published • 3 -
Commonsense Knowledge Transfer for Pre-trained Language Models
Paper • 2306.02388 • Published • 1 -
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Paper • 2305.01610 • Published • 2 -
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Paper • 2307.01201 • Published • 2
-
Dissecting In-Context Learning of Translations in GPTs
Paper • 2310.15987 • Published • 5 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 41 -
ZeroGen: Efficient Zero-shot Learning via Dataset Generation
Paper • 2202.07922 • Published • 1 -
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques
Paper • 2310.08101 • Published • 1