Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2307.01201

AugmentedLearning

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17
mistralai/Mixtral-8x7B-Instruct-v0.1

Text Generation • Updated Aug 19 • 706k • • 4.19k
microsoft/phi-2

Text Generation • Updated Apr 29 • 235k • 3.24k
TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • Updated Mar 17 • 1.25M • 1.09k

Interpretability

A technical note on bilinear layers for interpretability

Paper • 2305.03452 • Published May 5, 2023 • 1
Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT

Paper • 2305.13417 • Published May 22, 2023 • 1
Explainable AI for Pre-Trained Code Models: What Do They Learn? When They Do Not Work?

Paper • 2211.12821 • Published Nov 23, 2022 • 1
The Linear Representation Hypothesis and the Geometry of Large Language Models

Paper • 2311.03658 • Published Nov 7, 2023 • 1

Structured data

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

Paper • 2309.08963 • Published Sep 16, 2023 • 9
DSG: An End-to-End Document Structure Generator

Paper • 2310.09118 • Published Oct 13, 2023 • 2
Integrating Graphs with Large Language Models: Methods and Prospects

Paper • 2310.05499 • Published Oct 9, 2023 • 1
Schema-learning and rebinding as mechanisms of in-context learning and emergence

Paper • 2307.01201 • Published Jun 16, 2023 • 2

Knowledge graph

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

Paper • 2309.03118 • Published Sep 6, 2023 • 2
Head-to-Tail: How Knowledgeable are Large Language Models (LLM)? A.K.A. Will LLMs Replace Knowledge Graphs?

Paper • 2308.10168 • Published Aug 20, 2023 • 2
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models

Paper • 2308.09729 • Published Aug 17, 2023 • 5
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models

Paper • 2110.07178 • Published Oct 14, 2021 • 1

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3
Commonsense Knowledge Transfer for Pre-trained Language Models

Paper • 2306.02388 • Published Jun 4, 2023 • 1
Finding Neurons in a Haystack: Case Studies with Sparse Probing

Paper • 2305.01610 • Published May 2, 2023 • 2
Schema-learning and rebinding as mechanisms of in-context learning and emergence

Paper • 2307.01201 • Published Jun 16, 2023 • 2

Dissecting In-Context Learning of Translations in GPTs

Paper • 2310.15987 • Published Oct 24, 2023 • 5
In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 41
ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Paper • 2202.07922 • Published Feb 16, 2022 • 1
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques

Paper • 2310.08101 • Published Oct 12, 2023 • 1

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs