AI learning - a kalinkrustev Collection

Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve

Paper • 2309.13638 • Published Sep 24, 2023

Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks

Paper • 2307.02477 • Published Jul 5, 2023

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.7k • 333

stingning/ultrachat

Viewer • Updated Feb 22 • 774k • 1.37k • 418

Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 87

LLaMA: Open and Efficient Foundation Language Models

Paper • 2302.13971 • Published Feb 27, 2023 • 13

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 242

Code Llama: Open Foundation Models for Code

Paper • 2308.12950 • Published Aug 24, 2023 • 22

DocPrompting: Generating Code by Retrieving the Docs

Paper • 2207.05987 • Published Jul 13, 2022 • 1

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size

Paper • 1602.07360 • Published Feb 24, 2016 • 1

Physics of Language Models: Part 3.2, Knowledge Manipulation

Paper • 2309.14402 • Published Sep 25, 2023 • 5

Wuerstchen: Efficient Pretraining of Text-to-Image Models

Paper • 2306.00637 • Published Jun 1, 2023 • 12

microsoft/phi-1_5

Text Generation • Updated Apr 29 • 136k • 1.31k

replit/replit-code-v1_5-3b

Text Generation • Updated Oct 20, 2023 • 15.2k • 286

ybelkada/segment-anything

Updated Dec 26, 2023 • 95

HuggingFaceH4/zephyr-7b-alpha

Text Generation • Updated 25 days ago • 34.4k • • 1.1k

Intel/whisper-large-v2-onnx-int4-inc

Automatic Speech Recognition • Updated Oct 16, 2023 • 34 • 26

fondant-ai/fondant-cc-25m

Viewer • Updated Nov 21, 2023 • 25.9M • 2.09k • 52

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 47

ostris/ikea-instructions-lora-sdxl

Text-to-Image • Updated Sep 29, 2023 • 2.16k • • 219

facebook/belebele

Viewer • Updated Aug 12 • 110k • 8.74k • 98

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Paper • 2208.12242 • Published Aug 25, 2022 • 10

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 40

MusicLM: Generating Music From Text

Paper • 2301.11325 • Published Jan 26, 2023 • 1

AudioLM: a Language Modeling Approach to Audio Generation

Paper • 2209.03143 • Published Sep 7, 2022

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

Paper • 2310.00704 • Published Oct 1, 2023 • 19

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

Paper • 2109.10282 • Published Sep 21, 2021 • 6

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118