Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 90
view article Article Tutorial: Quantizing Llama 3+ Models for Efficient Deployment By theeseus-ai • Dec 15, 2024 • 4
view article Article Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning By Andyrasika • Jan 19, 2024 • 13
view article Article Selene 1 Mini: the best small language model-as-a-judge By AtlaAI and 10 others • 28 days ago • 12
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 149
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks Paper • 2406.12925 • Published Jun 14, 2024 • 24
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 6 days ago • 240
HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published Oct 2, 2024 • 23
view article Article How to build a custom text classifier without days of human labeling By sdiazlor and 4 others • Oct 17, 2024 • 55
view article Article Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • Jul 22, 2024 • 5
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22, 2024 • 74
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context Jul 23, 2024 • 228
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 285
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 127
ReLiK: Retrieve, Read and LinK Collection A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. • 20 items • Updated Dec 4, 2024 • 24
Nomic Embed: Training a Reproducible Long Context Text Embedder Paper • 2402.01613 • Published Feb 2, 2024 • 15