GLiREL -- Generalist Model for Zero-Shot Relation Extraction Paper • 2501.03172 • Published 6 days ago • 1
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8, 2024 • 14
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 25 days ago • 48
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • Jul 9, 2024 • 42
Positions Datasets Collection Datasets where each row is a chess position • 4 items • Updated 3 days ago • 6
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28
Tucano Collection Tucano is a series of decoder-transformers based on the Llama 2 architecture, natively pre-trained in Portuguese. • 17 items • Updated Nov 13, 2024 • 1
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1, 2024 • 145
LLM2Encoder Collection Collection of initial models and models that use converted decoders to encoders as backbones • 11 items • Updated Sep 10, 2024 • 6
GLiNER bi-encoders Collection Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10, 2024 • 13
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56