11 31 24

Solomatin Roman

Samoed

AI & ML interests

None yet

Recent Activity

upvoted a collection about 10 hours ago

NanoBEIR 🍺

upvoted an article 4 days ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

upvoted a paper 6 days ago

Towards General Text Embeddings with Multi-stage Contrastive Learning

View all activity

Organizations

Samoed's activity

upvoted a collection about 10 hours ago

NanoBEIR 🍺

Collection

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 12

upvoted an article 4 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

5 days ago

• 22

upvoted 2 papers 6 days ago

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 2

Jasper and Stella: distillation of SOTA embedding models

Paper • 2412.19048 • Published Dec 26, 2024 • 1

upvoted a paper 14 days ago

Facilitating large language model Russian adaptation with Learned Embedding Propagation

Paper • 2412.21140 • Published Dec 30, 2024 • 16

upvoted a paper 25 days ago

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Paper • 1908.10084 • Published Aug 27, 2019 • 5

upvoted a paper about 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

upvoted 2 collections 2 months ago

Hymba

Collection

A series of Hybrid Small Language Models. • 2 items • Updated 18 days ago • 27

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 5 days ago • 84

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

upvoted 6 papers 5 months ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 64

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 78

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 42

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

Paper • 2408.12503 • Published Aug 22, 2024 • 24