4 50 24

Alexey G

grib0ed0v

AI & ML interests

LLM / RLHF / AI4Everything.

Recent Activity

upvoted an article about 6 hours ago

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

upvoted an article about 6 hours ago

Train 400x faster Static Embedding Models with Sentence Transformers

upvoted an article about 6 hours ago

Introducing smolagents: simple agents that write actions in code.

View all activity

Organizations

grib0ed0v's activity

upvoted 3 articles about 6 hours ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

5 days ago

• 84

Article

Train 400x faster Static Embedding Models with Sentence Transformers

13 days ago

• 125

Article

Introducing smolagents: simple agents that write actions in code.

28 days ago

• 529

upvoted an article about 20 hours ago

Article

We now support VLMs in smolagents!

4 days ago

• 49

upvoted an article 4 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 510

upvoted 2 articles 5 days ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

12 days ago

• 60

Article

Timm ❤️ Transformers: Use any timm model with transformers

12 days ago

• 35

upvoted a collection about 2 months ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Dec 13, 2024 • 50

upvoted a collection 2 months ago

Cultura-Ru-Edu

Collection

Our dataset for enhancing LLM training with educational content in the Russian language. • 2 items • Updated Nov 26, 2024 • 5

upvoted 2 papers 2 months ago

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50

upvoted an article 2 months ago

Article

Let’s make a generation of amazing image generation models

•

Nov 26, 2024

• 34

upvoted 2 papers 2 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 52

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63

upvoted 6 papers 3 months ago

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2, 2024 • 12

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 17

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 19

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published Oct 24, 2024 • 42

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23, 2024 • 34

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145