Yandex

https://yandex.com/company/

yandexcom

yandex

AI & ML interests

None defined yet.

Recent Activity

vorobyov01 new activity about 3 hours ago

yandex/YandexGPT-5-Lite-8B-pretrain:chat_template будет?

vorobyov01 updated a model about 6 hours ago

yandex/YandexGPT-5-Lite-8B-pretrain

vorobyov01 new activity about 6 hours ago

yandex/YandexGPT-5-Lite-8B-pretrain:Инструктивная модель?

View all activity

yandex's activity

vorobyov01

in yandex/YandexGPT-5-Lite-8B-pretrain about 3 hours ago

chat_template будет?

#7 opened about 4 hours ago by

vorobyov01

updated a model about 6 hours ago

yandex/YandexGPT-5-Lite-8B-pretrain

Updated about 6 hours ago • 22 • 89

vorobyov01

in yandex/YandexGPT-5-Lite-8B-pretrain about 6 hours ago

Инструктивная модель?

#3 opened about 13 hours ago by

Огромное вам спасибо. Продолжение?

#5 opened about 10 hours ago by

tokenizer error

#4 opened about 12 hours ago by

kukutz

in yandex/YandexGPT-5-Lite-8B-pretrain about 13 hours ago

Где gguf?

#2 opened about 13 hours ago by

vorobyov01

published a model about 16 hours ago

yandex/YandexGPT-5-Lite-8B-pretrain

Updated about 6 hours ago • 22 • 89

vorobyov01

in yandex/YandexGPT-5-Lite-8B-pretrain 1 day ago

Дообучение модели через torchtune

#1 opened 1 day ago by

mryab

authored a paper about 1 month ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

mryab

authored a paper 3 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 51

puhsu

authored a paper 8 months ago

TabReD: A Benchmark of Tabular Machine Learning in-the-Wild

Paper • 2406.19380 • Published Jun 27, 2024 • 47

mryab

authored a paper 8 months ago

Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Paper • 2110.03313 • Published Oct 7, 2021 • 1

mryab

authored 5 papers 10 months ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 29

RuCoLA: Russian Corpus of Linguistic Acceptability

Paper • 2210.12814 • Published Oct 23, 2022 • 1

Petals: Collaborative Inference and Fine-tuning of Large Models

Paper • 2209.01188 • Published Sep 2, 2022 • 1

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

Paper • 2402.12374 • Published Feb 19, 2024 • 3

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8, 2024 • 8

mryab

authored 2 papers about 1 year ago

Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements

Paper • 2401.06766 • Published Jan 12, 2024 • 2

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Paper • 2312.08361 • Published Dec 13, 2023 • 28

mryab

authored a paper over 1 year ago

Training Transformers Together

Paper • 2207.03481 • Published Jul 7, 2022 • 5