43 464 597

Sugato Ray PRO

sugatoray

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 2 hours ago

Papers

upvoted a paper about 2 hours ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

updated a collection about 16 hours ago

LLM Tools

View all activity

Organizations

sugatoray's activity

upvoted a paper about 2 hours ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 1 day ago • 16

upvoted 3 papers 1 day ago

upvoted a paper 2 days ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 6 days ago • 52

upvoted 3 papers 3 days ago

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published 7 days ago • 23

Matrix Calculus (for Machine Learning and Beyond)

Paper • 2501.14787 • Published Jan 7 • 1

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Paper • 2502.02481 • Published 21 days ago • 8

upvoted a collection 3 days ago

GemmaX2

Collection

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated 19 days ago • 18

upvoted a paper 4 days ago

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published 9 days ago • 15

upvoted an article 4 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

6 days ago

• 164

upvoted a paper 5 days ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published 11 days ago • 50

upvoted an article 5 days ago

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Dec 9, 2024

• 54

upvoted a paper 5 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 7 days ago • 75

upvoted an article 5 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

8 days ago

• 89

upvoted an article 6 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

7 days ago

• 59

upvoted a collection 6 days ago

PaliGemma 2 Mix

Collection

13 items • Updated 6 days ago • 59

upvoted 2 papers 6 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 15 days ago • 136

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Paper • 2502.10454 • Published 14 days ago • 7

upvoted a paper 7 days ago

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Paper • 2502.12148 • Published 8 days ago • 16