Sayantan Das's picture

Sayantan Das

ucalyptus

·

https://ucalyptus.me/

AI & ML interests

Generative Modeling

Recent Activity

liked a Space 1 day ago

HyperCluster/manimator

updated a Space 1 day ago

ucalyptus/ygk-crime

upvoted a paper 2 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Organizations

ucalyptus's activity

upvoted a paper 2 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 4 days ago • 185

upvoted a collection 13 days ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 4 days ago • 71

upvoted a paper 18 days ago

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Paper • 2412.15214 • Published 24 days ago • 15

upvoted a paper 20 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 124

upvoted a collection 20 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 2 days ago • 79

upvoted an article 27 days ago

Article

Bridging the Gap Between Physical Numerical Simulations and Machine Learning: Introducing The Well

By

•

Dec 2, 2024

• 17

upvoted an article 3 months ago

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

• 55

upvoted 2 papers 3 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 215

upvoted a paper 4 months ago

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 10

upvoted 2 papers 5 months ago

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Paper • 2408.14717 • Published Aug 27, 2024 • 26

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 122

upvoted an article 5 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

upvoted a paper 5 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 68

upvoted a collection 5 months ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated 30 days ago • 78

upvoted an article 5 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

• 59

upvoted an article 6 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 226

upvoted a collection 6 months ago

xLAM models

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 24 days ago • 45

upvoted an article 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 298