1 308 458

Habibullah Akbar

ChavyvAkvar

https://chavyv.vercel.app

AI & ML interests

AGI, Ethical-Driven AI, Open-source AI

Recent Activity

liked a model about 19 hours ago

FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

liked a model about 20 hours ago

HuggingFaceTB/SmolVLM-256M-Instruct

liked a model 1 day ago

takara-ai/SwarmFormer-Sentiment-Small

View all activity

Organizations

ChavyvAkvar's activity

upvoted a paper 2 days ago

AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts

Paper • 2402.07625 • Published Feb 12, 2024 • 14

upvoted 2 papers 3 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 4 days ago • 55

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 4 days ago • 186

upvoted 5 papers 4 days ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published 10 days ago • 21

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Paper • 2501.08331 • Published 12 days ago • 17

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 6 days ago • 22

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 6 days ago • 74

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 5 days ago • 59

upvoted a paper 6 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 9 days ago • 97

upvoted a paper 7 days ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published 18 days ago • 50

upvoted a paper 8 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

upvoted a paper 10 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 12 days ago • 47

upvoted a paper 11 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 12 days ago • 268

upvoted 4 papers 12 days ago

upvoted 2 papers 14 days ago

AdaLomo: Low-memory Optimization with Adaptive Learning Rate

Paper • 2310.10195 • Published Oct 16, 2023 • 2

Full Parameter Fine-tuning for Large Language Models with Limited Resources

Paper • 2306.09782 • Published Jun 16, 2023 • 30

upvoted a paper 15 days ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published 19 days ago • 14