5 33 7

Misaki Wang

MisakiWang

AI & ML interests

None yet

Recent Activity

liked a Space about 19 hours ago

yanze/PuLID-FLUX

upvoted a paper 11 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

upvoted a paper 17 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Organizations

None yet

MisakiWang's activity

upvoted a paper 11 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 12 days ago • 268

upvoted a paper 17 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248

upvoted a paper about 2 months ago

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Paper • 2412.03548 • Published Dec 4, 2024 • 17

upvoted a paper 3 months ago

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Paper • 2410.05080 • Published Oct 7, 2024 • 21

upvoted a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted 8 papers 5 months ago

Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique

Paper • 2408.10701 • Published Aug 20, 2024 • 12

Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification

Paper • 2408.11237 • Published Aug 20, 2024 • 6

TrackGo: A Flexible and Efficient Method for Controllable Video Generation

Paper • 2408.11475 • Published Aug 21, 2024 • 18

Evidence-backed Fact Checking using RAG and Few-Shot In-Context Learning with LLMs

Paper • 2408.12060 • Published Aug 22, 2024 • 5

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 44

upvoted a paper 6 months ago

Text-Driven Neural Collaborative Filtering Model for Paper Source Tracing

Paper • 2407.17722 • Published Jul 25, 2024 • 8

upvoted a collection 6 months ago

"Physics of Language Models" series

Collection

6 items • Updated Aug 30, 2024 • 40

upvoted 2 papers 8 months ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Paper • 2405.19107 • Published May 29, 2024 • 14

LLMs achieve adult human performance on higher-order theory of mind tasks

Paper • 2405.18870 • Published May 29, 2024 • 17

upvoted 3 papers 10 months ago

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20, 2024 • 78

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Paper • 2403.11481 • Published Mar 18, 2024 • 13

DiPaCo: Distributed Path Composition

Paper • 2403.10616 • Published Mar 15, 2024 • 13