Xi's picture

Xi

xi0v

·

AI & ML interests

Reinforcement learning, Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.

Recent Activity

upvoted a paper about 6 hours ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

liked a model about 9 hours ago

John6666/noobai-xl-hd-eps10version-sdxl

upvoted a paper about 9 hours ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

View all activity

Organizations

xi0v's activity

upvoted a paper about 6 hours ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 1 day ago • 16

upvoted a paper about 9 hours ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 5 days ago • 133

upvoted 2 papers 1 day ago

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published 6 days ago • 31

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 5 days ago • 74

upvoted 3 papers 3 days ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 6 days ago • 52

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published 5 days ago • 23

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published 7 days ago • 23

upvoted a paper 4 days ago

Thinking Preference Optimization

Paper • 2502.13173 • Published 8 days ago • 15

upvoted a paper 5 days ago

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published 13 days ago • 39

upvoted an article 5 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

7 days ago

• 59

upvoted 2 papers 7 days ago

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Paper • 2502.10454 • Published 14 days ago • 7

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published 14 days ago • 27

upvoted a paper 12 days ago

Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training

Paper • 2502.06589 • Published 15 days ago • 17

upvoted 4 papers 14 days ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published 16 days ago • 32

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 15 days ago • 136

CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference

Paper • 2502.04416 • Published 19 days ago • 11

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Paper • 2502.04350 • Published 21 days ago • 11

upvoted an article 19 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

22 days ago

• 107

upvoted an article 21 days ago

Article

Open-source DeepResearch – Freeing our search agents

22 days ago

• 1.1k

upvoted a collection 21 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 13 days ago • 91