1 22

Tom LUCAS

C0casio45

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Baichuan-Omni-1.5 Technical Report

upvoted a paper about 13 hours ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper 11 days ago

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

View all activity

Organizations

C0casio45's activity

upvoted 2 papers about 13 hours ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 3 days ago • 35

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 6 days ago • 245

upvoted a paper 11 days ago

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published 13 days ago • 29

upvoted a paper 13 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 14 days ago • 268

upvoted a paper 19 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 20 days ago • 249

upvoted a paper 21 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 81

upvoted 2 papers 26 days ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published 29 days ago • 25

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 72

upvoted 2 papers 28 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 45

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 115

upvoted 3 papers about 1 month ago

upvoted 2 papers about 2 months ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 105

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 129

upvoted 2 papers 2 months ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published Nov 5, 2024 • 66

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 114

upvoted a paper 3 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

upvoted 2 papers 4 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 64