3 403 313

r PRO

oceansweep

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

deepseek-ai/DeepSeek-V3-Base

upvoted a paper about 12 hours ago

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval

liked a model about 23 hours ago

Qwen/QVQ-72B-Preview

View all activity

Organizations

None yet

oceansweep's activity

liked a model about 1 hour ago

deepseek-ai/DeepSeek-V3-Base

Updated about 2 hours ago • 77

upvoted a paper about 12 hours ago

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval

Paper • 2412.15443 • Published 6 days ago • 4

liked a model about 23 hours ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated about 17 hours ago • 1.07k • 223

liked a model about 24 hours ago

allenai/Molmo-7B-D-0924

Image-Text-to-Text • Updated Oct 10 • 240k • 477

upvoted 2 papers 1 day ago

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published 2 days ago • 8

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published 7 days ago • 7

upvoted a collection 1 day ago

InternVL2.5-MPO

Collection

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 1 day ago • 21

liked a model 2 days ago

ymcki/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • Updated 2 days ago • 2.18k • 11

upvoted 2 papers 5 days ago

DateLogicQA: Benchmarking Temporal Biases in Large Language Models

Paper • 2412.13377 • Published 8 days ago • 2

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 7 days ago • 103

upvoted 3 papers 6 days ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 6 days ago • 45

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 6 days ago • 30

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 327

upvoted 5 papers 7 days ago

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published 8 days ago • 30

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 8 days ago • 41

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 8 days ago • 40

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 8 days ago • 87

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 7 days ago • 43

liked 2 models 7 days ago

PleIAs/Pleias-Nano

Updated 20 days ago • 489 • 29

GoodiesHere/Apollo-LMMs-Apollo-3B-t32

Text Generation • Updated 7 days ago • 124 • 15