Shao qi's picture

8 4

Shao qi

OmniTN

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Temporal Preference Optimization for Long-Form Video Understanding

upvoted a paper 2 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

upvoted a paper 2 days ago

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

View all activity

Organizations

None yet

OmniTN's activity

upvoted 8 papers 2 days ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published 7 days ago • 21

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published 6 days ago • 29

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 9 days ago • 8

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published 5 days ago • 9

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published 4 days ago • 17

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published 4 days ago • 14

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 3 days ago • 19

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 5 days ago • 40

liked 2 models 2 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 3 days ago • 36.2k • 214

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.54M • 8.36k

liked 2 datasets 2 days ago

openai/gsm8k

Viewer • Updated Jan 4, 2024 • 17.6k • 212k • 510

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 122k • 710