3 8 23

Tim Wu

changtimwu

AI & ML interests

DL,IoT,Devop

Recent Activity

upvoted a paper 10 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

liked a model 10 days ago

QuantFactory/Llama-3.2-Taiwan-Legal-3B-Instruct-GGUF

upvoted a paper 21 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

View all activity

Organizations

changtimwu's activity

upvoted a paper 10 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114

liked a model 10 days ago

QuantFactory/Llama-3.2-Taiwan-Legal-3B-Instruct-GGUF

Text Generation • Updated Nov 2, 2024 • 1.57k • 10

upvoted a paper 21 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 95

upvoted a paper 23 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 28 days ago • 107

liked a Space 6 months ago

113

Llama3.1 S V0.2 Checkpoint 2024 08 20

😻

Convert text to audio and vice versa

liked a model 7 months ago

shenzhi-wang/Llama3.1-8B-Chinese-Chat

Text Generation • Updated Jul 29, 2024 • 9.32k • 261

liked a model 8 months ago

openbmb/MiniCPM-Llama3-V-2_5-gguf

Updated 26 days ago • 9.18k • 211

liked a Space 9 months ago

211

Microsoft Phi-3-Vision-128k

😻

Generate image descriptions

liked a model 10 months ago

google/paligemma-3b-pt-224

Image-Text-to-Text • Updated Sep 21, 2024 • 44.7k • 301

updated a model 10 months ago

changtimwu/speaker-segmentation-fine-tuned-callhome-jpn

Updated May 2, 2024 • 53

liked 2 models 10 months ago

crusoeai/Llama-3-8B-Instruct-262k-GGUF

Updated May 5, 2024 • 1.12k • 48

bullerwins/gradientai_Llama-3-8B-Instruct-262k_exl2_8.0bpw

Text Generation • Updated Apr 26, 2024 • 9 • 3

upvoted an article 10 months ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 233

upvoted a paper 10 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

New activity in Lin-Chen/ShareGPT4V 11 months ago

I wonder how the data was collected

#11 opened 11 months ago by

changtimwu

upvoted a paper 11 months ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 180

liked a Space about 1 year ago

510

Depth Anything

🌖

Generate depth map from image

upvoted a paper about 1 year ago

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

Paper • 2312.03029 • Published Dec 5, 2023 • 25

New activity in distil-whisper/distil-large-v2 about 1 year ago

Are models for all languages going to be accessible?

#19 opened about 1 year ago by

changtimwu

upvoted a paper about 1 year ago

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 58