zhanghang's picture

1 11 9

zhanghang

hangzhang-nlp

·

hangzhang-nlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

upvoted a paper 9 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

upvoted a paper 23 days ago

Qwen2.5 Technical Report

View all activity

Organizations

hangzhang-nlp's activity

upvoted 2 papers 9 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 12 days ago • 40

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 11 days ago • 91

upvoted a paper 23 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 24 days ago • 339

liked a dataset 3 months ago

BAAI/Infinity-Instruct

Viewer • Updated 5 days ago • 20.4M • 4.55k • 581

upvoted 2 papers 3 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 31

liked a Space 5 months ago

Running on CPU Upgrade

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

upvoted a paper 6 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 56

liked a model 6 months ago

openvla/openvla-7b

Image-Text-to-Text • Updated Sep 16, 2024 • 38.3k • 88

liked a Space 6 months ago

Running on Zero

VideoLLaMA2

Media understanding

reacted to stas's post with 👍 8 months ago

Post

If you're trying to run MoE Mixtral-8x7b under DeepSpeed w/ HF Transformers it's likely to hang on the first forward.

The solution is here https://github.com/microsoft/DeepSpeed/pull/4966?_x_tr_sl=auto&_x_tr_tl=en&_x_tr_hl=en-US#issuecomment-1989671378

and you need deepspeed>=0.13.0

Thanks to Masahiro Tanaka for the fix.

New activity in HuggingFaceM4/the_cauldron 9 months ago

Where is the GSD dataset?

#6 opened 9 months ago by

upvoted an article 9 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 171

upvoted a collection 9 months ago

WizardLM

0 items • Updated 4 days ago • 103

upvoted a paper 9 months ago

Audio Dialogues: Dialogues dataset for audio and music understanding

Paper • 2404.07616 • Published Apr 11, 2024 • 15

liked a dataset 10 months ago

MMInstruction/M3IT

Updated Nov 24, 2023 • 3.83k • 123

authored a paper about 1 year ago

SeaLLMs -- Large Language Models for Southeast Asia

Paper • 2312.00738 • Published Dec 1, 2023 • 23

liked a model about 1 year ago

SeaLLMs/SeaLLM-13B-Chat

Updated Feb 2, 2024 • 60

updated a Space over 1 year ago

Running on A10G

Video LLaMA

upvoted a paper over 1 year ago

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 37