Ismael's picture

Ismael

IsmaelMousa

AI & ML interests

NLP

Recent Activity

updated a model 5 days ago
IsmaelMousa/modernbert-ner-conll2003
liked a model 9 days ago
Qwen/QwQ-32B-Preview
liked a model 9 days ago
answerdotai/ModernBERT-base
View all activity

Organizations

Open-Source AI Meetup's profile picture OpenOrca's profile picture Ai4Privacy's profile picture MLX Community's profile picture INNOVA AI's profile picture Narra's profile picture llmc's profile picture

IsmaelMousa's activity

New activity in IsmaelMousa/movies 4 months ago
New activity in IsmaelMousa/books 4 months ago
updated a Space 5 months ago
reacted to DmitryRyumin's post with ๐Ÿ”ฅ 7 months ago
view post
Post
3661
๐Ÿš€๐ŸŽญ๐ŸŒŸ New Research Alert - Portrait4D-v2 (Avatars Collection)! ๐ŸŒŸ๐ŸŽญ๐Ÿš€
๐Ÿ“„ Title: Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer ๐Ÿ”

๐Ÿ“ Description: Portrait4D-v2 is a novel method for one-shot 4D head avatar synthesis using pseudo multi-view videos and a vision transformer backbone, achieving superior performance without relying on 3DMM reconstruction.

๐Ÿ‘ฅ Authors: Yu Deng, Duomin Wang, and Baoyuan Wang

๐Ÿ“„ Paper: Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (2403.13570)

๐ŸŒ GitHub Page: https://yudeng.github.io/Portrait4D-v2/
๐Ÿ“ Repository: https://github.com/YuDeng/Portrait-4D

๐Ÿ“บ Video: https://www.youtube.com/watch?v=5YJY6-wcOJo

๐Ÿš€ CVPR-2023-24-Papers: https://github.com/DmitryRyumin/CVPR-2023-24-Papers

๐Ÿ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

๐Ÿš€ Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

๐Ÿ” Keywords: Portrait4D #4DAvatar #HeadSynthesis #3DModeling #TechInnovation #DeepLearning #ComputerGraphics #ComputerVision #Innovation
  • 1 reply
ยท
reacted to merve's post with ๐Ÿค— 7 months ago
view post
Post
6061
Fine-tune Florence-2 on any task ๐Ÿ”ฅ

Today we release a notebook and a walkthrough blog on fine-tuning Florence-2 on DocVQA dataset @andito @SkalskiP

Blog: https://huggingface.co/blog ๐Ÿ“•
Notebook: https://colab.research.google.com/drive/1hKDrJ5AH_o7I95PtZ9__VlCTNAo1Gjpf?usp=sharing ๐Ÿ“–
Florence-2 is a great vision-language model thanks to it's massive dataset and small size!

This model requires conditioning through task prefixes and it's not as generalist, requiring fine-tuning on a new task, such as DocVQA ๐Ÿ“

We have fine-tuned the model on A100 (and one can also use a smaller GPU with smaller batch size) and saw that model picks up new tasks ๐Ÿฅน

See below how it looks like before and after FT ๐Ÿคฉ
Play with the demo here andito/Florence-2-DocVQA ๐Ÿ„โ€โ™€๏ธ