3 5 7

Ismael

IsmaelMousa

AI & ML interests

NLP

Recent Activity

updated a model 5 days ago

IsmaelMousa/modernbert-ner-conll2003

liked a model 9 days ago

Qwen/QwQ-32B-Preview

liked a model 9 days ago

answerdotai/ModernBERT-base

View all activity

Organizations

IsmaelMousa's activity

updated a model 5 days ago

IsmaelMousa/modernbert-ner-conll2003

Token Classification • Updated 5 days ago • 11

liked 2 models 9 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated about 10 hours ago • 134k • 1.53k

answerdotai/ModernBERT-base

Fill-Mask • Updated 1 day ago • 3.57M • 649

liked a model 10 days ago

deepseek-ai/DeepSeek-V3

Updated 13 days ago • 112k • 1.69k

liked a model about 1 month ago

Qwen/Qwen2.5-32B

Text Generation • Updated Sep 20, 2024 • 23k • 60

liked a model about 2 months ago

FacebookAI/roberta-base

Fill-Mask • Updated Feb 19, 2024 • 18.4M • 460

updated a model 3 months ago

IsmaelMousa/arab-bart-base-174M

Summarization • Updated Oct 3, 2024 • 15

updated a model 4 months ago

IsmaelMousa/arabic-bpe-tokenizer

Summarization • Updated Sep 14, 2024 • 1

New activity in IsmaelMousa/movies 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

parquet-converter

updated a dataset 4 months ago

IsmaelMousa/movies

Viewer • Updated Sep 3, 2024 • 1.17k • 43

New activity in IsmaelMousa/books 4 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

updated a Space 5 months ago

Sleeping

🚀

GPT2 L Docker

updated 2 datasets 5 months ago

IsmaelMousa/libri-in-italiano

Viewer • Updated Aug 18, 2024 • 18 • 36

IsmaelMousa/books

Viewer • Updated Aug 11, 2024 • 49 • 67

reacted to DmitryRyumin's post with 🔥 7 months ago

Post

3661

🚀🎭🌟 New Research Alert - Portrait4D-v2 (Avatars Collection)! 🌟🎭🚀
📄 Title: Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer 🔝

📝 Description: Portrait4D-v2 is a novel method for one-shot 4D head avatar synthesis using pseudo multi-view videos and a vision transformer backbone, achieving superior performance without relying on 3DMM reconstruction.

👥 Authors: Yu Deng, Duomin Wang, and Baoyuan Wang

📄 Paper: Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (2403.13570)

🌐 GitHub Page: https://yudeng.github.io/Portrait4D-v2/
📁 Repository: https://github.com/YuDeng/Portrait-4D

📺 Video: https://www.youtube.com/watch?v=5YJY6-wcOJo

🚀 CVPR-2023-24-Papers: https://github.com/DmitryRyumin/CVPR-2023-24-Papers

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: Portrait4D #4DAvatar #HeadSynthesis #3DModeling #TechInnovation #DeepLearning #ComputerGraphics #ComputerVision #Innovation

1 reply

commented a paper 7 months ago

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27, 2024 • 30 •

upvoted a paper 7 months ago

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27, 2024 • 30

reacted to merve's post with 🤗 7 months ago

Post

6061

Fine-tune Florence-2 on any task 🔥

Today we release a notebook and a walkthrough blog on fine-tuning Florence-2 on DocVQA dataset @andito @SkalskiP

Blog: https://huggingface.co/blog 📕
Notebook: https://colab.research.google.com/drive/1hKDrJ5AH_o7I95PtZ9__VlCTNAo1Gjpf?usp=sharing 📖
Florence-2 is a great vision-language model thanks to it's massive dataset and small size!

This model requires conditioning through task prefixes and it's not as generalist, requiring fine-tuning on a new task, such as DocVQA 📝

We have fine-tuned the model on A100 (and one can also use a smaller GPU with smaller batch size) and saw that model picks up new tasks 🥹

See below how it looks like before and after FT 🤩
Play with the demo here andito/Florence-2-DocVQA 🏄‍♀️

liked a model 7 months ago

nvidia/Nemotron-4-340B-Base

Updated Jun 28, 2024 • 128 • 145

upvoted a paper 7 months ago

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21, 2024 • 62