Adina Yakefu

AdinaY

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Chinese Localization's profile picture Huggingface Projects's profile picture Blog-explorers's profile picture ICCV2023's profile picture Open LLM Leaderboard's profile picture huggingPartyParis's profile picture Qwen's profile picture Journalists on Hugging Face's profile picture Women on Hugging Face's profile picture Social Post Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Hugging Face for Legal's profile picture

AdinaY's activity

posted an update about 12 hours ago
view post
Post
394
QvQ-72B-Preview๐ŸŽ„ an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
โœจ Combines visual understanding & language reasoning.
โœจ Scores 70.3 on MMMU
โœจ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving
posted an update 9 days ago
view post
Post
484
Megrez-3B-Omni ๐Ÿ”ฅ an on-device multimodal LLM by Infinigence AI, another startup emerging from the Tsinghua University ecosystem.
Model: Infinigence/Megrez-3B-Omni
Demo: Infinigence/Megrez-3B-Omni
โœจSupports analysis of image, text, and audio modalities
โœจLeads in bilingual speech ( English & Chinese ) input, multi-turn conversations, and voice-based queries
โœจOutperforms in scene understanding and OCR across major benchmarks
reacted to qq8933's post with ๐Ÿ”ฅ๐Ÿ‘€ 13 days ago
view post
Post
2520
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
ยท
reacted to julien-c's post with โค๏ธ๐Ÿ”ฅ 14 days ago
view post
Post
7567
After some heated discussion ๐Ÿ”ฅ, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co/docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community ๐Ÿ”ฅ

cc: @reach-vb @pierric @victor and the HF team
ยท
reacted to m-ric's post with ๐Ÿ”ฅ 16 days ago
view post
Post
2220
Last week was crazy in OS AI, with important models and datasets releases every day.

Here are the most important ones I've pinned:

๐ŸŒŽ Cohere relased GLobal-MMLU, a multilingual version of MMLU, to evaluate AI models' world knowledge in many languages!

๐Ÿฆ™ Meta released Llama-3.3-70B-Instruct, a 70B model that's on par with Llama-3.1-405B-Instruct, GPT-4o and Claude. Probably my new go-to for agentic workflows.

๐Ÿ”‰ FishAudio released fish-speech-1.5, multilingual text to speech model

๐ŸŽจ Microsoft Research released TRELLIS, an extremely impressive image-to-3D model, which you can try here: JeffreyXiang/TRELLIS

๐Ÿ“š Yesterday, Hugging Face release FineWeb 2, a new version that extends the previous FineWeb to over 1000 languages, including extended coverage in Russina, Mandarin, German, Japanese, Spanish, French, so a huge, high-quality dataset of > 3 trillion words! HuggingFaceFW/fineweb-2

Now let's go build to make this week as productive as last one!
reacted to davidberenstein1957's post with ๐Ÿ”ฅ 16 days ago
view post
Post
2046
Open Preference Dataset for Text-to-Image Generation by the ๐Ÿค— Community

Open Image Preferences is an Apache 2.0 licensed dataset for text-to-image generation. This dataset contains 10K text-to-image preference pairs across common image generation categories, while using different model families and varying prompt complexities.

https://huggingface.co/blog/image-preferences
reacted to thomwolf's post with ๐Ÿš€ 16 days ago
view post
Post
4324
We are proud to announce HuggingFaceFW/fineweb-2: A sparkling update to HuggingFaceFW/fineweb with 1000s of ๐Ÿ—ฃ๏ธlanguages.

We applied the same data-driven approach that led to SOTA English performance in๐Ÿท FineWeb to thousands of languages.

๐Ÿฅ‚ FineWeb2 has 8TB of compressed text data and outperforms other multilingual datasets in our experiments.

The dataset is released under the permissive ๐Ÿ“œ ODC-By 1.0 license, and the ๐Ÿ’ป code to reproduce it and our evaluations is public.

We will very soon announce a big community project, and are working on a ๐Ÿ“ blogpost walking you through the entire dataset creation process. Stay tuned!

In the mean time come ask us question on our chat place: HuggingFaceFW/discussion

H/t @guipenedo @hynky @lvwerra as well as @vsabolcec Bettina Messmer @negar-foroutan and @mjaggi
  • 2 replies
ยท
posted an update 16 days ago
view post
Post
868
Updates from the Chinese community last week ๐Ÿ”ฅ

LLM:
โœจ Sailor 2 , multilingual model supporting 10+ South Asian languages by Sea AI Lab. https://huggingface.co/sailor2

MLLM:
โœจInternVL 2.5 , new open multimodal LLM by OpenGVLab
https://huggingface.co/collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c
โœจQwen2-VL 2B/7B/72B base model, the latest iteration of our Qwen-VL model by Alibaba Qwen
Qwen/qwen2-vl-66cee7455501d7126940800d

Video model:
โœจHunyuanVideo , 13B open video model by Tencent
tencent/HunyuanVideo

Reasoning model:
โœจ LLaMA-O1 ๐Ÿฆ™ base & supervised model; pretrain & finetune datasets and demo all released
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7

Audio model:
โœจFish Speech 1.5, Text-to-speech in 13 languages, trained on 1M+ hours of audio by FishAudio
fishaudio/fish-speech-1.5
โœจClearVoice, An advanced voice processing framework by Alibaba Tongyi SpeechAI https://huggingface.co/alibabasglab

More details ๐Ÿ‘‰ https://huggingface.co/zh-ai-community
posted an update 21 days ago
view post
Post
1571
Sailor 2 ๐Ÿšข open multilingual model for Southeast Asia by Sea AI Lab๐Ÿ”ฅ
https://huggingface.co/sailor2
sail/Sailor2-20B-Chat

โœจ Fully open code & ALL datasets ๐Ÿ™Œ
โœจ 1B/ 8B/20B base & chat expanded on Qwen2.5
โœจ Apache 2.0
โœจ Supports 15 languages including English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray๐Ÿ‡ฌ๐Ÿ‡ง๐Ÿ‡จ๐Ÿ‡ณ๐Ÿ‡ฑ๐Ÿ‡ฆ๐Ÿ‡ฒ๐Ÿ‡พ๐Ÿ‡ฒ๐Ÿ‡ฒ๐Ÿ‡ป๐Ÿ‡ณ๐Ÿ‡น๐Ÿ‡ญ
posted an update 22 days ago
view post
Post
1465
2023 & 2024 Top Downloaded (all time) Open Models on the hub are both from the Chinese community ๐Ÿ‘€

2023 ๐Ÿ‘‰ Bge base by BAAI
BAAI/bge-base-en-v1.5
2024 ๐Ÿ‘‰ Qwen 2.5 by Alibaba Qwen
Qwen/Qwen2.5-1.5B-Instruct

Canโ€™t wait to see what incredible models the Chinese community will bring in 2025๐Ÿš€

โœจ Follow https://huggingface.co/zh-ai-community to get the latest updates from the Chinese community
โœจ Explore the 2024 Year in Review huggingface/open-source-ai-year-in-review-2024
replied to qq8933's post 22 days ago
view reply

Congrats on the release ๐ŸŽ‰๐Ÿ”ฅ

reacted to qq8933's post with ๐Ÿค—๐Ÿš€ 22 days ago
view post
Post
3037
  • 3 replies
ยท
posted an update 22 days ago
view post
Post
1330
HunyuanVideo ๐Ÿ“น The new open video generation model by Tencent!
๐Ÿ‘‰ tencent/HunyuanVideo
zh-ai-community/video-models-666afd86cfa4e4dd1473b64c
โœจ 13B parameters: Probably the largest open video model to date
โœจ Unified architecture for image & video generation
โœจ Powered by advanced features: MLLM Text Encoder, 3D VAE, and Prompt Rewrite
โœจ Delivers stunning visuals, diverse motion, and unparalleled stability
๐Ÿ”“ Fully open with code & weights
posted an update 26 days ago
view post
Post
1104
Zhipu AI, the Chinese generative AI startup behind CogVideo, just launched their first productized AI Agent - AutoGLM ๐Ÿ”ฅ
๐Ÿ‘‰ https://agent.aminer.cn

With simple text or voice commands, it:
โœจ Simulates phone operations effortlessly
โœจ Autonomously handles 50+ step tasks
โœจ Seamlessly operates across apps

Powered by Zhipu's "Decoupled Interface" and "Self-Evolving Learning Framework" to achieve major performance gains in Phone Use and Web Browser Use!

Meanwhile, GLM4-Edge is now on Hugging Face hub๐Ÿš€
๐Ÿ‘‰ THUDM/glm-edge-6743283c5809de4a7b9e0b8b
Packed with advanced dialogue + multimodal models:
๐Ÿ“ฑ 1.5B / 2B models: Built for mobile & in-car systems
๐Ÿ’ป 4B / 5B models: Optimized for PCs
replied to qq8933's post 26 days ago
reacted to qq8933's post with ๐Ÿš€ 26 days ago
view post
Post
1343
LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.
  • 1 reply
ยท
posted an update 27 days ago
view post
Post
1585
๐ŸŒŠ The wave of reasoning models from the Chinese community has arrived!

๐Ÿš€ Marco-o1 by AIDC, Alibaba
๐Ÿ‘‰ AIDC-AI/Marco-o1

โœจ QwQ by Qwen, Alibaba
๐Ÿ‘‰ Qwen/qwq-674762b79b75eac01735070a

๐ŸŒŸ Skywork-o1 by Kunlun Tech
๐Ÿ‘‰ Skywork/skywork-o1-open-67453df58e12f6c3934738d0

๐Ÿ”ฅ Xkev/Llama-3.2V-11B-cot by PKU Yuan group
๐Ÿ‘‰ Xkev/Llama-3.2V-11B-cot

๐Ÿ’ก DeepSeek-R1-Lite-Preview by DeepSeek AI
๐Ÿ‘‰ https://chat.deepseek.com/

๐Ÿ” InternThinker Preview by Shanghai AI Lab
๐Ÿ‘‰ https://sso.openxlab.org.cn/login?redirect=https://internlm-chat.intern-ai.org.cn/&clientId=ebmrvod6yo0nlzaek1yp

๐Ÿ“˜ k0-math by Moonshot AI
๐Ÿš€ https://kimi.moonshot.cn/ ( coming soon! )

Who's next? ๐Ÿ‘€
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7