AI & ML interests

None defined yet.

Recent Activity

HuggingFace-CN-community's activity

AdinaY 
posted an update about 23 hours ago
view post
Post
901
QvQ-72B-Preview🎄 an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
✨ Combines visual understanding & language reasoning.
✨ Scores 70.3 on MMMU
✨ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving
AdinaY 
posted an update 9 days ago
view post
Post
491
Megrez-3B-Omni 🔥 an on-device multimodal LLM by Infinigence AI, another startup emerging from the Tsinghua University ecosystem.
Model: Infinigence/Megrez-3B-Omni
Demo: Infinigence/Megrez-3B-Omni
✨Supports analysis of image, text, and audio modalities
✨Leads in bilingual speech ( English & Chinese ) input, multi-turn conversations, and voice-based queries
✨Outperforms in scene understanding and OCR across major benchmarks
AdinaY 
posted an update 16 days ago
view post
Post
868
Updates from the Chinese community last week 🔥

LLM:
✨ Sailor 2 , multilingual model supporting 10+ South Asian languages by Sea AI Lab. https://huggingface.co/sailor2

MLLM:
✨InternVL 2.5 , new open multimodal LLM by OpenGVLab
https://huggingface.co/collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c
✨Qwen2-VL 2B/7B/72B base model, the latest iteration of our Qwen-VL model by Alibaba Qwen
Qwen/qwen2-vl-66cee7455501d7126940800d

Video model:
✨HunyuanVideo , 13B open video model by Tencent
tencent/HunyuanVideo

Reasoning model:
✨ LLaMA-O1 🦙 base & supervised model; pretrain & finetune datasets and demo all released
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7

Audio model:
✨Fish Speech 1.5, Text-to-speech in 13 languages, trained on 1M+ hours of audio by FishAudio
fishaudio/fish-speech-1.5
✨ClearVoice, An advanced voice processing framework by Alibaba Tongyi SpeechAI https://huggingface.co/alibabasglab

More details 👉 https://huggingface.co/zh-ai-community
AdinaY 
posted an update 21 days ago
view post
Post
1572
Sailor 2 🚢 open multilingual model for Southeast Asia by Sea AI Lab🔥
https://huggingface.co/sailor2
sail/Sailor2-20B-Chat

✨ Fully open code & ALL datasets 🙌
✨ 1B/ 8B/20B base & chat expanded on Qwen2.5
✨ Apache 2.0
✨ Supports 15 languages including English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray🇬🇧🇨🇳🇱🇦🇲🇾🇲🇲🇻🇳🇹🇭
AdinaY 
posted an update 22 days ago
view post
Post
1465
2023 & 2024 Top Downloaded (all time) Open Models on the hub are both from the Chinese community 👀

2023 👉 Bge base by BAAI
BAAI/bge-base-en-v1.5
2024 👉 Qwen 2.5 by Alibaba Qwen
Qwen/Qwen2.5-1.5B-Instruct

Can’t wait to see what incredible models the Chinese community will bring in 2025🚀

✨ Follow https://huggingface.co/zh-ai-community to get the latest updates from the Chinese community
✨ Explore the 2024 Year in Review huggingface/open-source-ai-year-in-review-2024
AdinaY 
posted an update 22 days ago
view post
Post
1330
HunyuanVideo 📹 The new open video generation model by Tencent!
👉 tencent/HunyuanVideo
zh-ai-community/video-models-666afd86cfa4e4dd1473b64c
✨ 13B parameters: Probably the largest open video model to date
✨ Unified architecture for image & video generation
✨ Powered by advanced features: MLLM Text Encoder, 3D VAE, and Prompt Rewrite
✨ Delivers stunning visuals, diverse motion, and unparalleled stability
🔓 Fully open with code & weights
AdinaY 
posted an update 26 days ago
view post
Post
1104
Zhipu AI, the Chinese generative AI startup behind CogVideo, just launched their first productized AI Agent - AutoGLM 🔥
👉 https://agent.aminer.cn

With simple text or voice commands, it:
✨ Simulates phone operations effortlessly
✨ Autonomously handles 50+ step tasks
✨ Seamlessly operates across apps

Powered by Zhipu's "Decoupled Interface" and "Self-Evolving Learning Framework" to achieve major performance gains in Phone Use and Web Browser Use!

Meanwhile, GLM4-Edge is now on Hugging Face hub🚀
👉 THUDM/glm-edge-6743283c5809de4a7b9e0b8b
Packed with advanced dialogue + multimodal models:
📱 1.5B / 2B models: Built for mobile & in-car systems
💻 4B / 5B models: Optimized for PCs
AdinaY 
posted an update 28 days ago
view post
Post
1586
🌊 The wave of reasoning models from the Chinese community has arrived!

🚀 Marco-o1 by AIDC, Alibaba
👉 AIDC-AI/Marco-o1

✨ QwQ by Qwen, Alibaba
👉 Qwen/qwq-674762b79b75eac01735070a

🌟 Skywork-o1 by Kunlun Tech
👉 Skywork/skywork-o1-open-67453df58e12f6c3934738d0

🔥 Xkev/Llama-3.2V-11B-cot by PKU Yuan group
👉 Xkev/Llama-3.2V-11B-cot

💡 DeepSeek-R1-Lite-Preview by DeepSeek AI
👉 https://chat.deepseek.com/

🔍 InternThinker Preview by Shanghai AI Lab
👉 https://sso.openxlab.org.cn/login?redirect=https://internlm-chat.intern-ai.org.cn/&clientId=ebmrvod6yo0nlzaek1yp

📘 k0-math by Moonshot AI
🚀 https://kimi.moonshot.cn/ ( coming soon! )

Who's next? 👀
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7
AdinaY 
posted an update about 1 month ago
view post
Post
323
China launched an algorithm governance campaign to ensure algorithms are more positive, transparent, controllable, fair, and accountable🇨🇳📑
zh-ai-community/china-ai-policy-research

Highlights:
✨ Combat "echo chambers" and addictive content: ban forced tags, data misuse, and excessive collection.
✨ Make rankings transparent: explain algorithms, keep logs, and detect fake accounts.
✨ Protect workers: disclose delivery algorithms and provide appeal channels.
✨ Ban unfair pricing: ensure promo transparency and honest explanations for failures.
✨ Support users: improve recommendations for minors and seniors, promote good content, and detect fakes.
✨ Ensure safety: audit algorithms, secure data, fix flaws, and regularly evaluate models.

⏰ Timeline:
Company Self-Checks: before Dec 31, 2024
Verification: before Jan 31, 2025
Effectiveness Review: before Feb 14, 2025

Open reporting channels for algorithm issues during the campaign, monitor complaints, enforce corrections, and provide feedback to users.
AdinaY 
posted an update about 1 month ago
AdinaY 
posted an update about 1 month ago
AdinaY 
posted an update about 1 month ago
AdinaY 
posted an update about 1 month ago
AdinaY 
posted an update about 1 month ago
view post
Post
2541
Let’s dive into the exciting releases from the Chinese community last week 🔥🚀
More details 👉 https://huggingface.co/zh-ai-community

Code model:
✨Qwen 2.5 coder by Alibaba Qwen
Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f
✨OpenCoder by InflyAI - Fully open code model🙌
infly/opencoder-672cec44bbb86c39910fb55e

Image model:
✨Hunyuan3D-1.0 by Tencent
tencent/Hunyuan3D-1

MLLM:
✨JanusFlow by DeepSeek
deepseek-ai/JanusFlow-1.3B
deepseek-ai/JanusFlow-1.3B
✨Mono-InternVL-2B by OpenGVlab
OpenGVLab/Mono-InternVL-2B

Video model:
✨CogVideoX 1.5 by ChatGLM
THUDM/CogVideoX1.5-5B-SAT

Audio model:
✨Fish Agent by FishAudio
fishaudio/fish-agent-v0.1-3b

Dataset:
✨OPI dataset by BAAIBeijing
BAAI/OPI
AdinaY 
posted an update about 2 months ago
AdinaY 
posted an update 2 months ago
view post
Post
2235
China is advancing rapidly in AI technology while maintaining a strong focus on governance 🇨🇳📑
We've collected key AI governance documents released since 2017 and will continue updating them in this organization on the hub 👉China LLMs on Hugging Face
zh-ai-community/china-ai-policy-research
Any feedback is welcome🤗
xianbao 
posted an update 4 months ago
view post
Post
1704
With the open-weight release of CogVideoX-5B from THUDM, i.e. GLM team, the Video Generation Model (how about calling it VGM) field has officially became the next booming "LLM"

What does the landscape look like? What are other video generation models? This collection below is all your need.

xianbao/video-generation-models-66c350163c74f60f5c412af6

The above video is generated by @a-r-r-o-w with CogVideoX-5B, taken from a nice lookout for the field!
xianbao 
posted an update 7 months ago
view post
Post
1791
Why Apache 2.0 Matters for LLMs 🤔

@01AI_Yi recently switched from a permissive & commercially friendly license, to Apache 2.0. And the community loved it! 🚀

@JustinLin610 also had a poll on model license and the majority votes for Apache 2.0.

Why it is a Big Deal? ⬇️

📚 Legal Simplicity: Custom licenses need costly & time-consuming legal review. Apache 2.0 is well-known & easier for legal teams to handle.

👩‍💻 Developer-Friendly: Legal docs are a pain for devs! Apache 2.0 is well-known and tech-friendly, making it easier for non-native developers to understand the implications too.

🔗 Easier Integration: Apache 2.0 is compatible with many other licenses, simplifying tasks like model merging with models of different licensing requirements.

🚫 No Permission Needed: Custom licenses often require explicit permission and additional documentation work of filling forms, creating barriers. Apache 2.0 removes this hurdle, letting devs focus on innovation.

There are a lot interesting discussions from
@JustinLin610 's poll: https://x.com/JustinLin610/status/1793559737482764375 which inspired this thread.

Any other thoughts? Let me know ^^
  • 1 reply
·
xianbao 
posted an update 7 months ago
view post
Post
1214
DeepSeekV2 is a big deal. Not only because its significant improvements to both key components of Transformer: the Attention layer and FFN layer.

It has also completed disrupted the Chines LLM market and forcing the competitors to drop the price to 1% of the original price.

---

There are two key components in Transformer architecture: the self-attention layer, which captures relationships between tokens in context, and the Feed-Forward Network (FFN) layer, which stores knowledge.

DeepSeek V2 introduces optimizations to both:

Attention layer normally uses KV Cache to reduce repetitive compute, but it consumes significant GPU RAM, limiting concurrent requests. DeepSeek V2 introduces Multi-head Latent Attention (MLA), which stores only a small latent representation, resulting in substantial RAM savings.

DeepSeek V2 utilizes 162 experts instead of the usual 8 as in Mixtral. This approach segments experts into finer granularity for higher specialization and more accurate knowledge acquisition. Activating only a small subset of experts for each token, leads to efficient processing.

It disrupted the market by dropping API prices to $0.14 per 1M tokens. This dramatic reduction forced competitors like GLM, Ernie, and QWen to follow suit, lowering their prices to 1% of their original offerings. Now, users can access these APIs at 1/35th the cost of ChatGPT-4o.