ONEKQ AI

company

AI & ML interests

Benchmark, Code Generation, LLM

Recent Activity

onekq  updated a model about 15 hours ago
onekq-ai/Qwen2.5-14B-Instruct-1M-bnb-4bit
onekq  updated a model about 15 hours ago
onekq-ai/Qwen2.5-7B-Instruct-1M-bnb-4bit
onekq  published a model about 16 hours ago
onekq-ai/Qwen2.5-14B-Instruct-1M-bnb-4bit
View all activity

onekq-ai's activity

onekq 
posted an update 3 days ago
view post
Post
1924
So 🐋DeepSeek🐋 hits the mainstream media. But it has been a star in our little cult for at least 6 months. Its meteoric success is not overnight, but two years in the making.

To learn their history, just look at their 🤗 repo https://huggingface.co/deepseek-ai

* End of 2023, they launched the first model (pretrained by themselves) following Llama 2 architecture
* June 2024, v2 (MoE architecture) surpassed Gemini 1.5, but behind Mistral
* September, v2.5 surpassed GPT 4o mini
* December, v3 surpassed GPT 4o
* Now R1 surpassed o1

Most importantly, if you think DeepSeek success is singular and unrivaled, that's WRONG. The following models are also near or equal the o1 bar.

* Minimax-01
* Kimi k1.5
* Doubao 1.5 pro
onekq 
posted an update 6 days ago
onekq 
posted an update 7 days ago
view post
Post
4575
🐋DeepSeek 🐋 is the real OpenAI 😯
·
onekq 
posted an update 13 days ago