jiakai

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a model about 10 hours ago

mistralai/Mistral-Small-24B-Instruct-2501

liked a model about 10 hours ago

mistralai/Mistral-Small-24B-Base-2501

liked a model about 11 hours ago

m-a-p/YuE-s1-7B-anneal-zh-icl

View all activity

Organizations

real-jiakai's activity

upvoted an article about 11 hours ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

•

Oct 14, 2024

• 65

upvoted a collection 3 days ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 5 days ago • 84

upvoted a paper 4 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 5 days ago • 45

upvoted a paper 7 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 11 days ago • 41

upvoted an article 7 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

20 days ago

• 132

upvoted a collection 7 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311

upvoted a paper 11 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 12 days ago • 284

upvoted 2 papers 20 days ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 22

upvoted a paper 24 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 26 days ago • 252

upvoted 2 papers 25 days ago

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published 27 days ago • 33

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 27 days ago • 84

upvoted a paper 26 days ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 27 days ago • 48

upvoted a paper 27 days ago

Personalized Graph-Based Retrieval for Large Language Models

Paper • 2501.02157 • Published about 1 month ago • 28

upvoted a paper 28 days ago

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Paper • 2412.20005 • Published Dec 28, 2024 • 17

upvoted a collection 29 days ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 213

upvoted an article about 1 month ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

•

Jan 3

• 13

upvoted a paper about 1 month ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 48

upvoted 2 articles about 1 month ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

•

Nov 21, 2024

• 35

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

Jan 2

• 39