3 6 30

Chao-Chun (Joe) Hsu

joe32140

https://chaochunhsu.github.io

AI & ML interests

Hi, I am Joe!

Recent Activity

upvoted a collection about 23 hours ago

Qwen2.5-1M

updated a model 1 day ago

joe32140/ModernBERT-base-msmarco

updated a model 1 day ago

joe32140/ModernBERT-large-msmarco

View all activity

Organizations

joe32140's activity

upvoted a collection about 23 hours ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 1 day ago • 72

updated 2 models 1 day ago

joe32140/ModernBERT-base-msmarco

joe32140/ModernBERT-large-msmarco

liked a model 4 days ago

jinaai/ReaderLM-v2

Text Generation • Updated 6 days ago • 19.6k • 435

liked 2 models 5 days ago

Alibaba-NLP/gte-modernbert-base

Alibaba-NLP/gte-reranker-modernbert-base

liked 2 models 6 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 1 day ago • 124k • • 453

deepseek-ai/DeepSeek-R1

Text Generation • Updated 1 day ago • 149k • 3.27k

liked a model 11 days ago

mixedbread-ai/mxbai-rerank-base-v1

Text Classification • Updated Jul 22, 2024 • 394k • 37

liked 3 models 12 days ago

internlm/internlm3-8b-instruct

Text Generation • Updated 11 days ago • 16.8k • 188

lightblue/lb-reranker-0.5B-v1.0

Text Generation • Updated 7 days ago • 2.01k • 60

sentence-transformers/static-retrieval-mrl-en-v1

reacted to tomaarsen's post with 🔥 12 days ago

Post

4361

🏎️ Today I'm introducing a method to train static embedding models that run 100x to 400x faster on CPU than common embedding models, while retaining 85%+ of the quality! Including 2 fully open models: training scripts, datasets, metrics.

We apply our recipe to train 2 Static Embedding models that we release today! We release:
2️⃣ an English Retrieval model and a general-purpose Multilingual similarity model (e.g. classification, clustering, etc.), both Apache 2.0
🧠 my modern training strategy: ideation -> dataset choice -> implementation -> evaluation
📜 my training scripts, using the Sentence Transformers library
📊 my Weights & Biases reports with losses & metrics
📕 my list of 30 training and 13 evaluation datasets

The 2 Static Embedding models have the following properties:
🏎️ Extremely fast, e.g. 107500 sentences per second on a consumer CPU, compared to 270 for 'all-mpnet-base-v2' and 56 for 'gte-large-en-v1.5'
0️⃣ Zero active parameters: No Transformer blocks, no attention, not even a matrix multiplication. Super speed!
📏 No maximum sequence length! Embed texts at any length (note: longer texts may embed worse)
📐 Linear instead of exponential complexity: 2x longer text takes 2x longer, instead of 2.5x or more.
🪆 Matryoshka support: allow you to truncate embeddings with minimal performance loss (e.g. 4x smaller with a 0.56% perf. decrease for English Similarity tasks)

Check out the full blogpost if you'd like to 1) use these lightning-fast models or 2) learn how to train them with consumer-level hardware: https://huggingface.co/blog/static-embeddings

The blogpost contains a lengthy list of possible advancements; I'm very confident that our 2 models are only the tip of the iceberg, and we may be able to get even better performance.

Alternatively, check out the models:
* sentence-transformers/static-retrieval-mrl-en-v1
* sentence-transformers/static-similarity-mrl-multilingual-v1