Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
Sorokin Evgeny
DeathGodlike
Follow
21world's profile picture
1 follower
Β·
25 following
AI & ML interests
None yet
Recent Activity
reacted
to
onekq
's
post
with π₯
5 days ago
πDeepSeek π is the real OpenAI π―
reacted
to
alibabasglab
's
post
with π
5 days ago
ClearerVoice-Studio: your one-step speech processing platform for speech enhancement, speech separation, speech super-resolution, and audio-visual target speaker extraction. Say goodbye to noise and hello to clarity! Online demo: https://huggingface.co/spaces/alibabasglab/ClearVoice . Github repo: https://github.com/modelscope/ClearerVoice-Studio
reacted
to
tomaarsen
's
post
with π₯
11 days ago
ποΈ Today I'm introducing a method to train static embedding models that run 100x to 400x faster on CPU than common embedding models, while retaining 85%+ of the quality! Including 2 fully open models: training scripts, datasets, metrics. We apply our recipe to train 2 Static Embedding models that we release today! We release: 2οΈβ£ an English Retrieval model and a general-purpose Multilingual similarity model (e.g. classification, clustering, etc.), both Apache 2.0 π§ my modern training strategy: ideation -> dataset choice -> implementation -> evaluation π my training scripts, using the Sentence Transformers library π my Weights & Biases reports with losses & metrics π my list of 30 training and 13 evaluation datasets The 2 Static Embedding models have the following properties: ποΈ Extremely fast, e.g. 107500 sentences per second on a consumer CPU, compared to 270 for 'all-mpnet-base-v2' and 56 for 'gte-large-en-v1.5' 0οΈβ£ Zero active parameters: No Transformer blocks, no attention, not even a matrix multiplication. Super speed! π No maximum sequence length! Embed texts at any length (note: longer texts may embed worse) π Linear instead of exponential complexity: 2x longer text takes 2x longer, instead of 2.5x or more. πͺ Matryoshka support: allow you to truncate embeddings with minimal performance loss (e.g. 4x smaller with a 0.56% perf. decrease for English Similarity tasks) Check out the full blogpost if you'd like to 1) use these lightning-fast models or 2) learn how to train them with consumer-level hardware: https://huggingface.co/blog/static-embeddings The blogpost contains a lengthy list of possible advancements; I'm very confident that our 2 models are only the tip of the iceberg, and we may be able to get even better performance. Alternatively, check out the models: * https://huggingface.co/sentence-transformers/static-retrieval-mrl-en-v1 * https://huggingface.co/sentence-transformers/static-similarity-mrl-multilingual-v1
View all activity
Organizations
None yet
DeathGodlike
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
5 months ago
arcee-ai/Llama-3.1-SuperNova-Lite
Text Generation
β’
Updated
10 days ago
β’
9.64k
β’
188
bartowski/Llama-3.1-SuperNova-Lite-exl2
Text Generation
β’
Updated
Sep 11, 2024
β’
7
β’
2
liked
a model
6 months ago
Annuvin/Lumimaid-v0.2-12B-5.0bpw-exl2
Text Generation
β’
Updated
10 days ago
β’
9
β’
1