Olivier Dehaene's picture

Olivier Dehaene

olivierdehaene

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Hugging Face's profile picture BigScience Workshop's profile picture Hugging Face Internal Testing Organization's profile picture Text Generation Inference's profile picture OpenAssistant's profile picture HuggingFaceM4's profile picture TRL's profile picture BigCode's profile picture Hugging Face H4's profile picture Hugging Face OSS Metrics's profile picture CarperAI's profile picture gg-hf's profile picture mx-test's profile picture hsramall's profile picture yorg's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture blhf's profile picture kmhf's profile picture s0409's profile picture kernels-community's profile picture

olivierdehaene's activity

New activity in Alibaba-NLP/gte-Qwen2-1.5B-instruct 3 months ago
New activity in sanchit-gandhi/whisper-jax 3 months ago

Fix Dockerfile

1
#127 opened 3 months ago by
olivierdehaene
New activity in mistralai/Mistral-Nemo-Instruct-2407 4 months ago

model is not working

1
#74 opened 4 months ago by
lowpex
replied to mayank-mishra's post 10 months ago
view reply

Nice blog!
@osanseviero we have been doing this in TGI and TEI for a while ;)
Padding free implementations also make dynamic batching easier to implement and more predictable in memory.

reacted to loubnabnl's post with β€οΈπŸ€―πŸ€— 10 months ago
view post
Post
⭐ Today we’re releasing The Stack v2 & StarCoder2: a series of 3B, 7B & 15B code generation models trained on 3.3 to 4.5 trillion tokens of code:

- StarCoder2-15B matches or outperforms CodeLlama 34B, and approaches DeepSeek-33B on multiple benchmarks.
- StarCoder2-3B outperforms StarCoderBase-15B and similar sized models.
- The Stack v2 a 4x larger dataset than the Stack v1, resulting in 900B unique code tokens πŸš€
As always, we released everything from models and datasets to curation code. Enjoy!

πŸ”— StarCoder2 collection: bigcode/starcoder2-65de6da6e87db3383572be1a
πŸ”— Paper: https://drive.google.com/file/d/17iGn3c-sYNiLyRSY-A85QOzgzGnGiVI3/view
πŸ”— BlogPost: https://huggingface.co/blog/starcoder2
πŸ”— Code Leaderboard: bigcode/bigcode-models-leaderboard
New activity in BAAI/bge-reranker-large about 1 year ago

Add fast tokenizer

1
#4 opened about 1 year ago by
olivierdehaene
New activity in BAAI/bge-reranker-base about 1 year ago

Add fast tokenizer

1
#5 opened about 1 year ago by
olivierdehaene
New activity in HuggingFaceH4/zephyr-chat about 1 year ago
New activity in thenlper/gte-base about 1 year ago
New activity in llmrails/ember-v1 about 1 year ago
New activity in BAAI/bge-large-en-v1.5 about 1 year ago
New activity in BAAI/bge-base-en-v1.5 over 1 year ago
New activity in tiiuae/falcon-7b-instruct over 1 year ago

Add hf endpoint handler.py

#24 opened over 1 year ago by
olivierdehaene
New activity in tiiuae/falcon-40b-instruct over 1 year ago

Add hf endpoint handler.py

#30 opened over 1 year ago by
olivierdehaene