view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 9 days ago • 59
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 10 days ago • 120
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python Oct 22, 2024 • 44