sentencepiece accelerate evaluate datasets sacrebleu websockets omegaconf scikit-learn jiwer tenacity pynvml ninja ray psutil fastapi geomloss einops langdetect transformers transformers_stream_generator plotly vllm langchain langchain-community langchain-core sentence-transformers faiss-cpu pypdf sentencepiece docx2txt