streamlit transformers sentencepiece PyPDF2 python-docx torch langchain_huggingface huggingface_hub