requests gradio datasets scikit-learn huggingface_hub==0.25.2 fpdf torch transformers spacy nltk transformers tiktoken blobfile sentencepiece tokenizer libgen-api