datasets huggingface_hub imbalanced-learn numpy pandas matplotlib openai scikit-learn spacy streamlit tenacity