torch==2.4.0 transformers==4.39.2 spaces gradio==4.44.1 gradio_client==1.3.0 pillow accelerate bitsandbytes tqdm https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.0.post2/flash_attn-2.7.0.post2+cu12torch2.4cxx11abiFALSE-cp310-cp310-linux_x86_64.whl