|
--- |
|
tags: |
|
- panda70m |
|
- video2text |
|
--- |
|
|
|
# How to use? |
|
|
|
```bash |
|
huggingface-cli download Ligeng-Zhu/panda70m \ |
|
--local-dir panda70m --repo-type dataset --local-dir-use-symlinks False |
|
``` |
|
|
|
Then install dependencies |
|
|
|
```bash |
|
pip install fire yt_dlp pandas |
|
``` |
|
|
|
Next pull the videos |
|
|
|
```bash |
|
python main.py --csv=<your csv files> |
|
``` |
|
|
|
or split by shards to accelerate downloading |
|
|
|
```bash |
|
python main.py --csv=<your csv files> --shards=0 --total=10 |
|
python main.py --csv=<your csv files> --shards=1 --total=10 |
|
... |
|
python main.py --csv=<your csv files> --shards=9 --total=10 |
|
``` |