NYU VisionX

university
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xcpan  updated a dataset about 20 hours ago
nyu-visionx/oro_optical_results
xcpan  published a dataset about 20 hours ago
nyu-visionx/oro_optical_results
xcpan  updated a dataset 5 days ago
nyu-visionx/oro_dino_results
View all activity

nyu-visionx's activity

sayakpaul 
posted an update 4 days ago
view post
Post
1661
We have been cooking a couple of fine-tuning runs on CogVideoX with finetrainers, smol datasets, and LoRA to generate cool video effects like crushing, dissolving, etc.

We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:

* Models and datasets: https://huggingface.co/finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py
  • 1 reply
·
sayakpaul 
posted an update 7 days ago
view post
Post
1856
We have authored a post to go over the state of video generation in the Diffusers ecosystem 🧨

We cover the models supported, the knobs of optims our users can fire, fine-tuning, and more 🔥

5-6GBs for HunyuanVideo, sky is the limit 🌌 🤗
https://huggingface.co/blog/video_gen
sayakpaul 
posted an update about 1 month ago