Steering Rectified Flow Models in the Vector Field for Controlled Image Generation Paper • 2412.00100 • Published Nov 27, 2024 • 16
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Paper • 2411.02545 • Published Nov 4, 2024 • 1
Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization Paper • 2210.07764 • Published Oct 14, 2022
R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model Paper • 2405.16341 • Published May 25, 2024
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation Paper • 2408.05938 • Published Aug 12, 2024
STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced Audio-Visual Diarization Paper • 2306.10608 • Published Jun 18, 2023
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Paper • 2204.07705 • Published Apr 16, 2022 • 1
$λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space Paper • 2402.05195 • Published Feb 7, 2024 • 19
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations Paper • 2312.04655 • Published Dec 7, 2023 • 21
WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models Paper • 2306.04744 • Published Jun 7, 2023 • 1
CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering Paper • 2211.03779 • Published Nov 7, 2022 • 1
ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models Paper • 2306.04695 • Published Jun 7, 2023 • 1
WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models Paper • 2306.04744 • Published Jun 7, 2023 • 1