University of Washington

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

CrystalMo updated a Space 25 days ago

UW/Snake

CrystalMo published a Space 25 days ago

UW/Snake

anshuln authored a paper about 1 year ago

PEEKABOO: Interactive Video Generation via Masked-Diffusion

View all activity

UW's activity

CrystalMo

updated a Space 25 days ago

Snake

Text Similarity Analysis

CrystalMo

published a Space 25 days ago

Snake

Text Similarity Analysis

Nitre

authored a paper 3 months ago

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Paper • 2411.11922 • Published Nov 18, 2024 • 19

kevinlin311tw

authored a paper 4 months ago

GenXD: Generating Any 3D and 4D Scenes

Paper • 2411.02319 • Published Nov 4, 2024 • 20

kellycyy

authored a paper 4 months ago

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024

kevinlin311tw

authored a paper 7 months ago

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 13

kevinlin311tw

authored a paper 8 months ago

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12, 2024 • 27

nzolman

authored a paper 9 months ago

SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning

Paper • 2403.09110 • Published Mar 14, 2024

kevinlin311tw

authored a paper 10 months ago

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Paper • 2404.16375 • Published Apr 25, 2024 • 17

kevinlin311tw

authored 4 papers over 1 year ago

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Paper • 2311.07562 • Published Nov 13, 2023 • 14

MM-VID: Advancing Video Understanding with GPT-4V(ision)

Paper • 2310.19773 • Published Oct 30, 2023 • 20

DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design

Paper • 2310.15144 • Published Oct 23, 2023 • 14

OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation

Paper • 2310.07749 • Published Oct 11, 2023 • 5

zeqiuwu1

authored a paper over 1 year ago

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 76

kevinlin311tw

authored 2 papers over 1 year ago

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Paper • 2310.08541 • Published Oct 12, 2023 • 18

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities

Paper • 2308.02490 • Published Aug 4, 2023 • 17

zeqiuwu1

authored a paper over 1 year ago

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

Paper • 2307.07047 • Published Jul 13, 2023 • 16

kevinlin311tw

authored 2 papers over 1 year ago

Aligning Large Multi-Modal Model with Robust Instruction Tuning

Paper • 2306.14565 • Published Jun 26, 2023 • 6

DisCo: Disentangled Control for Referring Human Dance Generation in Real World

Paper • 2307.00040 • Published Jun 30, 2023 • 25

kevinlin311tw

authored a paper almost 2 years ago

GIT: A Generative Image-to-text Transformer for Vision and Language

Paper • 2205.14100 • Published May 27, 2022 • 1