vlm-demo / README.md
mattb512's picture
update readme
bb51ecc
metadata
title: VLM Demo
sdk: docker
license: mit

This demo illustrates the work published in the paper "Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models"

Source code

For more information, please refer to this repository:

VLM Demo: Lightweight repo for chatting with VLMs supported by our VLM Evaluation Suite.

Huffing Face Space architecture

Hugging Face Space build a container image based on the Dockerfile. In this file, we use the base Nvidia base image and install additional packages and external repositories.

The Hugging Face Space start the container and execute startup.sh. The script loads each model on a separate GPU of the 4xA10G. Then it launches several processes: one for each model, the Gradio API controller and frontend.