FoundationVision
/

groma-7b-finetune

Inference Endpoints

Model card Files Files and versions Community

groma-7b-finetune / README.md

nielsr's picture

nielsr HF staff

Add model card, tags, link to paper

f3891c4 verified 4 months ago

|

304 Bytes

	---
	datasets:
	- FoundationVision/groma_instruct
	language:
	- en
	pipeline_tag: image-text-to-text
	library_name: transformers
	---

	This repository contains the model of the paper [Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models](https://huggingface.co/papers/2404.13013).