Qingyun's picture
Update README.md
a36eb5c verified
|
raw
history blame
2.23 kB
metadata
license: mit
datasets:
  - Qingyun/lmmrotate-sft-data
language:
  - en
base_model:
  - microsoft/Florence-2-large
pipeline_tag: image-text-to-text
tags:
  - aerial
  - geoscience
  - remotesensing

LMMRotate ๐ŸŽฎ: A Simple Aerial Detection Baseline of Multimodal Language Models

Qingyun Liโ€ƒ Yushi Chenโ€ƒ Xinya Shuโ€ƒ Dong Chenโ€ƒ Xin Heโ€ƒ Yi Yuโ€ƒ Xue Yangโ€ƒ

If you find our work helpful, please consider giving us a โญ!

This repo hosts all the available checkpoints of Florence-2 trained for aerial detection with LMMRotate in our paper.

LMMRotate is a technical practice to fine-tune Large Multimodal language Models for oriented object detection as in MMRotate and hosts the official implementation of the paper: A Simple Aerial Detection Baseline of Multimodal Language Models.

framework

Detection Performance