---
license: mit
---

# ICDAR 2025 Competition on End-to-end Document Image Machine Translation (OCR-free Track)

This is the official repository for [***ICDAR 2025 Competition on End-to-end Document Image Machine Translation (OCR-free Track)***](https://cip-documentai.github.io/)

## For Participants  
Participants are required to translate all the images in the **competition_testset_images.zip** using an end-to-end OCR-free method into Simplified Chinese (zh-CN), and fill in **answer.json** file.
The file should be zipped and submitted to Codalab.  
In **answer.json** file, the **key** corresponds to the image file name, and the **value** is the translation of each image as a single string (str) after [**jieba**](https://github.com/fxsjy/jieba) cut.

## Training Dataset Download
The dataset can be downloaded from this [huggingface link](https://huggingface.co/datasets/liangyupu/DoTA_dataset).

## Baseline Implementation
This is an implementation of a simple end-to-end document image machine translation model with an image encoder and a translation decoder.
Details can be found in [***Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling (NAACL 2024 Main)***](https://aclanthology.org/2024.naacl-long.392/) Section 5.3 Base.