FiD model trained on NQ
-- This is the model checkpoint of FiD [2], based on the T5 (with 3B parameters) and trained on the NQ dataset [1].
-- Hyperparameters: 8 x 40GB A100 GPUs; batch size 8; AdamW; LR 3e-5; 30000 steps
References:
[1] Natural Questions: A Benchmark for Question Answering Research. TACL 2019.
[2] Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering. EACL 2021.
Model performance
We evaluate it on the NQ dataset, the EM score is 50.6 on the test set.
- Downloads last month
- 9