Can I uses a pre-trained checkpoint from RoBERTa to start pre-training the Longformer model - using huggingface implementations?
· Sign up or log in to comment