Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning Paper • 2302.14115 • Published Feb 27, 2023 • 1