PDF Document / OCR Datasets Collection Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 47
view article Article Extending the Massive Text Embedding Benchmark to French: the datasets By lyon-nlp-group • Jan 12 • 3