Tom Aarsen

tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Articles

Organizations

tomaarsen's activity

upvoted an article 8 days ago
view article
Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

17
upvoted an article 14 days ago
upvoted an article 16 days ago
view article
Article

Visually Multilingual: Introducing mcdse-2b

By marco
37
upvoted 2 articles 21 days ago
view article
Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

39
view article
Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

58
upvoted an article 22 days ago
upvoted an article 23 days ago
view article
Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By abhinand
30
upvoted an article 25 days ago
upvoted 2 articles 26 days ago
view article
Article

How to build a custom text classifier without days of human labeling

By sdiazlor
54
view article
Article

How to optimize your data labelling project with custom interfaces

18