Subword models struggle with word learning, but surprisal hides it Paper • 2502.12835 • Published 19 days ago
Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas Paper • 2410.01487 • Published Oct 2, 2024