SindBERT, the Sailor: Charting the Seas of Turkish NLP Paper • 2510.21364 • Published Oct 24, 2025 • 1
lang-uk/ukr-clip-vit-h-14-frozen-xlm-roberta-large-laion5B-s13B-b90k Zero-Shot Image Classification • Updated Oct 17, 2025 • 2
lang-uk/ukr-clip-vit-h-14-frozen-xlm-roberta-large-laion5B-s13B-b90k Zero-Shot Image Classification • Updated Oct 17, 2025 • 2
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models Paper • 2510.13996 • Published Oct 15, 2025 • 8
lang-uk/ukr-clip-vit-h-14-frozen-xlm-roberta-large-laion5B-s13B-b90k Zero-Shot Image Classification • Updated Oct 17, 2025 • 2
Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction Paper • 2509.14504 • Published Sep 18, 2025
OmniGEC Collection This is a collection of multilingual silver-standard datasets and models for the task of Grammatical Error Correction (GEC). • 9 items • Updated Sep 19, 2025 • 8
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian Paper • 2509.05668 • Published Sep 6, 2025 • 5