Three Things to Help Improve Low-Resource Language AI, According to Stanford – slator.com

A new white paper on large language model (LLM) development, the work of Stanford University with The Asia Foundation and the University of Pretoria, discusses best practices for improving the LLM landscape for low-resource languages. In an April 22, 2025 press release, Stanford’s Human-Centered Artificial Intelligence (HAI) explains the “digital divide” of LLM development, in […]
Lessons from AI Translation to Improve Multilingual LLM Evaluation – slator.com

As large language models (LLMs) continue to scale across languages, their evaluation frameworks are struggling to keep pace. Two recent studies — one from Alibaba and academic partners, the other from a collaboration between Cohere and Google — highlight critical challenges in multilingual LLM evaluation. “As large language models continue to advance in linguistic capabilities, […]
Multi-Agent AI Framework Aims to Improve Cultural Adaptation in AI Translation – slator.com

In a March 5, 2025 paper, researchers from Shahjalal University of Science and Technology and the University of Oklahoma proposed a multi-agent AI framework for culturally adaptive AI translation, particularly for low-resource languages. This multi-agent approach comes as the translation industry increasingly explores the limitless opportunities that agents can offer, along with the huge potential […]
How France’s Inria Aims to Improve AI Translation for Low-Resource Languages – slator.com

Large language models (LLMs) have significantly improved AI translation for high-resource languages, but performance remains uneven for low-resource languages (LRLs). In a March 6, 2025 paper, researchers Armel Zebaze, Benoît Sagot, and Rachel Bawden from Inria, the French National Institute for Research in Digital Science and Technology, introduced Compositional Translation (CompTra), an LLM-based approach designed […]
Google Expands Low-Resource AI Translation with SMOL Dataset – slator.com

On February 17, 2025, Google released SMOL (Set of Maximal Overall Leverage), a dataset translated by professional translators aimed at improving machine translation (MT) for 115 low-resource languages (LRLs). SMOL consists of two components: SMOLSENT, a collection of 863 English sentences translated into 81 languages, and SMOLDOC, a dataset of 584 English documents translated into […]