Instruction-Tuned Large Language Models Outperform Baselines in Medical Translation - slator.com

L10n Biz
- l10n biz
  
  Industry Events
  
  Looking for opportunities to connect? Check the event calendar for upcoming industry events.
  Open Calendar
  
  December 2024
  
  Tuesday September 17 3:00 pm – Tuesday December 31 8:00 pm
  
  Lazy Language Club
  
  March 2025
  
  Tuesday March 4 7:00 am – Thursday March 6 7:00 am
  
  CXO 2.0 Conference USA
  
  Monday March 10
  
  7:00 am
  Sustainable Artificial Intelligence-Powered Applications (SAIPA)
  7:00 am
  Sustainable Artificial Intelligence-Powered Applications (SAIPA)
  
  April 2025
  
  Thursday April 3
  
  7:00 am
  DigiMarCon California 2025 – Digital Marketing, Media and Advertising Conference & Exhibition
  
  Localization News
  
  Check the feeds for the latest developments in the subject field of Localization.
  Open Feed Reader
  
  The Most Popular Language Industry Stories of 2024 - slator.com
  
  9 hours ago
  
  How the Role of the Localization Manager Is Evolving - Slator
  
  1 day ago
  
  STIO Life Science invests EGP 400m to localize Egypt’s pharmaceutical industry - Daily News Egypt
  
  1 day ago
  
  Saudi consultancy sector localization plan enters 2nd phase - Arab News
  
  3 days ago
  
  Saudi Arabia signs localization agreements for wind energy steel towers - Arab News
  
  3 days ago
  
  No posts found
  
  Tool Reviews
  
  Sed ut justo nec massa sollicitudin viverra. Nulla varius vehicula quam.
  TBD
Loquatics
- Loquatics
  
  Learn More
  
  What is Loquatics?!
  
  ‘Loquatics’ is the business moniker I conceived in 2015 when launching my consultancy. The idea behind this portmanteau was to evoke a sense of fluent or seamless motion in addressing the complexities of Localization—a field that many content owners have historically viewed as a developmental burden that hinders ambitious objectives.
  Fast forward to 2024, this perception persists, yet the technological landscape of multilingual development has evolved tremendously. Advances in AI-driven translation tools, machine learning algorithms, and cultural adaptation software are transforming how businesses approach localization. In a world where global connectivity is paramount, the integration of AI not only accelerates the localization process but also enhances precision and cultural sensitivity, making it vital for competitive success. Moreover, the industry’s focus on data privacy and compliance with global standards, such as GDPR and CCPA, underscores the importance of ethical localization practices.
  
  What is Loquatics?!
  
  The significance of professional localization in attaining business success continues to grow. Companies are increasingly recognizing the value of localized content that resonates with diverse audiences, leveraging insights from big data analytics to tailor experiences effectively. As businesses strive to capture international markets, the demand for culturally resonant and accessible content is at an all-time high, reinforcing the critical role of expert localization in achieving these goals.
Contact
- contact
  
  DANIEL FINCK
  
  Solution Architect
  
  +49 (0) 1711735738
  
  dfinck@loquatics.com
  
  loquatics.com
  
  linkedin.com/in/dfinck/
  
  Berlin, Germany
  
  What is Loquatics?!
  
  ‘Loquatics’ is the business moniker I conceived in 2015 when launching my consultancy. The idea behind this portmanteau was to evoke a sense of fluent or seamless motion in addressing the complexities of Localization—a field that many content owners have historically viewed as a developmental burden that hinders ambitious objectives.
  Fast forward to 2024, this perception persists, yet the technological landscape of multilingual development has evolved tremendously. Advances in AI-driven translation tools, machine learning algorithms, and cultural adaptation software are transforming how businesses approach localization. In a world where global connectivity is paramount, the integration of AI not only accelerates the localization process but also enhances precision and cultural sensitivity, making it vital for competitive success. Moreover, the industry’s focus on data privacy and compliance with global standards, such as GDPR and CCPA, underscores the importance of ethical localization practices.
  The significance of professional localization in attaining business success continues to grow. Companies are increasingly recognizing the value of localized content that resonates with diverse audiences, leveraging insights from big data analytics to tailor experiences effectively. As businesses strive to capture international markets, the demand for culturally resonant and accessible content is at an all-time high, reinforcing the critical role of expert localization in achieving these goals.
  
  Get In Touch
Login

Instruction-Tuned Large Language Models Outperform Baselines in Medical Translation – slator.com

September 10, 2024

In an August 29, 2024 paper, Miguel Rios from the University of Vienna explored how instruction-tuned large language models (LLMs) can improve machine translation (MT) in specialized fields, particularly in the medical domain.

Rios noted that while state-of-the-art LLMs have shown promising results for high-resource language pairs and domains, they often struggle with accuracy and consistency in specialized, low-resource domains. “In specialized domains (e.g. medical) LLMs have shown lower performance compared to standard neural machine translation models,” Rios said.

He also explained that the limitations of LLMs in low-resource domains stem from their training data, which may not adequately cover the specific terminology and contextual nuances required for effective translation.

To address this challenge, Rios proposed improving LLMs’ performance by incorporating specialized terminology through instruction tuning — a technique where models are fine-tuned using datasets from various tasks formatted as instructions. “Our goal is to incorporate terminology, syntax information, and document structure constraints into a LLM for the medical domain,” he said.

Specifically, Rios suggested including medical terms as part of the instructions given to the LLM. When translating a segment, the model is provided with relevant medical terms that should be used in the translation.

2024 Cover Slator Pro Guide Translation AI

2024 Slator Pro Guide: Translation AI

The 2024 Slator Pro Guide presents 20 new and impactful ways that LLMs can be used to enhance translation workflows.

Additionally, the approach involves identifying pairs of terms — source and corresponding target terms — that are relevant to the text being translated, ensuring the correct medical terminology is applied to these segments during translation.

If one or more candidate terms are successfully matched in a segment, they are incorporated into the instruction template provided to the LLM. This means the model receives a prompt that not only instructs it to translate the text but also specifies which medical terms to use.

If no matching candidate terms are found, the model is given a basic translation task prompt, instructing it to translate the text without any specific medical terminology guidance.

Unbabel’s Tower Takes the Lead

For the experiments, Rios utilized Google’s FLAN-T5, Meta’s LLaMA-3-8B, and Unbabel’s Tower-7B as baseline models, applying QLoRA for parameter-efficient fine-tuning, and tested them across English-Spanish, English-German, and English-Romanian language pairs.

The results revealed that the instruction-tuned models “significantly” outperformed the baselines in terms of automatic metrics such as BLEU, chrF, and COMET scores. Specifically, the Tower-7B model showed the best performance in English-Spanish and English-German translations, followed by LLaMA-3-8B, which demonstrated strong performance in English-Romanian translations.

Talking to Slator, Rios expressed his intention to perform a manual evaluation with professional translators in the future, as automated metrics alone may not fully capture how well the models generate the correct medical terms in their translations.

Source link

December 2024

Tuesday September 17 3:00 pm – Tuesday December 31 8:00 pm

March 2025

Tuesday March 4 7:00 am – Thursday March 6 7:00 am

Monday March 10

April 2025

Thursday April 3

What is Loquatics?!

What is Loquatics?!

DANIEL FINCK

Solution Architect

+49 (0) 1711735738

What is Loquatics?!

Get In Touch

December 2024

Tuesday September 17 3:00 pm – Tuesday December 31 8:00 pm

March 2025

Tuesday March 4 7:00 am – Thursday March 6 7:00 am

Monday March 10

April 2025

Thursday April 3

What is Loquatics?!

What is Loquatics?!

DANIEL FINCK

Solution Architect

+49 (0) 1711735738

What is Loquatics?!

Get In Touch

Instruction-Tuned Large Language Models Outperform Baselines in Medical Translation – slator.com

September 10, 2024

2024 Slator Pro Guide: Translation AI

Unbabel’s Tower Takes the Lead

IN OTHER NEWS

Login

Don't need to reset? Login

Forgot Password?