# Lius：基于持续指令微调的古邦马来语翻译模型

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-06-10 08:00
- AIHOT 分数：54
- AIHOT 链接：https://aihot.virxact.com/items/cmq8ws2lk06ioslld40b2rxam
- 原文链接：https://arxiv.org/abs/2606.11786

## AI 摘要

大语言模型在处理低资源语言翻译时性能常下降。研究团队针对古邦马来语提出一种微调方法：利用双语词典的显式词汇与语义特征设计指令集，并引入持续指令微调（CIT）范式。实验结果表明，模型Lius在多项评测指标上比标准指令微调模型提升4–6个百分点，超越神经机器翻译（NMT）和多语言LLM模型10–13个百分点，展现出减少对大规模平行数据依赖的潜力。

## 正文

Large Language Models (LLMs) offer new potential for translation tasks but often experience performance degradation when handling low-resource languages. To address this limitation, we propose an approach for fine-tuning LLMs on a low-resource language, Kupang Malay. Our approach involves designing a set of instructions by leveraging explicit lexical and semantic features from a bilingual dictionary, and introducing Continual Instruction Tuning (CIT), a training paradigm that enables iterative instruction-based training. Experimental results demonstrate that our model, named Lius, yields notable improvements over standard instruction-tuned models by outperforming 4-6 points, and surpassing both Neural Machine Translation (NMT) and Multilingual LLM models by 10-13 points on several evaluation metrics. These findings highlight the potential of our approach to mitigate the reliance on large-scale parallel data in low-resource language translation.
