首页>
外国专利>
Multi-domain machine translation system with training data clustering and dynamic domain adaptation
Multi-domain machine translation system with training data clustering and dynamic domain adaptation
展开▼
机译:具有训练数据聚类和动态域自适应的多域机器翻译系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
A machine translation system capable of clustering training data and performing dynamic domain adaptation is disclosed. An unsupervised domain clustering process is utilized to identify domains in general training data that can include in-domain training data and out-of-domain training data. Segments in the general training data are then assigned to the domains in order to create domain-specific training data. The domain-specific training data is then utilized to create domain-specific language models, domain-specific translation models, and domain-specific model weights for the domains. An input segment to be translated can be assigned to a domain at translation time. The domain-specific model weights for the assigned domain can be utilized to translate the input segment.
展开▼