AppTek Company Overview

Recent Academic Research and Publications

Improving Language Model Integration for Neural Machine Translation

June 2023

Christian Herold, Yingbo Gao, Mohammad Zeineldeen, Hermann Ney

The integration of language models for neural machine translation has been extensively studied in the past. It has been shown that an external language model, trained on additional target-side monolingual data, can help improve translation quality. However, there has always been the assumption that the translation model also learns an implicit target-side language model during training, which interferes with the external language model at decoding time. Recently, some works on automatic speech recognition have demonstrated that, if the implicit language model is neutralized in decoding, further improvements can be gained when integrating an external language model. In this work, we transfer this concept to the task of machine translation and compare with the most prominent way of including additional monolingual data - namely back-translation. We find that accounting for the implicit language model significantly boosts the performance of language model fusion, although this approach is still outperformed by back-translation.

Company Overview

Home / Company Overview

About AppTek.ai

Company History and Timeline

Recent Academic Research and Publications

Improving Language Model Integration for Neural Machine Translation

Take the Hint: Improving Diacritization with Partially-Diacritized Text

Improving And Analyzing Neural Speaker Embeddings for ASR

Self-Normalized Importance Sampling for Neural Language Modeling

Efficient Training of Neural Transducer for Speech Recognition

Automatic Learning of Subword Dependent Model Scales

Improving the Training Recipe for a Robust Conformer-based Hybrid Model

Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept

On Sampling-Based Training Criteria for Neural Language Modeling

The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech

Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures

Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition

Librispeech Transducer Model with Internal Language Model Prior Correction

A New Training Pipeline for an Improved Neural Transducer

Early Stage LM Integration Using Local and Global Log-Linear Combination

Robust Beam Search for Encoder-Decoder Attention Based Speech Recognition without Length Bias

LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring

Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies

Learning Bilingual Sentence Embeddings via Autoencoding and Computing Similarities with a Multilayer Perceptron

Language Modeling with Deep Transformers

Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech

Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos

AI and ML Technologies to Bridge the Language Gap

Find us on Social Media:









ABOUT APPTEK.ai

SEARCH APPTEK.AI

SITEMAP

LATEST NEWS

LATEST BLOG POSTS