machine-learning

Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation

Fully unsupervised mining method that can built synthetic parallel data for unsupervised machine translation.

Cross-model Back-translated Distillation for Unsupervised Machine Translation

A novel strategy to improve unsupervised MT by using back-translation with multiple models.

Data Diversification: A Simple Strategy For Neural Machine Translation

A simple way to boost many NMT tasks by using multiple backward and forward models.

Tree-Structured Attention with Hierarchical Accumulation

A novel attention mechanism that aggregates hierarchical structures to encode constituency trees for downstream tasks.