Releases · gorkemozkaya/nmt-en-tr

16 Jul 18:31

53f4dd6

Pretrained en->tr and tr->en NMT models, trained with TF2 Latest

Latest

Pretrained en->tr and tr->en NMT models, trained with TF2

The models are re-trained using Tensorflow 2 with a larger training corpus compared to the prior release. Please this notebook for loading these checkpoints.

Assets 3

10 Jul 19:30

gorkemozkaya

blended_dataset_prep

7b4c4c0

blended_dataset Pre-release

Pre-release

Releasing a blended dataset created using a combination of four different parallel corpora. The preparation code is available here. The most dominant dataset is the Open Subtitles en/tr corpora, which is downsampled to 10% of its original size. The other datasets are used at 100%.

Assets 3

05 Jul 05:31

gorkemozkaya

bianet_ted_corpora

0114194

bianet_and_ted_corpora Pre-release

Pre-release

Adding two additional corpora that are used in the latest version of the neural machine translation model.

Assets 3

15 Jul 00:00

gorkemozkaya

pretrained_model

bb70429

pretrained_models

This consists of the pre-trained model weights for English->Turkish and Turkish->English translation models.
Documentation for loading these models will soon follow.

Assets 4