Dataset Card for "bashkir-russian-parallel-corpora"
 
 
  How the dataset was assembled.
 
 
  find the text in two languages. it can be a translated book or an internet page (wikipedia, news site)
 
 
  our algorithm tries to match Bashkir sentences with their translation in Russian
 
 
  We give these pairs to people to check
 
 @inproceedings{
title={Bashkir-Russian parallel corpora},
author={Iskander Shakirov, Aigiz Kunafin},
year={2023}
}