Autshumato English-Sepedi Parallel Corpora
- Resource Index 
MetadataShow full item record
Aligned parallel corpora for the language pair English-Sepedi. The data is given as two separate UTF-8 text files, with each aligned segment on a newline. The data was specifically selected and formatted for use in the training of machine translation systems. Further clean-up and processing might be required depending on the task the data is reused for.
Contact personSunny Gent
Contact person's e-mail firstname.lastname@example.org
CTexT® (Centre for Text Technology, North-West University)