Project: Autshumato II

Type: Frequency word list
Language: Xitsonga (ts_ZA, tso_ZA)
Date: 2014-10-21
Version: 1.0.5

Description: 
A list of the most frequent Xitsonga words as deliverable of the Autshumato project.
The data is given in two separate files:
- List.XitsongaFrequencyWordList.1.0.5.2014-10-21.csv
  Comma separated values files which contain all of the word with their frequencies, each on a new line;
  ordered from most to least frequent and then alphabetically.
  
- List.XitsongaFrequencyWordList.1.0.5.2014-10-21.txt
  Plain text file containing all of the most frequent words, each on a new line; 
  ordered from most to least frequent and then alphabetically.

All exact duplicates have been removed.
The data might contain Named Entities.

Content:
17 907 Xitsonga words.

Source(s):
Existing NCHLT Text data and Sourced translation of Government domain data.

_________________________________________________________________________________
Licence: Creative Commons Attribution Non-Commercial ShareAlike 2.5 South Africa
 
URL: http://creativecommons.org/licenses/by-nc-sa/2.5/za/
 
Attribute work to: 
	CTexT (Centre for Text Technology, North-West University), South Africa; 
	Department of Arts and Culture, South Africa.
Attribute work to URL:	
	http://autshumato.sourceforge.net/ and 
	http://www.nwu.ac.za/ctext
_________________________________________________________________________________