DSI LogoSADiLaR Logo
Clarin-ZA Logo
Search 
  •   SADiLaR
  • Language Resource Management Agency
  • Search
  •   SADiLaR
  • Language Resource Management Agency
  • Search
    • Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of SADiLaR

    Communities & CollectionsTitleProjectMedia type

    This Community

    TitleProjectMedia type

    Filter

    Language

    Afrikaans (2)English (2)isiNdebele (2)isiXhosa (2)isiZulu (2)Sesotho (2)Sesotho sa Leboa (Sepedi) (2)Setswana (2)
    Siswati (13)
    Tshivenda (2)Xitsonga (2)

    Collection

    Resource Catalogue (13)

    Media type

    Speech (6)Text (7)

    Project

    Autshumato (1)Lwazi (3)Lwazi II (1)NCHLT Speech (2)NCHLT Text (2)NCHLT Text II (2)

    Resource type

    Data (13)

    Search

    Show Advanced FiltersHide Advanced Filters

    Filters

    Use filters to refine the search results.

    Now showing items 1-10 of 13

    Filter options

    • Sort Options:
    • Relevance
    • Title Asc
    • Title Desc
    • Language Asc
    • Language Desc
    • Media type Asc
    • Media type Desc
    • Resource type Asc
    • Resource type Desc
    • Results Per Page:
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    • 100
    Thumbnail

    NCHLT Siswati Speech Corpus 

    Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel; Alta de Waal (Meraka Institute, CSIR; North-West University, 2014-07-08) ~ Resource Catalogue
    Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.
    Thumbnail

    Lwazi Siswati TTS corpus 

    Daniel van Niekerk; Etienne Barnard; Marelie Davel; Aby Louw; Alta de Waal (Meraka Institute, CSIR, 2013-03-27) ~ Resource Catalogue
    Orthographic and phonemically aligned transcriptions
    Thumbnail

    Lwazi Siswati Pronunciation Dictionary 

    Marelie Davel (Meraka Institute, CSIR, 2013-04-01) ~ Resource Catalogue
    General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
    Thumbnail

    NCHLT Siswati Phrase Chunk Annotated Corpus 

    B.B. Malangwane; M.N. Kekana; S.S. Sedibe; B.C. Ndhlovu; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
    Thumbnail

    NCHLT Siswati Named Entity Annotated Corpus 

    B.B. Malangwane; M.N. Kekana; S.S. Sedibe; B.C. Ndhlovu; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
    Thumbnail

    NCHLT Siswati Annotated Text Corpora 

    Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
    Thumbnail

    NCHLT Siswati Text Corpora 

    Martin Puttkammer; Martin Schlemmer; Wikus Pienaar; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
    Thumbnail

    NCHLT-inlang Pronunciation Dictionaries 

    Marelie Davel (Meraka Institute, CSIR; North-West University, 2014-07-04) ~ Resource Catalogue
    Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations ...
    Thumbnail

    Siswati Custom Dictionary for Government Domain 

    Martin Puttkammer; Nico Oosthuizen; Wikus Pienaar (North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ Resource Catalogue
    Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
    Thumbnail

    Siswati Genre Classification Corpus 

    Gerhard van Huyssteen; D.P. Snyman (Trifonius, 2013-06-19) ~ Resource Catalogue
    Contains training and testing data for Genre Classification for Siswati.
    • View previous page

    • 1
    • 2
    • View next page


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback
     

     


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback