DSI LogoSADiLaR Logo
Clarin-ZA Logo
Search 
  •   SADiLaR
  • Language Resource Management Agency
  • Search
  •   SADiLaR
  • Language Resource Management Agency
  • Search
    • Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of SADiLaR

    Communities & CollectionsTitleProjectMedia type

    This Community

    TitleProjectMedia type

    Filter

    Language

    Afrikaans (22)Dutch (2)English (19)isiNdebele (22)isiXhosa (22)isiZulu (22)Sesotho (22)Sesotho sa Leboa (Sepedi) (22)Setswana (22)
    Siswati (36)
    Tshivenda (22)Xitsonga (22)Yoruba (2)

    Collection

    Resource Catalogue (36)

    Media type

    Speech (13)Text (23)

    Project

    Autshumato (5)Autshumato IV (1)Lwazi (6)Lwazi II (1)NCHLT Speech (4)NCHLT Text (7)NCHLT Text II (4)NCHLT Text III (2)

    Resource type

    Applications (1)Data (13)Modules (3)Tools (17)

    Search

    Show Advanced FiltersHide Advanced Filters

    Filters

    Use filters to refine the search results.

    Now showing items 31-36 of 36

    Filter options

    • Sort Options:
    • Relevance
    • Title Asc
    • Title Desc
    • Language Asc
    • Language Desc
    • Media type Asc
    • Media type Desc
    • Resource type Asc
    • Resource type Desc
    • Results Per Page:
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    • 100
    Thumbnail

    Lwazi Siswati ASR corpus 

    Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel (Meraka Institute, CSIR, 2013-04-02) ~ Resource Catalogue
    Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
    Thumbnail

    Autshumato Machine Translation Evaluation Set 

    McKellar, Cindy Arlene (North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ Resource Catalogue
    Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
    Thumbnail

    NCHLT Siswati Auxiliary Speech Corpus 

    Febe de Wet; Laura Martinus; Jaco Badenhorst (CSIR Meraka Institute; North-West University, 2019-06-01) ~ Resource Catalogue
    The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
    Thumbnail

    CTexTools 2 

    Eiselen, Roald; Puttkammer, Martin; Hocking, Justin; Kruger, Albertus (North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ Resource Catalogue
    CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...
    Thumbnail

    NCHLT Tagger 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    A graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:\n* Part of Speech\n* Named ...
    Thumbnail

    NCHLT Siswati Lemmatiser 

    Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Lemmatiser developed during the NCHLT Text project. \n\n Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one ...
    • View previous page

    • 1
    • 2
    • 3
    • 4

    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback
     

     


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback