DSI LogoSADiLaR Logo
Clarin-ZA Logo
Search 
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Catalogue
  • Search
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Catalogue
  • Search
    • Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of SADiLaR

    Communities & CollectionsTitleProjectMedia type

    This Collection

    TitleProjectMedia type

    Filter

    Language

    Afrikaans (42)Dutch (2)English (35)isiNdebele (25)isiXhosa (30)isiZulu (32)Sepedi (2)Sesotho (26)Sesotho sa Leboa (Sepedi) (28)Setswana (29)Siswati (25)Tshivenda (26)Xitsonga (29)

    Collection

    Resource Catalogue (134)

    Media type

    Text (141)

    Project

    African Speech Technology (4)African Wordnet Project (5)Autshumato (21)Autshumato IV (1)Human Language Technology Audit 2017/18 (1)NCHLT Text (44)NCHLT Text II (22)NCHLT Text III (2)Parallel corpora for English into isiXhosa (2)SADiLaR Specialisation project: Multilingual wordlists in an academic context by the ICELDA node (1)

    Resource type

    Applications (2)Data (92)Modules (21)Tools (11)

    Database

    Monolingual Text Corpora: Annotated (21)Multilingual Text Corpora: Aligned (7)

    Search

    Show Advanced FiltersHide Advanced Filters

    Filters

    Use filters to refine the search results.

    Now showing items 1-10 of 141

    Filter options

    • Sort Options:
    • Relevance
    • Title Asc
    • Title Desc
    • Language Asc
    • Language Desc
    • Collection Asc
    • Collection Desc
    • Media type Asc
    • Media type Desc
    • Project Asc
    • Project Desc
    • Resource type Asc
    • Resource type Desc
    • Database Asc
    • Database Desc
    • Results Per Page:
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    • 100
    Thumbnail

    Afribooms Afrikaans Dependency Treebank 

    Daniel van Niekerk (North-West University; Centre for Text Technology (CTexT); Katholieke Universiteit Leuven (Belgium), 2015-02-10) ~ Resource Catalogue
    This is the annotated corpus developed for Afrikaans for the Afribooms project. The corpus includes annotations for lemma, part-of-speech (POS) and ...
    Thumbnail

    NCHLT Afrikaans Annotated Text Corpora 

    Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
    Thumbnail

    NCHLT Afrikaans Text Corpora 

    Martin Puttkammer; Martin Schlemmer; Wikus Pienaar; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
    Thumbnail

    NCHLT Afrikaans Morphological Decomposer 

    Martin Puttkammer; Martin Schlemmer (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Morphological decomposer developed during the NCHLT Text project.
    Thumbnail

    NCHLT Siswati Morphological Decomposer 

    Martin Puttkammer; Martin Schlemmer (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Morphological decomposer developed during the NCHLT Text project.
    Thumbnail

    NCHLT Afrikaans Phrase Chunk Annotated Corpus 

    Gerhard van Huyssteen; Martin Puttkammer; E.B. Trollip; J.C. Liversage; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
    Thumbnail

    NCHLT isiNdebele Phrase Chunk Annotated Corpus 

    K.S. Mahlangu; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
    Thumbnail

    NCHLT Afrikaans Named Entity Annotated Corpus 

    Gerhard van Huyssteen; Martin Puttkammer; E.B. Trollip; J.C. Liversage; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
    Thumbnail

    NCHLT isiNdebele Named Entity Annotated Corpus 

    K.S. Mahlangu; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue
    Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
    Thumbnail

    NCHLT isiNdebele Annotated Text Corpora 

    Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
    Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
    • View previous page

    • 1
    • 2
    • 3
    • 4
    • . . .
    • 15
    • View next page


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback
     

     


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback