Department of Science, Technology and InnovationCLARIN in South Africa

Annotated Short Mystery Novels Data Set

dc.contact.emailmenno.vanzaanen@nwu.ac.za
dc.contact.nameMenno van Zaanen
dc.contributor.authorHeyns, Nuette
dc.contributor.authorvan Zaanen, Menno
dc.date.accessioned2025-08-29T11:34:46Z
dc.date.available2025-08-29T11:34:46Z
dc.date.issued2025-08-20
dc.descriptionThis data set consists of ten annotated short mystery novels (whodunits). The novels, written in English between 1891 and 1924, range from 2,000 to 10,000 words each. This length ensures they are long enough to capture the full narrative structure of whodunits while remaining feasible for manual annotation. Unlike data sets like those developed in the SANTA project, which annotated shorter texts (under 2,000 words) for narrative levels and scene segmentation (Reiter, 2019), this data set contains annotated full texts to uncover complete narrative structures.
dc.formatXML, text
dc.format.extent10 narratives
dc.format.mediumN/A
dc.format.size1.4Mb
dc.identifier.urihttps://hdl.handle.net/20.500.12185/693
dc.languagesEnglish
dc.media.categoryannotated monolingual text corpus
dc.media.typeText
dc.publisherMenno van Zaanen
dc.rights.licenseCreative Commons License Attribution-ShareAlike 4.0 International CC BY-SA 4.0
dc.subjectMystery novels
dc.subjectNarrative
dc.subjectNarrative levels
dc.subjectScene annotations
dc.titleAnnotated Short Mystery Novels Data Set
dc.version1

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
mystery.zip
Size:
1.34 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.22 KB
Format:
Item-specific license agreed upon to submission
Description: