Department of Science, Technology and InnovationCLARIN in South Africa

Annotated Short Mystery Novels Data Set

Loading...
Thumbnail Image

Date

2025-08-20

Authors

Heyns, Nuette
van Zaanen, Menno

Journal Title

Journal ISSN

Volume Title

Publisher

Menno van Zaanen

Abstract

Description

This data set consists of ten annotated short mystery novels (whodunits). The novels, written in English between 1891 and 1924, range from 2,000 to 10,000 words each. This length ensures they are long enough to capture the full narrative structure of whodunits while remaining feasible for manual annotation. Unlike data sets like those developed in the SANTA project, which annotated shorter texts (under 2,000 words) for narrative levels and scene segmentation (Reiter, 2019), this data set contains annotated full texts to uncover complete narrative structures.

Citation

License

Creative Commons License Attribution-ShareAlike 4.0 International CC BY-SA 4.0

Verification status

Level 0