DH2022 Dataset Releases: Project Dialogism Novel Corpus (PDNC) and The Birth of the Modern Detective Story (BMDS)

Today, at the 2022 Digital Humanities conference, I and my colleagues (Krishnapriya Vishnubhotla and Graeme Hirst, PDNC; and Simon Stern, BMDS) are releasing two new datasets for humanities research: the Project Dialogism Novel Corpus (PDNC), a corpus of 22 novels annotated for character speech; and the Birth of the Modern Detective Story (BMDS) Dataset, a set of 440 detective stories provided in full text and annotated for features related to types of crimes, clues, and evidence provided.

Learn more about each dataset by clicking on the links above.