Author | Search for: Hammond, Adam; Search for: Vishnubhotla, Krishnapriya; Search for: Hirst, Graeme; Search for: Mohammad, Saif M.1 |
---|
Affiliation | - National Research Council of Canada. Digital Technologies
|
---|
Format | Text, Article |
---|
Conference | Digital Humanities 2022 Responding to Asian Diversity, July 25-29, 2022, Tokyo, Japan and Fully Online (Zoom) |
---|
Abstract | We introduce a new dataset for the computational analysis of novels: the Project Dialogism Novel Corpus (PDNC). The PDNC currently consists of 22 novels in which all quotations are identified and annotated for speaker, addressee(s), and characters mentioned. PDNC is by an order of magnitude the largest corpus of its kind. Each novel is annotated manually by a pair of annotators using customized software we developed. In addition to releasing the dataset itself alongside this paper, we are also releasing the custom annotation software we developed (including the source code) along with our annotation guidelines. In the discussion section, we present two applications of the PDNC from our own research: quote attribution and emotion dynamics. We argue that the PDNC will promote a more nuanced and accurate view of novelistic discourse; whereas much research currently envisions the novel as expressing the voice of the author, the PDNC presents novels as a polyphonic fabric of characters’ voices. |
---|
Publication date | 2022-07-25 |
---|
Publisher | The Alliance of Digital Humanities Organizations |
---|
In | |
---|
Language | English |
---|
Peer reviewed | Yes |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | 797b3a7e-b385-47fd-b1e2-a47356f7df94 |
---|
Record created | 2022-07-11 |
---|
Record modified | 2024-02-01 |
---|