Concept-based explanations to test for false causal relationships learned by abusive language classifiers

From National Research Council Canada

Download
  1. (PDF, 792 KiB)
DOIResolve DOI: https://doi.org/10.18653/v1/2023.woah-1.14
AuthorSearch for: 1; Search for: 1; Search for: 1; Search for: 1
Affiliation
  1. National Research Council of Canada. Digital Technologies
FormatText, Article
ConferenceThe 7th Workshop on Online Abuse and Harms (WOAH), July 13, 2023, Toronto, Ontario
Abstract
Publication date
PublisherAssociation for Computational Linguistics
Licence
In
LanguageEnglish
Peer reviewedYes
Export citationExport as RIS
Report a correctionReport a correction (opens in a new tab)
Record identifierb9e78f14-9675-41a6-b8ae-c205d6ba8b98
Record created2023-07-17
Record modified2023-11-03
Date modified: