National Research Council of Canada. Information and Communication Technologies
25th International Conference on Computational Linguistics, August 23-29 2014, Dublin, Ireland
We present a cross-lingual discourse relation analysis based on a parallel corpus with discourse information available only for one language. First, we conduct a corpus study to explore differences in discourse organization between Chinese and English, including differences in information packaging, implicit/explicit discourse expression divergence, and discourse connective
ambiguities. Second, we introduce a novel approach to learning to recognize discourse relations,
using the parallel corpus instead of discourse annotation in the language of interest. Our resulting
semi-supervised system reaches state-of-art performance on the task of discourse relation
detection, and outperforms a supervised system on discourse relation classification.
Association for Computational Linguistics
25th International Conference on Computational Linguistics1: 577–586.