Spurious Correlations in Cross-Topic Argument Mining

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review


  • Fulltext

    Final published version, 1.55 MB, PDF document

Recent work in cross-topic argument mining attempts to learn models that generalise across topics rather than merely relying on within-topic spurious correlations. We examine the effectiveness of this approach by analysing the output of single-task and multi-task models for cross-topic argument mining, through a combination of linear approximations of their decision boundaries, manual feature grouping, challenge examples, and ablations across the input vocabulary. Surprisingly, we show that cross-topic models still rely mostly on spurious correlations and only generalise within closely related topics, e.g., a model trained only on closed-class words and a few common open-class words outperforms a state-of-the-art cross-topic model on distant target topics.
Original languageEnglish
Title of host publicationProceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics
PublisherAssociation for Computational Linguistics
Publication date2021
Publication statusPublished - 2021
EventTenth Joint Conference on Lexical and Computational Semantics - SEM 2021 - Online
Duration: 5 Aug 20216 Aug 2021


ConferenceTenth Joint Conference on Lexical and Computational Semantics - SEM 2021

Number of downloads are based on statistics from Google Scholar and www.ku.dk

No data available

ID: 300082790