Abstract
Stance detection is critical for understanding
the underlying position or attitude expressed
toward a topic. Large language models (LLMs)
have demonstrated significant advancements
across various natural language processing
tasks including stance detection, however, their
performance in stance detection is limited by
biases and spurious correlations inherent due
to their data-driven nature. Our statistical experiment
reveals that LLMs are prone to generate
biased stances due to sentiment-stance
spurious correlations and preference towards
certain individuals and topics. Furthermore,
the results demonstrate a strong negative correlation
between stance bias and stance detection
performance, underscoring the importance
of mitigating bias to enhance the utility of
LLMs in stance detection. Therefore, in this paper,
we propose a Counterfactual Augmented
Calibration Network (FACTUAL), which a novel
calibration network is devised to calibrate potential
bias in the stance prediction of LLMs.
Further, to address the challenge of effectively
learning bias representations and the difficulty
in the generalizability of debiasing, we construct
counterfactual augmented data. This approach
enhances the calibration network, facilitating
the debiasing and out-of-domain generalization.
Experimental results on in-target
and zero-shot stance detection tasks show that
the proposed FACTUAL can effectively mitigate
biases of LLMs, achieving state-of-the-art results.
the underlying position or attitude expressed
toward a topic. Large language models (LLMs)
have demonstrated significant advancements
across various natural language processing
tasks including stance detection, however, their
performance in stance detection is limited by
biases and spurious correlations inherent due
to their data-driven nature. Our statistical experiment
reveals that LLMs are prone to generate
biased stances due to sentiment-stance
spurious correlations and preference towards
certain individuals and topics. Furthermore,
the results demonstrate a strong negative correlation
between stance bias and stance detection
performance, underscoring the importance
of mitigating bias to enhance the utility of
LLMs in stance detection. Therefore, in this paper,
we propose a Counterfactual Augmented
Calibration Network (FACTUAL), which a novel
calibration network is devised to calibrate potential
bias in the stance prediction of LLMs.
Further, to address the challenge of effectively
learning bias representations and the difficulty
in the generalizability of debiasing, we construct
counterfactual augmented data. This approach
enhances the calibration network, facilitating
the debiasing and out-of-domain generalization.
Experimental results on in-target
and zero-shot stance detection tasks show that
the proposed FACTUAL can effectively mitigate
biases of LLMs, achieving state-of-the-art results.
| Original language | English |
|---|---|
| Title of host publication | 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics |
| Publication status | Published - 29 Apr 2025 |
Fingerprint
Dive into the research topics of 'Mitigating Biases of Large Language Models in Stance Detection with Counterfactual Augmented Calibration'. Together they form a unique fingerprint.Projects
- 2 Finished
-
DineUP: Developing an Interactive Narrative Understanding Platform
Gui, L. (Primary Investigator) & He, Y. (Co-Investigator)
EPSRC Engineering and Physical Sciences Research Council
15/08/2024 → 13/03/2025
Project: Research
-
A Lebesgue Integral based Approximation for Language Modelling
Gui, L. (Primary Investigator) & He, Y. (Co-Investigator)
EPSRC Engineering and Physical Sciences Research Council
10/02/2023 → 9/02/2025
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver