Simple Framework for Interpretable Fine-grained Text Classification

Munkhtulga Battogtokh*, Michael Luck, Cosmin Davidescu, Rita Borgo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

1 Citation (Scopus)
94 Downloads (Pure)

Abstract

Fine-grained text classification with similar and many labels is a challenge in practical applications. Interpreting predictions in this context is particularly difficult. To address this, we propose a simple framework that disentangles feature importance into more fine-grained links. We demonstrate our framework on the task of intent recognition, which is widely used in real-life applications where trustworthiness is important, for state-of-the-art Transformer language models using their attention mechanism. Our human and semi-automated evaluations show that our approach better explains fine-grained input-label relations than popular feature importance estimation methods LIME and Integrated Gradient and that our approach allows faithful interpretations through simple rules, especially when model confidence is high.
Original languageEnglish
Title of host publicationECAI 2023
Subtitle of host publication3rd International Workshop on Explainable and Interpretable Machine Learning (XI-ML)
DOIs
Publication statusPublished - 2023

Fingerprint

Dive into the research topics of 'Simple Framework for Interpretable Fine-grained Text Classification'. Together they form a unique fingerprint.

Cite this