Automatic Detection of Cross-language Verbal Deception
- Pasquale Capuozzo, Department of General Psychology, Università degli Studi di Padova, Padova, PD, Italy
- Ivano Lauriola, Department of Mathematics, University of Padova, Padova, Italy
- Carlo Strapparava, FBK-Irst, Trento, Italy
- Fabio Aiolli, Dept. Mathematics, University of Padova, Padova, Italy
- Giuseppe Sartori, Department of General Psychology, University of Padova, Padova, Italy
AbstractThe assessment of how a deceptive message is produced in different languages has received little attention, with the majority of studies focused on the English language. Moreover, there is no agreement about the stability of linguistic clues of deceit across different languages. In this paper, we address this issue by analysing both theory-driven linguistic markers of deception (cognitive load hypothesis) and standard text categorisation features. After compiling a multilingual corpus of both honest and deceitful first-person opinions regarding five different topics, we assessed the cross-language applicability of four different features sets in within-topic, cross-topic and cross-language binary classification experiments. Results showed promising classification performances in all the three experiments with few exceptions. Interestingly, linguistic markers of deceit linked to the cognitive load hypothesis exhibited the same trend in the two languages under investigation and the cross-language evaluation highlighted their usefulness in spotting deceit between different languages.