The presence of social stereotypes in NLP resources is an emerging topic that challenges traditionally used approaches for the creation of corpora and resources. An increasing number of scholars proposed strategies for considering annotators’ subjectivity in order to reduce such bias both in computational resources and in NLP models. In this paper, we present Open-Stereotype, an annotated corpus of Italian tweets and news headlines regarding immigration in Italy developed through an experimental procedure for the annotation of stereotypes aimed to investigate their different interpretation. The annotation is the result of a six-step process, where annotators identify text-spans expressing stereotypes, generate rationales about these spans and group them in a more comprehensive set of labels. Results show that humans exhibit high subjectivity in conceptualizing this phenomenon, and that the prior knowledge of an Italian LLM leads to more consistent classifications of specific labels that do not depend on annotators’ background.

Subjectivity in Stereotypes Against Migrants in Italian: An Experimental Annotation Procedure

Jezek E.;
2025-01-01

Abstract

The presence of social stereotypes in NLP resources is an emerging topic that challenges traditionally used approaches for the creation of corpora and resources. An increasing number of scholars proposed strategies for considering annotators’ subjectivity in order to reduce such bias both in computational resources and in NLP models. In this paper, we present Open-Stereotype, an annotated corpus of Italian tweets and news headlines regarding immigration in Italy developed through an experimental procedure for the annotation of stereotypes aimed to investigate their different interpretation. The annotation is the result of a six-step process, where annotators identify text-spans expressing stereotypes, generate rationales about these spans and group them in a more comprehensive set of labels. Results show that humans exhibit high subjectivity in conceptualizing this phenomenon, and that the prior knowledge of an Italian LLM leads to more consistent classifications of specific labels that do not depend on annotators’ background.
2025
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Bosco Cristina, Jezek Elisabetta, Polignano Marco, Sanguinetti Manuela
Language & Linguistics covers resources concerned with the theoretical, descriptive, and historical aspects of linguistics.
The AI, Robotics & Automatic Control category is concerned with resources on the research and techniques of artificial intelligence; that is, the creation of machines that exhibit characteristics of human intelligence (e.g., efficient representation of knowledge, reasoning, deduction, problem solving, heuristics, and analysis of contradictory or ambiguous information). Related AI technologies include expert systems, fuzzy systems, natural language processing, speech and pattern recognition, computer vision, decision-support systems, knowledge-bases, and neural networks. Robotics resources are concerned with the design, construction, and operation of robots. Automatic Control resources cover the design and development of regulating processes and systems that replace the necessity of human intervention. Topics include adaptive control, robust control, discrete-event control, dynamic control, fuzzy control, and optimal control. Cybernetics resources are concerned with the control and communication within and between artificial (machine) systems and living or natural systems.
Esperti anonimi
Inglese
contributo
Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
September, 24-26 2025
Cagliari
Internazionale
ELETTRONICO
Proceedings of the Italian Conference on Computational Linguistics (CLiC-it 2025)
603
612
10
9791224305873
Associazione Italiana di Linguistica Computazionale
Subjectivity, Annotation, Italian, Stereotypes, Social Bias
https://aclanthology.org/2025.clicit-1.0/
no
none
Lo, S. M.; Stranisci, M. A.; Cignarella, A. T.; Frenda, S.; Basile, V.; Jezek, E.; Patti, V.
273
info:eu-repo/semantics/conferenceObject
7
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/1542855
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact