The study of institutional communication related to the pandemic, and to the population's response to it, is of great relevance today. The Italian spokesperson for communication regarding the pandemic has been, during the year 2020, the former Prime Minister Giuseppe Conte. We retrieved 4,860,395 comments from his Facebook official page and built the ConteCorpus, a new Italian resource annotated in CoNLL-U format. A first aim of the research was to evaluate the performance of the model used to annotate the corpus. Models trained on social media texts are usually not very generalizable. Nevertheless, the results of the evaluation were good, especially in parsing metrics, and showed that a parser trained on Twitter data can be successfully applied to Facebook data. A second aim of the research was to provide an overall view of the content of such a large corpus; for this purpose, topic modeling was conducted, training an LDA model. The model generated 5 topics that cover different aspects linked to the pandemic emergency, from economic to political issues. Through the topic modeling we investigated which topics are prevalent on particular days.

ConteCorpus: An Analysis of People Response to Institutional Communications During the Pandemic

Ventura V.;Jezek E.
2021-01-01

Abstract

The study of institutional communication related to the pandemic, and to the population's response to it, is of great relevance today. The Italian spokesperson for communication regarding the pandemic has been, during the year 2020, the former Prime Minister Giuseppe Conte. We retrieved 4,860,395 comments from his Facebook official page and built the ConteCorpus, a new Italian resource annotated in CoNLL-U format. A first aim of the research was to evaluate the performance of the model used to annotate the corpus. Models trained on social media texts are usually not very generalizable. Nevertheless, the results of the evaluation were good, especially in parsing metrics, and showed that a parser trained on Twitter data can be successfully applied to Facebook data. A second aim of the research was to provide an overall view of the content of such a large corpus; for this purpose, topic modeling was conducted, training an LDA model. The model generated 5 topics that cover different aspects linked to the pandemic emergency, from economic to political issues. Through the topic modeling we investigated which topics are prevalent on particular days.
2021
Proceedings of the Eight Italian Conference on Computational Linguistics Clic-it 2021
Fersini, E.,Passarotti, M., Patti, V.
Computer Science & Engineering
Language & Linguistics
Esperti anonimi
Inglese
contributo
Clic-it 2021 Italian Conference on Computational Linguistics
26-28 January 2022
Milan
Internazionale
ELETTRONICO
Collana dell'Associazione Italiana di Linguistica Computazionale
344
351
8
9791280136824
Accademia University Press
La collana pubblica gli atti del convegno annuale di Linguistica Computazionale (CLiC-it), che ha lo scopo di costituire un luogo di discussione di riferimento nel campo delle ricerce sulla linguistica computazionale. Gli atti includono interventi sul trattamento automatico della lingua, comprendenti le riflessioni teoriche e metodologiche sul tema, e forniscono un contributo importante per questo campo di ricerca. Le altre tematiche principali sono: la linguistica computazionale, la linguistica, le scienze cognitive, l'apprendimento automatico, l'informatica, la rappresentazione della conoscenza, l'information retrieval e l'umanistica digitale. L'organizzazione del convegno è il risultato dello sforzo dell'Associazione Italiana di Linguistica Computazionale (AILC http://www.ai-lc.it/), rappresentata ogni anno da alcuni dei membri organizzatori, che sono affiliati anche ad altre organizzazioni che operano nell'ambito della linguistica computazionale. La Collana è presente sul portale internazionale OpenEdition Books.
Topic Modeling with LDA model Evaluation of a model Pandemic Responde to institutional communications
https://www.aaccademia.it/ita/scheda-libro?aaref=1513
no
none
Ventura, V.; Jezek, E.
273
info:eu-repo/semantics/conferenceObject
2
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/1449869
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact