This contribution deals with the problem of text classification. The proposed approach is probabilistic and it is based on a mixture of a Dirichlet and Multinomial distributions. Our aim is to build a classifier able, not only to tale into account the words frequency, but also the latent topics contained within the available corpora. This new model, called sbDCM, allows us to insert directly the number of topics (known or unknown) that compound the document, without losing the 'burstiness' phenomenon and the classification performance.

Semantic based DCM models for text classification

CERCHIELLO, PAOLA
2012-01-01

Abstract

This contribution deals with the problem of text classification. The proposed approach is probabilistic and it is based on a mixture of a Dirichlet and Multinomial distributions. Our aim is to build a classifier able, not only to tale into account the words frequency, but also the latent topics contained within the available corpora. This new model, called sbDCM, allows us to insert directly the number of topics (known or unknown) that compound the document, without losing the 'burstiness' phenomenon and the classification performance.
2012
Advanced Statistical Methods for the Analysis of Large Data-Sets
Di Ciaccio, Coli, Angulo Ibanez
The Mathematics category includes resources dealing with mathematics, applied mathematics, statistics and probability.
Sì, ma tipo non specificato
Inglese
Internazionale
STAMPA
375
384
10
9783642210365
Springer-Verlag
Text classification; mixture models; Dirichlet compound Multinomial model
2 Contributo in Volume::2.1 Contributo in volume (Capitolo o Saggio)
1
268
none
Cerchiello, Paola
info:eu-repo/semantics/bookPart
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/452701
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact