The bivariate dependence analysis is strongly supported in literature by a wide set of measures, including the Pearson’s r, the Kendall’s τb and the Spearman’s rS correlation coefficients among others. Currently, we are assisting to an explosion in the availability of ordinal data due to widespread attitudinal surveys. In many cases, survey scales are also built on responses that are observed to belong to certain groups on a continuous scale (grouped variable). Given h groups, the measurement problem may be addressed by encoding each group through a label (from 1 to h) and, subsequently, by assigning rank one to all the units included in the first ordered group and finally rank h to those included in the h-th ordered group. In such a way, the assessment of the direct or inverse dependence relationship may be carried out through Spearman’s rS (e.g. Spearman[3]) or Kendall’s τb (e.g. Kendall[2]) coefficients which are based on the correlation between the ranks of two variables and on the pairs of concordant and discordant values of two variables, respectively. This results in neglecting the original continuous nature of the grouped variable, since the information from the grouped variable has to be reduced to its ordinal information, too. A crucial issue is then related to dependence relationship studies when one variable is ordinal and the other variable is grouped. The “Monotonic Dependence Coefficient” (MDC), recently proposed by Ferrari and Raffinetti[1], is here re-formalized for the case of grouped and ordinal variables. Through a Monte Carlo simulation study, some basic hints about the new MDC coefficient performance in specific scenarios are given even in comparison with Spearman’s and Kendall’s coefficients. The contribution ends with an application to drug-expenditure data incurred by the Italian system for public health assistance, whose aim is to illustrate the role of age differences in the allocation of drug expenditure both by considering overall patients and single sub-groups, differing in terms of gender.
Latest frontiers in grouped-ordinal data dependence analysis
E. Raffinetti;
2019-01-01
Abstract
The bivariate dependence analysis is strongly supported in literature by a wide set of measures, including the Pearson’s r, the Kendall’s τb and the Spearman’s rS correlation coefficients among others. Currently, we are assisting to an explosion in the availability of ordinal data due to widespread attitudinal surveys. In many cases, survey scales are also built on responses that are observed to belong to certain groups on a continuous scale (grouped variable). Given h groups, the measurement problem may be addressed by encoding each group through a label (from 1 to h) and, subsequently, by assigning rank one to all the units included in the first ordered group and finally rank h to those included in the h-th ordered group. In such a way, the assessment of the direct or inverse dependence relationship may be carried out through Spearman’s rS (e.g. Spearman[3]) or Kendall’s τb (e.g. Kendall[2]) coefficients which are based on the correlation between the ranks of two variables and on the pairs of concordant and discordant values of two variables, respectively. This results in neglecting the original continuous nature of the grouped variable, since the information from the grouped variable has to be reduced to its ordinal information, too. A crucial issue is then related to dependence relationship studies when one variable is ordinal and the other variable is grouped. The “Monotonic Dependence Coefficient” (MDC), recently proposed by Ferrari and Raffinetti[1], is here re-formalized for the case of grouped and ordinal variables. Through a Monte Carlo simulation study, some basic hints about the new MDC coefficient performance in specific scenarios are given even in comparison with Spearman’s and Kendall’s coefficients. The contribution ends with an application to drug-expenditure data incurred by the Italian system for public health assistance, whose aim is to illustrate the role of age differences in the allocation of drug expenditure both by considering overall patients and single sub-groups, differing in terms of gender.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.