This paper describes a tool implemented to automatically reconstruct the pedigree of an isolated population of Northern Italy with the aim of supporting genetic studies. The goal of such studies is to analyze genealogic, clinical and genetic data for genetic dissection of complex diseases. In this context the reconstruction of the population pedigree is fundamental to verify that such population is a genetic isolate and obtain the parental relationships among the individuals participating to the study. The algorithm presented in the paper, from heterogeneous data sources (demographic municipal and parish archives and other data sources), derives the pedigree applying several heuristic rules in a predefined order One of the main difficulties in performing such task stands in the "record linkage" process that requires the definition of a sufficiently general strategy for managing the ambiguities caused by missing or imprecise/erroneous input data. The paper, finally, presents and discusses the preliminary results obtained by reconstructing the pedigree of four villages from the data collected during the first eighteen months of project.

Automatic pedigree reconstruction for genetic studies in isolated populations. Stud Health Technol Inform.

LARIZZA, CRISTIANA;BELLAZZI, RICCARDO
2007-01-01

Abstract

This paper describes a tool implemented to automatically reconstruct the pedigree of an isolated population of Northern Italy with the aim of supporting genetic studies. The goal of such studies is to analyze genealogic, clinical and genetic data for genetic dissection of complex diseases. In this context the reconstruction of the population pedigree is fundamental to verify that such population is a genetic isolate and obtain the parental relationships among the individuals participating to the study. The algorithm presented in the paper, from heterogeneous data sources (demographic municipal and parish archives and other data sources), derives the pedigree applying several heuristic rules in a predefined order One of the main difficulties in performing such task stands in the "record linkage" process that requires the definition of a sufficiently general strategy for managing the ambiguities caused by missing or imprecise/erroneous input data. The paper, finally, presents and discusses the preliminary results obtained by reconstructing the pedigree of four villages from the data collected during the first eighteen months of project.
2007
978-1-58603-774-1
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/125889
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact