Skip to Main Content (Press Enter)

Logo UNIPV
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture

UNIFIND
Logo UNIPV

|

UNIFIND

unipv.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  1. Pubblicazioni

Running genome wide data analysis using a parallel approach on a cloud platform

Contributo in Atti di convegno
Data di Pubblicazione:
2015
Abstract:
Hierarchical Naive Bayes (HNB) is a multivariate classification algorithm that can be used to forecast the probability of a specific disease by analysing a set of Single Nucleotide Polymorphisms (SNPs). In this paper we present the implementation of HNB using a parallel approach based on the Map-Reduce paradigm built natively on the Hadoop framework, relying on the Amazon Cloud Infrastructure. We tested our approach on two GWAS datasets aimed at identifying the genetic bases of Type 1 (T1D) and Type 2 Diabetes (T2D). Both datasets include individual level data of 1,900 cases and 1,500 controls with similar to 420,000 SNPs. For T2D the best results were obtained using the complete set of SNPs, whereas for T1D the best performances were reached using few SNPs selected through standard univariate association tests. Our cloud-based implementation allows running genome wide simulations cutting down computational time and overall infrastructure costs.
Tipologia CRIS:
4.1 Contributo in Atti di convegno
Keywords:
Cloud computing; Data mining algorithm; Genome-wide association studies; Map reduce; Computer Science (all); Theoretical Computer Science
Elenco autori:
Demartini, Andrea; Capozzi, Davide; Malovini, ALBERTO LUIGI; Bellazzi, Riccardo
Autori di Ateneo:
BELLAZZI RICCARDO
MALOVINI ALBERTO LUIGI
Link alla scheda completa:
https://iris.unipv.it/handle/11571/1127091
Titolo del libro:
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pubblicato in:
LECTURE NOTES IN COMPUTER SCIENCE
Journal
LECTURE NOTES IN COMPUTER SCIENCE
Series
  • Dati Generali

Dati Generali

URL

http://springerlink.com/content/0302-9743/copyright/2005/
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.6.1.0