Repository of Research and Investigative Information

Repository of Research and Investigative Information

Baqiyatallah University of Medical Sciences

Novel directions in data pre-processing and genome-wide association study (GWAS) methodologies to overcome ongoing challenges

(2021) Novel directions in data pre-processing and genome-wide association study (GWAS) methodologies to overcome ongoing challenges. Informatics in Medicine Unlocked. ISSN 23529148 (ISSN)

[img] Text
Novel directions in data pre-processing and genome-wide association study (GWAS) methodologies to overcome ongoing challenges.pdf

Download (1MB)

Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

A genome-wide association study (GWAS) is a standard population-based technique for identifying the heritable genetic basis of complex diseases by discovering correlations between trait variations and allele frequencies of genetic markers. This article aims to help fill gaps in data pre-processing and GWAS methodologies by reviewing novel techniques and methodologies. Data pre-processing performed prior to a GWAS presents challenges in Hardy-Weinberg (H–W) estimation, genotyping and accounting for factors such as sample structure. Recent developments towards overcoming these challenges are presented: the likelihood ratio test for H–W estimation, sequencing for genotyping, and techniques for dealing with sample structure. Traditional statistical methods cannot provide a way to insightfully interpret the data generated from high-throughput techniques; therefore, novel directions in GWAS methodologies are reviewed using efficient statistical methods, which are flexible techniques for performing genetic association analysis when factors such as non-random sampling or population structure occur. Despite the development of these methods, genotyping costs and an increased capacity for large dataset analysis have motivated researchers to examine tissue-specific signals. This review discusses how prospective and retrospective association analyses can be used to consider binary traits, address non-random ascertainment, and increase the capacity for large dataset analysis. Importantly, for disease susceptibility, rare variants can represent a large portion of genetic markers, and this article reviews some association methods for rare variant detection. In conclusion, the recent developments in GWAS data preparation and methodologies reviewed in this article can overcome most current challenges in the field and will also address future challenges. © 2021 The Author(s)

Item Type: Article
Keywords: Genome-wide association study (GWAS) Machine learning Rare variants Retrospective association analysis Sequencing Tissue signals disease predisposition gene frequency gene mapping genetic marker genetic variation genome-wide association study genotype heritability high throughput technology human linkage analysis population structure prospective study Review tissue specificity
Journal or Publication Title: Informatics in Medicine Unlocked
Journal Index: Scopus
Volume: 24
Identification Number: https://doi.org/10.1016/j.imu.2021.100586
ISSN: 23529148 (ISSN)
Depositing User: مهندس مهدی شریفی
URI: http://eprints.bmsu.ac.ir/id/eprint/9334

Actions (login required)

View Item View Item