Phylogenetic Tree-based Pipeline for Uncovering Mutational Patterns during Influenza Virus Evolution (doi:10.21979/N9/PDYCUD)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link) (external link)

Document Description

Citation

Title:

Phylogenetic Tree-based Pipeline for Uncovering Mutational Patterns during Influenza Virus Evolution

Identification Number:

doi:10.21979/N9/PDYCUD

Distributor:

DR-NTU (Data)

Date of Distribution:

2019-04-26

Version:

2

Bibliographic Citation:

Ivan, Fransiskus Xaverius; Deshpande, Akhila; Lim, Chun Wei; Kwoh, Chee Keong, 2019, "Phylogenetic Tree-based Pipeline for Uncovering Mutational Patterns during Influenza Virus Evolution", https://doi.org/10.21979/N9/PDYCUD, DR-NTU (Data), V2

Study Description

Citation

Title:

Phylogenetic Tree-based Pipeline for Uncovering Mutational Patterns during Influenza Virus Evolution

Identification Number:

doi:10.21979/N9/PDYCUD

Authoring Entity:

Ivan, Fransiskus Xaverius (Nanyang Technological University)

Deshpande, Akhila (Nanyang Technological University)

Lim, Chun Wei (Nanyang Technological University)

Kwoh, Chee Keong (Nanyang Technological University)

Software used in Production:

Python

Grant Number:

AcRF Tier 2 grant MOE2014-T2-2-023

Distributor:

DR-NTU (Data)

Access Authority:

Ivan, Fransiskus Xaverius

Depositor:

Ivan, Fransiskus Xaverius

Date of Deposit:

2019-04-25

Holdings Information:

https://doi.org/10.21979/N9/PDYCUD

Study Scope

Keywords:

"Medicine, Health and Life Sciences", Influenza evolution, mutational patterns

Abstract:

Various computational and statistical approaches have been proposed to uncover the mutational patterns of rapidly evolving influenza viral genes. A problem that draws much attention is to identify pairs of sites that potentially co-mutate to contribute to the overall fitness of the virus. Unlike previous methods that extract the mutations from sequence alignments, here we endeavor a novel method that relies on identifying mutations in the phylogenetic trees that are reconstructed using resampled sequence data. Since the method takes into account the evolutionary structure presents in the sequence data, spurious mutations obtained by comparing sequences from different clades could be removed.

Kind of Data:

Python/R codes (.py and .ipynb files), compressed data, and documentations for the codes, input/output data and methods

Notes:

This is an extension of the works for the ACM-BCB 2017 conference paper: Ivan FX, et al. (2017). Phylogenetic Tree based Method for Uncovering Co-mutational Site-pairs in Influenza Viruses. Proceeding of the 8th ACM Internation Conference on Bioinformatics, Computational Biology, and Health Informatics. pp. 21-26. Boston, Massachusetts, USA. August 20-23, 2017.

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Publications

Citation

Identification Number:

10.1101/708420

Bibliographic Citation:

Ivan FX, et al. (2019). Phylogenetic Tree-based Pipeline for Uncovering Mutational Patterns during Influenza Virus Evolution. bioRxiv. July 19, 2019. doi: 10.1101/708420.

Citation

Identification Number:

10.1145/3107411.3107479

Bibliographic Citation:

Ivan FX, et al. (2017). Phylogenetic Tree based Method for Uncovering Co-mutational Site-pairs in Influenza Viruses. Proceeding of the 8th ACM Internation Conference on Bioinformatics, Computational Biology, and Health Informatics. pp. 21-26. Boston, Massachusetts, USA. August 20-23, 2017.

Other Study-Related Materials

Label:

codes.tar.gz

Text:

Codes for executing the analysis pipeline.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

Documentation - Codes (Tool) for Uncovering Mutational Patterns of Influenza Virus Evolution.pdf

Text:

Documentation of the codes (tool), which describe the input and output of the codes, environment for running the codes, and the steps for running the codes or analysis pipeline.

Notes:

application/pdf

Other Study-Related Materials

Label:

indata.tar.gz

Text:

Examples of metadata and sequence data (contain records obtained from the NCBI Influenza Virus Resource) for analysis pipeline.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

Methods.pdf

Text:

Description of methods for the analysis pipeline.

Notes:

application/pdf