Connect with us

Phylogenetics

Molecular Evolutionary Genetic Analysis

Published

on

MEGA: Molecular Evolutionary Genetic Analysis
It is important to know the basic molecular relationship between two living organisms as one begins performing comparative studies for knowing the evolutionary aspects and for contributing to knowledge base. Several tools and soft ware have been introduced for meeting the task of such analysis. Each tool has different algorithm and method to perform molecular phylogeny. Examples include; ClustalW, Dendroscope, Hyphy, PAUP and Phylip etc. Among them is the most efficient tool, MEGA, Molecular evolutionary phylogenetic analysis which performs both sequence analysis and phylogenetic analysis in a very sophisticated manner.

MEGA’s functionality include the creation and exploration of sequence alignments, the enumeration of sequence divergence, the construction and visualization of phylogenetic trees, and the testing of molecular evolutionary hypotheses. Previously, many versions of MEGA had been released which integrate Web-based sequence data acquisition and their alignment capabilities with the evolutionary analyses. It makes comparative analyses much easier to conduct in a single computing environment. Over the period of time, this tool has come to boost up the classroom learning experience as its use by educators, researcher and students in different disciplines has expanded. This tool is contended with three distinct functionalities, along with some other features, which is why it is exercised for performing fine quality phylogenies by a large number of researchers and professionals as outlined below.

First, Caption Expert software module; generates descriptions for every result obtained by MEGA4. This enunciation informs the user about all of the options used in the analysis, including the data subset, the selected option for the handling of sites with gaps and missing data, the evolutionary model of substitution (e.g., nucleic acid substitution pattern, uniformity of evolutionary convergence or divergence and its rate among sites, and homogeneity or heterogeneity assumption among descendents, and the algorithms applied for estimating pair wise distances and for inferring and testing phylogenetic trees. The caption is also included with specific citations for any algorithm, method and software used in analysis. The availability of these descriptions is to promote a better understanding of the assumptions used in analyses, and of the results produced. This is needed because MEGA’s instinctive graphical interface makes it easy for both new and expert users to perform a variety of computational and statistical analyses. Sometime users don’t realize the underlying assumptions and data-handling options intricate in each analysis. Even expert population and molecular geneticists may not recognize all of the assumptions for immediate. Generally, a description of algorithm or method and results is useful for researchers and beginners when preparing tables and figures for presentation and publication.

Multiple sequence alignment

Multiple sequence alignment

Second, Maximum Composite Likelihood (MCL) method is included for estimating evolutionary distances between nucleic acid sequences, which can be frequently employed by users for divergence times, inferring phylogenetic trees, and average sequence divergences between and within groups of sequences. In this approach, score is obtained as the sum of log likelihood for all sequence pairs in an alignment, and then is maximized by the common parameters for nucleotide substitution pattern to every sequence pair. This method was previously referred to as the ‘‘Simultaneous Estimation’’ (SE) method, because all distances are simultaneously estimated. This approach is different from current approaches for evolutionary distance estimation. In current approach, each distance is estimated independently of others, either by statistical formulas or by likelihood methods. The Maximum Composite Likelihood method has many advantages over the Independent Estimation (IE) approach. The IE method for estimating evolutionary distance for each pair of sequences often causes rather large errors unless very sequences are not estimated. One the hand, MCL reduces these errors, as a single set of parameter is applied to ever distance estimation. Inference of Phylogenetic trees by distance-based method is considered more accurate when error is low for estimation. This is in fact the case for the Neighbor-Joining method. The use of the MCL distances leads to a much higher accuracy with higher bootstrap values and even equal same topology of tree is expected to obtain. In addition, for pair wise distance calculation, IE method is not reliably applicable, because analytical formulas may become negative by chance due to algorithm’s arguments.

Distance-based method

Distance-based method

Such cases may increase with increase in number of sequence data, evolutionary distances become larger and substitution within sequences become more complex. The MCL method overcome these problems effectively and generates sophisticated models for inferring phylogenies from a larger number of diverse sequences. MEGA implicates the use of MCL method for evaluating average distances between and within groups, pair wise distances and average pairs, with their variances calculated by a bootstrap approach. The implementation of the MCL approach allows consideration of substitution rate variation from site to site, by an approximation of the gamma distribution divergence/convergence rates, and the assimilation of heterogeneity of nucleotide base composition in different sequences for species. We also have the leniency to determine the numbers of mutation per site separately. Intrinsically, the use of MCL method for inferring phylogenetic trees by distance-based methods, along with the bootstrap tests proves worth doing.

Professionally a teacher and passionately a researcher, Fozail is a Bioinformatician. He has worked on Molecular Evolution as a UGC project fellow in Dyal Singh College, University of Delhi. His area of research include Systems Biology, Biological Networking, Mathematical Modelling etc.

Advertisement
Click to comment

You must be logged in to post a comment Login

Leave a Reply

Phylogenetics

How to find a best fit model using IQ-TREE?

Dr. Muniba Faiza

Published

on

How to find a best fit model using IQ-TREE?

Previously, we have provided an installation tutorial for IQ-TREE on Ubuntu. In this article, we are going to perform model selection for a dataset using the standalone tool of IQ-TREE. (more…)

Continue Reading

Phylogenetics

Installing TREE-PUZZLE on Ubuntu

Dr. Muniba Faiza

Published

on

Installing tree-puzzle on Ubuntu

TREE-PUZZLE is a software to reconstruct phylogenetic trees using the maximum likelihood method [1,2]. It requires sequence data as input and implements a fast search algorithm and quartet puzzling. It can process large datasets easily. In this article, we will install TREE-PUZZLE on Ubuntu. (more…)

Continue Reading

Phylogenetics

Tutorial: Constructing phylogenetic tree using MEGA7

Published

on

mega7

MEGAX is a bioinformatics software/tool used for phylogenetic tree construction. In this article, we will construct a maximum likelihood (ML) tree for a number of protein sequences using MEGA7 [1]. (more…)

Continue Reading

Phylogenetics

Update: A multi-epitope in silico vaccine candidate designed for Covid-19

Dr. Muniba Faiza

Published

on

covid19-vaccine

Covid19 has created a great threat to human health. As you are aware, in this coronavirus outbreak, Bioinformatics Review has created a group, BiR-nCov19 Drug Development Team, to work on finding prevention to this disease. This research group consists of researchers from all over the world. (more…)

Continue Reading

Phylogenetics

Phylogenetics analysis of SARS-CoV-2 spike glycoproteins

Dr. Muniba Faiza

Published

on

Phylogenetic Tree of Covid Corona Virus with other Species

A novel coronavirus (CoV), named Severe Acute Respiratory Syndrome-CoV-2 (SARS-CoV-2) or nCoV-2019, has emerged since December 2019 from Wuhan city of Hubei province in China [1]. This virus belongs to the coronavirus family from which previous outbreaks have emerged (SARS and MERS). They have been a great threat to public health causing many deaths including  SARS-CoV-2. There is no proper treatment available to cure this coronavirus disease (covid19). Scientists and researchers are trying really hard to develop a drug or a vaccine or a proper way to cure covid19. (more…)

Continue Reading

Phylogenetics

Installing and executing ProtTest3 on Ubuntu

Dr. Muniba Faiza

Published

on

Prottest3 is a software which is used to select a best-fit amino acid replacement model for a set of protein sequences [1]. ProtTest3 finds a best-fit model on the basis of the smallest value of one of three criteria: Akaike Information Criterion (AIC), Corrected Akaike Information Criterion, Bayesian Information Criterion (BIC) score or Decision Theory Criterion (DT) selected by the user. In this article, we will learn how to download and install the command-line version of ProtTest3 on Ubuntu. (more…)

Continue Reading

Phylogenetics

How to calculate dN, dS, and dN/dS ratio on a set of genes using MEGA?

Dr. Muniba Faiza

Published

on

If you want to get a quick idea about the non-synonymous vs synonymous (dN/dS) substitutions, you can easily use MEGA software [1]. Although HYPHY/Datamonkey provides the best results regarding selection pressure analyses. MEGA also uses HYPHY program [2] to calculate the dN/dS substitutions rate. Here is how you can do it. (more…)

Continue Reading

Phylogenetics

Most widely used tools for phylogenetic tree customization

Dr. Muniba Faiza

Published

on

Most of the times, it is a very tedious job to convert file formats in bioinformatics, especially when we are dealing with phylogeny. Most of the available online servers mess your file and the output format is also not supported by the other programs. Additionally, it is quite difficult to perform other customizations on the phylogeny tree. (more…)

Continue Reading

Phylogenetics

Why To Study Evolution?

Published

on

Understanding evolution is critical for understanding biology. As the preeminent scientist Theodosius Dobzhansky stated, “Nothing in biology makes sense except in the light of evolution.” Evolution is the only scientific explanation for the diversity of life. It explains the striking similarities among vastly different forms of life, the changes that occur within populations, and the development of new life forms. Excluding evolution from the science curricula or compromising its treatment deprives students of this fundamental and unifying scientific concept to explain the natural world. (more…)

Continue Reading

Phylogenetics

Monogenea: An Easy Portfolio for Molecular Phylogenetics

Published

on

Estimation of present day diversity of organism and understanding their diversity forms new cornerstones of conservation biology, evolutionary biology and ecology. Maintaining the totality of all texa from past to present and classifying into groups reflect how they have changed over the period of times. Necessarily, phylogenetics and evolutionary study of a particular group help out in knowing pattern of occurrence and relationships between two distantly related class or families and their descendents. (more…)

Continue Reading

LATEST ISSUE

ADVERT