Prottest3 is a software which is used to select a best-fit amino acid replacement model for a set of protein sequences . ProtTest3 finds a best-fit model on the basis of the smallest value of one of three criteria: Akaike Information Criterion (AIC), Corrected Akaike Information Criterion, Bayesian Information Criterion (BIC) score or Decision Theory Criterion (DT) selected by the user. In this article, we will learn how to download and install the command-line version of ProtTest3 on Ubuntu.
Before installing the software, update and upgrade your system by typing the following the commands in a terminal:
$ sudo apt-get update
$ sudo apt-get upgrade
A ProtTest3 package compatible with your system can be downloaded from here. We are downloading the tar file for Ubuntu. Open the terminal and change to the directory where you want to download the software, let’s say ‘Downloads’.
$ cd Downloads
$ wget https://bitbucket.org/diegodl/prottest3/downloads/prottest-3.2-20120316.tar.gz
$ tar xvzf prottest-3.2-20120316.tar.gz
If you wish to run ProtTest3 on a cluster computer, then you need to download the MPJ Express from here.
Make sure you have the latest version of Java installed on your system or you can download it from here.
After that, you will see a new directory named with ‘prottest’, change to that directory.
$ cd prottest3
$ set PROTTEST_HOME=/home/user/Downloads/prottest3
In this directory, you will find the .jar file of ProtTest3, which you can directly use with the command line arguments on a terminal.
Installing MPJ Express
After downloading the tar package, type the following commands in a terminal:
$ cd Downloads
$ tar xvzf mpj.tar.gz
$ export MPJ_HOME=$PROTTEST_HOME/mpj
$ export PATH=$MPJ_HOME/bin:$PATH
To run ProtTest3, you need to run the .jar file located in the prottest3 directory.
$ java -jar prottest3.jar -i <alignment_file> -t <user_defined_tree_file> -o <output_file> -[matrix] -<models_to_evaluate> -<selection_criteria>
There are many other command-line options that you can see in its manual.
If you want to use multiple threads then you can do it with an additional thread option using which you can define the number of cores to be used.
$ java -jar prottest3.jar -i <alignment_file> -t <user_defined_tree_file> -o <output_file> -[matrix] -<models_to_evaluate> -<selection_criteria> -threads 4
Executing ProtTest3 on a cluster computer
If you have a large number of protein sequences in your dataset, then you can always go for cluster execution.
At first, start MPJ, and then run the HPC shell scripts provided in the same directory.
$ cd Downloads/prottest3
$ mpjboot machines
For the HPC script, the basic syntax is as follows:
$ ./runProtTestHPC.sh <no_of_processors> <parameters>
$ ./runProtTestHPC.sh 2 -i <alignment_file> -t <user_defined_tree_file> -o <output_file> -[matrix] -<models_to_evaluate> -<selection_criteria>
Hope this article helps!
If you have any queries, then comment below or post a question here.
- Darriba, D., Taboada, G. L., Doallo, R., & Posada, D. (2011). ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics, 27(8), 1164-1165.
How to find a best fit model using IQ-TREE?
Installing TREE-PUZZLE on Ubuntu
TREE-PUZZLE is a software to reconstruct phylogenetic trees using the maximum likelihood method [1,2]. It requires sequence data as input and implements a fast search algorithm and quartet puzzling. It can process large datasets easily. In this article, we will install TREE-PUZZLE on Ubuntu. (more…)
Tutorial: Constructing phylogenetic tree using MEGA7
MEGAX is a bioinformatics software/tool used for phylogenetic tree construction. In this article, we will construct a maximum likelihood (ML) tree for a number of protein sequences using MEGA7 . (more…)
Update: A multi-epitope in silico vaccine candidate designed for Covid-19
Covid19 has created a great threat to human health. As you are aware, in this coronavirus outbreak, Bioinformatics Review has created a group, BiR-nCov19 Drug Development Team, to work on finding prevention to this disease. This research group consists of researchers from all over the world. (more…)
Phylogenetics analysis of SARS-CoV-2 spike glycoproteins
A novel coronavirus (CoV), named Severe Acute Respiratory Syndrome-CoV-2 (SARS-CoV-2) or nCoV-2019, has emerged since December 2019 from Wuhan city of Hubei province in China . This virus belongs to the coronavirus family from which previous outbreaks have emerged (SARS and MERS). They have been a great threat to public health causing many deaths including SARS-CoV-2. There is no proper treatment available to cure this coronavirus disease (covid19). Scientists and researchers are trying really hard to develop a drug or a vaccine or a proper way to cure covid19. (more…)
How to calculate dN, dS, and dN/dS ratio on a set of genes using MEGA?
If you want to get a quick idea about the non-synonymous vs synonymous (dN/dS) substitutions, you can easily use MEGA software . Although HYPHY/Datamonkey provides the best results regarding selection pressure analyses. MEGA also uses HYPHY program  to calculate the dN/dS substitutions rate. Here is how you can do it. (more…)
Most widely used tools for phylogenetic tree customization
Most of the times, it is a very tedious job to convert file formats in bioinformatics, especially when we are dealing with phylogeny. Most of the available online servers mess your file and the output format is also not supported by the other programs. Additionally, it is quite difficult to perform other customizations on the phylogeny tree. (more…)
Why To Study Evolution?
Understanding evolution is critical for understanding biology. As the preeminent scientist Theodosius Dobzhansky stated, “Nothing in biology makes sense except in the light of evolution.” Evolution is the only scientific explanation for the diversity of life. It explains the striking similarities among vastly different forms of life, the changes that occur within populations, and the development of new life forms. Excluding evolution from the science curricula or compromising its treatment deprives students of this fundamental and unifying scientific concept to explain the natural world. (more…)
Molecular Evolutionary Genetic Analysis
MEGA: Molecular Evolutionary Genetic Analysis
It is important to know the basic molecular relationship between two living organisms as one begins performing comparative studies for knowing the evolutionary aspects and for contributing to knowledge base. Several tools and soft ware have been introduced for meeting the task of such analysis. Each tool has different algorithm and method to perform molecular phylogeny. Examples include; ClustalW, Dendroscope, Hyphy, PAUP and Phylip etc. Among them is the most efficient tool, MEGA, Molecular evolutionary phylogenetic analysis which performs both sequence analysis and phylogenetic analysis in a very sophisticated manner.
Monogenea: An Easy Portfolio for Molecular Phylogenetics
Estimation of present day diversity of organism and understanding their diversity forms new cornerstones of conservation biology, evolutionary biology and ecology. Maintaining the totality of all texa from past to present and classifying into groups reflect how they have changed over the period of times. Necessarily, phylogenetics and evolutionary study of a particular group help out in knowing pattern of occurrence and relationships between two distantly related class or families and their descendents. (more…)