Bioinformatics ReviewBioinformatics Review
Notification Show More
Font ResizerAa
  •  Home
  • Docking
  • MD Simulation
  • Tools
  • More Topics
    • Softwares
    • Sequence Analysis
    • Algorithms
    • Bioinformatics Programming
    • Bioinformatics Research Updates
    • Drug Discovery
    • Phylogenetics
    • Structural Bioinformatics
    • Editorials
    • Tips & Tricks
    • Bioinformatics News
    • Featured
    • Genomics
    • Bioinformatics Infographics
  • Community
    • BiR-Research Group
    • Community Q&A
    • Ask a question
    • Join Telegram Channel
    • Join Facebook Group
    • Join Reddit Group
    • Subscription Options
    • Become a Patron
    • Write for us
  • About Us
    • About BiR
    • BiR Scope
    • The Team
    • Guidelines for Research Collaboration
    • Feedback
    • Contact Us
    • Recent @ BiR
  • Subscription
  • Account
    • Visit Dashboard
    • Login
Font ResizerAa
Bioinformatics ReviewBioinformatics Review
Search
Have an existing account? Sign In
Follow US
ProteomicsSequence AnalysisSoftware

Sequence search against a set of local sequences (local database) using phmmer

Tariq Abdullah
Last updated: May 20, 2020 5:47 pm
Tariq Abdullah
Share
2 Min Read
SHARE

PHMMER is a sequence analysis tool used for protein sequences (http://hmmer.org; version 3.1 b2). It is available online as a web server and as well as a part of the HMMER stand-alone package (http://hmmer.org; version 3.1 b2). HMMER offers various useful features such as multiple sequence alignment including the file format conversion. 

In this article, a sequence search against a set of local sequences is explained using PHMMER stand-alone tool including the output in FASTA format. To do this, we will first obtain the primary output in Stockholm (.sto) format and then convert it into the FASTA format.

1. Make a local database

The local database consists of protein sequences in FASTA format. Let’s say, our local dataset file is ‘sequences.fasta’.

2. Search for protein sequences according to the input in the local database

Make a query sequence file, we will name it as ‘query.fasta’. This file consists of FASTA sequences to be searched within the local database. Open a terminal and type the following command:

$ /path/to/phmmer -A phmmer.sto query.fasta sequences.fasta

where -A is used to define a filename to save the multiple alignments of all significant hits in Stockholm format.

You can also adjust the inclusion thresholds of different e-values by using different arguments. For example,

–incE, default value is 0.01 which means that ~1 false positive in every 100 searches with different query sequences.

–incT, instead of using e-value, use a bit score of >=<value>.

There are several other arguments that you can find in the user guide of HMMER.

Now, we have output in Stockholm format. If you want it in FASTA format, then proceed to the next step.

3. Output in FASTA format

For this, we will be using the ‘esl-reformat’ binary of HMMER

$ /path/to/esl-reformat fasta phmmer.sto > phmmerout.fasta

here, you can convert it into other formats such as a2m, just replace ‘fasta’ with ‘a2m’ in the command line.

This output file will consist of FASTA sequences of significant hits.

Share This Article
Facebook Copy Link Print
ByTariq Abdullah
Tariq is founder of Bioinformatics Review and Lead Developer at IQL Technologies. His areas of expertise include algorithm design, phylogenetics, MicroArray, Plant Systematics, and genome data analysis. If you have questions, reach out to him via his homepage.
Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

ai tools vs traditional tools in bioinformatics
AI Tools vs Traditional Tools in Bioinformatics- Which one to select?
Algorithms Artificial Intelligence Machine Learning Software Tools
AI vs Physics in Molecular Docking
AI vs Physics in Molecular Docking: Towards Faster and More Accurate Pose Prediction
Artificial Intelligence Drug Discovery Machine Learning
10 years of Bioinformatics Review: From a Blog to a Bioinformatics Knowledge Hub!
Editorial
Starting in Bioinformatics? Do This First!
Starting in Bioinformatics? Do This First!
Tips & Tricks

You Might Also Like

Molecular dynamicsSoftwareTools

Tutorial: Molecular dynamics (MD) simulation using Gromacs

April 10, 2023
Installing pyrx on ubuntu
DockingSoftwareVirtual Screening

Installing Pyrx on Ubuntu

April 13, 2023
How to save high resolution images in Pymol using command line?
SoftwareTools

How to save high resolution images in Pymol using command line?

January 7, 2022
How to execute matlab from terminal in Ubuntu (Linux)?
Software

How to execute matlab from terminal in Ubuntu (Linux)?

June 11, 2022
Copyright 2024 IQL Technologies
  • Journal
  • Customer Support
  • Contact Us
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • Sitemap
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up