Bioinformatics ReviewBioinformatics Review
Notification Show More
Font ResizerAa
  •  Home
  • Docking
  • MD Simulation
  • Tools
  • More Topics
    • Softwares
    • Sequence Analysis
    • Algorithms
    • Bioinformatics Programming
    • Bioinformatics Research Updates
    • Drug Discovery
    • Phylogenetics
    • Structural Bioinformatics
    • Editorials
    • Tips & Tricks
    • Bioinformatics News
    • Featured
    • Genomics
    • Bioinformatics Infographics
  • Community
    • BiR-Research Group
    • Community Q&A
    • Ask a question
    • Join Telegram Channel
    • Join Facebook Group
    • Join Reddit Group
    • Subscription Options
    • Become a Patron
    • Write for us
  • About Us
    • About BiR
    • BiR Scope
    • The Team
    • Guidelines for Research Collaboration
    • Feedback
    • Contact Us
    • Recent @ BiR
  • Subscription
  • Account
    • Visit Dashboard
    • Login
Font ResizerAa
Bioinformatics ReviewBioinformatics Review
Search
Have an existing account? Sign In
Follow US
ProteomicsSequence AnalysisSoftware

Sequence search against a set of local sequences (local database) using phmmer

Tariq Abdullah
Last updated: May 20, 2020 5:47 pm
Tariq Abdullah
Share
2 Min Read
SHARE

PHMMER is a sequence analysis tool used for protein sequences (http://hmmer.org; version 3.1 b2). It is available online as a web server and as well as a part of the HMMER stand-alone package (http://hmmer.org; version 3.1 b2). HMMER offers various useful features such as multiple sequence alignment including the file format conversion. 

In this article, a sequence search against a set of local sequences is explained using PHMMER stand-alone tool including the output in FASTA format. To do this, we will first obtain the primary output in Stockholm (.sto) format and then convert it into the FASTA format.

1. Make a local database

The local database consists of protein sequences in FASTA format. Let’s say, our local dataset file is ‘sequences.fasta’.

2. Search for protein sequences according to the input in the local database

Make a query sequence file, we will name it as ‘query.fasta’. This file consists of FASTA sequences to be searched within the local database. Open a terminal and type the following command:

$ /path/to/phmmer -A phmmer.sto query.fasta sequences.fasta

where -A is used to define a filename to save the multiple alignments of all significant hits in Stockholm format.

You can also adjust the inclusion thresholds of different e-values by using different arguments. For example,

–incE, default value is 0.01 which means that ~1 false positive in every 100 searches with different query sequences.

–incT, instead of using e-value, use a bit score of >=<value>.

There are several other arguments that you can find in the user guide of HMMER.

Now, we have output in Stockholm format. If you want it in FASTA format, then proceed to the next step.

3. Output in FASTA format

For this, we will be using the ‘esl-reformat’ binary of HMMER

$ /path/to/esl-reformat fasta phmmer.sto > phmmerout.fasta

here, you can convert it into other formats such as a2m, just replace ‘fasta’ with ‘a2m’ in the command line.

This output file will consist of FASTA sequences of significant hits.

Share This Article
Facebook Copy Link Print
ByTariq Abdullah
Tariq is founder of Bioinformatics Review and Lead Developer at IQL Technologies. His areas of expertise include algorithm design, phylogenetics, MicroArray, Plant Systematics, and genome data analysis. If you have questions, reach out to him via his homepage.
Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Starting in Bioinformatics? Do This First!
Starting in Bioinformatics? Do This First!
Tips & Tricks
[Editorial] Is it ethical to change the order of authors’ names in a manuscript?
Editorial Opinion
Installing bbtools on Ubuntu
[Tutorial] Installing BBTools on Ubuntu (Linux).
Sequence Analysis Software Tools
wes_data_analysis Whole Exome Sequencing (WES) Data visualization Toolkit
wes_data_analysis: Whole Exome Sequencing (WES) Data visualization Toolkit
Bioinformatics Programming GitHub Python

You Might Also Like

Meta AnalysisNGSSequence AnalysisTools

Predictive metagenomics profiling: why, what and how ?

May 20, 2020
CoolBox- An open-source toolkit for genomic data visualization
GenomicsSoftwareTools

CoolBox- An open-source toolkit for genomic data visualization

October 25, 2021
CHARMM installation on Ubuntu
MD SimulationMolecular dynamicsSoftwareTools

Installing CHARMM on Ubuntu

July 12, 2020
Installing 3Dmapper on Ubuntu (Linux).
SoftwareTools

[Tutorial] How to install 3Dmapper on Ubuntu (Linux)?

March 13, 2025
Copyright 2024 IQL Technologies
  • Journal
  • Customer Support
  • Contact Us
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • Sitemap
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up