Bioinformatics ReviewBioinformatics Review
Notification Show More
Font ResizerAa
  •  Home
  • Docking
  • MD Simulation
  • Tools
  • More Topics
    • Softwares
    • Sequence Analysis
    • Algorithms
    • Bioinformatics Programming
    • Bioinformatics Research Updates
    • Drug Discovery
    • Phylogenetics
    • Structural Bioinformatics
    • Editorials
    • Tips & Tricks
    • Bioinformatics News
    • Featured
    • Genomics
    • Bioinformatics Infographics
  • Community
    • BiR-Research Group
    • Community Q&A
    • Ask a question
    • Join Telegram Channel
    • Join Facebook Group
    • Join Reddit Group
    • Subscription Options
    • Become a Patron
    • Write for us
  • About Us
    • About BiR
    • BiR Scope
    • The Team
    • Guidelines for Research Collaboration
    • Feedback
    • Contact Us
    • Recent @ BiR
  • Subscription
  • Account
    • Visit Dashboard
    • Login
Font ResizerAa
Bioinformatics ReviewBioinformatics Review
Search
Have an existing account? Sign In
Follow US
GenomicsSequence AnalysisSoftwareTools

Roary: Analysis of Prokaryote Pan Genome on a large-scale

Dr. Muniba Faiza
Last updated: December 11, 2015 1:33 am
Dr. Muniba Faiza
Share
2 Min Read
SHARE

The Microbial Pan Genome is the union of genes shared by genomes of interest. This term was first used by Medini in 2005.

Since then, microbial genome data has been enormously increased, so to study processes such as selection and evolution, the construction of pan genome of species is required. But construction of pan genome from the real data available is very difficult and would not be accurate due to fragmented assemblies, poor annotation and also the contamination,i.e., microbial organisms can rapidly acquire genes from other organisms. Therefore, Andrew J. Page et al have developed

a new method to generate the pan genome of a set of related prokaryotic isolates and named the tool as ‘Roary’. It deals with thousands of isolates in a feasible time.

How Roary Works?

One annotated assembly per sample is input in the Roary from which coding regions are extracted and converted in to protein sequences, and all the partial sequences are removed and pre clustered using CD-HIT (a fast program for clustering and comparing). This produces a reduced set of protein sequences.These reduced sequences are compared all-against-all with the help of BLASTP with a user defined percentage sequence identity (default 95%). Now, by using conserved neighborhood genes, homologous groups are split in to true orthologs. Finally, a graph is constructed showing the  relationships of the clusters based on the order of occurrence in the input sequences.

Fig.1

Fig.1  Effect of dataset size on the wall time of multiple applications.

That’s how the orthologous genes of prokaryotes can be easily identified and the microbial evolution can be well studied. It is done on a large scale covering a large data set to analyse the pan genomes of prokaryotes. Other tools have also been made earlier than Roary for the same purpose,namely, PanOCT and PGAP, but Roary is more fast, heuristic and most feasible tool among them.

Note:

An exhaustive list of references for this article is available with the author and is available on personal request, for more details write to muniba@bioinformaticsreview.com.

Share This Article
Facebook Copy Link Print
ByDr. Muniba Faiza
Follow:
Dr. Muniba is a Bioinformatician based in New Delhi, India. She has completed her PhD in Bioinformatics from South China University of Technology, Guangzhou, China. She has cutting edge knowledge of bioinformatics tools, algorithms, and drug designing. When she is not reading she is found enjoying with the family. Know more about Muniba
10 years of Bioinformatics Review: From a Blog to a Bioinformatics Knowledge Hub!
Editorial
Starting in Bioinformatics? Do This First!
Starting in Bioinformatics? Do This First!
Tips & Tricks
[Editorial] Is it ethical to change the order of authors’ names in a manuscript?
Editorial Opinion
Installing bbtools on Ubuntu
[Tutorial] Installing BBTools on Ubuntu (Linux).
Sequence Analysis Software Tools

You Might Also Like

How to save high resolution images in Pymol using command line?
SoftwareTools

How to save high resolution images in Pymol using command line?

January 7, 2022
Most widely used tools for transcription factor binding site prediction
SoftwareToolsTranscription Factor Binding Site Prediction

Most widely used tools for transcription factor binding site prediction

April 22, 2021
alignet: ppi network aligner
SoftwareTools

AligNet- A New Protein-Protein Interaction Network Aligner

November 26, 2020
Meta AnalysisNGSSequence AnalysisTools

Predictive metagenomics profiling: why, what and how ?

May 20, 2020
Copyright 2024 IQL Technologies
  • Journal
  • Customer Support
  • Contact Us
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • Sitemap
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up