Connect with us

Genomics

Genozip- a new compression tool for VCF files

Published

on

vcf compression tool

Variant Call Format (VCF) is a text file format used to store thousands of genomic datasets. Since these files consist of a large number of gene sequences, their file size is quite large even after compression. Recently, a new compression tool has been introduced known as genozip [1].

genozip tool allows compression of VCF files without any loss. This tool utilizes a compression algorithm specific to genotypes that are only one data type represented in VCF files.

Features of genozip:

  • capable of storing data of any phasing structure, ploidy, and variant types with up to 99 alternate alleles per variant.
  • allows pipeline analyses along with lossless compression.
  • allows secure storage and distribution.
  • can be easily operated on major operating systems (Linux, Windows, and MAC).
  • allows seamless integration into analytical pipelines.
  • data can be encrypted with a password.
  • compression can be optimized according to the users’ needs.
  • consists of several other options.

genozip has been tested on a benchmark dataset that shows faster and higher compression ratios than the other tested tools [1]. For more details about this tool, click here.


References

  1. Lan, D., Tobler, R., Souilmi, Y., & Llamas, B. (2020). genozip: a fast and efficient compression tool for VCF files. Bioinformatics (Oxford, England).

Tariq is founder of Bioinformatics Review and CEO at IQL Technologies. His areas of expertise include algorithm design, phylogenetics, MicroArray, Plant Systematics, and genome data analysis. If you have questions, reach out to him via his homepage.

Genomics

CoolBox- An open-source toolkit for genomic data visualization

Published

on

CoolBox- An open-source toolkit for genomic data visualization

A new toolkit called CoolBox is developed for the visual analysis of genomic data [1]. It makes it easy to visualize patterns in a large-scale genomic dataset. (more…)

Continue Reading

Genomics

VISPR- A new tool to visualize CRISPR screening experiments

Published

on

VISPR- A new tool to visualize CRISPR screening experiments

As CRISPR/Cas9 is a well-known genome editing technology, it is important to explore and analyze CRISPR screening experiments. In this article, we discuss a new tool developed for better visualization of CRISPR screening experiments. (more…)

Continue Reading

Genomics

How to install Cortex on Ubuntu?

Published

on

Cortex - genome analysis framework

Cortex is a user-friendly framework for genome analysis [1]. It acquires less memory and is quite efficient in performance. It’s installation involves various steps. In this article, we will install Cortex on Ubuntu. (more…)

Continue Reading

LATEST ISSUE

ADVERT