|Original author(s)||Divon Lan|
|Platform||Linux, Mac, Windows|
Search Genozip on Amazon.
Genozip is a proprietary universal compressor for genomic files. Its main feature is to compress FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP and 23andMe files, but it can also work as a normal file compressor.
Genozip works by segmenting a source file into its individual data contexts, applying context-specific algorithms to exploit correlations between values within the same context or between contexts, and finally applying the appropriate compression codec to each context.
Genozip is made to be extensible; it can be extended either by adding new segmenters[lower-alpha 1]context-specific algorithms and/or codecs. It is one of the first universal compressor of genomic file formats[lower-alpha 2].
- Lan,D. et al. (2021) Genozip: a universal extensible genomic data compressor. Bioinformatics (Oxford University Press)
- Abdullah,T (2020) Genozip- a new compression tool for VCF files. Bioinformatics Review
- Lan,D. et al. (2020) genozip: a fast and efficient compression tool for VCF files. Bioinformatics (Oxford University Press), 36, 4091–4092.
- To add support for compressing additional file formats
- i.e. able to compress all common genomic file formats, while taking advantages of their unique characteristics to achieve compression better than general-purpose compressors
This article "Genozip" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Genozip. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.