Genozip
| File:Genozip logo.png | |
| Original author(s) | Divon Lan |
|---|---|
| Initial release | 2020 |
| Repository | https://github.com/divonlan/genozip |
| Written in | C |
| Engine | |
| Platform | Linux, Mac, Windows |
| Type | Bioinformatics |
| License | Non-commercial license |
| Website | genozip |
Search Genozip on Amazon.
Genozip[1][2] is a proprietary universal compressor for genomic files. Its primary function is compressing FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP, and 23andMe files, but it can also serve as a general-purpose file compressor.
Genozip operates by segmenting a source file into its individual data contexts, applying context-specific algorithms to exploit correlations within and between contexts, and then applying the appropriate compression codec to each context.[2][3]
Genozip is designed to be extensible, allowing the addition of new segmenters, context-specific algorithms, and/or codecs.[lower-alpha 1] It is one of the first universal compressors of genomic file formats,[lower-alpha 2].
References
- ↑ Lan,D. et al. (2021) Genozip: a universal extensible genomic data compressor. Bioinformatics (Oxford University Press)
- ↑ 2.0 2.1 Abdullah,T (2020) Genozip- a new compression tool for VCF files. Bioinformatics Review
- ↑ Lan,D. et al. (2020) genozip: a fast and efficient compression tool for VCF files. Bioinformatics (Oxford University Press), 36, 4091–4092.
Footnotes
This article "Genozip" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Genozip. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.
