Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Closed Access logo, derived from PLoS Open Access logo. This version with transparent background. http://commons.wikimedia.org/wiki/File:Closed_Access_logo_transparent.svg Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Cytometry Part Aarrow_drop_down
image/svg+xml Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Closed Access logo, derived from PLoS Open Access logo. This version with transparent background. http://commons.wikimedia.org/wiki/File:Closed_Access_logo_transparent.svg Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao
Cytometry Part A
Article . 2019 . Peer-reviewed
License: Wiley Online Library User Agreement
Data sources: Crossref
Cytometry Part A
Article . 2020
versions View all 3 versions
addClaim

Lossless Compression of Cytometric Data

Authors: Anne E. Bras; Vincent H. J. van der Velden;

Lossless Compression of Cytometric Data

Abstract

AbstractNowadays, most cytometrists apply lossless compression by storing their FCS files in ZIP archives. Unfortunately, ZIP only achieves modest space savings in cytometric data, due to DEFLATE being used as the underlying lossless compression algorithm (LCA). Presumably, other modern LCA can outperform DEFLATE, especially in terms of space savings. Twenty‐one codecs (programs implementing LCA) were evaluated in 167,131 publicly available FCS files. Within floating‐point data, as produced by modern instruments, most favorable compression ratios (CRs) were achieved by ZPAQ (median 0.469), BCM (median 0.523), and LZMA (median 0.545). In comparison, the DEFLATE‐based codecs only achieved median CR of 0.728 under the most optimal conditions. By default, ZIP offers nine compression level (CL) settings, where lower ZIP‐CL optimizes for time efficiency, while higher ZIP‐CL optimizes for space efficiency. Interestingly, the third ZIP‐CL already resulted in near optimal CR in 90% of the files with floating‐point data, as produced by digital cytometers. LZMA is well established, widely supported, and actively maintained (in sharp contrast to ZPAQ and BCM) and therefore arguably the most attractive alternative for ZIP. Within floating‐point data, by shifting from ZIP (under optimal conditions) to LZMA (at default settings), the median CR can be improved by 25%. Based on our results, cytometrists can benefit from state‐of‐the‐art compression by choosing the appropriate codec for their situation. Our results are likely to speed‐up the adaptation of modern codecs, as CR around 0.5 were beyond all expectations, and such space savings will benefit the field of cytometry. © 2019 International Society for Advancement of Cytometry

Country
Netherlands
Keywords

Time Factors, Humans, Data Compression, Flow Cytometry, EMC MM-02-72-01

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    5
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Top 10%
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
5
Top 10%
Average
Average
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!