Alternative output types with better compression and cross platform : .csv.gz
& .parquet
#20
Labels
enhancement
New feature or request
.csv.gz
& .parquet
#20
our current plain text output in AusTraits 4.2.1 is 260Mb, with most 230Mb is
traits.csv
. We use an*.rds
filetype for smaller distribution, which compresses to just 12Mb. But this only works for R users.@cboettig pointed out two alternative types comprised types that are possibly better:
.csv.gz
and.parquet
Both are cross platform and offer comparable compression to
.rds
. E.g.traits.csv
which is 230Mb could be compressed to 10.8Mb astraits.csv.gz
, or 10.2Mb astraits.parquet
this will be particularly important when/if exporting wide format (see #19), which is
.csv
.csv.gz
.parquet
Apache
.parquet
format (see https://en.wikipedia.org/wiki/Apache_Parquet) is rapidly emerging as new standard, accessible via the arrow package.The text was updated successfully, but these errors were encountered: