How to export file from csv -> hdf5 without generating yaml file. ? #2256
-
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi, Do you understand the point of that file? In a sense.. that file contains metadata about the conversion process. So you can do something like df = vaex.open('file.csv', convert=True) The first time you run the line above, the CSV file will be converted to HDF5. The 2nd time you run it, Vaex will just read the HDF5 file directly, and knows exactly what to read thanks to that YAML file. YAML file also includes some metadata of the source file (the CSV) and the converted file (the HDF5). So if for some reason the source or the output file is modified (like due to an update or whatever), running the line above will re-trigger the conversion. If you do not want this, you can simply delete the yaml file. If you do not want it created, then convert data yourself like this df = vaex.open('file.csv', convert=True)
df.export('file.hdf5') I don't think the 2nd approach adds that much complexity :). In the past, using Does this help a bit? |
Beta Was this translation helpful? Give feedback.
Hi,
Do you understand the point of that file? In a sense.. that file contains metadata about the conversion process. So you can do something like
The first time you run the line above, the CSV file will be converted to HDF5. The 2nd time you run it, Vaex will just read the HDF5 file directly, and knows exactly what to read thanks to that YAML file. YAML file also includes some metadata of the source file (the CSV) and the converted file (the HDF5). So if for some reason the source or the output file is modified (like due to an update or whatever), running the line above will re-trigger the conversion.
If you do not want this, you can simply delete…