Replies: 5 comments 10 replies
-
Maybe we can try cache for parquet/orc/json directly instead a general cache lib. |
Beta Was this translation helpful? Give feedback.
-
Ask a question, not read the code overall, what the unit of the cache of parquet (whole or page) ? 🤔 |
Beta Was this translation helpful? Give feedback.
-
Hi, @Xuanwo . Could I have a try with foyer? Is there a guide to do it? |
Beta Was this translation helpful? Give feedback.
-
Regarding this issue, I've had some recent thoughts. Leveraging the capabilities of opendal, we can easily access almost all storage media, which means that writing a multi-layer caching system will be quite straightforward. Users can assemble their caching systems like building blocks, for example (fifo memory 1GB, lru memory 10GB, fifo ssd 1GB, fifo s3 express one 10TB). The caching algorithms will actually become more complex (promote to the upper level, reinsert, move to the lower level, evict), but that's not the main point. The key focus is on how to abstract these layers? |
Beta Was this translation helpful? Give feedback.
-
I think mimic the design of CacheLib is a good idea. We could in high level cache files through naming with url. If we only support parquet I think it is sort of weird since we are operating on file granularity all the time. |
Beta Was this translation helpful? Give feedback.
-
In the past, we attempted to implement a cache layer in OpenDAL, but it did not function effectively. Therefore, it is now necessary for us to revisit this area and identify potential improvements.
For example, can we port https://github.com/facebook/CacheLib to pure rust with opendal?
Beta Was this translation helpful? Give feedback.
All reactions