clp-core: Refactor FileReader to use RAII. #496

haiqi96 · 2024-07-26T20:49:18Z

Description

This PR refactors the FileReader so it does not use the open()/close() API. instead, it requires the FileReader to be constructed with a path to open. This prevents the potential risk of leaving the FileReader in an invalid state.

The PR also updates how FileReader is used in the code in multiple places. Originally, the code keeps a FileReader object and re-open and close it whenever a new file need to be read. With the new design, the code either create a static FileReader object with the target path and let it go out of scope by itself, or use a unique_ptr to keep manage the FileReader.

Validation performed

Verified that all unit-tests still passed.
Compressed and decompressed a single 64MB log. Confirmed that decompressed log matches original log
Ran a simple search queries and ensured that clg doesn't return any error.

haiqi96

Post a few high level questions

haiqi96 · 2024-07-26T21:00:30Z

components/core/src/clp/DictionaryReader.hpp

-            m_segment_index_file_reader,
-            m_segment_index_decompressor
-    );
+    read_new_entries(dictionary_path, segment_index_path);


Make it to be a separate function so the code looks cleaner. we can also put code directly into open and remove the function?

components/core/src/clp/DictionaryReader.hpp

components/core/src/clp/FileReader.cpp

haiqi96 · 2024-07-26T21:19:03Z

components/core/src/clp/Utils.cpp

@@ -262,38 +265,29 @@ void load_lexer_from_file(
        }

        if (contains_delimiter) {
-            FileReader schema_reader;
-            ErrorCode error_code = schema_reader.try_open(schema_ast->m_file_path);
-            if (ErrorCode_Success != error_code) {


The exception throw here doesn't make sense. actually it would be hard to imagine that the file was opened once but failed to be opened again. so I would rather leave it to throw the default exception from File_reader

haiqi96 · 2024-07-26T21:24:44Z

components/core/src/clp/DictionaryReader.hpp


    // Read dictionary header
-    auto num_dictionary_entries = read_dictionary_header(m_dictionary_file_reader);
+    uint64_t num_dictionary_entries{};


technically we can turn this part into a helper function and put into the dictionary_utils.cpp. But given how simple the code is, I feel it's also ok to leave it as it is.
Let me know if you think it's necessary

haiqi96 · 2024-07-26T21:45:21Z

components/core/src/clp/DictionaryReader.hpp

+) {
+    constexpr size_t cDecompressorFileReadBufferCapacity = 64 * 1024;  // 64 KB
+
+    FileReader dictionary_file_reader{dictionary_path};


put them here instead of right in front of the code that uses them, so that if either file fails to open, we can skip the processing.

components/core/src/clp/DictionaryReader.hpp

components/core/src/clp/FileReader.hpp

haiqi96 · 2024-08-13T14:26:29Z

components/core/src/clp/dictionary_utils.cpp

 uint64_t read_dictionary_header(FileReader& file_reader) {
    auto dictionary_file_reader_pos = file_reader.get_pos();
    file_reader.seek_from_begin(0);
-    uint64_t num_dictionary_entries;
+    uint64_t num_dictionary_entries{};


haiqi96 · 2024-08-13T14:26:36Z

components/core/src/clp/dictionary_utils.cpp

@@ -39,7 +16,7 @@ uint64_t read_segment_index_header(FileReader& file_reader) {
    // Read segment index header
    auto segment_index_file_reader_pos = file_reader.get_pos();
    file_reader.seek_from_begin(0);
-    uint64_t num_segments;
+    uint64_t num_segments{};


tbd same as above

kirkrodrigues · 2024-08-13T19:58:10Z

components/core/src/clp/clp/compression.cpp

+    unique_ptr<FileReader> grouped_file_path_reader;
+    try {
+        grouped_file_path_reader = make_unique<FileReader>(list_path);
+    } catch (FileReader::OperationFailed const& err) {


This is minor, but we usually use e, ex, or exception for exceptions.

components/core/src/clp/FileReader.cpp

components/core/src/clp/DictionaryWriter.hpp

kirkrodrigues · 2024-08-13T21:00:52Z

components/core/src/clp/DictionaryWriter.hpp

-    FileReader dictionary_file_reader;
-    FileReader segment_index_file_reader;
+    FileReader dictionary_file_reader{dictionary_path};
+


Sorry, I didn't spot this earlier. This method was written as a way to reload the dictionary so that we can continue writing an archive, so although we don't use it right now, to maintain correctness, we'd need to also load the segment index, right?

I think an alternative solution is to get rid of open_and_preload entirely until we add support for opening and rewriting archives.

If I have to choose between the two, maybe removing it.

You also need to remove the method prototype.

components/core/src/clp/DictionaryReader.hpp

Co-authored-by: kirkrodrigues <2454684+kirkrodrigues@users.noreply.github.com>

kirkrodrigues

For the PR title, how about:

clp-core: Refactor FileReader to use RAII.

haiqi96 added 8 commits July 26, 2024 16:09

first batch of change

edbcc6d

Remove dictionary utils

21b4471

fix exception issues and make sure unit tests and clg can build

3a3c9ba

Linter

2c40e29

Fixes

d503436

Rearrange code

1ff5ea8

fix clo build

8a7ddc7

fix

70325a9

haiqi96 commented Jul 26, 2024

View reviewed changes

haiqi96 added 3 commits July 26, 2024 17:33

Fix make-dictionaries-readable

99d45a6

fix constructors

b086953

refactor

746834f

haiqi96 commented Jul 26, 2024

View reviewed changes

components/core/src/clp/DictionaryReader.hpp Outdated Show resolved Hide resolved

haiqi96 commented Jul 26, 2024

View reviewed changes

components/core/src/clp/FileReader.hpp Show resolved Hide resolved

haiqi96 added 6 commits July 29, 2024 10:59

Update docstrings and comments. Also updated destructor

d2a6836

Merge branch 'main' into FileReaderRefactor

d30030a

Revert unintended changes

4dabc96

fix

a8d0747

Missing fix

ff0b9d8

Fixes

b3f2795

haiqi96 changed the title ~~File reader refactor~~ clp-core: Refactor FileReader to use constructor initialization. Aug 13, 2024

haiqi96 changed the title ~~clp-core: Refactor FileReader to use constructor initialization.~~ clp-core: Refactor FileReader to use constructor-style initialization. Aug 13, 2024

haiqi96 marked this pull request as ready for review August 13, 2024 13:47

Revert unintended changes

4d0d80b

haiqi96 commented Aug 13, 2024

View reviewed changes

haiqi96 added 2 commits August 13, 2024 11:08

Fix one weird issue

70c8342

small fix

83ee77e

kirkrodrigues requested changes Aug 13, 2024

View reviewed changes

haiqi96 and others added 4 commits August 13, 2024 18:02

Apply suggestions from code review

08b4137

Co-authored-by: kirkrodrigues <2454684+kirkrodrigues@users.noreply.github.com>

Apply code review comments

4fd303f

Linter

3538ec6

Missing change

45cdf64

kirkrodrigues approved these changes Aug 14, 2024

View reviewed changes

haiqi96 changed the title ~~clp-core: Refactor FileReader to use constructor-style initialization.~~ clp-core: Refactor FileReader to use RAII. Aug 14, 2024

haiqi96 merged commit a89ff14 into y-scope:main Aug 14, 2024
12 of 13 checks passed

jackluo923 pushed a commit to jackluo923/clp that referenced this pull request Dec 4, 2024

clp-core: Refactor FileReader to use RAII. (y-scope#496)

b7ad520

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clp-core: Refactor FileReader to use RAII. #496

clp-core: Refactor FileReader to use RAII. #496

haiqi96 commented Jul 26, 2024 •

edited

Loading

haiqi96 left a comment

haiqi96 Jul 26, 2024

haiqi96 Jul 26, 2024

haiqi96 Jul 26, 2024

haiqi96 Jul 26, 2024

haiqi96 Aug 13, 2024

haiqi96 Aug 13, 2024

kirkrodrigues Aug 13, 2024

kirkrodrigues Aug 13, 2024

haiqi96 Aug 13, 2024

kirkrodrigues Aug 14, 2024

kirkrodrigues left a comment

clp-core: Refactor FileReader to use RAII. #496

clp-core: Refactor FileReader to use RAII. #496

Conversation

haiqi96 commented Jul 26, 2024 • edited Loading

Description

Validation performed

haiqi96 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kirkrodrigues left a comment

Choose a reason for hiding this comment

haiqi96 commented Jul 26, 2024 •

edited

Loading