Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Int RLE v1 encoding #24

Merged
merged 1 commit into from
Nov 5, 2023
Merged

Support for Int RLE v1 encoding #24

merged 1 commit into from
Nov 5, 2023

Conversation

Jefffrey
Copy link
Collaborator

@Jefffrey Jefffrey commented Nov 5, 2023

Closes #9

Support decoding V1 RLE

Copy link
Collaborator Author

@Jefffrey Jefffrey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Initial support for v1 encoding

I'll need to do another pass after to do some cleanup and refactoring, but don't want to make the PR any larger

Comment on lines +269 to +277
pub fn v0_file_test() {
let path = basic_path("demo-11-zlib.orc");
let reader = new_arrow_reader_root(&path);
let _expected_row_count = reader.total_row_count();
let batches = reader.collect::<Result<Vec<_>, _>>().unwrap();
let _total_rows: usize = batches.iter().map(|b| b.num_rows()).sum();
// TODO: not reading entire file, debug
// assert_eq!(expected_row_count as usize, total_rows);
}
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When we do collect on ArrowReader it doesn't seem to be reading all rows in the file, will need to debug separately

@Jefffrey Jefffrey requested a review from WenyXu November 5, 2023 10:27
Copy link
Collaborator

@WenyXu WenyXu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Jefffrey Jefffrey merged commit 2b7a2b0 into main Nov 5, 2023
6 checks passed
@Jefffrey Jefffrey deleted the feature/int_v1_support branch November 5, 2023 10:49
waynexia pushed a commit that referenced this pull request Oct 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Integer RLE v1 decoding support
2 participants