Skip to content

Releases: NVIDIA/Megatron-Energon

4.0.0

19 Nov 12:37
26700bb
Compare
Choose a tag to compare

What's Changed

Breaking Changes

  • Dataset checkpoints from <4.0.0 will not be compatible due to the structural simplification. Everything else (e.g. randomness and the interface compatibility) should remain the same.

Full Changelog: 3.0.1...4.0.0

3.0.1

06 Nov 12:48
10c47c6
Compare
Choose a tag to compare

What's Changed

  • This fixes AttributeError: module 'fsspec' has no attribute 'asyn' see #26 by @philipp-fischer

Full Changelog: 3.0.0...3.0.1

3.0.0

24 Oct 13:10
62ea012
Compare
Choose a tag to compare

What's Changed

Full Changelog: 2.3.0...3.0.0

2.3.0

22 Aug 19:21
76256b1
Compare
Choose a tag to compare

What's Changed

  • Support loading datasets via dict() & update SimilarityInterleaved by @paul-gibbons in #4
  • Fix gc_init_worker for ShardedTensor by @philipp-fischer in #7
  • Fix dataloader being stuck iterating samples in a certain condition by @voegtlel in #6
  • Fix epath with relative usage by @voegtlel in #3

New Contributors

Full Changelog: 2.2.0...2.3.0

2.2.0

04 Jun 10:25
Compare
Choose a tag to compare

Full Changelog: 2.1.1...2.2.0

Version 2.2.0 introduces support for video files in your dataset and adds new samples and dataset classes for video question answering (VidQA).
The new classes are: VidQASample and VidQAWebdataset.

If you have videos in your dataset shards, those should now automatically be decoded and yield a VideoData object, which contains the frames and the audio data as well as some meta data.

2.1.1 First public release

24 May 19:42
Compare
Choose a tag to compare