-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update NeMo/Megatron #302
Open
sichu2023
wants to merge
38
commits into
main
Choose a base branch
from
sichu/update-3rd-party
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+120
−102
Open
Update NeMo/Megatron #302
Changes from all commits
Commits
Show all changes
38 commits
Select commit
Hold shift + click to select a range
0930452
bump commit hash
sichu2023 a7e7b36
update ESM2TEDotProductAttention
sichu2023 e44f931
bump te version
sichu2023 db1b6e5
update 3rd party commit hash
sichu2023 2907fc7
remove extra state from load_weights_sharded_inplace_nemo2_to_mcore
sichu2023 ff5b98a
mark test_mixin_strategy_contract_get_loss_reduction xfail
sichu2023 de2e9c2
switch Megatron-LM commit hash to main
sichu2023 ff12671
update 3rd party commit hash
sichu2023 622a914
reuse TEDotProductAttention __init__
sichu2023 40540f1
fix moe_token_dispatcher_type when variable_seq_lengths
sichu2023 5b0eca6
fix rotary_pos_emb get_rotary_seq_len call
sichu2023 c826b86
revert te version
sichu2023 279aa74
Revert "reuse TEDotProductAttention __init__"
sichu2023 454271e
support cp_comm_type in ESM2TEDotProductAttention __init__
sichu2023 184ccde
pump NeMo/Megatron/TE commit hash
sichu2023 b9615c5
fix test_tokenizer_serialization
sichu2023 69e2899
update iomixin test - nemo only captures non-default arguments to __i…
sichu2023 70f9707
increase limit_val_batches and val_check_interval to avoid duplicated…
sichu2023 c7d0b6b
add checkpoint callback to every mode in stop-and-go-test for nemo up…
sichu2023 44189e4
Revert "update iomixin test - nemo only captures non-default argument…
sichu2023 9a0c976
add notes on IOMixin behavior
sichu2023 32136de
Revert "increase limit_val_batches and val_check_interval to avoid du…
sichu2023 cb8856f
add step in checkpoint_dir to avoid name clashing
sichu2023 2b0c87a
fix test_main_runs for esm2 and geneformer
sichu2023 4645768
update test_iomixin_utils.py
sichu2023 b1ef6c6
revert ModelCheckpoint move in stopandgo
pstjohn 522e849
disable ckpt_async_save
sichu2023 7eb1e8a
use trainer.should_stop to interrupt training, remove uneven checks i…
pstjohn 35c9bb0
drop every_n_train_steps in ModelCheckpoint
sichu2023 c9a421f
update min_lr in esm2 scheduler
sichu2023 9159a25
bump nemo version
sichu2023 430c228
mark validation stop and go test xfail
sichu2023 ced60e6
drop every_n_train_steps
sichu2023 f1621e5
update geneformer output tolerance
sichu2023 a7e4f39
ruff
sichu2023 e53a5ac
bump megatron version
sichu2023 1fd15ba
disable ckpt_async_save
sichu2023 fb81158
revert to original loss thresholds from main
pstjohn File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Submodule Megatron-LM
updated
576 files
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -30,7 +30,6 @@ | |
from bionemo.testing import megatron_parallel_state_utils | ||
|
||
|
||
@pytest.mark.skip("duplicate unittest") | ||
@pytest.fixture | ||
def dummy_protein_dataset(tmp_path): | ||
"""Create a mock protein dataset.""" | ||
|
@@ -62,7 +61,6 @@ def dummy_protein_dataset(tmp_path): | |
return db_file | ||
|
||
|
||
@pytest.mark.skip("duplicate unittest") | ||
@pytest.fixture | ||
def dummy_parquet_train_val_inputs(tmp_path): | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. now that you're at it you may want to consider importing these fixtures from bionemo.testing |
||
"""Create a mock protein train and val cluster parquet.""" | ||
|
@@ -104,7 +102,7 @@ def test_main_runs(monkeypatch, tmpdir, dummy_protein_dataset, dummy_parquet_tra | |
result_dir=result_dir, | ||
wandb_project=None, | ||
wandb_offline=True, | ||
num_steps=55, | ||
num_steps=10, | ||
warmup_steps=5, | ||
limit_val_batches=1, | ||
val_check_interval=1, | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Attempting to split this as a separate PR in #399