Skip to content

Actions: huggingface/nanotron

Secret Leaks

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
73 workflow runs
73 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

remove transpose in kernel
Secret Leaks #73: Commit e93cf55 pushed by xrsrke
November 4, 2024 10:57 16s xrsrke/fp8_for_nanotron
November 4, 2024 10:57 16s
add dumb transpose in fp8_matmul_kernel
Secret Leaks #72: Commit edb1e87 pushed by xrsrke
November 3, 2024 14:48 19s xrsrke/fp8_for_nanotron
November 3, 2024 14:48 19s
65% speed up in fwd+bwd pass with m=n=k=32768
Secret Leaks #71: Commit 4b26cf1 pushed by xrsrke
November 3, 2024 13:33 16s xrsrke/fp8_for_nanotron
November 3, 2024 13:33 16s
add speed benchmark
Secret Leaks #70: Commit f3e3495 pushed by xrsrke
November 1, 2024 18:50 17s xrsrke/fp8_for_nanotron
November 1, 2024 18:50 17s
add bencmark speed with 5% speed up
Secret Leaks #69: Commit c937375 pushed by xrsrke
November 1, 2024 15:34 22s xrsrke/fp8_for_nanotron
November 1, 2024 15:34 22s
remove uncessary .contiguous() in fp8 backward
Secret Leaks #67: Commit 39a4960 pushed by xrsrke
November 1, 2024 13:50 17s xrsrke/fp8_for_nanotron
November 1, 2024 13:50 17s
update profiling script
Secret Leaks #66: Commit b4156dc pushed by xrsrke
November 1, 2024 13:45 17s xrsrke/fp8_for_nanotron
November 1, 2024 13:45 17s
add fp8 tp profiler
Secret Leaks #65: Commit dda00a4 pushed by xrsrke
October 30, 2024 14:38 18s xrsrke/fp8_for_nanotron
October 30, 2024 14:38 18s
add fp8 tensor parallel
Secret Leaks #64: Commit 7dfe3ac pushed by xrsrke
October 29, 2024 12:46 19s xrsrke/fp8_for_nanotron
October 29, 2024 12:46 19s
add fp8 linear
Secret Leaks #63: Commit 0f8f672 pushed by xrsrke
October 28, 2024 14:11 17s xrsrke/fp8_for_nanotron
October 28, 2024 14:11 17s
add fp8 tensor
Secret Leaks #62: Commit 6c9a4d0 pushed by xrsrke
October 28, 2024 10:45 24s xrsrke/fp8_for_nanotron
October 28, 2024 10:45 24s
move the basics of fp8 to this branch
Secret Leaks #61: Commit 44e6574 pushed by xrsrke
October 9, 2024 10:15 19s xrsrke/fp8_for_nanotron
October 9, 2024 10:15 19s
Merge pull request #234 from huggingface/ci-move
Secret Leaks #60: Commit 51ca40b pushed by glegendre01
September 26, 2024 16:45 16s main
September 26, 2024 16:45 16s
change runner cluster for 8-t4
Secret Leaks #59: Commit 676351b pushed by glegendre01
September 26, 2024 16:32 23s ci-move
September 26, 2024 16:32 23s
change runner cluster for A10
Secret Leaks #58: Commit 565b3a4 pushed by glegendre01
September 26, 2024 16:31 17s ci-move
September 26, 2024 16:31 17s
put the comma in the right place
Secret Leaks #57: Commit 11d60c8 pushed by eliebak
September 20, 2024 05:18 16s add-lighteval-after-ckpt
September 20, 2024 05:18 16s
add logging lr
Secret Leaks #56: Commit 415c6b2 pushed by xrsrke
September 19, 2024 13:33 23s xrsrke/unit_mup_ref
September 19, 2024 13:33 23s
add unit mup[
Secret Leaks #55: Commit 36131cf pushed by xrsrke
September 18, 2024 13:01 22s xrsrke/unit_mup_ref
September 18, 2024 13:01 22s
add sync all reduce
Secret Leaks #54: Commit d4a5997 pushed by xrsrke
September 18, 2024 10:17 23s xrsrke/ref_main_for_fp8
September 18, 2024 10:17 23s
initial prototype for unit mup, but don't get std across outputs
Secret Leaks #53: Commit f6a19cf pushed by xrsrke
September 17, 2024 14:30 19s xrsrke/unit_mup_ref
September 17, 2024 14:30 19s
fix optim state logging for all params
Secret Leaks #52: Commit e255511 pushed by xrsrke
September 13, 2024 15:17 17s xrsrke/unit_mup_ref
September 13, 2024 15:17 17s
Merge pull request #232 from huggingface/xrsrke/precommit-s3
Secret Leaks #51: Commit 97c13b0 pushed by zzhhjjj
September 9, 2024 17:34 17s main
September 9, 2024 17:34 17s
only precommit S3 pr
Secret Leaks #50: Commit 1f04626 pushed by xrsrke
September 9, 2024 17:11 20s xrsrke/precommit-s3
September 9, 2024 17:11 20s
precommit
Secret Leaks #49: Commit e4d48e3 pushed by xrsrke
September 9, 2024 17:01 22s xrsrke/precommit-s3
September 9, 2024 17:01 22s