Skip to content

Releases: lucidrains/Mega-pytorch

0.1.0

26 Aug 17:59
Compare
Choose a tag to compare
add sub-groupnorm in multihead ema, for use in audio modeling

0.0.15

19 Feb 04:14
Compare
Choose a tag to compare
fix laplace activation function thanks to @boweny-cerebras

0.0.14

05 Oct 15:08
Compare
Choose a tag to compare
fix residual within mega layer, thanks to @VHellendoorn

0.0.12

24 Sep 22:06
Compare
Choose a tag to compare
prenorm requires a final layernorm

0.0.11

24 Sep 21:51
Compare
Choose a tag to compare
fix residual for prenorm

0.0.10

24 Sep 21:31
Compare
Choose a tag to compare
expose multi-headed learned EMA, for use outside of repo

0.0.9

24 Sep 18:44
Compare
Choose a tag to compare
offer prenorm architecture

0.0.8

24 Sep 17:59
Compare
Choose a tag to compare
handle bidirectional better

0.0.7

24 Sep 15:40
Compare
Choose a tag to compare

Full Changelog: 0.0.6...0.0.7

0.0.6

24 Sep 02:32
Compare
Choose a tag to compare
improvise on bidirectional for multi-head learned ema