Releases: lucidrains/Mega-pytorch
Releases · lucidrains/Mega-pytorch
0.0.5
fix multi-headed EMA expansion and reduction of dimension into heads …
0.0.4
throw in full mega architecture
0.0.3
only the Mega layer, not the Mega arch
0.0.2
they kept the design similar to GAU from FLASH attention paper
0.0.1
first pass at the mega layer