Skip to content

Latest commit

 

History

History
5 lines (5 loc) · 168 Bytes

README.md

File metadata and controls

5 lines (5 loc) · 168 Bytes

Experiments about some transformer architectures

  • Vanilla Transformer
  • Encoder Classifier
  • GPT-2 Generator
  • Mixtral - Mixture of Experts Generator