yo just throw a multi-mae loss on an sd finetune
multimae pretrained with text prompts as an additional modality
- prompt generation basically for free
- opens up opportunity for step-by-step prompting and learning from own reasoning: https://twitter.com/shaneguML/status/1584668991372464129