- Setup the repo to work in it
- Turning the model into a diffuser model
- Turned it into a pipeline
- Loaded the diffusor model
- Trying these models:
- clarity_3
- dreamshaper_8
- realisticVision_6
- Wrote argparser for CLI
- Finish the path from the article
- Need to fix the argparse in main.py
- Need to add save location
- Add real-ESRGAN
- Error Handling
- Lora
Try these:
- negative embeddings lora's
- Make a model selector menu that reads models from the diffused folder
- Rehashing the code:
- Check if the pipeline setup inside a function is an issue, should not be an issue because the embdedding one works, will have to see about the global effect of using EADS in it or not
- Debug the masks and the emb's 117-122
- Remove the commented code, shift learnings to a ignored file
- The changes that I made:
- Took lines 167-170 from inside pipeCreate
- added .input_ids
- replaced getEmbdedding's main core logic, what a shame
- Fix the output screen
- Show the prompt, the neg prompt, the seed, cfg scale
- Load the above details into the pictures (check civit.ai)
- Show save location
- Make a save folder logic, checks for exist
- The name for the file saving
- Try adding a progress loading bar
- Check how to change the sampler
- For the main task:
- Need to make one model that gen's the BG and the other that gen's the FG
- Or I can use a singular model for this
- Use DeepLabv3+ for the mask
- Pass that to the BG generation model
- Assign the mask image to a variable
- Save the mask, name is temp_mask inside output
- Dump memory to clear space in the end off the VRAM after gen is done
- Add inpainting model option to argparse
- Add BG prompt to the CLI as well
- Make this a function that is called
- Have to push to main pipeline_setup file
- Or I can gen the FG, cut it out using UNet, and then gen a BG
- Add a method that upscales the image, realESRGAN
- Add a method that fixes the faces, GFPGAN
- Do the repo setup from the readme
- Do a demo run
- Add the models from the readme if needed
- Try to add controlNet inpainting
- Can add controlNet Aux for posture detection
- Add a method that saves the image generated with a pre-fix
- Add upscale to the CLI
- Make a interactive UI, which shows size of image and then
- Fix the size issue in the embeds
- Fix issue of single seed being used
- Make the readme.md with examples, code hashes, pictures
- Give credits to the projects used, and the citations needed
- make the commands better visible
- Add link to install civit.ai Models, and the ones that I have used