Skip to content

Latest commit

 

History

History
65 lines (64 loc) · 3.13 KB

path.md

File metadata and controls

65 lines (64 loc) · 3.13 KB

The path I took:

  1. Setup the repo to work in it
  2. Turning the model into a diffuser model
  3. Turned it into a pipeline
  4. Loaded the diffusor model
  1. Wrote argparser for CLI
  2. Finish the path from the article
  3. Need to fix the argparse in main.py
  4. Need to add save location
  5. Add real-ESRGAN
  6. Error Handling
  7. Lora

    Try these:

    • negative embeddings lora's
  8. Make a model selector menu that reads models from the diffused folder
  9. Rehashing the code:
    • Check if the pipeline setup inside a function is an issue, should not be an issue because the embdedding one works, will have to see about the global effect of using EADS in it or not
    • Debug the masks and the emb's 117-122
    • Remove the commented code, shift learnings to a ignored file
    • The changes that I made:
      • Took lines 167-170 from inside pipeCreate
      • added .input_ids
      • replaced getEmbdedding's main core logic, what a shame
  10. Fix the output screen
    • Show the prompt, the neg prompt, the seed, cfg scale
    • Load the above details into the pictures (check civit.ai)
  11. Show save location
    • Make a save folder logic, checks for exist
    • The name for the file saving
  12. Try adding a progress loading bar
  13. Check how to change the sampler
  14. For the main task:
    • Need to make one model that gen's the BG and the other that gen's the FG
    • Or I can use a singular model for this
      • Use DeepLabv3+ for the mask
      • Pass that to the BG generation model
      • Assign the mask image to a variable
        • Save the mask, name is temp_mask inside output
      • Dump memory to clear space in the end off the VRAM after gen is done
      • Add inpainting model option to argparse
        • Add BG prompt to the CLI as well
      • Make this a function that is called
      • Have to push to main pipeline_setup file
    • Or I can gen the FG, cut it out using UNet, and then gen a BG
  15. Add a method that upscales the image, realESRGAN
  16. Add a method that fixes the faces, GFPGAN
    • Do the repo setup from the readme
    • Do a demo run
    • Add the models from the readme if needed
  17. Try to add controlNet inpainting
    • Can add controlNet Aux for posture detection
  18. Add a method that saves the image generated with a pre-fix
  19. Add upscale to the CLI
    • Make a interactive UI, which shows size of image and then
  20. Fix the size issue in the embeds
  21. Fix issue of single seed being used
  22. Make the readme.md with examples, code hashes, pictures
    • Give credits to the projects used, and the citations needed
    • make the commands better visible
    • Add link to install civit.ai Models, and the ones that I have used