Exactly what the title says, but I didn't bother with naming this with a fancy name since it's basically a bunch of scripts I'm currently using for a multitude of projects and I thought someone who is desperate and lonely like me might like to use this because they feel unimportant (dw, you are, just like me fr).
Ehm, you basically will have your gene_presence_abscence.csv
in the same directory as these scripts and then you will execute the generateUpSet.sh
script like this:
bash generateUpSet.sh
And then you just go to grab something while this runs. It doesn't take that long unless you're dealing with stupid Pseudomonas, I guess (yes, that's what I'm doing right now).
Because I need to do stuff to feel relevant and I thought this was it. That being said, in a more serious tone, I really wanted to start being more diligent with my programming and what I normally do.
There's a great remix of Pokemon Gym Leader music on YT. Go and find it, you'll feel your soul will elevate from this realm.
This script basically works with a bunch of libraries because fuck good programming practices. These are the libraries:
pandas
matplotlib
upsetplot
- There's an error appearing when you execute really big .csv files. I'll need to check on that.
- For now, the minimum of incidences being plotted is determined by the script. I want to parse this through the bash command instead. So yeah.