CS466-Mini-Project

Intro to Bioinformatics Mini Project

Link to Google Docs: Paper

generate.py

Generates 7 sets of 10 datasets, where each set has different parameters.

default: ICPC = 2, ML = 8, SC = 10, SL = 500
ICPC_1: ICPC = 1, ML = 8, SC = 10, SL = 500
ICPC_1.5: ICPC = 1.5, ML = 8, SC = 10, SL = 500
ML_6: ICPC = 2, ML = 6, SC = 10, SL = 500
ML_7: ICPC = 2, ML = 7, SC = 10, SL = 500
SC_5: ICPC = 2, ML = 8, SC = 5, SL = 500
SC_20: ICPC = 2, ML = 8, SC = 20, SL = 500

To run generate.py, run the following command in the terminal: python generate.py

motif_finder.py

Use the Gibbs sampling algorithm on the benchmark dataset to get "predictedmotif.txt" and "predictedsites.txt" for every dataset

To run motif_finder.py on the benchmark in the "dataset" directory, run the following command in the terminal: python motif_finder.py

analyze.py

Analyze the performance of the motif finding algorithm implemented in motif_finder.py by comparing "predictedmotif.txt" and "predictedsites.txt" with data generated in generate.py. Outputs the numbers for our measures of performance.

To run analyze.py, run the following command in the terminal: python analyze.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS466-Mini-Project

generate.py

motif_finder.py

analyze.py

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
dataset		dataset
README.md		README.md
analyze.py		analyze.py
generate.py		generate.py
motif_finder.py		motif_finder.py

romerorisa/CS466-Mini-Project

Folders and files

Latest commit

History

Repository files navigation

CS466-Mini-Project

generate.py

motif_finder.py

analyze.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages