Skip to content

Latest commit

 

History

History
executable file
·
83 lines (56 loc) · 5.61 KB

README.md

File metadata and controls

executable file
·
83 lines (56 loc) · 5.61 KB

scripts

This directory contains various scripts used by the pipeline. However, you can use most of these scripts on their own, too. Some may even be helpful in day-to-day use.

All python scripts implement the --help argument. For R scripts, you can run head <script> to read about their usage.

A python script to analyze the output of the pipeline and calculate metrics that might be useful for downstream biological applications of our software. This script is not, in fact, part of the pipeline.

A python script for summarizing the runtime and memory usage of the pipeline based on its benchmark files. This script is not, in fact, part of the pipeline.

An R script for predicting variants using a trained classifier. It takes as input a model generated by classify_train.R.

An R script for creating a trained classifier. It takes as input a set of plant segments that have already been labeled by their species.

A python script that splits a set of pre-labeled segments into truth and training sets, for use by classify_test.R and classify_train.R.

A python script that extracts the elevation of each point in an orthomosaic from a Metashape project file. Elevation values are calculated by Metashape's digital elevation model. This script is not, in fact, part of the pipeline.

A python script that exports the orthomosaic from a Metashape project file to a standard image file.

A python script that converts orthomosaic pixel coordinates to geographic coordinates using the Metashape project file. It can also be used for extracting the center of each segment in a json segments file (also in geographic coordinates). This script is not, in fact, part of the pipeline.

A python script for extracting machine learning features for each segmented region in a json segments file.

A suite of python functions for calculating features. These functions are used primarily by extract_features.py.

A bash script that finds all of the original drone images that have a plant, provided the plant's segment ID. This script uses the output of a step in the experimental strategy. This script is not, in fact, part of the pipeline.

A python module for importing segments from a json labelme file. The functions in this module are used by many other scripts.

A python script for visualizing the random forest importance of each machine learning feature. This script uses the output of classify_train.R. This script is not, in fact, part of the pipeline.

A python script for visualizing the output of the pipeline via a map.

A python script to calculate scoring metrics to evaluate the performance of the classifier. This script uses the output of classify_test.R.

A python script for creating a precision-recall curve for the classified segments from classify_test.R. It uses the output of statistics.py.

A python script for resolving conflicting species labels assigned to the same segments.

A python script that transforms orthomosaic pixel coordinates to their coordinates in the original drone images.

A python script that uses computer vision algorithms to identify the location of plants in an image. The script outputs both regions that it is highly confident contain plants and regions that it is less confident about.

A python script that creates the points of a precision-recall curve. This script's output is used by prc.py.

A python script that uses Agisoft Metashape to create an orthomosaic from a collection of overlapping drone images. The output of this script is a special Metashape project file, not the orthomosaic as a standard image file.

A python script that can be useful for debugging the segmentation scripts: segment.py and watershed.py. This script is not, in fact, part of the pipeline.

A python script that transforms pixel coordinates in the original drone iamges to their coordinates in the orthomosaic.

A python script that uses the high and low confidence regions from segment.py in the watershed algorithm. It outputs its best guess for the location of each plant as a segments file.

Triggered by qsub run.bash -U out/SAMPLE/label_Images.txt. Can also be run with command line, for example: python3 extract_images.py /mnt/biology/donaldson/tom/flower_map/out/6217East/rev_transforms 512,517,510 /mnt/biology/donaldson/tom/flower_map/out/6217East/label_Images.txt. A python script that takes in a string list of target segment labels to retrace and find the names of the source images used to generate those segments.

Triggered by qsub run.bash -U out/SAMPLE/subsetImages/subsetImagesLog.txt. A python script that takes in the output of extract_images.py to retrace and find the source images used to generate those segments.