Material Segmentation

The four semantic classes in the dataset are:

Background
Structural Concrete
Structural Steel 
Metal Decking

Coming soon in November 🟢[Paper] 🟢[Dataset] 🟢[Trained models]

The structural material segmentation dataset can be used for auxiliary structural inspection tasks to aid in the localization of structural damage, provide context to predictions, and for more futuristic style transfer SPADE and GAN / GAN-Inversion applications.

Results

We were able to recieve an f1-score of over 94% with our trained model.

Requirements

The most important environment configurations are the following:

Pytorch >= 1.4
Python >= 3.6
tqdm
matplotlib
sklearn
cv2
Pillow
pandas
shutil

Evaluating the Trained DeeplabV3+ Model

Download the DeeplabV3+ 🟢trained model weights
Configure run_metrics_evaluation.py

Customize the following in run_metrics_evaluation.py

data_dir = './material_detection_data/bridge_images/Test/' (PATH TO TEST DATA)
batchsize = 1 (KEEP AT 1)
model = torch.load(f'./stored_weights_plus/weights_XX.pt', map_location=torch.device('cuda')) (LOAD MODEL / WEIGHTS OF MODEL)

You will get the f1 score, the jaccard index, and the confusion matrix. We suggest running this in an IDE.

Visualizing the results from the Trained DeeplabV3+ Model

Once training has converged or when it has stopped, we can used the best checkpoint based on the validation data results. This checkpoint is loaded and our test data is evaluated.

run_show_results__.py

gets predicted masks
gets combined mask and image overaly
gets one-hot-encoded vector images of predictions

Training with the Structural Material dataset

Clone the repository
Download the 🟢dataset
Go into the Training folder
Create a DATA folder
Copy and paste the Train and Test folders for 512x512 images from the dataset you downloaded into the DATA folder
The DATA folder should have a folder called 'Train' and a folder called 'Test'. Inside each of those folders include the mask and image pairs in their respective folders (Masks, Images).
If you have set this up correctly then you are now ready to begin.

Neccesary and optional inputs to the main_plus.py file: ('-' means it is neccessary, '--' means that these are optional inputs)

 -data_directory = dataset directory path (expects there to be a 'Test' and a 'Train' folder, with folders 'Masks' and 'Images')
 -exp_directory = where the stored metrics and checkpoint weights will be stored
 --epochs = number of epochs
 --batchsize = batch size
 --output_stride = deeplab hyperparameter for output stride
 --channels = number of classes (we have four, the default has been set to four). 
 --class_weights = weights for the cross entropy loss function
 --folder_structure = 'sep' or 'single' (sep = separate (Test, Train), single = only looks at one folder (Train). If you want to get validation results instead of getting back your test dataset results then you should use 'single'. If you want to test directly on the Test dataset then you should use 'sep'.
 --pretrained = if there is a pretrained model to start with then include the path to the model weights here.

Run the following command: (there is an example shell script file, 'var_8plus_40epoch_3-2021_ca.sh' we used for training provided in the 'Training' folder).

python main_plus.py -data_directory '/PATH TO DATA DIRECTORY/' -exp_directory '/PATH TO SAVE CHECKPOINTS/' \
--epochs 40 --batch 2

During training there are model checkpoints saved every epoch. At these checkpoints the model is compared against the test or validation data. If the test or validation scores are better than the best score, then it is saved.

Training with a custom dataset

Clone the repository
Ensure your image and mask data is 512x512 pixels. (can use the rescale_image.py in Pre-processing)
Ensure that if you resized your masks to 512x512 that they did not interpolate the colors into more color classes than you have. The expected format is BGR. (can use the rescale_segmentation.py in Pre-processing)
You now need to go into the datahandler_plus.py file and edit the colors as necessary. For example, the Structural Materials dataset used the following format, which is in the datahandler_plus.py in this repository.

# color mapping corresponding to classes
# ---------------------------------------------------------------------
# 0 = background (Black)
# 1 = Steel (Red)
# 2 = Concrete (Green)
# 3 = Metal Deck (Yellow)
# ---------------------------------------------------------------------
self.mapping = {(0,0,0): 0, (0,0,128): 1, (0,128,0): 2, (0,128,128): 3}

Adjust the number of 'channels' in the training command to match the number of channels that you have.
Ensure that your DATA folder has a folder called 'Train' and a folder called 'Test'. Inside each of those folders include the mask and image pairs in their respective folders (Masks, Images).
If you have set this up correctly then you are now ready to begin.

Building a Custom Dataset

(The images in the dataset were annotated using labelme. We suggest that you use this tool)

If you are planning to extend on the material dataset, then please read the annotation guidelines provided by the author in the 🟢 material dataset repository.
Before beginning to annotate, we suggest that you use jpeg for the RGB image files. We advised against beginning with images which are already resized.
We have put together a tutorial on tips and tricks on how to use the labelme software in this youtube video. We also made a video on youtube showing how to set up labelme with Anaconda prompt.
After annotating you will have matching JSON and jpeg files, indicating the annotation and image pair respectfully.
You will take these files and generate masks and one-hot-encoded vector files using run_labelme2voc_.py file in Pre-processing. Then you can re-scale these images and masks using the respective files in Pre-processing. You can also use the random sort function we have created to randomly split the data.

The labels_corrosion_segmentation.txt file contains the class labels needed for the run_labelme2voc_.py function. If your classes are different then they need to be reflected in this particular file.

Citation

Structural Material Dataset:

Bianchi, Eric; Hebdon, Matthew (2021): Structural Material Semantic Segmentation Dataset. 
University Libraries, Virginia Tech. Dataset. https://doi.org/10.7294/16624648.v1

Structural Material Model:

Bianchi, Eric; Hebdon, Matthew (2021): Trained Model for the Semantic Segmentation of Structural Material. 
University Libraries, Virginia Tech. Software. https://doi.org/10.7294/16628620.v1

Paper:

@article{doi:10.1061/(ASCE)CP.1943-5487.0001045,
author = {Eric Bianchi  and Matthew Hebdon },
title = {Development of Extendable Open-Source Structural Inspection Datasets},
journal = {Journal of Computing in Civil Engineering},
volume = {36},
number = {6},
pages = {04022039},
year = {2022},
doi = {10.1061/(ASCE)CP.1943-5487.0001045},

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
Pre-processing		Pre-processing
Training - Testing		Training - Testing
Visualization		Visualization
access		access
figures		figures
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Material Segmentation

Results

Requirements

Evaluating the Trained DeeplabV3+ Model

Visualizing the results from the Trained DeeplabV3+ Model

Training with the Structural Material dataset

Training with a custom dataset

Building a Custom Dataset

Citation

About

Releases

Packages

Languages

License

beric7/material_segmentation

Folders and files

Latest commit

History

Repository files navigation

Material Segmentation

Results

Requirements

Evaluating the Trained DeeplabV3+ Model

Visualizing the results from the Trained DeeplabV3+ Model

Training with the Structural Material dataset

Training with a custom dataset

Building a Custom Dataset

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages