Detection and identification of macromolecular complexes in cryo-electron tomograms using support vector machines
Reproduction challenge for the paper Detection and identification of macromolecular complexes in cryo-electron tomograms using support vector machines.
The paper deals with 3D electron microscopy scans (tomograms) of biological structures (e.g. proteins), and detecting the components using a library of 'puzzle pieces'- biological macromolecules (templates), in order to determined the arrangement and composition of the structure under investigation. The process is based on finding points in the tomogram that have high correlation with library templates and using a Support Vector Machine (SVM) to choose the correct templates.
The following demonstrates a 2D flow for generated tomograms from geometric shapes as templates
Each such candidate such is translated to a vector of correlations with all the library templates and an SVM performs multi-class classification where each template in the library is a class. Post processing is performed to improve the location of the site, given the chosen template, and choose the optimal orientation of the template.
The SVM is trained using a generated data set, with added noise, and the training, prediction and evaluation flow are describes below
Sample results in synthetic 2D cases are presented below (w. noise and w/o noise)