Skip to content

Latest commit

 

History

History
93 lines (89 loc) · 5.51 KB

TODO.md

File metadata and controls

93 lines (89 loc) · 5.51 KB

In Progress

  • Tests
    • Workflow test with example data
    • Trivial examples for each function
    • Unit tests for SSI
    • Unit tests for density features

Plans

  • More example tcl scripts for VMD
  • Feature comparison of more than two ensembles
    • with respect to the joint ensemble (all metrics)
    • with respect to a reference ensemble (will not always work for KLD)
  • Use MDAnalysis instead of biotite for water featurization
  • Weighted PCA/tICA? (to account for varying simulation lengths or uncertainty)
  • Make file format (png/pdf?) for matplotlib optional.

Ideas

Done ✓

  • Include TICA in unit tests
  • Write "getting started" for documentation
  • Refactoring and fixes for release 0.2
    • Restructure modules to subpackages
    • Adapt README
    • Adapt API documentation
    • Include SSI to comparison example script
    • Numbering of principal component trajectories starts with 0, should start with 1
    • Axis labels and legend name for distance matrix plot
    • Function pca_features() does not have labels
    • Function compare_projections() does not have labels or legend
  • Slack channel for all developers and testers, and to provide support for the user community.
  • Implement clustering in principal component space
  • Option to write and load features as CSV file.
  • Implement nucleic acid torsions and pseudo-torsions, as reviewed Keating et al. and as used in x3DNA or Barnaba (Barnaba code on GitHub)
  • Hydrogen bods as features
  • Use MDAnalysis instead of PyEMMA to read features (to avoid mmshare dependency).
  • Use scikit-learn or Deeptime instead of PyEMMA for clustering.
  • Use scikit-learn or Deeptime instead of PyEMMA for dimensionality reduction.
  • exploratory analysis via correlation coefficients of the features
  • Unified tutorial in documentation.
    • preprocessing
    • featurization
    • comparison
    • dimensionality reduction and clusters
    • SSI

Abandoned

  • Colab Tutorial
  • Frame classification via CNN on features
    • Prototype to classify simulation frames --> Diffnets probably more powerful.
    • Interpret weights as relevance of features
    • Write module
    • Write unit tests
  • Integrate DiffNets.
    • Lay out module structure in separate branch.
    • Copy core network from DiffNets repo.
    • Try to use existing featurization.
    • Include existing DiffNets featurization and compare.