PFD-kit: finetune and distillation from pre-trained atomic models

PFD-kit automates the fine-tuning and distillation process of pre-trained atomic models. It enables practical atomistic simualtion with the highly transferable, but computationally expensive pre-trained models. PFD-kit is built upon the dflow package and includes components of DPGEN2 workflow. Currently, PFD-kit supports the Deep Potential models.

1. Overview

Inspired by DPGEN concurrent learning scheme, PFD-kit provides automated workflow for efficient model fine-tuning and distillation for practical application of pre-trained model in atomic simulation. Fig.1 shows the schematic of fine-tuning workflow. Given the initial structures of fine-tuning systems, the workflow generates perturbed structures, and executes a series of short ab initio molecular dynamics (AIMD) simulation based upon randomly perturbed structures. The pre-trained model is firstly fine-tuned by the AIMD dataset, then MD simulation with the fine-tuned model searches new configurations, which are then labeled by first-principle calculation softwares. If the fine-tuned model cannot predict the labeled dataset with sufficient accuracy, the collected dataset would be added to the fine-tuning training set, and the train-search-label process would iterate until convergence.

Figure 1. Fine-tune workflow.

A lightweight model can be generated from a fine-tuned model through distillation, which enables much faster simulation. The distilled model can be generated with training data labeled by the fine-tuned model. Figure 2 shows the schematic of the distillation workflow.

Figure 2. Distillation workflow.

2. Installation

PFD-kit can be built and installed form the source.

git clone https://github.com/ruoyuwang1995nya/pfd-kit.git
cd pfd-kit && pip install .

3. Quick start

PFD-kit comes with a simple CLI interface. For instance, a finetune workflow can be submitted using following command:

pfd submit finetune.json -t finetune

The finetune.json specifies imput parameters of the finetune task, whose details can be found in the examples directory.

4. Userguide

Examples of json input file for model fine-tune and distillation can be found in the examples directory. The complete documentation of the latest version PFD-kit can be found here

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
examples		examples
pfd		pfd
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.releaserc.json		.releaserc.json
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PFD-kit: finetune and distillation from pre-trained atomic models

Table of Contents

1. Overview

2. Installation

3. Quick start

4. Userguide

About

Releases

Packages

Languages

License

JiajiaLiu7451/pfd-kit

Folders and files

Latest commit

History

Repository files navigation

PFD-kit: finetune and distillation from pre-trained atomic models

Table of Contents

1. Overview

2. Installation

3. Quick start

4. Userguide

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages