BertSum

This is the implementation of BertSum trained on CNN/DailyMail dataset. The current available model is BertSum-Ext for extractive summarization, and the abstractive model will be released in the near future.

Most implementations in the field of summarization still rely on nlpyang/PreSumm. However, it is written nearly from the scratch. This repo contributes to the reproducibility of extractive summarization.

Training

We used bert-base-uncased for the BERT checkpoints, and training was conducted on a single T4.

Evaluation

The results of our model evaluated on CNN/DailyMail test set is as follows:

rouge1	rouge2	rougeLsum	rougeL
43.03	20.16	39.46	27.69

The ROUGE scores were calculated using rouge-score of google-research. This library provides two types of ROUGE-L , rougeL and rougeLsum:

rougeL : \\n ignored. It treats the summary as a single sentence.
rougeLsum : \\n not ignored. It matches the sentence pair with the largest LCS from two summaries.

ROUGE-L provided by pyrouge, used in nlpyang/PreSumm, is same with rougeLsum.

Usage

To train the model, define the experiment in a YAML file and run the following command.

python train.py -—config-name exp_0

To evaluate the model, enter the path of checkpoints file in test_checkpoint of the YAML file, and execute the following command.

python test.py —-config-name exp_0

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
config		config
dataset		dataset
misc		misc
src		src
.DS_Store		.DS_Store
License		License
README.md		README.md
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BertSum

Training

Evaluation

Usage

License

About

Releases 1

Languages

License

Espresso-AI/bertsum

Folders and files

Latest commit

History

Repository files navigation

BertSum

Training

Evaluation

Usage

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Languages