Bộ dữ liệu tiếng việt sử dụng "A Large-scale Vietnamese News Text Classification Corpus"
- quick_start_data_NLP_Pho_Bert.ipynb : Xử lý dữ liệu
- text_classification_with_CNN.ipynb : Huấn luyện mô hình
precision | recall | f1-score | support | |
---|---|---|---|---|
Class 0 | 0.75 | 0.77 | 0.76 | 200 |
Class 1 | 0.82 | 0.89 | 0.86 | 200 |
Class 2 | 0.87 | 0.79 | 0.83 | 200 |
Class 3 | 0.85 | 0.83 | 0.84 | 200 |
Class 4 | 0.91 | 0.87 | 0.89 | 200 |
Class 5 | 0.91 | 0.87 | 0.89 | 200 |
Class 6 | 0.85 | 0.88 | 0.86 | 200 |
Class 7 | 0.99 | 0.96 | 0.97 | 200 |
Class 8 | 0.90 | 0.95 | 0.92 | 200 |
Class 9 | 0.90 | 0.92 | 0.91 | 200 |
accuracy | 0.87 | 2000 | ||
macro avg | 0.87 | 0.87 | 0.87 | 2000 |
weighted avg | 0.87 | 0.87 | 0.87 | 2000 |
precision | recall | f1-score | support | |
---|---|---|---|---|
Class 0 | 0.72 | 0.84 | 0.78 | 100 |
Class 1 | 0.88 | 0.91 | 0.90 | 100 |
Class 2 | 0.93 | 0.90 | 0.91 | 100 |
Class 3 | 0.85 | 0.86 | 0.86 | 100 |
Class 4 | 0.99 | 0.82 | 0.90 | 100 |
Class 5 | 0.97 | 0.91 | 0.94 | 100 |
Class 6 | 0.88 | 0.92 | 0.90 | 100 |
Class 7 | 0.95 | 0.97 | 0.96 | 100 |
Class 8 | 0.95 | 0.93 | 0.94 | 100 |
Class 9 | 0.92 | 0.93 | 0.93 | 100 |
accuracy | 0.90 | 1000 | ||
macro avg | 0.90 | 0.90 | 0.90 | 1000 |
weighted avg | 0.90 | 0.90 | 0.90 | 1000 |
STT | Score |
---|---|
ROC 1 | 0.9910 |
ROC 2 | 0.9933 |
ROC 3 | 0.9909 |
ROC 4 | 0.9920 |
ROC 5 | 0.9908 |
ROC 6 | 0.9912 |
ROC 7 | 0.9904 |
ROC 8 | 0.9927 |
ROC 9 | 0.9896 |
ROC 10 | 0.9941 |