GitHub - tupleblog/salim-classifier: Automatic Slim Classifier using WangchanBERTa (RoBERTa trained on Thai text)

Salim Classifier

วัตถุประสงค์: ทุกวันนี้หาเพื่อนที่รักชาติ ศาสนา พระมหากษัตริย์ รัฐบาลยากเหลือเกิน มีแต่พวกสามกีบ ควายแดงคอยจ้องจะทำร้าย ทางทีมของเราจึงสร้างโมเดลมาเพื่อช่วยหาเพื่อนสลิ่มจากคอมเม้น ที่นับวันจะหลงเหลืออยู่น้อยยิ่งนักในสังคมไทย ทั้งนี้เพื่อเป็นแนวทางในการสร้างสังคมสลิ่มที่แข็งแรงต่อไป

วิธีการใช้งาน

สามารถลง transfomers จาก Huggingface และใช้งานโมเดลได้ดังต่อไปนี้

from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline

# download model
tokenizer = AutoTokenizer.from_pretrained("tupleblog/salim-classifier")
model = AutoModelForSequenceClassification.from_pretrained("tupleblog/salim-classifier")

# using pipeline
classifier = pipeline("sentiment-analysis", model=model, tokenizer=tokenizer)
text = "จิตไม่ปกติ วันๆคอยแต่ให้คนเสี้ยมทะเลาะกันด่ากัน คอยจ้องแต่จะเล่นงานรัฐบาล ความคดด้านลบ"
classifier(text)
# >> [{'label': 'HIGHLY LIKELY SALIM', 'score': 0.9989368915557861}] ยินดีด้วย น่าจะเป็นสลิ่ม!

การเก็บข้อมูล

สร้างข้อมูลตัวอย่างและทำการ annotate จากนั้นนำข้อมูลมาเทรนโมเดลด้วย WangchanBERTa โดยข้อมูลอาจมีความ bias เนื่องจากทางทีมงานเป็นผู้เก็บข้อมูลเอง

ทดลองใช้งานผ่าน HuggingFace

ท่านสามารถทดลองใช้งานผ่าน HuggingFace โดยใส่คอมเม้นจาก Facebook เข้าไปในช่องได้ในเว็บไซต์ huggingface.co/tupleblog/salim-classifier

ตัวอย่างประโยค

รัฐรับผิดชอบทุกชีวิตไม่ได้หรอกคนให้บริการต้องจัดการเองถ้าจะเปิดผับบาร์

ทางทีมงานกำลังเขียนแอพพลิเคชั่นเพื่อให้สามารถนำไปใช้งานได้ง่ายยิ่งขึ้น

Performance

We report performance on 20% evaluation set (accuracy, precision, recall, F1-score macro) as follows:

Accuracy	Precision	Recall	F1
86.15%	86.12%	86.13%	86.13%

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api.py		api.py
train.ipynb		train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Salim Classifier

วิธีการใช้งาน

การเก็บข้อมูล

ทดลองใช้งานผ่าน HuggingFace

Performance

About

Releases

Packages

Contributors 2

Languages

License

tupleblog/salim-classifier

Folders and files

Latest commit

History

Repository files navigation

Salim Classifier

วิธีการใช้งาน

การเก็บข้อมูล

ทดลองใช้งานผ่าน HuggingFace

Performance

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages