Natural-Language-Processing

Participation to the SemEval 2022 task on Patronizing and Condescending Language (PCL) Detection. This task is based on the paper Don't Patronize Me! An annotated Dataset with Patronizing and Condescending Language Towards Vulnerable Communities (Perez-Almendros et al., 2020).

The aim of this task is to build a model able to differenciate patronizing from non-patronizing sentences.

evaluation.py: contains the metrics and the loss curve plot functions to evaluate the models.
main.py: main class.
PCLDataset.py: Map-style dataset class tailored to our problem.
preprocessing.py: contains the preprocessing functions (eg stop word removal, tokenization...).
Roberta_PCL.py: Instantiation of the model and definition of the forward pass.
Trainer_PCL.py: Trainer class tailored to our problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural-Language-Processing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
PCLDataset.py		PCLDataset.py
README.md		README.md
Roberta_PCL.py		Roberta_PCL.py
Trainer_PCL.py		Trainer_PCL.py
evaluation.py		evaluation.py
main.py		main.py
preprocessing.py		preprocessing.py

AgatheZ/Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

Natural-Language-Processing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages