Named entity recognition (NER) helps you easily identify the key elements in a text, like names of people, places, brands, monetary values, and more.Extracting the main entities in a text helps sort unstructured data and detect important information, which is crucial if you have to deal with large datasets. This project has been deployed using AWS ECR and AWS EC2 Instance.
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
- Get data and properly create text and label (Can be done using https://explosion.ai/demos/displacy-ent.
- Use trasnformer Roberta architecture for training the ner tagger
- Use hugging face for Robereta Tokenizer
- Train and Deploy model for use-cases
create fresh conda environment
conda create -p ./env python=3.7 -y
activate conda environment
conda activate ./env
Install requirements
pip install -r requirements.txt
To run inferencing
python app.py
To launch swagger ui
http://localhost:8080/docs
- Natural Language processing
- Pytorch
- Transformer
- FastApi
- AWS ECR
- AWS EC2
- Search and Recommendation system
- Content Classification
- Customer Support
- Research Paper Screening
- Automatically Summarizing Resumes
We have shown how to train our own name entity tagger along with proper inplementaion of train and predict pipeline.