GitHub - Rishav-hub/Personal-Information-Tagger-NER: Named entity recognition (NER) helps you easily identify the key elements in a text, like names of people, places, brands, monetary values, and more.Extracting the main entities in a text helps sort unstructured data and detect important information, which is crucial if you have to deal with large datasets. This project has been deployed using AWS

🆕 Personal Information Tagger Based on Named entity recognition

Named entity recognition (NER) helps you easily identify the key elements in a text, like names of people, places, brands, monetary values, and more.Extracting the main entities in a text helps sort unstructured data and detect important information, which is crucial if you have to deal with large datasets. This project has been deployed using AWS ECR and AWS EC2 Instance.

💽 Dataset

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

📚 Approach

Get data and properly create text and label (Can be done using https://explosion.ai/demos/displacy-ent.
Use trasnformer Roberta architecture for training the ner tagger
Use hugging face for Robereta Tokenizer
Train and Deploy model for use-cases

🚀 API

🧑‍💻 How to setup

create fresh conda environment

conda create -p ./env python=3.7 -y

activate conda environment

conda activate ./env

Install requirements

pip install -r requirements.txt

To run inferencing

python app.py

To launch swagger ui

http://localhost:8080/docs

🧑‍💻 Tech Used

Natural Language processing
Pytorch
Transformer
FastApi

Deployment

AWS ECR
AWS EC2

🏭 Industrial Use-cases

Search and Recommendation system
Content Classification
Customer Support
Research Paper Screening
Automatically Summarizing Resumes

👋 Conclusion

We have shown how to train our own name entity tagger along with proper inplementaion of train and predict pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
artifacts		artifacts
assignment		assignment
flowchart		flowchart
ner		ner
notebooks		notebooks
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
config.yaml		config.yaml
initial_setup.sh		initial_setup.sh
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🆕 Personal Information Tagger Based on Named entity recognition

💽 Dataset

📚 Approach

🚀 API

🧑‍💻 How to setup

🧑‍💻 Tech Used

Deployment

🏭 Industrial Use-cases

👋 Conclusion

About

Releases

Packages

Languages

Rishav-hub/Personal-Information-Tagger-NER

Folders and files

Latest commit

History

Repository files navigation

🆕 Personal Information Tagger Based on Named entity recognition

💽 Dataset

📚 Approach

🚀 API

🧑‍💻 How to setup

🧑‍💻 Tech Used

Deployment

🏭 Industrial Use-cases

👋 Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages