- Project Overview
- Features
- Directory Structure
- Installation
- How to Run
- Usage
- Roadmap
- Contributing
- License
- Contact
- Acknowledgments
- MADAR Citation
Welcome to the NLP Arabic Dialect Identification and Next Word Prediction project! This project leverages advanced natural language processing techniques to offer two main functionalities:
- Next Word Prediction (Knowledge-Based): Uses an n-gram model to predict the next word in a given sentence with the MADAR dataset.
- Arabic Dialect Identification (Machine Learning): Utilizes a BERT model with lexicon features to identify the Arabic dialect of a given text, leveraging the MADAR dataset.
✨ Experience the project live on Streamlit! ✨
- Next Word Prediction using n-gram model
- Arabic Dialect Identification using BERT
- Interactive UI with Streamlit
- Comprehensive text preprocessing for Arabic
- Python 3.x
- pip
- Streamlit
- Clone the repository:
git clone https://github.com/maans2001/UJ-NLP-Project cd UJ-NLP-Project
- Open Command Prompt or PowerShell.
- Navigate to the project directory and run:
run.bat
- Open Terminal.
- Navigate to the project directory and make the script executable:
chmod +x run.sh ./run.sh # use run_macosx.sh if you're on a mac machine
- Select "برنامج خمن الكلمة التالية (Knowledge Based)" from the sidebar.
- Enter a sentence in Arabic.
- Click "خمن الكلمات الجاية" to predict the next words.
- Select "برنامج تحديد اللهجات (Machine Learning)" from the sidebar.
- Enter an Arabic text.
- Click "حدد اللهجة" to identify the dialect.
- Add more dialects
- Improve the prediction model
- Enhance the UI/UX
Contributions are welcome! Please fork this repository and submit a pull request. For major changes, please open an issue first to discuss what you would like to change.
- Fork the Project
- Create your Feature Branch (`git checkout -b feature/AmazingFeature`)
- Commit your Changes (`git commit -m 'Add some AmazingFeature'`)
- Push to the Branch (`git push origin feature/AmazingFeature`)
- Open a Pull Request
Distributed under the MIT License.
For any inquiries or feedback, feel free to reach out!