diff --git a/README.md b/README.md index c8606e8..288116d 100644 --- a/README.md +++ b/README.md @@ -1,190 +1,156 @@ -# LLM Alignment Assistant +# ๐ŸŒŒ LLM Alignment Assistant - Your Template for Aligning Language Models -## ๐ŸŒŸ Overview +## ๐Ÿ“Œ Introduction -**LLM Alignment Assistant** is an advanced tool designed to assist in aligning large language models (LLMs) with desired human values and objectives. This project offers a full-stack approach to training, fine-tuning, deploying, and monitoring LLMs using **Reinforcement Learning from Human Feedback (RLHF)**. The system also incorporates evaluation metrics to ensure ethical and effective use of language models. The assistant provides a user-friendly interface for exploring the alignment, visualization of training metrics, and deploying the system at scale using cloud-native technologies. +**LLM Alignment Assistant** is not just a comprehensive tool for aligning large language models (LLMs), but also serves as a **powerful template** for building your own LLM alignment application. This repository is designed to provide a full stack of functionality, acting as a starting point to customize and extend for your own LLM alignment needs. Whether you are a researcher, developer, or data scientist, this template provides a solid foundation for efficiently creating and deploying LLMs tailored to align with human values and objectives. -![โœจ Architecture Diagram](assets/architecture_diagram.png) -## โœจ Key Features +## โœจ Features -- **๐Ÿ–ฅ๏ธ User-Friendly Web Interface**: A sleek, intuitive UI for interacting with the LLM and viewing alignment results. -- **๐Ÿ“Š Interactive Training**: Train models using RLHF, with dynamic metrics displayed in real-time. -- **๐Ÿ› ๏ธ Data Augmentation & Preprocessing**: Advanced preprocessing scripts, including tokenization, cleaning, and data augmentation using NLP techniques. -- **โš™๏ธ Scalable Deployment**: Easy deployment via Docker and Kubernetes, with horizontal scaling capabilities. -- **๐Ÿ” Explainability & Monitoring**: Incorporates SHAP or LIME-based explainability features along with live monitoring dashboards. +- **๐ŸŒ Interactive Web Interface**: A user-friendly interface for interacting with the LLM, training models, and viewing alignment metrics. +- **๐Ÿง  Training with RLHF**: Reinforcement Learning from Human Feedback to ensure model alignment with human preferences. +- **๐Ÿ› ๏ธ Data Augmentation & Preprocessing**: Advanced preprocessing, tokenization, and data augmentation with back-translation and paraphrasing. +- **๐Ÿ”„ Transfer Learning**: Utilize pre-trained models like BERT for improved performance on specific tasks. +- **๐Ÿ“ฆ Scalable Deployment**: Docker and Kubernetes-based deployment with Horizontal Pod Autoscaling (HPA). +- **๐Ÿ” Model Explainability**: SHAP-based dashboards for understanding model decisions. +- **๐Ÿ“Š User Feedback Loop**: Collection of user ratings for fine-tuning models continuously. -## ๐Ÿ—‚๏ธ Project Structure +## ๐Ÿ“‚ Project Structure -- **๐Ÿ“ app/**: Contains the UI and the backend logic of the web interface. - - `ui.py`: Manages routes and interactions with the UI. - - `static/`: Contains styles and JavaScript for an appealing UI. - - `templates/`: HTML templates for rendering the web pages. -- **๐Ÿ“ data/**: Scripts and datasets for generating, downloading, and processing data. -- **๐Ÿ“ deployment/**: Docker, Kubernetes configurations, and Helm charts to manage deployments. -- **๐Ÿ“ src/**: Core functionality, including training, evaluation, and preprocessing scripts. -- **๐Ÿ“ tests/**: Unit and integration tests to ensure quality across the different components. +- **`app/`**: Contains API and UI code. + - `auth.py`, `feedback.py`, `ui.py`: API endpoints for user interaction, feedback collection, and general interface management. + - **Static Files**: JavaScript (`app.js`, `chart.js`), CSS (`styles.css`), and Swagger API documentation (`swagger.json`). + - **Templates**: HTML templates (`chat.html`, `feedback.html`, `index.html`) for UI rendering. -## ๐Ÿ› ๏ธ Setup +- **`src/`**: Core logic and utilities for preprocessing and training. + - **Preprocessing** (`preprocessing/`): + - `preprocess_data.py`: Combines original and augmented datasets and applies text cleaning. + - `tokenization.py`: Handles tokenization. + - **Training** (`training/`): + - `fine_tuning.py`, `transfer_learning.py`, `retrain_model.py`: Scripts for training and retraining models. + - `rlhf.py`, `reward_model.py`: Scripts for reward model training using RLHF. + - **Utilities** (`utils/`): Common utilities (`config.py`, `logging.py`, `validation.py`). -### ๐Ÿ“‹ Prerequisites +- **`dashboards/`**: Performance and explainability dashboards for monitoring and model insights. + - `performance_dashboard.py`: Displays training metrics, validation loss, and accuracy. + - `explainability_dashboard.py`: Visualizes SHAP values to provide insight into model decisions. -- Python 3.8+ -- Docker & Docker Compose -- Kubernetes (Minikube or any cloud provider) -- Node.js (for front-end enhancements) +- **`tests/`**: Unit, integration, and end-to-end tests. + - `test_api.py`, `test_preprocessing.py`, `test_training.py`: Various unit and integration tests. + - **End-to-End Tests** (`e2e/`): Cypress-based UI tests (`ui_tests.spec.js`). + - **Load Testing** (`load_testing/`): Uses Locust (`locustfile.py`) for load testing. -### ๐Ÿ”ง Installation +- **`deployment/`**: Configuration files for deployment and monitoring. + - **Kubernetes Configurations** (`kubernetes/`): Deployment and Ingress configurations for scaling and canary releases. + - **Monitoring** (`monitoring/`): Prometheus (`prometheus.yml`) and Grafana (`grafana_dashboard.json`) for performance and system health monitoring. -1. **๐Ÿ“ฅ Clone the Repository**: - ```bash - git clone https://github.com/yourusername/LLM-Alignment-Assistant.git - cd LLM-Alignment-Assistant - ``` +## โš™๏ธ Setup -2. **๐Ÿ“ฆ Set Up the Virtual Environment**: - ```bash - python3 -m venv venv - source venv/bin/activate - pip install -r requirements.txt - ``` +### Prerequisites + +- ๐Ÿ Python 3.8+ +- ๐Ÿณ Docker & Docker Compose +- โ˜ธ๏ธ Kubernetes (Minikube or a cloud provider) +- ๐ŸŸข Node.js (for front-end dependencies) -3. **๐Ÿ“ฆ Install Node.js Dependencies** (optional for enhanced UI): +### ๐Ÿ“ฆ Installation + +1. **Clone the Repository**: ```bash - cd app/static - npm install + git clone https://github.com/yourusername/LLM-Alignment-Assistant.git + cd LLM-Alignment-Assistant ``` -### ๐Ÿš€ Running Locally +2. **Install Dependencies**: + - Python dependencies: + ```bash + pip install -r requirements.txt + ``` + - Node.js dependencies (optional for UI improvements): + ```bash + cd app/static + npm install + ``` -To run the application locally: +### ๐Ÿƒ Running Locally -1. **๐Ÿณ Build Docker Image**: +1. **Build Docker Images**: ```bash docker-compose up --build ``` -2. **๐ŸŒ Access the UI**: - Visit `http://localhost:5000` in your web browser. +2. **Access the Application**: + - Open a browser and visit `http://localhost:5000`. -## ๐Ÿ“ฆ Deployment +## ๐Ÿšข Deployment -### ๐Ÿšข Docker and Kubernetes +### โ˜ธ๏ธ Kubernetes Deployment -- **๐Ÿณ Docker**: A Dockerfile is provided for containerization. -- **โ˜ธ๏ธ Kubernetes**: Use the provided `deployment/kubernetes/deployment.yml` and `service.yml` files to deploy the app to a Kubernetes cluster. -- **๐Ÿ“œ Helm Charts**: Helm charts are available in the `deployment/helm/` directory for easier reusability and scalability. +- **Deploy to Kubernetes**: + - Apply the deployment and service configurations: + ```bash + kubectl apply -f deployment/kubernetes/deployment.yml + kubectl apply -f deployment/kubernetes/service.yml + ``` + - **Horizontal Pod Autoscaler**: + ```bash + kubectl apply -f deployment/kubernetes/hpa.yml + ``` -### ๐Ÿ”„ CI/CD Pipeline +### ๐ŸŒŸ Canary Deployment -A GitHub Actions workflow is included to automate building, testing, and deployment: +- Canary deployments are configured using `deployment/kubernetes/canary_deployment.yml` to roll out new versions safely. -- **โœ… Lint & Test**: Linting and unit tests are run at every pull request. -- **๐Ÿ‹ Docker Build & Push**: Docker images are built and pushed to Docker Hub. -- **โ˜ธ๏ธ Kubernetes Deployment**: Automatically deploy to the Kubernetes cluster upon merging. +### ๐Ÿ“ˆ Monitoring and Logging -## ๐Ÿค– Training and Fine-Tuning +- **Prometheus and Grafana**: + - Apply Prometheus and Grafana configurations in `deployment/monitoring/` to enable monitoring dashboards. +- **๐Ÿ“‹ Centralized Logging**: The **ELK Stack** is configured with Docker using `docker-compose.logging.yml` for centralized logs. -### ๐Ÿ’ก Reinforcement Learning from Human Feedback (RLHF) +## ๐Ÿง  Training and Evaluation -The training module includes: -- **๐Ÿ“Š Fine-Tuning**: Using the `training/fine_tuning.py` script, models can be fine-tuned on specific datasets. -- **๐Ÿ† Reward Models**: Implemented in `training/reward_model.py` for evaluating the appropriateness of responses. -- **๐ŸŒ Distributed Training**: Support for distributed training using `training/rlhf.py`. +### ๐Ÿ”„ Transfer Learning -### ๐ŸŽ›๏ธ Hyperparameter Tuning +The training module (`src/training/transfer_learning.py`) uses pre-trained models like **BERT** to adapt to custom tasks, providing a significant performance boost. -For hyperparameter tuning, **Optuna** has been integrated to provide automated exploration of the training parameters, ensuring optimal model performance. +### ๐Ÿ“Š Data Augmentation -## ๐Ÿ”„ Data Pipeline +The `data_augmentation.py` script (`src/data/`) applies augmentation techniques like back-translation and paraphrasing to improve data quality. -- **๐Ÿ› ๏ธ Data Augmentation**: Using advanced NLP techniques, including back-translation and embedding-based masking, available in `preprocessing/augmentation.py`. -- **โœ… Validation**: Thorough validation scripts (`preprocess_data.py` and `validate_data.py`) to maintain data quality. -- **โš™๏ธ Automation with Apache Airflow**: Data pipeline orchestration using Airflow, ensuring proper data flow between stages. +### ๐Ÿง  Reinforcement Learning from Human Feedback (RLHF) -## ๐Ÿ“ˆ Evaluation and Monitoring +- **Reward Model Training**: Uses the `rlhf.py` and `reward_model.py` scripts to fine-tune models based on human feedback. +- **Feedback Collection**: Users rate responses via the feedback form (`feedback.html`), and the model retrains with `retrain_model.py`. -- **๐Ÿ“Š Metrics**: The `evaluation/metrics.py` script provides a detailed analysis of model performance, including bias detection and fairness metrics. -- **๐Ÿ›ก๏ธ Safety Testing**: Ethical AI assessments using `evaluation/safety_tests.py`. -- **๐Ÿ“Š Dashboard**: Real-time monitoring with **Streamlit**, displaying key metrics, including training loss, accuracy, and reward trends. +### ๐Ÿ” Explainability Dashboard -## ๐ŸŒ Web Interface Improvements - -- **๐ŸŽจ Improved UI with TailwindCSS**: We've enhanced the CSS for modern and engaging aesthetics. -- **๐Ÿ“ˆ Interactive Visualizations**: Added **Chart.js** visualizations to present alignment metrics in a clear, graphical format. -- **๐Ÿ’ฌ Chatbot Integration**: A conversational UI element to interact directly with the trained LLM. +The `explainability_dashboard.py` script uses **SHAP** values to help users understand why a model made specific predictions. ## ๐Ÿงช Testing -- **โœ… Unit Tests**: Located in `tests/`, covering training, preprocessing, and evaluation. -- **๐Ÿ”„ Integration Tests**: End-to-end tests that simulate full pipeline execution. -- **๐Ÿงช Mock Testing**: Use of `pytest-mock` to simulate API calls and external integrations. - -## ๐Ÿ“Š Monitoring and Logging - -- **๐Ÿ“ˆ Monitoring**: Kubernetes monitoring using **Prometheus** and **Grafana**, with Horizontal Pod Autoscaling (HPA) for scalability. -- **๐Ÿ” Explainability**: SHAP and LIME explainability metrics are added to the evaluation process, providing insights into model behavior. -- **๐Ÿ“œ Logging**: Centralized logging using **ELK Stack** (Elasticsearch, Logstash, Kibana). - -## ๐Ÿš€ Cloud Deployment Instructions (AWS) - -To deploy the LLM Alignment Assistant on **AWS**, you can utilize **Elastic Kubernetes Service (EKS)** or **AWS Sagemaker** for model training: - -1. **AWS Elastic Kubernetes Service (EKS)**: - - Create an EKS cluster using AWS CLI or the console. - - Apply the Kubernetes deployment files: - ```bash - kubectl apply -f deployment/kubernetes/deployment.yml - kubectl apply -f deployment/kubernetes/service.yml - ``` - - Configure the **Horizontal Pod Autoscaler (HPA)** to ensure scalability: - ```bash - kubectl apply -f deployment/kubernetes/hpa.yml - ``` - -2. **AWS Sagemaker for Model Training**: - - Modify the `training/fine_tuning.py` to integrate with AWS Sagemaker. - - Use the Sagemaker Python SDK to launch a training job: - ```python - import sagemaker - from sagemaker.pytorch import PyTorch - - role = "arn:aws:iam::your-account-id:role/service-role/AmazonSageMaker-ExecutionRole-2023" - - pytorch_estimator = PyTorch( - entry_point='training/fine_tuning.py', - role=role, - instance_count=1, - instance_type='ml.p2.xlarge', - framework_version='1.8.0', - py_version='py3' - ) - - pytorch_estimator.fit({'training': 's3://your-bucket-name/training-data'}) - ``` - - Ensure IAM roles and permissions are properly set for accessing **S3** and **Sagemaker**. - +- **โœ… Unit Tests**: Located in `tests/`, covering API, preprocessing, and training functionalities. +- **๐Ÿ–ฅ๏ธ End-to-End Tests**: Uses **Cypress** to test UI interactions. +- **๐Ÿ“Š Load Testing**: Implemented with **Locust** (`tests/load_testing/locustfile.py`) to ensure stability under load. -## ๐Ÿš€ Future Work +## ๐Ÿ”ฎ Future Work -- **๐ŸŒ Multi-Language Support**: Expand the LLM's training to support multiple languages. -- **โš–๏ธ Ethical AI Enhancements**: Further enhance bias detection and mitigation techniques. -- **โ˜๏ธ Cloud-Native Deployment**: Implement cloud services like **AWS SageMaker** for training at scale. +- **๐Ÿ”‘ User Roles and Permissions**: Adding a role-based access control system. +- **๐Ÿ“‰ Advanced Monitoring**: Further enhance Prometheus alerts for anomaly detection. +- **๐Ÿš€ Public Demo Deployment**: Deploy a public version on Heroku or AWS for showcasing. -## ๐Ÿค Getting Involved +## ๐Ÿค Contributing -Contributions are welcome! Feel free to submit issues, pull requests, or suggestions for new features. +Contributions are welcome! Please submit pull requests or issues for improvements or new features. ## ๐Ÿ“œ License -This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for more details. +This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for more information. ## ๐Ÿ“ฌ Contact -- **โœ‰๏ธ Email**: [amirsina.torfi@gmail.com](mailto:amirsina.torfi@gmail.com) -- **๐ŸŒ Website**: [Portfolio](https://astorfi.github.io) +- **๐Ÿ“ง Email**: [amirsina.torfi@gmail.com](mailto:amirsina.torfi@gmail.com) +- **๐ŸŒ Website**: [Your Portfolio](https://astorfi.github.io) --- -

Made with โค๏ธ by Amirsina Torfi

- +

Developed with โค๏ธ by Your Name