Hello, I'm MuhammadBinUsman03 π
I'm a Machine Learning Engineer, currently contributing to and democratizing LLMs space. I explore recent advancements/research in the domain, experiment with them, share insights, and also build side projects.
Domain | Frameworks |
---|---|
Machine Learning | |
LLM-Training | |
LLM-Inferencing | |
Data-Generation | |
RAG | |
DevOps | |
Development |
- 3-Pipeline LLMOps System - Training pipeline finetunes model on a serverless GPU infrastructure, Logs checkpoints on WandB Registry. Streaming pipeline ingests data from a live source, processes, embeds, and stores into Qdrant VectorDB, the pipeline is deployed on AWS using GitHub CI/CD. Inference pipeline loads from model registry, calls LLM with context, and maintains chat history
- OrpoLlama3-8B - Surpassed Llama3-8B by 2 points on OpenLLM Leaderboard with 15K-steps ORPO training on 1xA100.
- apollo-preview-v0.2 - RP/Creative writing/Instruction following dataset curated in collaboration with QuasarResearch.
- QueryRouter - Dynamic routing system deployed on AWS for querying LLMs, boosting efficiency, and optimizing costs.
- AutoPrune - Automatic pruning of LLMs on Runpod-GPUs.
- Chain-QnA - RAG application deployed with LangServe as a REST-API for Basic/PDF QnA.
- ImageGallery-Microservices - A microservices architecture-based Google Photos clone, deployed on Google Cloud.