GitHub - derak-isaack/PharmacySales_Modelling: Apache Kafka Streamlit application for real-time tracking of pharmacy sales using the start schema data model.

Pharmacy Sales Tracker

Motivation: With the ever rising need for automation and real-time tracking across sales organizations to minimize human error and identify fraud, I sought to develop a Streamlit application which uses MySQL server database and is intergrated with Apache Kafka which offers Low latency to ensures real-time data streaming.

Project Overview

Streamlit real-time Pharmacy sales tracker that uses the star-schema to track sales across several pharmacy outlets for a big pharma. The application leverages on using the star-schema which is:

Easier to understand and manage
Less dependant on table joins.
High performance.

The application also uses the MySQL server database for data entry which has several advantages namely:

supports transactions.
Supports data integrity.
Handles severall transaction requests simultaneously.
Offers atomicity.

The application also intergates Apache Kafka for real-time data streaming as well as transformations. Using Kafka offers the following benefits namely:

Data durability and reliability because data is stored on disk across brokers
Real-time data processing
Flexibility in batch and stream processing.
Data auditing and compliance: With Change Data Capture (CDC) approaches, Kafka facilitates data replication across multiple systems or databases, ensuring accurate and consistent data for auditing and compliance purposes.

Objectives & description

Develop a data model that follows the star-schema approach having the dimensions and facts table. The table-models can be found here which typically follows the sql approach.

Defining the tables in a separate file offers a more flexible approach for the application suppose further change may arise. It also provides easy debugging for the application.

ERD-diagram

This python-file defines a class using the traditional python OOP approach which offers more customization and flavour to the main streamlit application.It also allows form sharing from the doctor table, Employee table and Drug items tables which are the dimension tables which very vital in providing more context to the Facts table.

Intergrate Apache Kafka into the streamlit application to serve as the Producer. The data should be in JSON formart for easier ingestion into the Kafka topics. This is made possible by using the serializer which allows for transformation of data into JSON formart.

Read data from Kafka topics by a consumer to allow for Real-time data streaming as well as processing. The consumer can be found here

To get started with Apache Kafka, the Zookeper should be running. On windows, the command to run the Zookeeper is .\bin\windows\zookeeper-server-start.bat .\config\zookeeper.properties. The Kafka server should also be running and is possible by uisng the command .\bin\windows\kafka-server-start.bat .\config\server.properties.

N/B: Apache Kafka should be correctly configured in the environment variables to allow port communication.

Conclusion & Future steps

The Sreamlit app is deployed locally due to the constraints of the database being available locally and Apache Kafka port usage. Here is a snippet of the User Interface for inputting sales data to provide real-time tracking.

After running the Consumer, here is a snapshot of how the data streams in from the streamlit application

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.streamlit		.streamlit
__pycache__		__pycache__
state/streamlit-reader/default		state/streamlit-reader/default
.gitignore		.gitignore
Dimensions.png		Dimensions.png
ERD_diagram.png		ERD_diagram.png
Facts.png		Facts.png
Readme.md		Readme.md
data_stream.png		data_stream.png
database.db		database.db
database.py		database.py
database2.py		database2.py
group_consumer.py		group_consumer.py
helpers.py		helpers.py
kafka_consumer.py		kafka_consumer.py
pharmacy_sales_tracker.sql		pharmacy_sales_tracker.sql
query_data.py		query_data.py
requirements.txt		requirements.txt
sales_dashboard.py		sales_dashboard.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pharmacy Sales Tracker

Project Overview

Objectives & description

Conclusion & Future steps

About

Releases

Packages

Languages

derak-isaack/PharmacySales_Modelling

Folders and files

Latest commit

History

Repository files navigation

Pharmacy Sales Tracker

Project Overview

Objectives & description

Conclusion & Future steps

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages