Skip to content

This project uses SQL to analyze supply chain data, focusing on orders, cancellations, and customer trends. It includes datasets and queries to help understand how SQL can be applied to real-world supply chain problems.

License

Notifications You must be signed in to change notification settings

heyamay/SQL-Supply-Chain-Analysis

Repository files navigation

SQL Supply Chain Analysis

banner-supply-chain-management-vector-illustration-concept-icon-with-management-analysis-plan-product-procurement-logistic-distribution-2RBW8J0_enhanced

Overview

This project demonstrates a comprehensive analysis of supply chain data using SQL. It covers a range of insights such as order trends, customer behavior, item popularity, and advanced classifications. The dataset includes sales data (sales_test.csv) and cancellations data (canceled_test.csv). The queries are categorized into Easy, Intermediate, and Advanced levels based on complexity.

Features

  • Basic Analytics: Total orders, unique customers, and average items ordered.
  • Customer Insights: Top customers, canceled orders, and successful order rates.
  • Item Trends: Top-ordered items, ABC classification, and canceled vs. shipped items.
  • Advanced Techniques: Cumulative sales, order rankings, and customer contributions to sales.

Tools and Technologies

  • Database: MySQL
  • Programming Language: SQL
  • Dataset: sales_test.csv and canceled_test.csv
  • Deployment: Available on GitHub for exploration and learning.

Query Categories

Easy Level

  1. Orders in January 2017: Count of orders placed in January 2017.
  2. Units Ordered in February 2017: Total number of units ordered.
  3. Canceled Orders per Customer: Number of canceled orders for each customer.
  4. Unique Customers: Count of unique customers.
  5. Average Items Ordered: Average number of items ordered per order.
  6. Top 5 Ordered Items: List of the most ordered items.
  7. Successful Orders for Specific Customers: Total successful orders by selected customers.

Intermediate Level

  1. Units Ordered vs. Canceled: Comparison of orders and cancellations for items appearing in both datasets.
  2. Canceled vs. Successful Orders: A detailed comparison for the same items.
  3. Order Classification: Categorization of orders as 'High', 'Medium', or 'Low' based on volume.
  4. Shipped Items Percentage: Calculation of shipped items as a percentage of ordered items for each customer.
  5. Top 3 Customers with Canceled Orders: Identification of customers with the highest cancellations.
  6. Items Canceled More than Shipped: Items with more cancellations than shipments.
  7. Top Ordering Customer in January 2017: Identification of the customer with the most orders.

Advanced Level

  1. Cumulative Ordered Units: Cumulative total of ordered units ranked by date for each customer.
  2. Top 3 Customers by Canceled Orders: Analysis of customers with the most cancellations and their total sales.
  3. Top Customers' Contribution to Sales: Percentage contribution of top 5 customers to total sales.
  4. ABC Classification: Classification of items into A, B, and C categories based on sales contribution.

Dataset

1. sales_test.csv

Contains details about orders placed, including:

  • ORDER_NO: Order ID
  • CUSTOMER_NO: Customer ID
  • ITEM: Item ordered
  • NS_ORDER: Number of items ordered
  • NS_SHIP: Number of items shipped
  • DATE: Date of the order

2. canceled_test.csv

Contains information on cancellations, including:

  • CUSTOMER_NO: Customer ID
  • ITEM: Item canceled
  • NC_ORDER: Number of items canceled
  • NC_SHIP: Number of canceled items shipped

Key Findings

  • Understanding how frequently orders are canceled and which items are more likely to be canceled.
  • Determining service level performance for different customers and identifying top performers.
  • Performing an ABC classification to prioritize items based on sales volume.
  • Identifying peak sales and cancellation days, helping the business make informed operational decisions.

How to Use

  1. Clone the repository:
    git clone https://github.com/heyamay/SQL-Supply-Chain-Analysis.git
  2. Load the datasets (sales_test.csv and canceled_test.csv) into your SQL database.
  3. Execute the provided SQL queries in your SQL environment.
  4. Review the results and insights generated.

🚀 About Me

Hi, I'm Amay Jaiswal! 👋

I am a Data Analytics Enthusiast and Data science practitioner

Linkedin

About

This project uses SQL to analyze supply chain data, focusing on orders, cancellations, and customer trends. It includes datasets and queries to help understand how SQL can be applied to real-world supply chain problems.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published