Analyzing Online Retail Business Performance
Insights required:
The aim of the project is to analyze online retail store data to answer to following business questions:
1)top 10 highest revenue generating products
2)top 5 highest selling products in each region
3)month over month growth comparison for 2022 and 2023 sales
4)for each category which month had highest sales
5)which sub category had highest growth by profit from 2022 to 2023
Methodology:
*Data has been extracted from Kaggle using Kaggle API into Jupyter notebook, for cleaning and pre processing using pandas.
*The cleaned data frame is then imported to MS SQL server Database using SQLAlchemy which is a Python SQL toolkit that allows developers to access and manage SQL databases using Pythonic domain language.
*SQL queries have been used to answer business questions making use of aggregations, CTEs, Case statements and window functions.