๐ Project Overview
This project applies financial analytics and machine learning to the SuperStore dataset, transforming raw sales data into actionable insights that directly impact business performance.
The goal is to move beyond traditional reporting into predictive and prescriptive analytics โ enabling management to anticipate profitability, optimise strategies, and strengthen financial decision-making.
๐ฏ Objectives
- Identify Key Financial Drivers
- Analyse which variables (customer segments, product categories, discount levels, regions) most affect profitability.
- Quantify their financial impact.
- Develop Predictive Models
- Build models to forecast sales and classify transactions as profitable vs. unprofitable.
- Target: 85%+ accuracy in profit classification.
- Segment Customer Base
- Analyse purchasing patterns.
- Score and rank customers for targeted marketing and retention strategies.
- Optimise Discount Strategy
- Find discount thresholds that maximise revenue while protecting margins.
- Recommend dynamic pricing strategies.
- Deliver Actionable Recommendations
- Provide clear strategies with ROI projections.
- Build dashboards to support continuous monitoring.
๐ Dataset Summary
- Source: Kaggle โ SuperStore Sales dataset
- Data Type: Transaction-level data
- Scope: Sales, profit, discount, product, customer, geography, time
Key Features:
- Financial metrics: Sales, profit, profit margin, discount
- Product attributes: Product name, category, subcategory, manufacturer
- Customer attributes: Segment, customer name
- Geographic attributes: Region, city, state, postal code
- Temporal attributes: Order date, ship date
This dataset supports multi-dimensional analysis across customers, products, regions, and time periods.
๐ Approach
1. Exploratory Data Analysis (EDA)
- Analysed revenue, profit trends, and discount patterns.
- Compared profitability by product category, customer segment, and region.
- Visualised sales seasonality and operational efficiency (order vs. ship dates).
2. Feature Engineering
- Calculated profit margins, customer lifetime value, discount effectiveness.
- Created segmentation variables for customer targeting.
- Developed regional performance metrics.
3. Predictive Modelling
- Regression & Classification: Predicted profitability of transactions.
- Forecasting Models: Time-series models for sales and demand prediction.
- Achieved 85%+ classification accuracy in identifying profitable vs. loss-making transactions.
4. Business Intelligence Dashboards
- Designed Power BIโstyle financial dashboards.
- Enabled drill-down by region, segment, and category.
- Provided real-time insights for executives.
โจ Key Insights
Financial Challenges Identified
- Margin Pressure: Aggressive discounting erodes profit.
- Inventory Costs: Excess stock reduces efficiency.
- Acquisition Costs: Rising marketing costs demand smarter targeting.
Operational Findings
- Regional performance varies significantly.
- Some categories drive high sales but low margins.
- Seasonal sales patterns affect cash flow and logistics.
Impact Potential
- 5โ15% margin improvement via optimised pricing/discounts.
- 10โ20% lower inventory costs with better demand forecasting.
- 15โ25% higher customer lifetime value through segmentation.
- 20โ30% stronger regional performance via tailored strategies.
๐ ๏ธ Tools & Tech
- Languages: Python (pandas, NumPy, matplotlib, seaborn, scikit-learn)
- Techniques: Regression, Classification, Forecasting, Feature Engineering
- Visualisation: Power BI (KPIs, drill-down dashboards)
- Business Analytics: Variance analysis, ROI assessment, scenario planning
๐ Outcome
- Built a profit classification model (85%+ accuracy).
- Delivered dashboard-style insights for executives.
- Identified strategies for margin optimisation, customer retention, and regional growth.
- Produced clear ROI-focused recommendations for management.
๐ Links
- ๐ View on GitHub
- ๐ Live Dashboard
๐ Learning Outcomes
- Applied data science in a financial analytics context.
- Strengthened skills in profitability analysis, customer segmentation, and forecasting.
- Demonstrated how to translate analytics into real-world business recommendations.
- Practiced combining machine learning with BI dashboards for maximum impact.
๐ฎ Future Work
- Incorporate deep learning for demand forecasting.
- Add real-time data feeds (sales API, ERP integration).
- Expand discount optimisation into a dynamic pricing engine.
- Enhance explainability with SHAP/feature importance dashboards.
ยฉ Teslim Adeyanju 2025. All Rights Reserved.