Automated Data Cleaning Pipeline with Python and PostgreSQL

Most people clean data. I built a pipeline that does it automatically, loads it into PostgreSQL, and visualises it in Power BI — end to end. Here's what my latest project covers 👇 I analysed 3,900 retail customer records to answer 10 real business questions: — Do subscribers actually spend more? — Which products get discounts the most — and does it hurt revenue? — Who are the loyal customers driving the most revenue? The answers were surprising. 🔧 What I built: → Python ETL pipeline with group-aware null imputation & feature engineering → PostgreSQL database loaded via SQLAlchemy (CTEs, Window Functions, subqueries) → Interactive Power BI dashboard with live KPIs 📊 Key finding: A significant segment of discount users still pays above the average purchase amount — discounts don't always mean lower margins. This is the kind of analysis I'd bring to any data team from day one. 🛠 Python · SQL · PostgreSQL · Power BI · SQLAlchemy Github:-https://lnkd.in/eDQusQWq #DataAnalyst #DataAnalytics #Python #SQL #PowerBI #PostgreSQL #ETL #Pandas #OpenToWork #DataPortfolio #PortfolioProject #BuildInPublic #LondonJobs #TechCareers #DataDriven #BusinessIntelligence #GitHub #DataScience #CareerInData #HiringAlert

  • graphical user interface, application

Looks very professional’

Like
Reply

Insightful dashboard and very beautiful and simple to understand.

Like
Reply
See more comments

To view or add a comment, sign in

Explore content categories