Built an ML pipeline for preprocessing and model ensembling using Python and ScikitLearn.

5mo

Excited to share the ML pipeline I built to automate the full workflow — from preprocessing to model ensembling! Key Highlights: • KNNImputer + FunctionTransformer for handling missing values • OneHotEncoder for categorical encoding • RobustScaler for numerical scaling • Ensemble model using Random Forest, Gradient Boosting & XGBoost with a Voting Classifier This pipeline ensures clean data, consistent preprocessing, and efficient model training — all in one place! #MachineLearning #DataScience #Python #ScikitLearn #XGBoost #MLPipeline #AI #DataAnalytics #MLModels #FeatureEngineering #EnsembleLearning #CodingJourney #PortfolioProject

To view or add a comment, sign in

More Relevant Posts

Swapna Nedarapalli
5mo
Report this post
Today I explored how machine learning models handle categorical features — specifically, converting text data like city names into numbers the model can understand. Using the get_dummies() method in Pandas, I created dummy variables for the town column in my dataset, merged them back, and trained a Linear Regression model to predict house prices. It was cool to see how encoding categories correctly can change the model’s accuracy and make predictions more reliable. #MachineLearning #DataScience #Python #LinearRegression #Pandas #ScikitLearn #StudentLearning #AI
Like Comment
To view or add a comment, sign in
Harigovind A
6mo
Report this post
🌟 Just learned my first machine learning algorithm — K-Nearest Neighbors (KNN)! KNN is simple but powerful — it predicts based on the nearest data points. What amazed me is how much feature scaling affects accuracy. 💡 Key takeaway: Choosing the right K value and scaling your features properly makes a big difference in performance! Next up: experimenting with Naive Bayes and SVM 🚀 #MachineLearning #Python #DataScience #KNN #LearningJourney #AI
Like Comment
To view or add a comment, sign in
Pratham Singla
6mo Edited
Report this post
#Day19 of #100DaysOfCode K-Nearest Neighbors (KNN) Algorithm Today I explored K-Nearest Neighbors (KNN) one of the most intuitive Machine Learning algorithms. KNN predicts outcomes based on the closest data points, following the idea that: “Similar things stay close to each other.” Achieved 96.67% accuracy on the classic Iris dataset A simple yet powerful approach for classification tasks! #MachineLearning #KNN #AI #Python #DataScience #100DaysOfCode #MLProjects
Like Comment
To view or add a comment, sign in
Pratham Singla
5mo
Report this post
#Day29 of #100DaysOfCode Today I learned about the Bias–Variance Tradeoff in Machine Learning. High Bias → Underfitting (model too simple) High Variance → Overfitting (model too complex) The goal is to find the right balance for best accuracy ✅ Understanding this tradeoff helps in building models that generalize well on unseen data. #MachineLearning #DataScience #AI #Python #LearningJourney #100DaysOfCode
Like Comment
To view or add a comment, sign in
Tharushika Abedheera
6mo
Report this post
Reflection Design Pattern in AI Agents Explained Simply! In this short tutorial, I walk through how reflection works in AI Agents. You’ll learn how this pattern forms the foundation for self improving AI systems, and how you can implement it yourself with just a few lines of code. 💻 GitHub repo: https://lnkd.in/gYiurHn9 #AI #MachineLearning #Agents #ReflectionPattern #Gemini #Python #AIDesignPatterns #LLM #GenerativeAI

4 Comments
Like Comment
To view or add a comment, sign in
Intelli Gen AI

13 followers
6mo
Report this post
Reflection Design Pattern in AI Agents Explained Simply! In this short tutorial, I walk through how reflection works in AI Agents. You’ll learn how this pattern forms the foundation for self improving AI systems, and how you can implement it yourself with just a few lines of code. 💻 GitHub repo: https://lnkd.in/gYiurHn9 #AI #MachineLearning #Agents #ReflectionPattern #Gemini #Python #AIDesignPatterns #LLM #GenerativeAI https://lnkd.in/gEWR2bVR

Tharushika Abedheera

Machine Learning Engineer | AI Engineer | AI Researcher | NLP & Generative AI | LLMs, RAG, AI Agents | PyTorch, TensorFlow, Hugging Face | MLOps | AWS | Azure | Cloud AI Systems | Python | Building Custom AI Solutions
6mo

Reflection Design Pattern in AI Agents Explained Simply! In this short tutorial, I walk through how reflection works in AI Agents. You’ll learn how this pattern forms the foundation for self improving AI systems, and how you can implement it yourself with just a few lines of code. 💻 GitHub repo: https://lnkd.in/gYiurHn9 #AI #MachineLearning #Agents #ReflectionPattern #Gemini #Python #AIDesignPatterns #LLM #GenerativeAI
Like Comment
To view or add a comment, sign in
Shivam Saxena
5mo
Report this post
Level up your AI stack in 2025: these Python tools cover everything from data pipelines to MLOps, so you can ship reliable models faster and prove impact. Prioritize niche expertise, add original takeaways, and spark discussion—the algorithm now rewards helpful insights, focused topics, and meaningful comments over generic virality. What’s the one tool here that 10x’d your workflow this year—and why? #AI #ArtificialIntelligence #Python #DataScience #MachineLearning #MLOps #GenerativeAI #Analytics #DataEngineering #LLM #dataanalysis #analysis #AI
Like Comment
To view or add a comment, sign in
Pratham Singla
5mo
Report this post
#Day32 of #100DaysOfCode Bagging vs Boosting in Action! Today’s ML deep dive was all about making models smarter 🤖 I explored two powerful Ensemble Methods 🌲 Bagging (Random Forest) and ⚡ Boosting (AdaBoost) 📊 Results on the Iris Dataset: ✅ Random Forest → 97% Accuracy ✅ AdaBoost → 95% Accuracy Both gave great results — 👉 Bagging = Stability & Less Overfitting 👉 Boosting = Smarter Learning from Mistakes Here’s my accuracy comparison #MachineLearning #Python #AI #DataScience #CodingJourney #100DaysOfCode #EnsembleLearning #Motivation
Like Comment
To view or add a comment, sign in
Gowtham Vuppaladhadiam
6mo
Report this post
In Episode 1 of my Learn AI from Scratch series, we build a fun little project: A rule-based machine that plays 'Guess the Number' with you. Watch the 4-min demo and see how a system makes decisions without any learning. Next up: real Machine Learning - where the AI starts to learn from data. Follow along if you're learning AI the hands-on way! #AI #RuleBasedAI #GuessTheNumber #Python #MachineLearning #LearnAI #TechSimplified #LinkedInCreators #videoseries

4 Comments
Like Comment
To view or add a comment, sign in
Abdul Rahman Mohammad
6mo
Report this post
ML Zoomcamp - Module 6: Decision Trees and Ensemble Learning 📊 Decision Trees and Ensemble Learning form a crucial part of machine learning, offering powerful methods for prediction and classification tasks. A decision tree models decisions and their possible consequences using a tree like structure, making it easy to interpret and visualize. Ensemble learning builds on this by combining multiple models such as Random Forests, Bagging, and Boosting to improve performance, accuracy, and generalization compared to individual models. This module covered: ➡️ Decision trees ➡️ Random forest ➡️ Gradient boosting (XGBoost) ➡️ Hyperparameter tuning ➡️ Feature importance Together, Decision Trees and Ensemble Learning highlight the balance between simplicity and strength in machine learning models. By leveraging the interpretability of trees and the collective power of ensembles, we can build models that are both accurate and reliable bridging the gap between data understanding and intelligent decision making. #DecisionTrees #RandomForest #XGBoost #MachineLearning #MLZoomcamp #DataScience #Python #LearningInPublic Alexey Grigorev DataTalksClub
Like Comment
To view or add a comment, sign in

55 followers

4 Posts

View Profile Follow

Built an ML pipeline for preprocessing and model ensembling using Python and ScikitLearn.

More Relevant Posts

Explore related topics

Explore content categories