Implemented and compared core ML algorithms in Python: PCA, ICA, t-SNE, DBSCAN, Agglomerative Clustering, One-class SVM, SVM.

5mo

Just wrapped up a deep dive into core ML techniques using Python! In this pet-project, I implemented and compared several foundational algorithms to understand their strengths, trade-offs, and real-world applicability: * Dimensionality Reduction: PCA for linear feature compression ICA to uncover independent sources t-SNE for powerful non-linear visualization * Unsupervised Learning: DBSCAN for density-based clustering (great for identifying outliers!) Agglomerative Clustering for hierarchical grouping One-class SVM * Supervised Learning: Support Vector Machine (SVM) I evaluated each method on synthetic datasets, visualized results and summarized performance in a clear task-comparison table—making it easier to choose the right tool for the job. This exercise reinforced a key lesson: there’s no “best” algorithm—only the best choice for your data and problem. Check out the full notebook on Kaggle (link in comments)! #MachineLearning #DataScience #Python #PCA #tSNE #Clustering #SVM #UnsupervisedLearning #AI #DataAnalysis #ML

1 Comment

Denis Andrikov 5mo

https://www.kaggle.com/code/denisandrikov/7-ml-cases-and-one-dataset-with-overlay-problem

To view or add a comment, sign in

More Relevant Posts

Pramendra Singh
5mo Edited
Report this post
#AI #ML #ScikitLearn #Post10 🚀Scikit Learn python library is essential for building/training machine learning models. Key features are- 1. Provides a wide range of supervised learning algorithms (classification & regression). 2. Supports unsupervised learning like clustering and dimensionality reduction. 3. Offers robust model evaluation tools such as cross-validation and metrics. 4. Includes preprocessing utilities for scaling, encoding, and imputing data. 5. Enables feature engineering with tools like PolynomialFeatures and PCA/NMF. 6. Supplies built-in datasets for quick testing and prototyping. 7. Allows creation of clean machine-learning workflows using Pipelines. 8. Supports hyperparameter tuning through GridSearchCV and RandomizedSearchCV. 9. Ensures model persistence with joblib and pickle. 10. Built on optimized NumPy/SciPy for fast, efficient performance.
2 Comments
Like Comment
To view or add a comment, sign in
Arya Deshmukh
6mo
Report this post
🤖 Experiment 8: Logistic Regression Algorithm Delighted to share the completion of Experiment 8 from my Data Science and Statistics practical series — “Logistic Regression Algorithm.” This experiment introduced me to the fundamentals of classification problems and how logistic regression is applied to predict categorical outcomes using statistical modeling. Key learnings from this experiment: 🔹 Understanding the concept and working of Logistic Regression 🔹 Implementing the algorithm using Scikit-learn 🔹 Evaluating model accuracy and visualizing decision boundaries 🔹 Differentiating between regression and classification models This experiment enhanced my understanding of supervised learning and how data-driven predictions can be used to make informed decisions in real-world applications. 🔗 Explore the complete notebook here: https://lnkd.in/eY_AynnY #Python #LogisticRegression #MachineLearning #ScikitLearn #DataScience #AI #DataAnalytics #LearningByDoing #EngineeringJourney
Like Comment
To view or add a comment, sign in
Shrihari Markad
5mo
Report this post
🌳 Experiment 11: Decision Tree Algorithm using Python 🤖 In this lab, I explored the Decision Tree Algorithm, one of the most intuitive and powerful techniques in supervised machine learning used for both classification and regression. 🔍 Key learning outcomes: • Understanding how decision trees split data using information gain and Gini index • Implementing Decision Trees using scikit-learn • Visualizing tree structures for better interpretability • Avoiding overfitting through pruning techniques • Evaluating model performance and feature importance This experiment enhanced my understanding of how Decision Trees form the foundation for ensemble methods like Random Forests and Gradient Boosting, making them crucial in real-world predictive modeling. 📁 Explore the repository here : 👉 https://lnkd.in/epWys7e7 #DataScience #MachineLearning #Python #DecisionTree #ScikitLearn #Classification #PredictiveModeling #DataAnalysis #AI #LearningJourney #jupyter Notebook Ashish Sawant sir
Like Comment
To view or add a comment, sign in
Jovaney Atef
5mo
Report this post
Just wrapped up the Linear Classifiers in Python course on DataCamp! 🎓🐍 It was short, practical, and super helpful for connecting math intuition with hands-on scikit-learn workflows. What I practiced Logistic Regression: decision boundaries, L1/L2 regularization, and GridSearchCV tuning Perceptron & Linear SVM: when to use each, margins, and the impact of C Pipelines: clean preprocessing ➜ model flow Metrics: beyond accuracy—precision/recall, F1, ROC-AUC, class imbalance awareness Scaling & splits: StandardScaler + stratified splits for fair evaluation Key takeaways Regularization is your first line of defense against overfitting 💪 The right metric can change the decision entirely Pipelines save time and reduce mistakes when moving to production If you’re building a solid ML foundation, mastering linear models before jumping to deep nets is a great move. #DataCamp #MachineLearning #Python #ScikitLearn #LinearModels #LogisticRegression #SVM #AI #DataScience #LearningJourney
6 Comments
Like Comment
To view or add a comment, sign in
Muhammad Umair
5mo
Report this post
🚢 Titanic Survival Prediction Project I built a machine learning model to predict passenger survival on the Titanic based on features like age, gender, class, and fare. The project involved data preprocessing, feature engineering, and training models such as Logistic Regression, Random Forest, and XGBoost. Achieved strong accuracy and gained valuable insights into the factors influencing survival rates. 🔹 Tools & Libraries: Python, Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn 🔹 Techniques: Data Cleaning | Feature Selection | Model Evaluation #MachineLearning #DataScience #Python #AI #TitanicDataset #Classification #Kaggle #InternshipProject #DataAnalytics #MLProject
Like Comment
To view or add a comment, sign in
Gaurav Kaushik
6mo
Report this post
📘 #100DaysOfML – Day 02 Today’s focus was on Encoding Categorical Data — an essential step before feeding data into any Machine Learning model. 🔹 Explored Ordinal Encoding to convert text features like “category” into numerical values while keeping the order. 🔹 Learned about One-Hot Encoding — how it represents categories as binary vectors and why it’s better when the feature has no natural order. 🔹 Also got to know that One-Hot Encoding often returns a SciPy sparse matrix, which saves memory for large datasets! 🧠 Concepts are getting clearer step by step — can’t wait to move on to Feature Scaling & Transformation next! #MachineLearning #100DaysOfML #DataPreprocessing #AI #Python #LearningJourney
Like Comment
To view or add a comment, sign in
Yash Wadpalliwar
6mo
Report this post
🚀 Unlock Machine Learning with Python & Scikit-Learn! 🐍🤖 Scikit-Learn makes ML simple, fast, and powerful: 🔹 Load & Preprocess Data – Standardize, Normalize, Encode, Impute missing values 🔹 Supervised Learning – Linear Regression, KNN, SVM, Naive Bayes 🔹 Unsupervised Learning – K-Means, PCA 🔹 Model Tuning – Grid Search, Randomized Search 🔹 Evaluate Performance – Accuracy, Confusion Matrix, Classification Report, MAE, MSE, R² 💡 Pro Tip: Keep your workflow clean—preprocess, train, tune, and evaluate. Scikit-Learn provides a unified interface for every step of your ML journey! #Python #ScikitLearn #MachineLearning #DataScience #AI #ML #DataAnalytics
Like Comment
To view or add a comment, sign in
Arun Reddy
5mo
Report this post
Random Forest is one of the most powerful and widely used algorithms in Machine Learning. It combines the predictions of multiple Decision Trees to improve accuracy, reduce overfitting, and handle large, complex datasets with ease. ✨ Key Highlights: Uses bootstrap sampling and ensemble classification Reduces variance and improves robustness Works great for both classification and regression tasks Handles missing values and noisy data effectively Implemented using Python (Scikit-Learn, TensorFlow) 💡 Why It’s Special: Each decision tree “votes,” and the majority wins — this collective wisdom leads to more stable and accurate predictions! 🌲🌲🌲 📊 Applications: ✅ Disease prediction ✅ Stock market analysis ✅ Fraud detection ✅ Recommendation systems 👨💻 Team Members: M Arun Kumar Reddy | B Tharun Sujith | B Venkata Anil Kumar | A Pooja Samanvitha | P Venu Gopala Krishna #MachineLearning #RandomForest #AI #DataScience #EnsembleLearning #Python #ScikitLearn #TensorFlow #MLProject
Like Comment
To view or add a comment, sign in
Mohammad Ghasemifard
5mo
Report this post
Working with Numbers and AI😊 Many people think working with numbers is difficult, but I’ve always enjoyed it. What fascinates me even more is realizing that behind the ease artificial intelligence brings us, everything is ultimately reduced to numbers. Text, images, audio, and even human behaviour are all converted into numerical representations before a machine can understand them. It’s a timely reminder of how powerful structured data can be, and why strengthening our analytical skills is more critical than ever. The more fluent we become with numbers, the better we can understand and shape the systems around us. #DataScience #Analytics #MachineLearning #ArtificialIntelligence #Python #DataDriven
Like Comment
To view or add a comment, sign in
Vaishnavi Dhage
6mo
Report this post
Excited to share my recent project comparing five popular classification algorithms — Logistic Regression, KNN, SVM, Decision Tree, and Random Forest! Through this experiment, I learned how different models handle data patterns and where each shines in terms of accuracy and performance. Ashish Sawant https://lnkd.in/eYH54psE #MachineLearning #DataScience #AI #Python #MLProject #Classification
Like Comment
To view or add a comment, sign in

450 followers

8 Posts

View Profile Connect

Implemented and compared core ML algorithms in Python: PCA, ICA, t-SNE, DBSCAN, Agglomerative Clustering, One-class SVM, SVM.

More Relevant Posts

Explore related topics

Explore content categories