Creepypasta Data Analysis with Machine Learning

Project Title: Creepypasta Data Analysis & Prediction 🚀 I'm excited to share my latest Machine Learning project! In this project, I performed data cleaning and applied ML algorithms to analyze Creepypasta data. Key Highlights: Data Cleaning: Processed messy data into a structured format. Models Used: Explored algorithms like Linear Regression and Random Forest. Tools: Python, Google Colab, and Pandas. You can check out the full code and dataset on my GitHub here: [https://lnkd.in/dFASwtJh] #MachineLearning #DataScience #Python #GitHub #MLProjects

1 Comment

Beenish . 1w

Nayab Bano Adeel Ahmed Satti

To view or add a comment, sign in

More Relevant Posts

Sinchana B S
4w
Report this post
🚀 Day 54 of My 90-Day Data Science Challenge Today I worked on Loss Functions in Machine Learning. 📊 Business Question: How do we measure how wrong a model’s predictions are? Loss functions calculate the difference between actual and predicted values. Using Python concepts: • Learned Mean Squared Error (MSE) • Understood Mean Absolute Error (MAE) • Explored Log Loss (Binary Cross-Entropy) • Compared regression vs classification loss • Understood impact on model training 📈 Key Understanding: Loss functions guide the model to improve by minimizing error. 💡 Insight: Choosing the right loss function is crucial for correct model learning. 🎯 Takeaway: Better loss function → better learning → better predictions. Day 54 complete ✅ Understanding model errors 🚀 #DataScience #MachineLearning #DeepLearning #LossFunction #Python #LearningInPublic #90DaysChallenge
Like Comment
To view or add a comment, sign in
Shreya Borate
3w Edited
Report this post
Day 43 of #100DaysOfDataScience 📊 Built Student Performance Analysis using K-Means Clustering 🎓 ✔️ Data preprocessing & feature selection ✔️ Feature scaling ✔️ Applied K-Means clustering ✔️ Identified clusters: Struggling, Average & Top Performers ✔️ Cluster analysis & insights #DataScience #MachineLearning #Python #KMeans #AI GitHub : https://lnkd.in/dBG3wGAE
Like Comment
To view or add a comment, sign in
Osita Jerry
3w
Report this post
🚀 Learning update: Visualization Today felt different, less about models, more about seeing the data. 🌳 Hierarchical Clustering Instead of forcing k clusters, it builds a tree of clusters 🔍 How It Works - Start with each point as its own cluster - Merge closest clusters step by step - End with one big cluster 📊 Dendrogram A tree-like diagram that shows: - How clusters are formed - Distance between clusters We can “cut” the tree at any level to get clusters. 🗺️ t-SNE (Visualization) This one blew my mind a bit. t-SNE converts high-dimensional data into 2D or 3D so we can see it. ⚠️ Important Insight - Points close together → similar - Clusters matter - Axes don’t mean anything 💡 My Takeaway Some tools are not for prediction, they’re for understanding and explaining data. And honestly, this is where things start to feel visual and intuitive. #DataVisualization #MachineLearning #DataScience #Python #DataCamp #DataCampAfrica
Like Comment
To view or add a comment, sign in
Statistics Globe

14,942 followers
1w
Report this post
The Statistics Globe Hub is moving forward quickly and is about to enter its third month, with new content released each week. Access to the April modules is only available to those who join this month. If you are interested in these modules, you have seven days left to register until April 30. If you sign up by April 30, you will receive immediate access to all modules released in April. After April 30, these modules will no longer be available to new members. The April modules include: 🔹 Draw Synthetic Datasets with drawdata in Python 🔹 Monte Carlo Simulation 🔹 AI-Assisted Coding with gander in R 🔹 Animated Visualization with magick in R The visualization below shows some of the topics and graphs covered this month. More information about the Statistics Globe Hub: https://lnkd.in/e5YB7k4d #Statistics #DataScience #AI #RStats #Python #MachineLearning #DataVisualization #StatisticsGlobeHub
Like Comment
To view or add a comment, sign in
Harivansh Dogne
4w
Report this post
🚀 Hands-on Machine Learning Project: Decision Tree Classifier Recently, I worked on a small but insightful project where I implemented a Decision Tree Classifier using Python and Scikit-learn. 📊 What I did: Created a structured dataset with features like Age, Salary, and Experience Applied data preprocessing techniques Built and trained a Decision Tree model Evaluated performance using Confusion Matrix & Classification Report Visualized patterns using Seaborn 📈 Key Learnings: How Decision Trees split data based on feature importance Importance of handling data properly before modeling Understanding evaluation metrics like precision, recall, and F1-score 💡 This project helped me strengthen my fundamentals in machine learning and model evaluation. 🔗 I’ll be sharing the GitHub repository soon! #MachineLearning #DataScience #Python #ScikitLearn #DecisionTree #DataAnalytics #LearningJourney

2 Comments
Like Comment
To view or add a comment, sign in
Yash Kumar Choudhary
1mo
Report this post
🚀 Recently, I explored the powerful NumPy library as a part of my Data Science journey. Starting with understanding the origin and need of NumPy, I learned why it is widely used for numerical computations and how it overcomes the limitations of traditional Python lists. Here’s what I covered: 🔹 Difference between NumPy arrays and Python lists 🔹 Creation of 1D and 2D arrays 🔹 Various array generation functions 🔹 Random array generation techniques 🔹 Understanding array attributes 🔹 Working with useful array methods 🔹 Reshaping and resizing arrays 🔹 Indexing and slicing of vectors 🔹 Boolean indexing 🔹 Performing array operations 🔹 Concept of deep copy vs shallow copy 🔹 Basics of matrix operations 🔹 Advanced array manipulations like vstack, hstack, and column_stack This learning has strengthened my foundation in handling data efficiently and performing fast computations, which is a crucial step in my journey towards Data Science. Looking forward to exploring more libraries and building exciting projects ahead! 💡 #NumPy #Python #DataScience #LearningJourney #Programming #AI #MachineLearning
Like Comment
To view or add a comment, sign in
Muzamil Naik
4w
Report this post
🌸 Iris Flower Classification — End-to-End ML Project Completed an end-to-end machine learning project focused on classifying iris flower species using data analysis and modeling techniques. 🔹 Key Highlights: Performed exploratory data analysis to understand dataset structure and quality Visualized feature relationships to identify important patterns Observed that petal length and petal width are key features for classification Built a Logistic Regression model for multi-class classification 🔹 Results: Achieved 100% accuracy on test data Precision, recall, and F1-score all indicate perfect performance Confusion matrix confirmed zero misclassifications 🔹 Key Takeaways: Data understanding and visualization play a crucial role in model performance Clean and well-separated datasets can lead to highly accurate models Proper evaluation is essential to validate model performance GitHub: https://lnkd.in/gTwJEjVa 📊 Tools Used: Python, Pandas, Seaborn, Scikit-learn #datascience #machinelearning #dataanalysis #python #analytics
Like Comment
To view or add a comment, sign in
Joachim Schork
1w
Report this post
The Statistics Globe Hub is moving forward quickly and is about to enter its third month, with new content released each week. Access to the April modules is only available to those who join this month. If you are interested in these modules, you have seven days left to register until April 30. If you sign up by April 30, you will receive immediate access to all modules released in April. After April 30, these modules will no longer be available to new members. The April modules include: 🔹 Draw Synthetic Datasets with drawdata in Python 🔹 Monte Carlo Simulation 🔹 AI-Assisted Coding with gander in R 🔹 Animated Visualization with magick in R The visualization below shows some of the topics and graphs covered this month. More information about the Statistics Globe Hub: https://lnkd.in/exBRgHh2 #Statistics #DataScience #AI #RStats #Python #MachineLearning #DataVisualization #StatisticsGlobeHub
Like Comment
To view or add a comment, sign in
Aryav Khadka
3w
Report this post
📊 Day 7 of My Data Science Journey Today I explored techniques used to understand relationships between variables in a dataset. Topics covered: • Scatter plots for visualizing relationships between variables • Correlation analysis to measure how features are related • Correlation heatmaps to visualize feature relationships across the dataset Learning how to identify patterns and relationships in data is an important step before building machine learning models. Continuing to strengthen my data analysis and visualization skills. #DataScience #Python #DataVisualization #Seaborn #MachineLearning #LearningJourney
Like Comment
To view or add a comment, sign in

7 followers

3 Posts

View Profile Connect

Creepypasta Data Analysis with Machine Learning

More Relevant Posts

Explore related topics

Explore content categories