Machine Learning
39.4K subscribers
4.36K photos
40 videos
50 files
1.42K links
Real Machine Learning — simple, practical, and built on experience.
Learn step by step with clear explanations and working code.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Cheat Sheet for Data Cleaning in Python Pandas

1. Data Overview

import pandas as pd
df = pd.read_csv('data.csv')   # Load CSV
df.head(5)                     # First 5 rows
df.info()                      # Data types and null counts
df.describe()                  # Summary statistics


2. Handling Missing Values and Duplicates

df.isnull().sum()              # Count missing values
df.dropna()                    # Remove rows with nulls
df.fillna(0)                   # Fill missing values with zeros
df.duplicated().sum()          # Count duplicates
df.drop_duplicates()           # Remove duplicates


3. Renaming and Type Conversion

df.rename(columns={'old': 'new'})   # Rename column
df.astype({'col': 'int'})           # Change data type
df.columns.str.strip()              # Remove spaces


4. Filtering and Selection

df.loc[df['col'] > 100]             # Filter by condition
df.iloc[0:5]                        # Select by row index
df['col'].isin(['A', 'B'])          # Filter by values


5. Sorting and Grouping

df.sort_values('col')               # Sort ascending
df.groupby('group')['val'].mean()   # Group and aggregate
df['col'].value_counts()            # Count unique values


6. Column Operations

df['new'] = df['col1'] + df['col2']               # New column
df['col'] = df['col'].apply(lambda x: x*2)        # Apply function
df.drop('col', axis=1)                            # Drop column


7. Saving Cleaned Data

df.to_csv('cleaned.csv', index=False)  # Export to CSV
6
🔥 Trending Repository: free-programming-books

📝 Description: 📚 Freely available programming books

🔗 Repository URL: https://github.com/EbookFoundation/free-programming-books

🌐 Website: https://ebookfoundation.github.io/free-programming-books/

📖 Readme: https://github.com/EbookFoundation/free-programming-books#readme

📊 Statistics:
🌟 Stars: 366K stars
👀 Watchers:
🍴 Forks: 63.9K forks

💻 Programming Languages: Python - HTML

🏷️ Related Topics:
#education #list #books #resource #hacktoberfest


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: midday

📝 Description: Invoicing, Time tracking, File reconciliation, Storage, Financial Overview & your own Assistant made for Freelancers

🔗 Repository URL: https://github.com/midday-ai/midday

🌐 Website: https://midday.ai

📖 Readme: https://github.com/midday-ai/midday#readme

📊 Statistics:
🌟 Stars: 11.1K stars
👀 Watchers: 68
🍴 Forks: 1K forks

💻 Programming Languages: TypeScript - MDX - Rust - CSS - JavaScript - Dockerfile

🏷️ Related Topics:
#finance #typescript #nextjs #tailwind #supabase


==================================
🧠 By: https://t.me/DataScienceM
🔥1
🔥 Trending Repository: yt-dlp

📝 Description: A feature-rich command-line audio/video downloader

🔗 Repository URL: https://github.com/yt-dlp/yt-dlp

🌐 Website: https://discord.gg/H5MNcFW63r

📖 Readme: https://github.com/yt-dlp/yt-dlp#readme

📊 Statistics:
🌟 Stars: 124K stars
👀 Watchers: 640
🍴 Forks: 9.9K forks

💻 Programming Languages: Python

🏷️ Related Topics:
#python #cli #downloader #youtube_dl #youtube_downloader #sponsorblock #yt_dlp


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: django

📝 Description: The Web framework for perfectionists with deadlines.

🔗 Repository URL: https://github.com/django/django

🌐 Website: https://www.djangoproject.com/

📖 Readme: https://github.com/django/django#readme

📊 Statistics:
🌟 Stars: 84.7K stars
👀 Watchers: 2.3k
🍴 Forks: 32.8K forks

💻 Programming Languages: Python - Jinja

🏷️ Related Topics:
#python #django #views #framework #orm #web #apps #templates #models


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: ComfyUI-nunchaku

📝 Description: ComfyUI Plugin of Nunchaku

🔗 Repository URL: https://github.com/nunchaku-tech/ComfyUI-nunchaku

🌐 Website: https://nunchaku.tech/docs/ComfyUI-nunchaku/

📖 Readme: https://github.com/nunchaku-tech/ComfyUI-nunchaku#readme

📊 Statistics:
🌟 Stars: 1.9K stars
👀 Watchers: 21
🍴 Forks: 68 forks

💻 Programming Languages: Python

🏷️ Related Topics:
#flux #quantization #diffusion #mlsys #comfyui #genai


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: klipper

📝 Description: Klipper is a 3d-printer firmware

🔗 Repository URL: https://github.com/Klipper3d/klipper

📖 Readme: https://github.com/Klipper3d/klipper#readme

📊 Statistics:
🌟 Stars: 10.7K stars
👀 Watchers: 368
🍴 Forks: 5.6K forks

💻 Programming Languages: C - Python - Jupyter Notebook - C++ - Assembly - Makefile

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.me/DataScienceM
1
🔥 Trending Repository: Java

📝 Description: All Algorithms implemented in Java

🔗 Repository URL: https://github.com/TheAlgorithms/Java

📖 Readme: https://github.com/TheAlgorithms/Java#readme

📊 Statistics:
🌟 Stars: 62.8K stars
👀 Watchers: 2.2k
🍴 Forks: 20.2K forks

💻 Programming Languages: Java - Dockerfile

🏷️ Related Topics:
#search #java #algorithm #algorithms #sort #data_structures #sorting_algorithms #algorithm_challenges #hacktoberfest #algorithms_datastructures


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: Folo

📝 Description: 🧡 Follow everything in one place

🔗 Repository URL: https://github.com/RSSNext/Folo

🌐 Website: https://folo.is

📖 Readme: https://github.com/RSSNext/Folo#readme

📊 Statistics:
🌟 Stars: 32K stars
👀 Watchers: 119
🍴 Forks: 1.5K forks

💻 Programming Languages: TypeScript - Swift - JavaScript - CSS - Kotlin - HTML

🏷️ Related Topics:
#rss #ai #rss_reader #blockchain #reader


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: ai-agents-for-beginners

📝 Description: 11 Lessons to Get Started Building AI Agents

🔗 Repository URL: https://github.com/microsoft/ai-agents-for-beginners

🌐 Website: https://microsoft.github.io/ai-agents-for-beginners/

📖 Readme: https://github.com/microsoft/ai-agents-for-beginners#readme

📊 Statistics:
🌟 Stars: 34.9K stars
👀 Watchers: 328
🍴 Forks: 11.1K forks

💻 Programming Languages: Jupyter Notebook - Python - Dockerfile

🏷️ Related Topics:
#ai_agents #autogen #generative_ai #semantic_kernel #ai_agents_framework #agentic_framework #agentic_rag #agentic_ai


==================================
🧠 By: https://t.me/DataScienceM
1
🔥 Trending Repository: verifiers

📝 Description: Verifiers for LLM Reinforcement Learning

🔗 Repository URL: https://github.com/willccbb/verifiers

📖 Readme: https://github.com/willccbb/verifiers#readme

📊 Statistics:
🌟 Stars: 2.1K stars
👀 Watchers: 26
🍴 Forks: 267 forks

💻 Programming Languages: Python

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: spotify-downloader

📝 Description: Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

🔗 Repository URL: https://github.com/spotDL/spotify-downloader

🌐 Website: https://spotdl.readthedocs.io/en/latest/

📖 Readme: https://github.com/spotDL/spotify-downloader#readme

📊 Statistics:
🌟 Stars: 21.3K stars
👀 Watchers: 197
🍴 Forks: 1.9K forks

💻 Programming Languages: Python

🏷️ Related Topics:
#python #music #playlists #spotify #mp3 #song #hacktoberfest #youtube_music #song_lyrics #download_music #spotdl #spotdl_cli


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: github-readme-stats

📝 Description: Dynamically generated stats for your github readmes

🔗 Repository URL: https://github.com/anuraghazra/github-readme-stats

🌐 Website: https://github-readme-stats.vercel.app

📖 Readme: https://github.com/anuraghazra/github-readme-stats#readme

📊 Statistics:
🌟 Stars: 75.3K stars
👀 Watchers: 291
🍴 Forks: 25.7K forks

💻 Programming Languages: JavaScript - Shell

🏷️ Related Topics:
#serverless #dynamic #hacktoberfest #readme_generator #profile_readme #readme_stats


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: nn-zero-to-hero

📝 Description: Neural Networks: Zero to Hero

🔗 Repository URL: https://github.com/karpathy/nn-zero-to-hero

📖 Readme: https://github.com/karpathy/nn-zero-to-hero#readme

📊 Statistics:
🌟 Stars: 15.5K stars
👀 Watchers: 319
🍴 Forks: 2.2K forks

💻 Programming Languages: Jupyter Notebook

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: system_prompts_leaks

📝 Description: Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

🔗 Repository URL: https://github.com/asgeirtj/system_prompts_leaks

📖 Readme: https://github.com/asgeirtj/system_prompts_leaks#readme

📊 Statistics:
🌟 Stars: 9.2K stars
👀 Watchers: 200
🍴 Forks: 2K forks

💻 Programming Languages: JavaScript

🏷️ Related Topics:
#ai #gemini #openai #chatbots #prompts #claude #google_deepmind #large_language_models #llm #prompt_engineering #generative_ai #chatgpt #prompt_injection #anthropic


==================================
🧠 By: https://t.me/DataScienceM
1
🔥 Trending Repository: generative-ai-for-beginners

📝 Description: 21 Lessons, Get Started Building with Generative AI

🔗 Repository URL: https://github.com/microsoft/generative-ai-for-beginners

📖 Readme: https://github.com/microsoft/generative-ai-for-beginners#readme

📊 Statistics:
🌟 Stars: 95.7K stars
👀 Watchers: 827
🍴 Forks: 50.1K forks

💻 Programming Languages: Jupyter Notebook - Python - JavaScript - TypeScript - Shell - PowerShell

🏷️ Related Topics:
#ai #azure #transformers #openai #gpt #language_model #semantic_search #dall_e #prompt_engineering #llms #generative_ai #generativeai #chatgpt #microsoft_for_beginners


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: awesome-tuis

📝 Description: List of projects that provide terminal user interfaces

🔗 Repository URL: https://github.com/rothgar/awesome-tuis

📖 Readme: https://github.com/rothgar/awesome-tuis#readme

📊 Statistics:
🌟 Stars: 11.1K stars
👀 Watchers: 155
🍴 Forks: 430 forks

💻 Programming Languages: Not available

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.me/DataScienceM
Meet BLIP: The Vision-Language Model Powering Image Captioning

📖 Table of Contents Meet BLIP: The Vision-Language Model Powering Image Captioning What Is Image Captioning and Why Is It Challenging? Why It’s Challenging Why Traditional Vision Tasks Aren’t Enough Configuring Your Development Environment A Brief History of Image Captioning Models…...

🏷️ #ComputerVision #DeepLearning #ImageCaptioning #MultimodalAI #Tutorial
1
🔥 Trending Repository: SurfSense

📝 Description: Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join our discord:https://discord.gg/ejRNvftDp9

🔗 Repository URL: https://github.com/MODSetter/SurfSense

🌐 Website: https://www.surfsense.net

📖 Readme: https://github.com/MODSetter/SurfSense#readme

📊 Statistics:
🌟 Stars: 6.7K stars
👀 Watchers: 46
🍴 Forks: 507 forks

💻 Programming Languages: Python - TypeScript - MDX - CSS - JavaScript - Dockerfile

🏷️ Related Topics:
#python #chrome_extension #slack #agent #jira #typescript #extension #ai #nextjs #agents #notion #perplexity #rag #fastapi #langchain #ollama #langgraph #nextjs15 #aceternity_ui #notebooklm


==================================
🧠 By: https://t.me/DataScienceM
🔥 Trending Repository: Seelen-UI

📝 Description: The Fully Customizable Desktop Environment for Windows 10/11.

🔗 Repository URL: https://github.com/eythaann/Seelen-UI

📖 Readme: https://github.com/eythaann/Seelen-UI#readme

📊 Statistics:
🌟 Stars: 10.7K stars
👀 Watchers: 45
🍴 Forks: 327 forks

💻 Programming Languages: Rust - TypeScript - CSS - NSIS - SCSS - JavaScript

🏷️ Related Topics:
#windows #wallpaper #web #tools #toolbar #finder #desktop_application #dock #taskbar #tiling_window_manager #multilanguage #topbar #tauri #web_technologies #app_launcher #windows_11 #tauri_app #seelen_ui #seelen


==================================
🧠 By: https://t.me/DataScienceM