Python | Machine Learning | Coding | R
62.7K subscribers
1.13K photos
68 videos
143 files
788 links
List of our channels:
https://t.me/addlist/8_rRW2scgfRhOTc0

Discover powerful insights with Python, Machine Learning, Coding, and R—your essential toolkit for data-driven solutions, smart alg

Help and ads: @hussein_sheikho

https://telega.io/?r=nikapsOH
Download Telegram
🐼 20 of the most used Pandas + PDF functions

👨🏻‍💻 The first time I used Pandas, I was supposed to quickly clean and organize a raw and complex dataset with the help of Pandas functions. Using the groupby function, I was able to categorize the data and get in-depth analysis of customer behavior. Best of all, it was when I used loc and iloc that I could easily filter the data.

✔️ Since then I decided to prepare a list of the most used Pandas functions that I use on a daily basis. Now this list is ready! In the following, I will introduce 20 of the best and most used Pandas functions:



🏳️‍🌈 read_csv(): Fast data upload from CSV files

🏳️‍🌈 head(): look at the first five rows of the database to start..

🏳️‍🌈 info(): Checking data structure such as data type and empty values.

🏳️‍🌈 describe(): Generate descriptive statistics for numeric columns.

🏳️‍🌈 loc[ ]: accesses rows and columns by label or condition.

🏳️‍🌈 iloc[ ]: Access data by row number.

🏳️‍🌈 merge(): Merge dataframes with common columns.

🏳️‍🌈 groupby(): Grouping for easier analysis.

🏳️‍🌈 pivot_table(): Summarize data in pivot table format.

🏳️‍🌈 to_csv(): Save data as a CSV file.

🏳️‍🌈 pd.concat(): Concatenate multiple dataframes in rows or columns.

🏳️‍🌈 pd.melt(): Convert wide format data to long format.

🏳️‍🌈 pd.pivot_table(): Create a pivot table with multiple levels.

🏳️‍🌈 pd.cut(): Split the data into specific intervals.

🏳️‍🌈 pd.qcut(): Sort data by percentage.

🏳️‍🌈 pd.merge(): Merge data in database style for advanced linking.

🏳️‍🌈 DataFrame.apply(): Apply a custom function to the data.

🏳️‍🌈 DataFrame.groupby(): Analyze grouped data.

🏳️‍🌈 DataFrame.drop_duplicates(): Drop duplicate rows.

🏳️‍🌈 DataFrame.to_excel(): Save data directly to Excel file.


🐼 Pandas Functions
📄 PDF

#MachineLearning #DeepLearning #BigData #Datascience #ML #Pandas #DataVisualization #ArtificialInteligence #SoftwareEngineering #GenAI #deeplearning #ChatGPT #OpenAI #python #AI #keras #SQL #Statistics #LLMs #AIagents

http://t.me/codeprogrammer ⭐️
Please open Telegram to view this post
VIEW IN TELEGRAM
👍262
The Big Book of Large Language Models by Damien Benveniste

Chapters:
1⃣ Introduction

🔢 Language Models Before Transformers

🔢 Attention Is All You Need: The Original Transformer Architecture

🔢 A More Modern Approach To The Transformer Architecture

🔢 Multi-modal Large Language Models

🔢 Transformers Beyond Language Models

🔢 Non-Transformer Language Models

🔢 How LLMs Generate Text

🔢 From Words To Tokens

1⃣0⃣ Training LLMs to Follow Instructions

1⃣1⃣ Scaling Model Training

1⃣🔢 Fine-Tuning LLMs

1⃣🔢 Deploying LLMs

Read it: https://book.theaiedge.io/

#ArtificialIntelligence #AI #MachineLearning #LargeLanguageModels #LLMs #DeepLearning #NLP #NaturalLanguageProcessing #AIResearch #TechBooks #AIApplications #DataScience #FutureOfAI #AIEducation #LearnAI #TechInnovation #AIethics #GPT #BERT #T5 #AIBook #AIEnthusiast

https://t.me/CodeProgrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
👍174👎1
🔰 How to become a data scientist in 2025?

👨🏻‍💻 If you want to become a data science professional, follow this path! I've prepared a complete roadmap with the best free resources where you can learn the essential skills in this field.


🔢 Step 1: Strengthen your math and statistics!

✏️ The foundation of learning data science is mathematics, linear algebra, statistics, and probability. Topics you should master:

Linear algebra: matrices, vectors, eigenvalues.

🔗 Course: MIT 18.06 Linear Algebra


Calculus: derivative, integral, optimization.

🔗 Course: MIT Single Variable Calculus


Statistics and probability: Bayes' theorem, hypothesis testing.

🔗 Course: Statistics 110



🔢 Step 2: Learn to code.

✏️ Learn Python and become proficient in coding. The most important topics you need to master are:

Python: Pandas, NumPy, Matplotlib libraries

🔗 Course: FreeCodeCamp Python Course

SQL language: Join commands, Window functions, query optimization.

🔗 Course: Stanford SQL Course

Data structures and algorithms: arrays, linked lists, trees.

🔗 Course: MIT Introduction to Algorithms



🔢 Step 3: Clean and visualize data

✏️ Learn how to process and clean data and then create an engaging story from it!

Data cleaning: Working with missing values ​​and detecting outliers.

🔗 Course: Data Cleaning

Data visualization: Matplotlib, Seaborn, Tableau

🔗 Course: Data Visualization Tutorial



🔢 Step 4: Learn Machine Learning

✏️ It's time to enter the exciting world of machine learning! You should know these topics:

Supervised learning: regression, classification.

Unsupervised learning: clustering, PCA, anomaly detection.

Deep learning: neural networks, CNN, RNN


🔗 Course: CS229: Machine Learning



🔢 Step 5: Working with Big Data and Cloud Technologies

✏️ If you're going to work in the real world, you need to know how to work with Big Data and cloud computing.

Big Data Tools: Hadoop, Spark, Dask

Cloud platforms: AWS, GCP, Azure

🔗 Course: Data Engineering



🔢 Step 6: Do real projects!

✏️ Enough theory, it's time to get coding! Do real projects and build a strong portfolio.

Kaggle competitions: solving real-world challenges.

End-to-End projects: data collection, modeling, implementation.

GitHub: Publish your projects on GitHub.

🔗 Platform: Kaggle🔗 Platform: ods.ai



🔢 Step 7: Learn MLOps and deploy models

✏️ Machine learning is not just about building a model! You need to learn how to deploy and monitor a model.

MLOps training: model versioning, monitoring, model retraining.

Deployment models: Flask, FastAPI, Docker

🔗 Course: Stanford MLOps Course



🔢 Step 8: Stay up to date and network

✏️ Data science is changing every day, so it is necessary to update yourself every day and stay in regular contact with experienced people and experts in this field.

Read scientific articles: arXiv, Google Scholar

Connect with the data community:

🔗 Site: Papers with code
🔗 Site: AI Research at Google


#ArtificialIntelligence #AI #MachineLearning #LargeLanguageModels #LLMs #DeepLearning #NLP #NaturalLanguageProcessing #AIResearch #TechBooks #AIApplications #DataScience #FutureOfAI #AIEducation #LearnAI #TechInnovation #AIethics #GPT #BERT #T5 #AIBook #AIEnthusiast

https://t.me/CodeProgrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3515👏1
10 Must-Know Python Libraries for LLMs in 2025

Large language models (LLMs) are changing the way we think about AI. They help with #chatbots, text generation, and search tools, among other natural language processing tasks and beyond. To work with #LLMs, you need the right #Python libraries.

In this article, we explore 10 of the Python libraries every developer should know in 2025.

Read and learn:
https://machinelearningmastery.com/10-must-know-python-libraries-for-llms-in-2025/

https://t.me/CodeProgrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
👍7🔥2
Anyone trying to deeply understand Large Language Models.

Checkout
Foundations of Large Language Models


by Tong Xiao & Jingbo Zhu. It’s one of the clearest, most comprehensive resource.

⭐️ Paper Link: arxiv.org/pdf/2501.09223

#LLMs #LargeLanguageModels #AIResearch #DeepLearning #MachineLearning #AIResources #NLP #AITheory #FoundationModels #AIUnderstanding



✉️ Our Telegram channels: https://t.me/addlist/0f6vfFbEMdAwODBk

📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
14
Please open Telegram to view this post
VIEW IN TELEGRAM
8💯2👨‍💻1
The 2025 MIT deep learning course is excellent, covering neural networks, CNNs, RNNs, and LLMs. You build three projects for hands-on experience as part of the course. It is entirely free. Highly recommended for beginners.

Enroll Free: https://introtodeeplearning.com/

#DeepLearning #MITCourse #NeuralNetworks #CNN #RNN #LLMs #AIForBeginners #FreeCourse #MachineLearning #IntroToDeepLearning #AIProjects #LearnAI #AI2025

✉️ Our Telegram channels: https://t.me/addlist/0f6vfFbEMdAwODBk

📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
7
This media is not supported in your browser
VIEW IN TELEGRAM
Over the last year, several articles have been written to help candidates prepare for data science technical interviews. These resources cover a wide range of topics including machine learning, SQL, programming, statistics, and probability.

1️⃣ Machine Learning (ML) Interview
Types of ML Q&A in Data Science Interview
https://shorturl.at/syN37

ML Interview Q&A for Data Scientists
https://shorturl.at/HVWY0

Crack the ML Coding Q&A
https://shorturl.at/CDW08

Deep Learning Interview Q&A
https://shorturl.at/lHPZ6

Top LLMs Interview Q&A
https://shorturl.at/wGRSZ

Top CV Interview Q&A [Part 1]
https://rb.gy/51jcfi

Part 2
https://rb.gy/hqgkbg

Part 3
https://rb.gy/5z87be

2️⃣ SQL Interview Preparation
13 SQL Statements for 90% of Data Science Tasks
https://rb.gy/dkdcl1

SQL Window Functions: Simplifying Complex Queries
https://t.ly/EwSlH

Ace the SQL Questions in the Technical Interview
https://lnkd.in/gNQbYMX9

Unlocking the Power of SQL: How to Ace Top N Problem Questions
https://lnkd.in/gvxVwb9n

How To Ace the SQL Ratio Problems
https://lnkd.in/g6JQqPNA

Cracking the SQL Window Function Coding Questions
https://lnkd.in/gk5u6hnE

SQL & Database Interview Q&A
https://lnkd.in/g75DsEfw

6 Free Resources for SQL Interview Preparation
https://lnkd.in/ghhiG79Q

3️⃣ Programming Questions
Foundations of Data Structures [Part 1]
https://lnkd.in/gX_ZcmRq

Part 2
https://lnkd.in/gATY4rTT

Top Important Python Questions [Conceptual]
https://lnkd.in/gJKaNww5

Top Important Python Questions [Data Cleaning and Preprocessing]
https://lnkd.in/g-pZBs3A

Top Important Python Questions [Machine & Deep Learning]
https://lnkd.in/gZwcceWN

Python Interview Q&A
https://lnkd.in/gcaXc_JE

5 Python Tips for Acing DS Coding Interview
https://lnkd.in/gsj_Hddd

4️⃣ Statistics
Mastering 5 Statistics Concepts to Boost Success
https://lnkd.in/gxEuHiG5

Mastering Hypothesis Testing for Interviews
https://lnkd.in/gSBbbmF8

Introduction to A/B Testing
https://lnkd.in/g35Jihw6

Statistics Interview Q&A for Data Scientists
https://lnkd.in/geHCCt6Q

5️⃣ Probability
15 Probability Concepts to Review [Part 1]
https://lnkd.in/g2rK2tQk

Part 2
https://lnkd.in/gQhXnKwJ

Probability Interview Q&A [Conceptual Questions]
https://lnkd.in/g5jyKqsp

Probability Interview Q&A [Mathematical Questions]
https://lnkd.in/gcWvPhVj

🔜 All links are available in the GitHub repository:
https://lnkd.in/djcgcKRT

#DataScience #InterviewPrep #MachineLearning #SQL #Python #Statistics #Probability #CodingInterview #AIBootcamp #DeepLearning #LLMs #ComputerVision #GitHubResources #CareerInDataScience


✉️ Our Telegram channels: https://t.me/addlist/0f6vfFbEMdAwODBk

📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
8👍2💯2
10 GitHub repos to build a career in AI engineering:

(100% free step-by-step roadmap)

1️⃣ ML for Beginners by Microsoft

A 12-week project-based curriculum that teaches classical ML using Scikit-learn on real-world datasets.

Includes quizzes, lessons, and hands-on projects, with some videos.

GitHub repo → https://lnkd.in/dCxStbYv

2️⃣ AI for Beginners by Microsoft

This repo covers neural networks, NLP, CV, transformers, ethics & more. There are hands-on labs in PyTorch & TensorFlow using Jupyter.

Beginner-friendly, project-based, and full of real-world apps.

GitHub repo → https://lnkd.in/dwS5Jk9E

3️⃣ Neural Networks: Zero to Hero

Now that you’ve grasped the foundations of AI/ML, it’s time to dive deeper.

This repo by Andrej Karpathy builds modern deep learning systems from scratch, including GPTs.

GitHub repo → https://lnkd.in/dXAQWucq

4️⃣ DL Paper Implementations

So far, you have learned the fundamentals of AI, ML, and DL. Now study how the best architectures work.

This repo covers well-documented PyTorch implementations of 60+ research papers on Transformers, GANs, Diffusion models, etc.

GitHub repo → https://lnkd.in/dTrtDrvs

5️⃣ Made With ML

Now it’s time to learn how to go from notebooks to production.

Made With ML teaches you how to design, develop, deploy, and iterate on real-world ML systems using MLOps, CI/CD, and best practices.

GitHub repo → https://lnkd.in/dYyjjBGb

6️⃣ Hands-on LLMs

- You've built neural nets.
- You've explored GPTs and LLMs.

Now apply them. This is a visually rich repo that covers everything about LLMs, like tokenization, fine-tuning, RAG, etc.

GitHub repo → https://lnkd.in/dh2FwYFe

7️⃣ Advanced RAG Techniques

Hands-on LLMs will give you a good grasp of RAG systems. Now learn advanced RAG techniques.

This repo covers 30+ methods to make RAG systems faster, smarter, and accurate, like HyDE, GraphRAG, etc.

GitHub repo → https://lnkd.in/dBKxtX-D

8️⃣ AI Agents for Beginners by Microsoft

After diving into LLMs and mastering RAG, learn how to build AI agents.

This hands-on course covers building AI agents using frameworks like AutoGen.

GitHub repo → https://lnkd.in/dbFeuznE

9️⃣ Agents Towards Production

The above course will teach what AI agents are. Next, learn how to ship them.

This is a practical playbook for building agents covering memory, orchestration, deployment, security & more.

GitHub repo → https://lnkd.in/dcwmamSb

🔟 AI Engg. Hub

To truly master LLMs, RAG, and AI agents, you need projects.

This covers 70+ real-world examples, tutorials, and agent app you can build, adapt, and ship.

GitHub repo → https://lnkd.in/geMYm3b6

#AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers


✉️ Our Telegram channels: https://t.me/addlist/0f6vfFbEMdAwODBk

📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
6
Auto-Encoder & Backpropagation by hand ✍️ lecture video ~ 📺 https://byhand.ai/cv/10

It took me a few years to invent this method to show both forward and backward passes for a non-trivial case of a multi-layer perceptron over a batch of inputs, plus gradient descents over multiple epochs, while being able to hand calculate each step and code in Excel at the same time.

= Chapters =
• Encoder & Decoder (00:00)
• Equation (10:09)
• 4-2-4 AutoEncoder (16:38)
• 6-4-2-4-6 AutoEncoder (18:39)
• L2 Loss (20:49)
• L2 Loss Gradient (27:31)
• Backpropagation (30:12)
• Implement Backpropagation (39:00)
• Gradient Descent (44:30)
• Summary (51:39)

#AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers


✉️ Our Telegram channels: https://t.me/addlist/0f6vfFbEMdAwODBk
Please open Telegram to view this post
VIEW IN TELEGRAM
3
This media is not supported in your browser
VIEW IN TELEGRAM
GPU by hand ✍️ I drew this to show how a GPU speeds up an array operation of 8 elements in parallel over 4 threads in 2 clock cycles. Read more 👇

CPU
• It has one core.
• Its global memory has 120 locations (0-119).
• To use the GPU, it needs to copy data from the global memory to the GPU.
• After GPU is done, it will copy the results back.

GPU
• It has four cores to run four threads (0-3).
• It has a register file of 28 locations (0-27)
• This register file has four banks (0-3).
• All threads share the same register file.
• But they must read/write using the four banks.
• Each bank allows 2 reads (Read 0, Read 1) and 1 write in a single clock cycle.

#AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers


✉️ Our Telegram channels: https://t.me/addlist/0f6vfFbEMdAwODBk
Please open Telegram to view this post
VIEW IN TELEGRAM
👍4