Machine Learning
39.9K subscribers
3.59K photos
26 videos
47 files
605 links
Real Machine Learning โ€” simple, practical, and built on experience.
Learn step by step with clear explanations and working code.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
๐Ÿ”ฅ Primer pedido con B4U Prime por solo 1โ‚ฌ
B4U Prime te ayuda a ahorrar en comida y viajes.
๐Ÿ” Uber Eats hasta -50%
๐Ÿš• Uber Rides hasta -50%
โšก๏ธ Rรกpido y cรณmodo
๐Ÿ’Ž Servicio premium
๐ŸŽ Oferta para nuevos clientes:
tu primer pedido cuesta solo 1โ‚ฌ.
ยฟCรณmo funciona?
1๏ธโƒฃ Nos envรญas tu pedido por DM
2๏ธโƒฃ Te damos el precio con descuento
3๏ธโƒฃ Pagas el importe confirmado
4๏ธโƒฃ Procesamos tu pedido
Ejemplo:
Tu pedido cuesta 40โ‚ฌ โ†’ pagas 20โ‚ฌ
Despuรฉs del primer pedido, sigues disfrutando de hasta 50% de descuento.
๐Ÿ“ฉ Escrรญbenos ahora para hacer tu pedido.
B4U Prime โ€” ahorra antes de pagar.
Canal: @b4u_prime_channel
โค2
Found an easy way to learn math for ML: Mathematics for Machine Learning ๐ŸŽ“๐Ÿ“š

This is a curated collection on GitHub, including books, research papers, video lectures, and basic materials on math for studying and reviewing the mathematical foundations of machine learning. ๐Ÿ“–๐Ÿ“Š

It helps build a stronger knowledge base by bringing together trusted resources around topics that machine learning engineers constantly encounter: linear algebra, mathematical analysis, probability theory, statistics, information theory, matrix calculus, and deep learning mathematics. ๐Ÿงฎ๐Ÿค–

Free public repository on GitHub. ๐Ÿ’ปโœจ

https://github.com/dair-ai/Mathematics-for-ML

#MachineLearning #Mathematics #DataScience #Learning #GitHub #AI
โค6
๐Ÿ”– A huge open-source course on AI Engineering from scratch

In the repository, we've collected:
โ€” 435 lessons;
โ€” 320+ hours of content;
โ€” Python, TypeScript, and Rust;
โ€” AI agents, MCP servers, prompts, and AI skills.

Moreover, almost every lesson includes practical tasks, so this isn't just theory, but a full-fledged roadmap for AI Engineering. ๐Ÿš€

โ›“๏ธ Link to the repository
https://github.com/rohitg00/ai-engineering-from-scratch

#AI #MachineLearning #Python #Rust #OpenSource #Tech

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค6๐Ÿ‘1
Transformer implementations for vision, audio, and AI agents ๐Ÿค–๐Ÿ‘๏ธ๐ŸŽต

Repo: https://github.com/Nicolepcx/transformers-the-definitive-guide

#AI #MachineLearning #Vision #Audio #Agents #Tech

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค3๐Ÿ‘2
๐Ÿš€ HelloEncyclo Presale is LIVE!

Master the skills that matter โ€” Gen-AI, Data Science, Machine Learning and more โ€” all in one place.

๐ŸŽ First 250 members get a flat 40% OFF

Use code: PRESALE-BOOK-WAVE-2GFG

โœ… 13 full courses live right now

โœ… 40+ more dropping in the next 2โ€“3 weeks

โœ… Complete library within 2 months โ€” built and refined by industry experts

โœ… 15-day money-back guarantee โ€” don't love it? Get a full refund.

โš ๏ธ Coupon works only after you log in with Gmail, and it's valid once per member.

๐Ÿ‘‰ Log in now and start learning:

https://helloencyclo.com

Don't wait โ€” the 40% deal disappears after the first 250 seats. ๐Ÿ”ฅ
1โค3
"Calculus: Early Transcendentals" is an excellent free textbook for building a solid foundation in mathematical analysis. ๐Ÿ“˜

The book is written in a clear and accessible language, while maintaining the necessary mathematical rigor. It contains a large number of examples and problems, making it suitable for both self-study and use in the educational process. ๐ŸŽ“

The textbook covers a wide range of topics, including:
โ€ข limits;
โ€ข derivatives;
โ€ข integrals;
โ€ข sequences and series;
โ€ข differential equations;
โ€ข multivariate analysis.

I consider this book another valuable tool in the arsenal of anyone studying mathematics. ๐Ÿ› ๏ธ

If you are a student and want to master or review key topics in mathematical analysis, or a teacher looking for new ideas and alternative explanations, this textbook is definitely worth attention.


https://open.umn.edu/opentextbooks/textbooks/415

https://github.com/antoniolupetti/algebrica

#Calculus #Math #FreeTextbook #StudyGuide #Mathematics #STEM

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค5
Data leakage is one of the main reasons why ML demos look impressive... and then fail in production. ๐Ÿ“‰

The model didn't become smarter.
It just happened to see the correct answers in advance.

In 4 minutes, you'll understand where data leaks hide. ๐Ÿ”

Let's break it down below: ๐Ÿ‘‡

1. Data Leakage ๐Ÿ•ณ๏ธ

Data leakage occurs when information that won't be available at the time of actual prediction is used during the model training process.

Because of this, metrics on the validation stage can look much better than the actual quality of the model on new, previously unseen data.

2. Model Evaluation โš–๏ธ

The test set isn't just "additional data".
It's a simulation of the future.

Only train the model on the information that would have been available to you at the time of prediction.
Evaluate it on examples that the model couldn't have influenced during training.

3. Direct Leakage ๐Ÿšจ

This is the most obvious type of leakage.

Examples:
- a field with information from the future;
- an ID that encodes the target variable;
- a variable that appears only after an event has occurred;
- duplicate records in both the training and test sets.

If a feature doesn't exist at the time of inference (prediction), then it's likely a source of data leakage.

4. Indirect Leakage ๐Ÿ•ต๏ธ

This is the type of leakage that most often traps teams.

You perform normalization, imputation, feature selection, outlier removal, or dimensionality reduction before splitting the data into a training and test set.

The model didn't directly see the data from the test set.
But your preprocessing pipeline already saw it.

5. Train/Test Split โœ‚๏ธ

Wrong:
fit the scaler on all data โ†’ split the data โ†’ evaluate

Right:
split the data โ†’ fit the scaler only on the training set โ†’ apply it to both the training and test sets

The same idea applies to imputers, encoders, feature selection, PCA, and any preprocessing step that is trained on the data.

6. Cross-Validation ๐Ÿ”„

Each fold is a mini-experiment with a training and test set.
Therefore, preprocessing should be performed within each fold.

If you prepared the entire dataset once and then ran cross-validation, each fold would already have had access to its held-out data.

7. Pipelines ๐Ÿ› ๏ธ

A pipeline isn't just a way to make the code cleaner.
It's also a defense against data leakage.

Combine preprocessing, feature selection, and the model into a single pipeline, and then pass this pipeline to cross-validation or hyperparameter search (grid search).

8. AI Engineering Version ๐Ÿค–

Data leaks also occur in RAG systems and when evaluating LLMs.

Leakage occurs when you tune chunks, prompts, re-rankers, thresholds, or examples on the same evaluation dataset that you later present as "held-out".

As a result, your benchmark turns into training data.

9. Leakage Checklist โœ…

Before trusting the obtained metric, ask yourself:

- Could this feature exist at the time of prediction?
- Was any transformation (transform) step trained (fit) on the test data?
- Did cross-validation include the entire pipeline?
- Were we tuning parameters on the final evaluation dataset?

If the answer is "yes", then the metric likely doesn't reflect the actual quality of the model.

#MachineLearning #DataScience #MLOps #DataLeakage #ArtificialIntelligence #TechTips

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค4๐Ÿ‘3
FREE MIT books on AI and Machine Learning: ๐Ÿ“š๐Ÿค–

1. Foundations of Machine Learning cs.nyu.edu/~mohri/mlbook/
2. Understanding Deep Learning udlbook.github.io/udlbook/
3. Introduction to Machine Learning Systems โฏ Vol 1: mlsysbook.ai/vol1/assets/do โฏ Vol 2: mlsysbook.ai/vol2/assets/do
4. Algorithms for ML algorithmsbook.com
5. Deep Learning deeplearningbook.org
6. Reinforcement Learning andrew.cmu.edu/course/10-703/
7. Distributional Reinforcement Learning direct.mit.edu/books/oa-monog
8. Multi Agent Reinforcement Learning marl-book.com
9. Agents in the Long Game of AI direct.mit.edu/books/oa-monog
10. Fairness and Machine Learning fairmlbook.org
11. Probabilistic Machine Learning
โฏ Part 1 : probml.github.io/pml-book/book1
โฏ Part 2 : probml.github.io/pml-book/book2

#MIT #AI #MachineLearning #DeepLearning #ReinforcementLearning #FreeBooks

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค6
# Algebra 2 ๐Ÿ“š

"Algebra 2" is another completely free textbook that covers a significant portion of algebra at both the pre-university and initial university levels. ๐ŸŽ“

With over 1,100 pages and a large number of worked examples, practical problems, and exercises, it covers linear equations, quadratic equations, polynomial equations, rational equations, irrational equations, exponential and logarithmic equations, systems of equations, inequalities, and many fundamental concepts underlying algebra. ๐Ÿงฎ

In my opinion, this is one of the most comprehensive free resources for studying equation theory and algebraic methods typically encountered in the first years of university study. ๐Ÿ’ก

Source: https://openstax.org/details/books/algebra-and-trigonometry-2e

#Algebra #Math #FreeTextbook #Education #Study #University

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค6
Introduction to Deep RL and DQN

Link: https://www.dailydoseofds.com/rl-course-part-6/

๐Ÿค– #DeepRL #DQN #ReinforcementLearning #AI #MachineLearning #DataScience

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A

๐Ÿš€ Level up your AI & Data Science skills with HelloEncyclo โ€” a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โœ… 13 courses live + 40+ coming soon
๐ŸŽฏ One access, lifetime updates
๐Ÿ”‘ Use code: PRESALE-BOOK-WAVE-2GFG
๐Ÿ‘‰ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค6
Optimizing the model's performance through Prompt Tuning with the PEFT library.

โœจ Full-fledged fine-tuning of language models requires a huge amount of video memory and completely overwrites the network's weights. We will apply the Prompt Tuning method (retraining virtual token prompts), which freezes the main model and adjusts only a tiny matrix of virtual embeddings. This allows adapting AI to a narrow task using a regular user's graphics card and without the risk of destroying the neural network's basic knowledge.

๐Ÿ“ฆ First, we will install the necessary libraries for working with transformers and effective fine-tuning methods (PEFT).

pip install torch transformers peft

โœ… The packages have been successfully installed in the system and are ready for configuring lightweight training. We will create a basic Prompt Tuning configuration for training just twenty virtual tokens instead of billions of model parameters.

from peft import PromptTuningConfig, PromptTuningInit, get_peft_model
from transformers import AutoModelForCausalLM

peft_config = PromptTuningConfig(
task_type="CAUSAL_LM",
prompt_tuning_init=PromptTuningInit.TEXT,
num_virtual_tokens=20,
prompt_tuning_init_text="Classify the sentiment of this text:",
tokenizer_name_or_path="gpt2"
)

๐Ÿ”„ The configuration is initialized and links the text prompt to the trainable virtual embeddings. We will wrap the base model in a PEFT container to freeze the main weights and leave only the new tokens available for gradient descent.

base_model = AutoModelForCausalLM.from_pretrained("gpt2")
peft_model = get_peft_model(base_model, peft_config)
peft_model.print_trainable_parameters()

๐Ÿš€ The model is ready for training, and the percentage of active parameters will be displayed on the screen (usually less than 0.01%).

python3 -c "from peft import PromptTuningConfig; print('PEFT Setup: OK')"

๐Ÿ“ Expected output: PEFT Setup: OK

pip uninstall peft -y

๐Ÿ’ก Prompt Tuning โ€” an ideal choice when you need to train a model for many different customers or tasks simultaneously. Instead of gigabyte-sized copies of neural networks, you store only lightweight configuration files weighing a few kilobytes, dynamically substituting them at inference.

#PromptTuning #PEFT #AI #MachineLearning #DeepLearning #DataScience

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A

๐Ÿš€ Level up your AI & Data Science skills with HelloEncyclo โ€” a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โœ… 13 courses live + 40+ coming soon
๐ŸŽฏ One access, lifetime updates
๐Ÿ”‘ Use code: PRESALE-BOOK-WAVE-2GFG
๐Ÿ‘‰ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค4
If you want to finally understand how neural networks actually learn, I recommend these notes from Stanford CS224N. ๐Ÿง 

"Computing Neural Network Gradients" explains the calculation of gradients and backpropagation without black-box formulas. ๐Ÿ“‰

Inside:
โ€ข Chain Rule
โ€ข Computational Graphs
โ€ข Vectorized derivatives
โ€ข Efficient gradient calculation
โ€ข Step-by-step examples with formula analysis

Many people use PyTorch or TensorFlow every day, but never understood what happens after calling .backward(). ๐Ÿ”ฅ

These notes just fill this gap. ๐Ÿ› ๏ธ

PDF:
https://web.stanford.edu/class/cs224n/readings/gradient-notes.pdf

#NeuralNetworks #DeepLearning #StanfordCS #Backpropagation #MachineLearning #AIResearch

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A

๐Ÿš€ Level up your AI & Data Science skills with HelloEncyclo โ€” a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โœ… 13 courses live + 40+ coming soon
๐ŸŽฏ One access, lifetime updates
๐Ÿ”‘ Use code: PRESALE-BOOK-WAVE-2GFG
๐Ÿ‘‰ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค2
๐ŸŽ SPOTO Mid-Year Sale โ€“ Grab Your IT Certification Success Kit!

๐Ÿ”ฅ Whether you're prepping for #Python, #AI, #Cisco, #PMI, #Fortinet, #AWS, #Azure, #Excel, #Comptia, #ITIL, #Cloud or any other hot certification โ€“ SPOTO has your back with real exam dumps and hands-on training!

โœ… Free Resources:
ใƒปFree Python, Excel, Cyber Security, Cisco, SQL, ITIL, PMP, AWS courses: https://bit.ly/4alTSfk
ใƒปIT Certs E-book: https://bit.ly/49ub0zq
ใƒปIT Exams Skill Test: https://bit.ly/4dVPapB
ใƒปFree AI material and support tools: https://bit.ly/4elzcpl
ใƒปFree Cloud Study Guide: https://bit.ly/4u7sdG0

๐ŸŽ Join SPOTO Mid-Year Lucky Draw:
๐Ÿ“ฑ iPhone 17 ๐Ÿ›’ Free Order
๐Ÿ›’ Amazon Gift $100 ๐Ÿ“˜PMP/ AWS/ CCNA Course


๐Ÿ‘‰ Enter the Draw Now โ†’ https://bit.ly/4uN3lVt

๐Ÿ‘‰ Join Our IT Learning Community for free resources & support:
https://chat.whatsapp.com/FmbIbbqm2QhKglVpVTSH4d
๐Ÿ’ฌ Want exam help? Chat with an admin now:
https://wa.link/knicza

โฐ Mid-Year Deal Ends Soon โ€“ Don't Miss Out!
โค2๐Ÿคฉ1
Data Science Interview Questions.pdf
1.4 MB
Data Science Interview Questions

๐Ÿ’ก Here is your curated list for Data Science interviews!

โœจ Join Best TG Channels https://t.me/addlist/0f6vfFbEMdAwODBk

โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A

๐Ÿš€ Level up your AI & Data Science skills with HelloEncyclo โ€” a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โœ… 13 courses live + 40+ coming soon
๐ŸŽฏ One access, lifetime updates
๐Ÿ”‘ Use code: PRESALE-BOOK-WAVE-2GFG
๐Ÿ‘‰ https://helloencyclo.com/?ref=HUSSEINSHEIKHO

#DataScience #AI #MachineLearning #LLM #TechJobs #InterviewPrep
โค4
๐Ÿšจ ONLY THE FIRST 5 GET THIS.

I'm sharing this link with my network once โ€” and only the first 5 people who enroll through it lock in a deal that has never been offered before.

๐Ÿ‘‘ Lifetime access to HelloEncyclo โ€” every AI, ML & Data Science course ever built โ€” for ~$41. Once. Forever.
This isn't a drill. This isn't a rerun.
This is the founding-member price โ€” and it disappears the moment the first 250 seats globally are gone.


โœ… 13 courses live right now
โœ… 40+ more in 2โ€“3 weeks
โœ… Every future course included automatically
โœ… 15-day money-back โ€” full refund, no questions

Code: PRESALE-BOOK-WAVE-2GFG

(Log in with Gmail ยท valid once ยท applies at checkout)

๐Ÿ‘‡ First 5. That's it.

https://helloencyclo.com/?ref=HUSSEINSHEIKHO

โณ Once those 5 seats go through this link โ€”

I'm not sharing it again. ๐Ÿ”ฅ