Machine Learning
39.4K subscribers
4.35K photos
40 videos
50 files
1.42K links
Real Machine Learning โ€” simple, practical, and built on experience.
Learn step by step with clear explanations and working code.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Hugging Face has literally gathered all the key "secrets". ๐Ÿค”

It's important to understand the evaluation of large language models. ๐Ÿ“Š

While you're working with language models:
> training or retraining your models, ๐Ÿ”„
> selecting a model for a task, ๐ŸŽฏ
> or trying to understand the current state of the field, ๐ŸŒ

the question almost inevitably arises:
how to understand that a model is good? โ“

The answer is quality evaluation. It's everywhere:
> leaderboards with model ratings, ๐Ÿ†
> benchmarks that supposedly measure reasoning, ๐Ÿง 
> knowledge, coding or mathematics, ๐Ÿ‘จโ€๐Ÿ’ป
> articles with claimed new best results. ๐Ÿ“ˆ

But what is evaluation actually? ๐Ÿคทโ€โ™‚๏ธ
And what does it really show? ๐Ÿ”

This guide helps to understand everything. ๐Ÿ“š
https://huggingface.co/spaces/OpenEvals/evaluation-guidebook#what-is-model-evaluation-about


What is model evaluation all about ๐Ÿค–
Basic concepts of large language models for understanding evaluation ๐Ÿ—๏ธ
Evaluation through ready-made benchmarks ๐Ÿ“
Creating your own evaluation system ๐Ÿ”ง
The main problem of evaluation โš ๏ธ
Evaluation of free text ๐Ÿ“
Statistical correctness of evaluation ๐Ÿ“‰
Cost and efficiency of evaluation ๐Ÿ’ฐ

https://t.me/CodeProgrammer ๐ŸŸข
Please open Telegram to view this post
VIEW IN TELEGRAM
โค2
Channel photo removed
Channel photo updated
๐Ÿ›  ๐๐ž๐ฒ๐จ๐ง๐ ๐ญ๐ก๐ž ๐†๐ซ๐š๐๐ข๐ž๐ง๐ญ: ๐“๐ก๐ž ๐Œ๐š๐ญ๐ก๐ž๐ฆ๐š๐ญ๐ข๐œ๐ฌ ๐๐ž๐ก๐ข๐ง๐ ๐‹๐จ๐ฌ๐ฌ ๐…๐ฎ๐ง๐œ๐ญ๐ข๐จ๐ง๐ฌ

ML engineers often treat loss functions as โ€œset-and-forgetโ€ hyperparameters. But the loss is not just a training detail; it is the mathematical statement of what the model is supposed to care about.

โžก๏ธ In ๐ซ๐ž๐ ๐ซ๐ž๐ฌ๐ฌ๐ข๐จ๐ง, ๐Œ๐’๐„ pushes the model to reduce large errors aggressively, which makes it sensitive to outliers, while ๐Œ๐€๐„ treats all errors more evenly and is often more robust.
โ†ณ ๐‡๐ฎ๐›๐ž๐ซ ๐ฅ๐จ๐ฌ๐ฌ sits between the two, using squared error for small deviations and absolute error for larger ones.
โ†ณ ๐๐ฎ๐š๐ง๐ญ๐ข๐ฅ๐ž ๐ฅ๐จ๐ฌ๐ฌ becomes useful when the goal is not a single prediction, but an interval or asymmetric risk, and ๐๐จ๐ข๐ฌ๐ฌ๐จ๐ง ๐ฅ๐จ๐ฌ๐ฌ fits naturally when the target is a count or rate.
โžก๏ธ In ๐œ๐ฅ๐š๐ฌ๐ฌ๐ข๐Ÿ๐ข๐œ๐š๐ญ๐ข๐จ๐ง, ๐‚๐ซ๐จ๐ฌ๐ฌ-๐„๐ง๐ญ๐ซ๐จ๐ฉ๐ฒ remains the core objective because it trains the model to produce good probabilities, not just correct labels.
โ†ณ ๐๐ข๐ง๐š๐ซ๐ฒ ๐‚๐ซ๐จ๐ฌ๐ฌ-๐„๐ง๐ญ๐ซ๐จ๐ฉ๐ฒ is the natural choice for two-class or multi-label settings, while ๐‚๐š๐ญ๐ž๐ ๐จ๐ซ๐ข๐œ๐š๐ฅ ๐‚๐ซ๐จ๐ฌ๐ฌ-๐„๐ง๐ญ๐ซ๐จ๐ฉ๐ฒ extends that idea to multi-class softmax outputs.
โ†ณ ๐Š๐‹ ๐ƒ๐ข๐ฏ๐ž๐ซ๐ ๐ž๐ง๐œ๐ž is especially important when the task involves matching distributions, such as distillation, variational inference, or probabilistic modeling.
โ†ณ ๐‡๐ข๐ง๐ ๐ž ๐ฅ๐จ๐ฌ๐ฌ and squared hinge loss reflect the margin-based logic behind SVM-style learning, and focal loss is particularly valuable when easy examples dominate and the hard cases need more attention.
โžก๏ธ In ๐ฌ๐ฉ๐ž๐œ๐ข๐š๐ฅ๐ข๐ณ๐ž๐ ๐ญ๐š๐ฌ๐ค๐ฌ, the choice of loss becomes even more meaningful.
โ†ณ ๐ƒ๐ข๐œ๐ž ๐ฅ๐จ๐ฌ๐ฌ works well in segmentation because it focuses on overlap and helps with class imbalance.
โ†ณ ๐†๐€๐ ๐ฅ๐จ๐ฌ๐ฌ drives the generatorโ€“discriminator game in adversarial learning.
โ†ณ ๐“๐ซ๐ข๐ฉ๐ฅ๐ž๐ญ ๐ฅ๐จ๐ฌ๐ฌ and contrastive loss shape embedding spaces so that similarity is learned directly.
โ†ณ ๐‚๐“๐‚ ๐ฅ๐จ๐ฌ๐ฌ solves alignment problems in sequence tasks like speech recognition and OCR, where labels are unsegmented.
โ†ณ ๐‚๐จ๐ฌ๐ข๐ง๐ž ๐ฉ๐ซ๐จ๐ฑ๐ข๐ฆ๐ข๐ญ๐ฒ is useful when vector direction matters more than magnitude.

๐Ÿ’ก ๐‘ป๐’‰๐’† ๐’ƒ๐’Š๐’ˆ๐’ˆ๐’†๐’“ ๐’•๐’‚๐’Œ๐’†๐’‚๐’˜๐’‚๐’š: ๐‘‡โ„Ž๐‘’ ๐‘™๐‘œ๐‘ ๐‘  ๐‘“๐‘ข๐‘›๐‘๐‘ก๐‘–๐‘œ๐‘› ๐‘’๐‘›๐‘๐‘œ๐‘‘๐‘’๐‘  ๐‘ฆ๐‘œ๐‘ข๐‘Ÿ ๐‘Ž๐‘ ๐‘ ๐‘ข๐‘š๐‘๐‘ก๐‘–๐‘œ๐‘›๐‘  ๐‘Ž๐‘๐‘œ๐‘ข๐‘ก ๐‘กโ„Ž๐‘’ ๐‘๐‘Ÿ๐‘œ๐‘๐‘™๐‘’๐‘š. ๐ผ๐‘ก ๐‘Ž๐‘“๐‘“๐‘’๐‘๐‘ก๐‘  ๐‘๐‘œ๐‘›๐‘ฃ๐‘’๐‘Ÿ๐‘”๐‘’๐‘›๐‘๐‘’, ๐‘ ๐‘ก๐‘Ž๐‘๐‘–๐‘™๐‘–๐‘ก๐‘ฆ, ๐‘๐‘Ž๐‘™๐‘–๐‘๐‘Ÿ๐‘Ž๐‘ก๐‘–๐‘œ๐‘›, ๐‘Ÿ๐‘œ๐‘๐‘ข๐‘ ๐‘ก๐‘›๐‘’๐‘ ๐‘ , ๐‘Ž๐‘›๐‘‘ ๐‘”๐‘’๐‘›๐‘’๐‘Ÿ๐‘Ž๐‘™๐‘–๐‘ง๐‘Ž๐‘ก๐‘–๐‘œ๐‘›; ๐‘ ๐‘œ๐‘š๐‘’๐‘ก๐‘–๐‘š๐‘’๐‘  ๐‘—๐‘ข๐‘ ๐‘ก ๐‘Ž๐‘  ๐‘š๐‘ข๐‘โ„Ž ๐‘Ž๐‘  ๐‘กโ„Ž๐‘’ ๐‘Ž๐‘Ÿ๐‘โ„Ž๐‘–๐‘ก๐‘’๐‘๐‘ก๐‘ข๐‘Ÿ๐‘’ ๐‘–๐‘ก๐‘ ๐‘’๐‘™๐‘“.
โžœ ๐‘†๐‘œ ๐‘กโ„Ž๐‘’ ๐‘Ÿ๐‘’๐‘Ž๐‘™ ๐‘ž๐‘ข๐‘’๐‘ ๐‘ก๐‘–๐‘œ๐‘› ๐‘–๐‘  ๐‘›๐‘œ๐‘ก ๐‘œ๐‘›๐‘™๐‘ฆ โ€œ๐‘Šโ„Ž๐‘–๐‘โ„Ž ๐‘š๐‘œ๐‘‘๐‘’๐‘™ ๐‘ โ„Ž๐‘œ๐‘ข๐‘™๐‘‘ ๐ผ ๐‘ข๐‘ ๐‘’?โ€
โžœ ๐ผ๐‘ก ๐‘–๐‘  ๐‘Ž๐‘™๐‘ ๐‘œ: โ€œ๐‘Šโ„Ž๐‘Ž๐‘ก ๐‘๐‘’โ„Ž๐‘Ž๐‘ฃ๐‘–๐‘œ๐‘Ÿ ๐‘–๐‘  ๐‘กโ„Ž๐‘–๐‘  ๐‘™๐‘œ๐‘ ๐‘  ๐‘’๐‘›๐‘๐‘œ๐‘ข๐‘Ÿ๐‘Ž๐‘”๐‘–๐‘›๐‘”?โ€

https://t.me/MachineLearning9
โค6๐Ÿ‘1๐Ÿ”ฅ1
๐Ÿ”– 10 Stanford courses on AI and ML โ€” with official pages and all materials

โ–ถ๏ธ CS221: Artificial Intelligence
โ–ถ๏ธ CS229: Machine Learning
โ–ถ๏ธ CS229M: Theory of Machine Learning
โ–ถ๏ธ CS230: Deep Learning
โ–ถ๏ธ CS234: Reinforcement Learning
โ–ถ๏ธ CS224N: Natural Language Processing
โ–ถ๏ธ CS231N: Deep Learning for Computer Vision
โ–ถ๏ธ CME295: Large Language Models
โ–ถ๏ธ CS236: Deep Generative Models
โ–ถ๏ธ CS336: Modeling Language from Scratch

They cover the entire spectrum: classic ML, LLM, and generative models โ€” with theory and practice.

tags: #python #ML #LLM #AI

โžก https://t.me/MachineLearning9
Please open Telegram to view this post
VIEW IN TELEGRAM
โค9
Algorithms by Jeff Erickson - one of the best algorithm books out there ๐Ÿ“š.

The illustrations make complex concepts surprisingly easy to follow ๐ŸŽจ. Highly recommend this ๐Ÿ‘.

Link: https://jeffe.cs.illinois.edu/teaching/algorithms/ ๐Ÿ”—

https://t.me/MachineLearning9
โค3๐Ÿ‘3๐Ÿ”ฅ1
Every data professional forgets which statistical test to use. Here's the fix. ๐Ÿ› 

(Bookmark it. Seriously. ๐Ÿ“Œ)

I've been there:
โ†ณ Staring at two datasets wondering which test to run ๐Ÿค”
โ†ณ Googling "t-test vs ANOVA" for the 10th time ๐Ÿ”
โ†ณ Second-guessing myself in an interview ๐Ÿ˜ฐ

Choosing the wrong statistical test can invalidate your findings and lead to flawed conclusions. โš ๏ธ

Here's your quick reference guide:

๐‚๐จ๐ฆ๐ฉ๐š๐ซ๐ข๐ง๐  ๐Œ๐ž๐š๐ง๐ฌ: ๐Ÿ“Š
โ†ณ 2 independent groups โ†’ Independent t-Test
โ†ณ Same group, before/after โ†’ Paired t-Test
โ†ณ 3+ groups โ†’ ANOVA

๐๐จ๐ง-๐๐จ๐ซ๐ฆ๐š๐ฅ ๐ƒ๐š๐ญ๐š: ๐Ÿ“‰
โ†ณ 2 groups โ†’ Mann-Whitney U Test
โ†ณ Paired samples โ†’ Wilcoxon Signed-Rank Test
โ†ณ 3+ groups โ†’ Kruskal-Wallis Test

๐‘๐ž๐ฅ๐š๐ญ๐ข๐จ๐ง๐ฌ๐ก๐ข๐ฉ๐ฌ: ๐Ÿ”—
โ†ณ Linear relationship โ†’ Pearson Correlation
โ†ณ Ranked/non-linear โ†’ Spearman Correlation
โ†ณ Two categorical variables โ†’ Chi-Square Test

๐๐ซ๐ž๐๐ข๐œ๐ญ๐ข๐จ๐ง: ๐Ÿ”ฎ
โ†ณ Continuous outcome โ†’ Linear Regression
โ†ณ Binary outcome (yes/no) โ†’ Logistic Regression

๐•๐š๐ซ๐ข๐š๐ง๐œ๐ž: โš–๏ธ
โ†ณ Compare spread between groups โ†’ Levene's Test / F-Test

Here are 5 resources to help you: ๐Ÿ“š

1. Khan Academy Statistics: https://lnkd.in/statistics-khan
2. StatQuest YouTube Channel: https://lnkd.in/statquest-yt
3. Seeing Theory (Visual Stats): https://lnkd.in/seeing-theory
4. Statistics by Jim Blog: https://lnkd.in/stats-jim
5. OpenIntro Statistics (Free Textbook): https://lnkd.in/openintro-stats
โค4
๐Ÿš€ ๐—ฆ๐˜๐—ถ๐—น๐—น ๐—ง๐—ต๐—ถ๐—ป๐—ธ ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ถ๐˜€ ๐—๐˜‚๐˜€๐˜ ๐—”๐—ฏ๐—ผ๐˜‚๐˜ ๐—ฃ๐˜†๐˜๐—ต๐—ผ๐—ป & ๐—ง๐—ผ๐—ผ๐—น๐˜€? ๐—ง๐—ต๐—ถ๐—ป๐—ธ ๐—”๐—ด๐—ฎ๐—ถ๐—ป.

Behind every powerful model, every accurate prediction, and every data-driven decisionโ€ฆ lies mathematics.

Whether you're starting out or advancing in data science, mastering core mathematics is what separates tool users from true problem solvers.

Here are some of the most important mathematical concepts every data professional should be comfortable with:

๐Ÿ”น ๐—ข๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ง๐—ฒ๐—ฐ๐—ต๐—ป๐—ถ๐—พ๐˜‚๐—ฒ๐˜€ (๐—š๐—ฟ๐—ฎ๐—ฑ๐—ถ๐—ฒ๐—ป๐˜ ๐——๐—ฒ๐˜€๐—ฐ๐—ฒ๐—ป๐˜)
Drives how models learn by minimizing error step-by-step.

๐Ÿ”น ๐—ฃ๐—ฟ๐—ผ๐—ฏ๐—ฎ๐—ฏ๐—ถ๐—น๐—ถ๐˜๐˜† & ๐——๐—ถ๐˜€๐˜๐—ฟ๐—ถ๐—ฏ๐˜‚๐˜๐—ถ๐—ผ๐—ป๐˜€ (๐—ก๐—ผ๐—ฟ๐—บ๐—ฎ๐—น ๐——๐—ถ๐˜€๐˜๐—ฟ๐—ถ๐—ฏ๐˜‚๐˜๐—ถ๐—ผ๐—ป, ๐—ก๐—ฎ๐—ถ๐˜ƒ๐—ฒ ๐—•๐—ฎ๐˜†๐—ฒ๐˜€)
Helps in understanding uncertainty and making predictions.

๐Ÿ”น ๐—ฆ๐˜๐—ฎ๐˜๐—ถ๐˜€๐˜๐—ถ๐—ฐ๐˜€ ๐—™๐˜‚๐—ป๐—ฑ๐—ฎ๐—บ๐—ฒ๐—ป๐˜๐—ฎ๐—น๐˜€ (๐—ญ-๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ, ๐—–๐—ผ๐—ฟ๐—ฟ๐—ฒ๐—น๐—ฎ๐˜๐—ถ๐—ผ๐—ป)
Essential for interpreting data and identifying meaningful patterns.

๐Ÿ”น ๐—”๐—ฐ๐˜๐—ถ๐˜ƒ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—™๐˜‚๐—ป๐—ฐ๐˜๐—ถ๐—ผ๐—ป๐˜€ (๐—ฆ๐—ถ๐—ด๐—บ๐—ผ๐—ถ๐—ฑ, ๐—ฅ๐—ฒ๐—Ÿ๐—จ, ๐—ฆ๐—ผ๐—ณ๐˜๐—บ๐—ฎ๐˜…)
Power the intelligence behind neural networks.

๐Ÿ”น ๐— ๐—ผ๐—ฑ๐—ฒ๐—น ๐—˜๐˜ƒ๐—ฎ๐—น๐˜‚๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐— ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฐ๐˜€ (๐—™๐Ÿญ ๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ, ๐—ฅยฒ, ๐— ๐—ฆ๐—˜, ๐—Ÿ๐—ผ๐—ด ๐—Ÿ๐—ผ๐˜€๐˜€)
Measure how well your model is actually performing.

๐Ÿ”น ๐—Ÿ๐—ถ๐—ป๐—ฒ๐—ฎ๐—ฟ ๐—”๐—น๐—ด๐—ฒ๐—ฏ๐—ฟ๐—ฎ (๐—˜๐—ถ๐—ด๐—ฒ๐—ป๐˜ƒ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ๐˜€, ๐—ฆ๐—ฉ๐——)
The backbone of dimensionality reduction and complex transformations.

๐Ÿ”น ๐—ข๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป & ๐—ฅ๐—ฒ๐—ด๐˜‚๐—น๐—ฎ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป (๐— ๐—Ÿ๐—˜, ๐—Ÿ๐Ÿฎ ๐—ฅ๐—ฒ๐—ด๐˜‚๐—น๐—ฎ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป)
Prevents overfitting and improves model generalization.

๐Ÿ”น ๐—–๐—น๐˜‚๐˜€๐˜๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด & ๐— ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฐ๐˜€ (๐—ž-๐— ๐—ฒ๐—ฎ๐—ป๐˜€, ๐—–๐—ผ๐˜€๐—ถ๐—ป๐—ฒ ๐—ฆ๐—ถ๐—บ๐—ถ๐—น๐—ฎ๐—ฟ๐—ถ๐˜๐˜†)
Helps in grouping and understanding hidden structures in data.

๐Ÿ”น ๐—œ๐—ป๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ง๐—ต๐—ฒ๐—ผ๐—ฟ๐˜† (๐—˜๐—ป๐˜๐—ฟ๐—ผ๐—ฝ๐˜†, ๐—ž๐—Ÿ ๐——๐—ถ๐˜ƒ๐—ฒ๐—ฟ๐—ด๐—ฒ๐—ป๐—ฐ๐—ฒ)
Used in decision trees and probabilistic models.

๐Ÿ”น ๐—”๐—ฑ๐˜ƒ๐—ฎ๐—ป๐—ฐ๐—ฒ๐—ฑ ๐—ข๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป (๐—ฆ๐—ฉ๐— , ๐—Ÿ๐—ฎ๐—ด๐—ฟ๐—ฎ๐—ป๐—ด๐—ฒ ๐— ๐˜‚๐—น๐˜๐—ถ๐—ฝ๐—น๐—ถ๐—ฒ๐—ฟ)
Crucial for constrained optimization problems.

๐Ÿ’ก ๐—ฅ๐—ฒ๐—ฎ๐—น๐—ถ๐˜๐˜† ๐—–๐—ต๐—ฒ๐—ฐ๐—ธ:

You donโ€™t need to master all of these at onceโ€”but ignoring them will limit your growth.

๐Ÿ‘‰ Start small.

๐Ÿ‘‰ Focus on intuition over memorization.

๐Ÿ‘‰ Learn how these concepts connect to real-world problems.

Because in data science, math is not optionalโ€”itโ€™s your competitive advantage.

https://t.me/MachineLearning9 ๐Ÿงก
Please open Telegram to view this post
VIEW IN TELEGRAM
โค3๐Ÿ‘1
Convolutional Neural Network

https://t.me/MachineLearning9
โค5
This Machine Learning Cheat Sheet Saved Me Hours of Revision โณ

It includes:
โœ… Supervised & Unsupervised algorithms
โœ… Regression, Classification & Clustering techniques
โœ… PCA & Dimensionality Reduction
โœ… Neural Networks, CNN, RNN & Transformers
โœ… Assumptions, Pros/Cons & Real-world use cases

Whether you're:
๐Ÿ”น Preparing for data science interviews
๐Ÿ”น Working on ML projects
๐Ÿ”น Or strengthening your fundamentals
this one-page guide is a must-save.

โ™ป๏ธ Repost and share with your ML circle.

#MachineLearning #DataScience #AI #MLAlgorithms #InterviewPrep #LearnML
โค3
Linear Regression explained in a simple geometric way

https://t.me/MachineLearning9 ๐Ÿ’—
Please open Telegram to view this post
VIEW IN TELEGRAM
โค2