Data science/ML/AI
13.7K subscribers
561 photos
2 videos
145 files
320 links
Data science and machine learning hub

Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.

For beginners, data scientists and ML engineers
๐Ÿ‘‰ https://rebrand.ly/bigdatachannels

DMCA: @disclosure_bds
Contact: @mldatascientist
Download Telegram
Data Science Minimum: 10 Essential Skills You Need to Know to Start Doing Data Science
๐Ÿ‘5๐Ÿ˜1
Data Science : Definition, Challenges and Use cases
๐Ÿ‘4
Mastering Probability and Combinatorics

"Mastering the Essentials: Probability and Combinatorics Explained"

Rating โญ๏ธ: 4.0 out 5
Students ๐Ÿ‘จโ€๐ŸŽ“ : 1,129
Duration โฐ : 1hr 24min of on-demand video
Created by ๐Ÿ‘จโ€๐Ÿซ: Akhil Vydyula

๐Ÿ”— Course Link

#probability
โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–
๐Ÿ‘‰Join @datascience_bds for more๐Ÿ‘ˆ
๐Ÿ‘3โค2
Data Science Portfolios, Speeding Up Python, KANs, and Other May Must-Reads

Python One Billion Row Challenge โ€” From 10 Minutes to 4 Seconds
With a longstanding reputation for slowness, youโ€™d think that Python wouldnโ€™t stand a chance at doing well in the popular โ€œone billion rowโ€ challenge. Dario Radeฤiฤ‡โ€™s viral post aims to show that with some flexibility and outside-the-box thinking, you can still squeeze impressive time savings out of your code.

N-BEATS โ€” The First Interpretable Deep Learning Model That Worked for Time Series Forecasting
Anyone who enjoys a thorough look into a modelโ€™s inner workings should bookmark Jonte Danckerโ€™s excellent explainer on N-BEATS, the โ€œfirst pure deep learning approach that outperformed well-established statistical approachesโ€ for time-series forecasting tasks.

Build a Data Science Portfolio Website with ChatGPT: Complete Tutorial
In a competitive job market, data scientists canโ€™t afford to be coy about their achievements and expertise. A portfolio website can be a powerful way to showcase both, and Natassha Selvarajโ€™s patient guide demonstrates how you can build one from scratch with the help of generative-AI tools.

A Complete Guide to BERT with Code
Why not take a step back from the latest buzzy model to learn about those precursors that made todayโ€™s innovations possible? Bradney Smith invites us to go all the way back to 2018 (or several decades ago, in AI time) to gain a deep understanding of the groundbreaking BERT (Bidirectional Encoder Representations from Transformers) model.

Why LLMs Are Not Good for Coding โ€” Part II
Back in the present day, we keep hearing about the imminent obsolescence of programmers as LLMs continue to improve. Andrea Valenzuelaโ€™s latest article serves as a helpful โ€œnot so fast!โ€ interjection, as she focuses on their inherent limitations when it comes to staying up-to-date with the latest libraries and code functionalities.

PCA & K-Means for Traffic Data in Python
What better way to round out our monthly selection than with a hands-on tutorial on a core data science workflow? In her debut TDS post, Beth Ou Yang walks us through a real-world exampleโ€”traffic data from Taiwan, in this caseโ€”of using principle component analysis (PCA) and K-means clustering.
๐Ÿ‘3โค2
Data Analysis Skills
๐Ÿ‘4โค2
Data Science in health care
๐Ÿ‘10
Forwarded from Data visualization
Data Analyst Skills Required by Employers
๐Ÿ‘3โค2
12 Fundamental Math Theories Needed to Understand AI

1. Curse of Dimensionality
This phenomenon occurs when analyzing data in high-dimensional spaces. As dimensions increase, the volume of the space grows exponentially, making it challenging for algorithms to identify meaningful patterns due to the sparse nature of the data.
2. Law of Large Numbers
A cornerstone of statistics, this theorem states that as a sample size grows, its mean will converge to the expected value. This principle assures that larger datasets yield more reliable estimates, making it vital for statistical learning methods.
3. Central Limit Theorem
This theorem posits that the distribution of sample means will approach a normal distribution as the sample size increases, regardless of the original distribution. Understanding this concept is crucial for making inferences in machine learning.
4. Bayesโ€™ Theorem
A fundamental concept in probability theory, Bayesโ€™ Theorem explains how to update the probability of your belief based on new evidence. It is the backbone of Bayesian inference methods used in AI.
5. Overfitting and Underfitting
Overfitting occurs when a model learns the noise in training data, while underfitting happens when a model is too simplistic to capture the underlying patterns. Striking the right balance is essential for effective modeling and performance.
6. Gradient Descent
This optimization algorithm is used to minimize the loss function in machine learning models. A solid understanding of gradient descent is key to fine-tuning neural networks and AI models.
7. Information Theory
Concepts like entropy and mutual information are vital for understanding data compression and feature selection in machine learning, helping to improve model efficiency.
8. Markov Decision Processes (MDP)
MDPs are used in reinforcement learning to model decision-making scenarios where outcomes are partly random and partly under the control of a decision-maker. This framework is crucial for developing effective AI agents.
9. Game Theory
Old school AI is based off game theory. This theory provides insights into multi-agent systems and strategic interactions among agents, particularly relevant in reinforcement learning and competitive environments.
10. Statistical Learning Theory
This theory is the foundation of regression, regularization and classification. It addresses the relationship between data and learning algorithms, focusing on the theoretical aspects that govern how models learn from data and make predictions.
11. Hebbian Theory
This theory is the basis of neural networks, โ€œNeurons that fire together, wire togetherโ€. Its a biology theory on how learning is done on a cellular level, and as you would have it โ€” Neural Networks are based off this theory.
12. Convolution (Kernel)
Not really a theory and you donโ€™t need to fully understand it, but this is the mathematical process on how masks work in image processing. Convolution matrix is used to combine two matrixes and describes the overlap.

Special thanks to Jiji Veronica Kim for this list.


โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–
Join @datascience_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group
๐Ÿ‘9โค2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘4โค2๐Ÿ”ฅ2๐Ÿ‘Ž1
Salaries of In-demand data science jobs
โค12
streamlit

Streamlit โ€” A faster way to build and share data apps.

Creator: Streamlit
Stars โญ๏ธ: 35.4k
Forked By: 3.1k
https://github.com/streamlit/streamlit

#datascience
โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–
Join @datascience_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group
๐Ÿ‘2
Essential Machine Learning Algorithms for Data Scientists

Master essential machine learning algorithms and elevate your data science skills

Rating โญ๏ธ: 4.6 out 5
Students ๐Ÿ‘จโ€๐ŸŽ“ : 791
Duration โฐ : 43min of on-demand video
Created by ๐Ÿ‘จโ€๐Ÿซ: Arunkumar Krishnan

๐Ÿ”— Course Link

#ml #algorithm
โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–โž–
๐Ÿ‘‰Join @datascience_bds for more๐Ÿ‘ˆ
๐Ÿ‘3
Forecasting vs. Predictive Analytics: The Obama Example
Analytics can influence elections, not just predict them. This article explores how the Obama campaign used predictive analytics to outmaneuver traditional forecasting.

Forecasting vs. Predictive Analytics
Nate Silverโ€™s forecasting predicted state outcomes, while Obamaโ€™s team used predictive analytics to score individual voters, targeting those most likely to be persuaded.

Impact of Predictive Analytics
The Obama campaign optimized interactions, avoiding โ€œdo-not-disturbโ€ voters and improving ad spending effectiveness by 18%.

Conclusion
Predictive analytics enables organizations to shape outcomes through personalized insights, distinguishing it from forecastingโ€™s broad predictions.
๐Ÿ‘5โค3
Data Science Trends in 2024
๐Ÿ‘5๐Ÿ”ฅ4
Characteristics of a Data whisperer
๐Ÿ‘7โค3
Top 10 Data Libraries for Python
๐Ÿ‘8