Data Science Portfolio - Kaggle Datasets & AI Projects | Artificial Intelligence
37.6K subscribers
287 photos
76 files
339 links
Free Datasets For Data Science Projects & Portfolio

Buy ads: https://telega.io/c/DataPortfolio

For Promotions/ads: @coderfun @love_data
Download Telegram
Kandinsky 5.0 Video Lite and Kandinsky 5.0 Video Pro generative models on the global text-to-video landscape

๐Ÿ”˜Pro is currently the #1 open-source model worldwide
๐Ÿ”˜Lite (2B parameters) outperforms Sora v1.
๐Ÿ”˜Only Google (Veo 3.1, Veo 3), OpenAI (Sora 2), Alibaba (Wan 2.5), and KlingAI (Kling 2.5, 2.6) outperform Pro โ€” these are objectively the strongest video generation models in production today. We are on par with Luma AI (Ray 3) and MiniMax (Hailuo 2.3): the maximum ELO gap is 3 points, with a 95% CI of ยฑ21.

Useful links
๐Ÿ”˜Full leaderboard: LM Arena
๐Ÿ”˜Kandinsky 5.0 details: technical report
๐Ÿ”˜Open-source Kandinsky 5.0: GitHub and Hugging Face
โค2๐Ÿ‘2
How to send follow up email to a recruiter ๐Ÿ‘‡๐Ÿ‘‡

Dear [Recruiterโ€™s Name],

I hope this email finds you doing well. I wanted to take a moment to express my sincere gratitude for the time and consideration you have given me throughout the recruitment process for the [position] role at [company].

I understand that you must be extremely busy and receive countless applications, so I wanted to reach out and follow up on the status of my application. If itโ€™s not too much trouble, could you kindly provide me with any updates or feedback you may have?

I want to assure you that I remain genuinely interested in the opportunity to join the team at [company] and I would be honored to discuss my qualifications further. If there are any additional materials or information you require from me, please donโ€™t hesitate to let me know.

Thank you for your time and consideration. I appreciate the effort you put into recruiting and look forward to hearing from you soon.


Warmest regards,

(Tap to copy)
โค11
The Shift in Data Analyst Roles: What You Should Apply for in 2025

The traditional โ€œData Analystโ€ title is gradually declining in demand in 2025 not because data is any less important, but because companies are getting more specific in what theyโ€™re looking for.

Today, many roles that were once grouped under โ€œData Analystโ€ are now split into more domain-focused titles, depending on the team or function they support.

Here are some roles gaining traction:
* Business Analyst
* Product Analyst
* Growth Analyst
* Marketing Analyst
* Financial Analyst
* Operations Analyst
* Risk Analyst
* Fraud Analyst
* Healthcare Analyst
* Technical Analyst
* Business Intelligence Analyst
* Decision Support Analyst
* Power BI Developer
* Tableau Developer

Focus on the skillsets and business context these roles demand.

Whether you're starting out or transitioning, look beyond "Data Analyst" and align your profile with industry-specific roles. Itโ€™s not about the titleโ€”itโ€™s about the value you bring to a team.
โค12๐Ÿ‘1๐Ÿ”ฅ1
โœ… Data Analyst Mock Interview Questions with Answers ๐Ÿ“Š๐ŸŽฏ

1๏ธโƒฃ Q: Explain the difference between a primary key and a foreign key.
A:
โ€ข Primary Key: Uniquely identifies each record in a table; cannot be null.
โ€ข Foreign Key: A field in one table that refers to the primary key of another table; establishes a relationship between the tables.

2๏ธโƒฃ Q: What is the difference between WHERE and HAVING clauses in SQL?
A:
โ€ข WHERE: Filters rows before grouping.
โ€ข HAVING: Filters groups after aggregation (used with GROUP BY).

3๏ธโƒฃ Q: How do you handle missing values in a dataset?
A: Common techniques include:
โ€ข Imputation: Replacing missing values with mean, median, mode, or a constant.
โ€ข Removal: Removing rows or columns with too many missing values.
โ€ข Using algorithms that handle missing data: Some machine learning algorithms can handle missing values natively.

4๏ธโƒฃ Q: What is the difference between a line chart and a bar chart, and when would you use each?
A:
โ€ข Line Chart: Shows trends over time or continuous values.
โ€ข Bar Chart: Compares discrete categories or values.
โ€ข Use a line chart to show sales trends over months; use a bar chart to compare sales across different product categories.

5๏ธโƒฃ Q: Explain what a p-value is and its significance.
A: The p-value is the probability of obtaining results as extreme as, or more extreme than, the observed results, assuming the null hypothesis is true. A small p-value (typically โ‰ค 0.05) indicates strong evidence against the null hypothesis.

6๏ธโƒฃ Q: How would you deal with outliers in a dataset?
A:
โ€ข Identify Outliers: Using box plots, scatter plots, or statistical methods (e.g., Z-score).
โ€ข Treatment:
โ€ข Remove Outliers: If they are due to errors or anomalies.
โ€ข Transform Data: Using techniques like log transformation.
โ€ข Keep Outliers: If they represent genuine data points and provide valuable insights.

7๏ธโƒฃ Q: What are the different types of joins in SQL?
A:
โ€ข INNER JOIN: Returns rows only when there is a match in both tables.
โ€ข LEFT JOIN (or LEFT OUTER JOIN): Returns all rows from the left table, and the matching rows from the right table. If there is no match, the right side will contain NULL values.
โ€ข RIGHT JOIN (or RIGHT OUTER JOIN): Returns all rows from the right table, and the matching rows from the left table. If there is no match, the left side will contain NULL values.
โ€ข FULL OUTER JOIN: Returns all rows from both tables, filling in NULLs when there is no match.

8๏ธโƒฃ Q: How would you approach a data analysis project from start to finish?
A:
โ€ข Define the Problem: Understand the business question you're trying to answer.
โ€ข Collect Data: Gather relevant data from various sources.
โ€ข Clean and Preprocess Data: Handle missing values, outliers, and inconsistencies.
โ€ข Explore and Analyze Data: Use statistical methods and visualizations to identify patterns.
โ€ข Draw Conclusions and Make Recommendations: Summarize your findings and provide actionable insights.
โ€ข Communicate Results: Present your analysis to stakeholders.

๐Ÿ‘ Tap โค๏ธ for more!
โค11๐Ÿ‘2๐Ÿ”ฅ1
The best way to learn data analytics skills is to:

1. Watch a tutorial

2. Immediately practice what you just learned

3. Do projects to apply your learning to real-life applications

If you only watch videos and never practice, you wonโ€™t retain any of your teaching.

If you never apply your learning with projects, you wonโ€™t be able to solve problems on the job. (You also will have a much harder time attracting recruiters without a recruiter.)
โค6
๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ณ๐—ผ๐—ฟ ๐—™๐—ฅ๐—˜๐—˜ (๐—ก๐—ผ ๐—ฆ๐˜๐—ฟ๐—ถ๐—ป๐—ด๐˜€ ๐—”๐˜๐˜๐—ฎ๐—ฐ๐—ต๐—ฒ๐—ฑ)

๐—ก๐—ผ ๐—ณ๐—ฎ๐—ป๐—ฐ๐˜† ๐—ฐ๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€, ๐—ป๐—ผ ๐—ฐ๐—ผ๐—ป๐—ฑ๐—ถ๐˜๐—ถ๐—ผ๐—ป๐˜€, ๐—ท๐˜‚๐˜€๐˜ ๐—ฝ๐˜‚๐—ฟ๐—ฒ ๐—น๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด.

๐—›๐—ฒ๐—ฟ๐—ฒโ€™๐˜€ ๐—ต๐—ผ๐˜„ ๐˜๐—ผ ๐—ฏ๐—ฒ๐—ฐ๐—ผ๐—บ๐—ฒ ๐—ฎ ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—ถ๐˜€๐˜ ๐—ณ๐—ผ๐—ฟ ๐—™๐—ฅ๐—˜๐—˜:

1๏ธโƒฃ Python Programming for Data Science โ†’ Harvardโ€™s CS50P
The best intro to Python for absolute beginners:
โ†ฌ Covers loops, data structures, and practical exercises.
โ†ฌ Designed to help you build foundational coding skills.

Link: https://cs50.harvard.edu/python/

https://t.me/datasciencefun

2๏ธโƒฃ Statistics & Probability โ†’ Khan Academy
Want to master probability, distributions, and hypothesis testing? This is where to start:
โ†ฌ Clear, beginner-friendly videos.
โ†ฌ Exercises to test your skills.

Link: https://www.khanacademy.org/math/statistics-probability

https://whatsapp.com/channel/0029Vat3Dc4KAwEcfFbNnZ3O

3๏ธโƒฃ Linear Algebra for Data Science โ†’ 3Blue1Brown
โ†ฌ Learn about matrices, vectors, and transformations.
โ†ฌ Essential for machine learning models.

Link: https://www.youtube.com/playlist?list=PLZHQObOWTQDMsr9KzVk3AjplI5PYPxkUr

4๏ธโƒฃ SQL Basics โ†’ Mode Analytics
SQL is the backbone of data manipulation. This tutorial covers:
โ†ฌ Writing queries, joins, and filtering data.
โ†ฌ Real-world datasets to practice.

Link: https://mode.com/sql-tutorial

https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v

5๏ธโƒฃ Data Visualization โ†’ freeCodeCamp
Learn to create stunning visualizations using Python libraries:
โ†ฌ Covers Matplotlib, Seaborn, and Plotly.
โ†ฌ Step-by-step projects included.

Link: https://www.youtube.com/watch?v=JLzTJhC2DZg

https://whatsapp.com/channel/0029VaxaFzoEQIaujB31SO34

6๏ธโƒฃ Machine Learning Basics โ†’ Googleโ€™s Machine Learning Crash Course
An in-depth introduction to machine learning for beginners:
โ†ฌ Learn supervised and unsupervised learning.
โ†ฌ Hands-on coding with TensorFlow.

Link: https://developers.google.com/machine-learning/crash-course

7๏ธโƒฃ Deep Learning โ†’ Fast.aiโ€™s Free Course
Fast.ai makes deep learning easy and accessible:
โ†ฌ Build neural networks with PyTorch.
โ†ฌ Learn by coding real projects.

Link: https://course.fast.ai/

8๏ธโƒฃ Data Science Projects โ†’ Kaggle
โ†ฌ Compete in challenges to practice your skills.
โ†ฌ Great way to build your portfolio.

Link: https://www.kaggle.com/
โค11๐Ÿ”ฅ2
๐Ÿ”ฐ Python program to convert text to speech
โค8
โš ๏ธ Mistakes Beginners Repeat for Years

โŒ Ignoring fundamentals
โŒ Copy-pasting without understanding
โŒ Overusing frameworks
โŒ Avoiding debugging
โŒ Skipping tests
โŒ Fear of refactoring

React ๐Ÿงก if you want more of this type of content

#techinfo
โค15๐Ÿ”ฅ1
โœ… GitHub Profile Tips for Data Analysts ๐ŸŒ๐Ÿ’ผ

Your GitHub is more than code โ€” itโ€™s your digital resume. Here's how to make it stand out:

1๏ธโƒฃ Clean README (Profile)
โ€ข Add your name, title & tools
โ€ข Short about section
โ€ข Include: skills, top projects, certificates, contact
โœ… Example:
โ€œHi, Iโ€™m Rahul โ€“ a Data Analyst skilled in SQL, Python & Power BI.โ€

2๏ธโƒฃ Pin Your Best Projects
โ€ข Show 3โ€“6 strong repos
โ€ข Add clear README for each project:
- What it does
- Tools used
- Screenshots or demo links
โœ… Bonus: Include real data or visuals

3๏ธโƒฃ Use Commits & Contributions
โ€ข Contribute regularly
โ€ข Avoid empty profiles
โœ… Daily commits > 1 big push once a month

4๏ธโƒฃ Upload Resume Projects
โ€ข Excel dashboards
โ€ข SQL queries
โ€ข Python notebooks (Jupyter)
โ€ข BI project links (Power BI/Tableau public)

5๏ธโƒฃ Add Descriptions & Tags
โ€ข Use repo tags: sql, python, EDA, dashboard
โ€ข Write short project summary in repo description

๐Ÿง  Tips:
โ€ข Push only clean, working code
โ€ข Use folders, not messy files
โ€ข Update your profile bio with your LinkedIn

๐Ÿ“Œ Practice Task:
Upload your latest project โ†’ Write a README โ†’ Pin it to your profile

๐Ÿ’ฌ Tap โค๏ธ for more!
โค12
๐Ÿšจ Anthropic dropped a FREE 33-page playbook revealing Claude's very own cheat code:

The 'Skills' folder.

Spend 30 minutes building it,
and youโ€™ll never have to explain your process again.

Top-tier users don't just type commands, they build systems.

Grab your free copy of Anthropic's official guide to building Claude skills right here: https://resources.anthropic.com/hubfs/The-Complete-Guide-to-Building-Skill-for-Claude.pdf
โค9
๐Ÿ“ข Advertising in this channel

You can place an ad via Telegaโ€คio. It takes just a few minutes.

Formats and current rates: View details
โœ… Useful Platform to Practice SQL Programming ๐Ÿง ๐Ÿ–ฅ๏ธ

Learning SQL is just the first step โ€” practice is what builds real skill. Here are the best platforms for hands-on SQL:

1๏ธโƒฃ LeetCode โ€“ For Interview-Oriented SQL Practice
โ€ข Focus: Real interview-style problems
โ€ข Levels: Easy to Hard
โ€ข Schema + Sample Data Provided
โ€ข Great for: Data Analyst, Data Engineer, FAANG roles
โœ” Tip: Start with Easy โ†’ filter by โ€œDatabaseโ€ tag
โœ” Popular Section: Database โ†’ Top 50 SQL Questions
Example Problem: โ€œFind duplicate emails in a user tableโ€ โ†’ Practice filtering, GROUP BY, HAVING

2๏ธโƒฃ HackerRank โ€“ Structured & Beginner-Friendly
โ€ข Focus: Step-by-step SQL track
โ€ข Has certification tests (SQL Basic, Intermediate)
โ€ข Problem sets by topic: SELECT, JOINs, Aggregations, etc.
โœ” Tip: Follow the full SQL track
โœ” Bonus: Company-specific challenges
Try: โ€œRevising Aggregations โ€“ The Count Functionโ€ โ†’ Build confidence with small wins

3๏ธโƒฃ Mode Analytics โ€“ Real-World SQL in Business Context
โ€ข Focus: Business intelligence + SQL
โ€ข Uses real-world datasets (e.g., e-commerce, finance)
โ€ข Has an in-browser SQL editor with live data
โœ” Best for: Practicing dashboard-level queries
โœ” Tip: Try the SQL case studies & tutorials

4๏ธโƒฃ StrataScratch โ€“ Interview Questions from Real Companies
โ€ข 500+ problems from companies like Uber, Netflix, Google
โ€ข Split by company, difficulty, and topic
โœ” Best for: Intermediate to advanced level
โœ” Tip: Try โ€œHardโ€ questions after doing 30โ€“50 easy/medium

5๏ธโƒฃ DataLemur โ€“ Short, Practical SQL Problems
โ€ข Crisp and to the point
โ€ข Good UI, fast learning
โ€ข Real interview-style logic
โœ” Use when: You want fast, smart SQL drills

๐Ÿ“Œ How to Practice Effectively:
โ€ข Spend 20โ€“30 mins/day
โ€ข Focus on JOINs, GROUP BY, HAVING, Subqueries
โ€ข Analyze problem โ†’ write โ†’ debug โ†’ re-write
โ€ข After solving, explain your logic out loud

๐Ÿงช Practice Task:
Try solving 5 SQL questions from LeetCode or HackerRank this week. Start with SELECT, WHERE, and GROUP BY.

๐Ÿ’ฌ Tap โค๏ธ for more!
โค5