Python for Data Analysts
47.7K subscribers
492 photos
64 files
318 links
Find top Python resources from global universities, cool projects, and learning materials for data analytics.

For promotions: @coderfun

Useful links: heylink.me/DataAnalytics
Download Telegram
Excel vs SQL vs Python (pandas):

1️⃣ Filtering Data
↳ Excel: =FILTER(A2:D100, B2:B100>50) (Excel 365 users)
↳ SQL: SELECT * FROM table WHERE column > 50;
↳ Python: df_filtered = df[df['column'] > 50]

2️⃣ Sorting Data
↳ Excel: Data → Sort (or =SORT(A2:A100, 1, TRUE))
↳ SQL: SELECT * FROM table ORDER BY column ASC;
↳ Python: df_sorted = df.sort_values(by="column")

3️⃣ Counting Rows
↳ Excel: =COUNTA(A:A)
↳ SQL: SELECT COUNT(*) FROM table;
↳ Python: row_count = len(df)

4️⃣ Removing Duplicates
↳ Excel: Data → Remove Duplicates
↳ SQL: SELECT DISTINCT * FROM table;
↳ Python: df_unique = df.drop_duplicates()

5️⃣ Joining Tables
↳ Excel: Power Query → Merge Queries (or VLOOKUP/XLOOKUP)
↳ SQL: SELECT * FROM table1 JOIN table2 ON table1.id = table2.id;
↳ Python: df_merged = pd.merge(df1, df2, on="id")

6️⃣ Ranking Data
↳ Excel: =RANK.EQ(A2, $A$2:$A$100)
↳ SQL: SELECT column, RANK() OVER (ORDER BY column DESC) AS rank FROM table;
↳ Python: df["rank"] = df["column"].rank(method="min", ascending=False)

7️⃣ Moving Average Calculation
↳ Excel: =AVERAGE(B2:B4) (manually for rolling window)
↳ SQL: SELECT date, AVG(value) OVER (ORDER BY date ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) AS moving_avg FROM table;
↳ Python: df["moving_avg"] = df["value"].rolling(window=3).mean()

8️⃣ Running Total
↳ Excel: =SUM($B$2:B2) (drag down)
↳ SQL: SELECT date, SUM(value) OVER (ORDER BY date) AS running_total FROM table;
↳ Python: df["running_total"] = df["value"].cumsum()
👍5
𝗧𝗼𝗽 𝗣𝘆𝘁𝗵𝗼𝗻 𝗜𝗻𝘁𝗲𝗿𝘃𝗶𝗲𝘄 𝗤𝘂𝗲𝘀𝘁𝗶𝗼𝗻𝘀 𝗳𝗼𝗿 𝟮𝟬𝟮𝟱 — 𝗥𝗲𝗰𝗲𝗻𝘁𝗹𝘆 𝗔𝘀𝗸𝗲𝗱 𝗯𝘆 𝗠𝗡𝗖𝘀😍

📌 Preparing for Python Interviews in 2025?🗣

If you’re aiming for roles in data analysis, backend development, or automation, Python is your key weapon—and so is preparing with the right questions.💻✨️

𝐋𝐢𝐧𝐤👇:-

https://pdlink.in/3ZbAtrW

Crack your next Python interview✅️
👍2
Here is a powerful 𝗜𝗡𝗧𝗘𝗥𝗩𝗜𝗘𝗪 𝗧𝗜𝗣 to help you land a job!

Most people who are skilled enough would be able to clear technical rounds with ease.

But when it comes to 𝗯𝗲𝗵𝗮𝘃𝗶𝗼𝗿𝗮𝗹/𝗰𝘂𝗹𝘁𝘂𝗿𝗲 𝗳𝗶𝘁 rounds, some folks may falter and lose the potential offer.

Many companies schedule a behavioral round with a top-level manager in the organization to understand the culture fit (except for freshers).

One needs to clear this round to reach the salary negotiation round.

Here are some tips to clear such rounds:

1️⃣ Once the HR schedules the interview, try to find the LinkedIn profile of the interviewer using the name in their email ID.

2️⃣ Learn more about his/her past experiences and try to strike up a conversation on that during the interview.

3️⃣ This shows that you have done good research and also helps strike a personal connection.

4️⃣ Also, this is the round not just to evaluate if you're a fit for the company, but also to assess if the company is a right fit for you.

5️⃣ Hence, feel free to ask many questions about your role and company to get a clear understanding before taking the offer. This shows that you really care about the role you're getting into.

💡 𝗕𝗼𝗻𝘂𝘀 𝘁𝗶𝗽 - Be polite yet assertive in such interviews. It impresses a lot of senior folks.
👍4
𝟱 𝗙𝗿𝗲𝗲 𝗠𝗜𝗧 𝗣𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗶𝗻𝗴 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 𝗧𝗵𝗮𝘁 𝗘𝘃𝗲𝗿𝘆 𝗕𝗲𝗴𝗶𝗻𝗻𝗲𝗿 𝗦𝗵𝗼𝘂𝗹𝗱 𝗦𝘁𝗮𝗿𝘁 𝗪𝗶𝘁𝗵😍

💻 Want to Learn Coding but Don’t Know Where to Start?🎯

Whether you’re a student, career switcher, or complete beginner, this curated list is your perfect launchpad into tech💻🚀

𝐋𝐢𝐧𝐤👇:-

https://pdlink.in/437ow7Y

All The Best 🎊
👍1
𝟳 𝗙𝗥𝗘𝗘 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 𝗧𝗼 𝗠𝗮𝘀𝘁𝗲𝗿 𝗧𝗲𝗰𝗵 𝗦𝗸𝗶𝗹𝗹𝘀 𝗜𝗻 𝟮𝟬𝟮𝟱 😍

If you dream of a tech career but don’t want to break the bank, you’re in the right place.

These 7 hand-picked resources are free and help you build real, job-ready skills—from web development to machine learning and AI.

𝐋𝐢𝐧𝐤 👇:-

https://pdlink.in/4j1lqbJ

Enroll for FREE & Get Certified 🎓
👍2
Deloitte Recent Data Analyst Interview Questions Part-1
👍21
Deloitte Recent Data Analyst Interview Questions Part-2
1👍1
𝟱 𝗙𝗿𝗲𝗲 𝗠𝗜𝗧 𝗗𝗮𝘁𝗮 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 𝗧𝗵𝗮𝘁 𝗪𝗶𝗹𝗹 𝗕𝗼𝗼𝘀𝘁 𝗬𝗼𝘂𝗿 𝗖𝗮𝗿𝗲𝗲𝗿😍

📊 Want to Learn Data Analytics but Hate the High Price Tags?💰📌

Good news: MIT is offering free, high-quality data analytics courses through their OpenCourseWare platform💻🎯

𝐋𝐢𝐧𝐤👇:-

https://pdlink.in/4iXNfS3

All The Best 🎊
👍2
WhatsApp is no longer a platform just for chat.

It's an educational goldmine.

If you do, you’re sleeping on a goldmine of knowledge and community. WhatsApp channels are a great way to practice data science, make your own community, and find accountability partners.

I have curated the list of best WhatsApp channels to learn coding & data science for FREE

Free Courses with Certificate
👇👇
https://whatsapp.com/channel/0029Vamhzk5JENy1Zg9KmO2g

Jobs & Internship Opportunities
👇👇
https://whatsapp.com/channel/0029VaI5CV93AzNUiZ5Tt226

Web Development
👇👇
https://whatsapp.com/channel/0029VaiSdWu4NVis9yNEE72z

Python Free Books & Projects
👇👇
https://whatsapp.com/channel/0029VaiM08SDuMRaGKd9Wv0L

Java Free Resources
👇👇
https://whatsapp.com/channel/0029VamdH5mHAdNMHMSBwg1s

Coding Interviews
👇👇
https://whatsapp.com/channel/0029VammZijATRSlLxywEC3X

SQL For Data Analysis
👇👇
https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v

Power BI Resources
👇👇
https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c

Programming Free Resources
👇👇
https://whatsapp.com/channel/0029VahiFZQ4o7qN54LTzB17

Data Science Projects
👇👇
https://whatsapp.com/channel/0029Va4QUHa6rsQjhITHK82y

Learn Data Science & Machine Learning
👇👇
https://whatsapp.com/channel/0029Va8v3eo1NCrQfGMseL2D

Coding Projects
👇👇
https://whatsapp.com/channel/0029VamhFMt7j6fx4bYsX908

Excel for Data Analyst
👇👇
https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i

ENJOY LEARNING 👍👍
👍41
𝗙𝗥𝗘𝗘 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀 𝗙𝗿𝗼𝗺 𝗧𝗼𝗽 𝗖𝗼𝗺𝗽𝗮𝗻𝗶𝗲𝘀😍

Top Companies Offering FREE Certification Courses To Upskill In 2025 

Google:- https://pdlink.in/3YsujTV

Microsoft :- https://pdlink.in/4jpmI0I

Cisco :- https://pdlink.in/4fYr1xO

HP :- https://pdlink.in/3DrNsxI

IBM :- https://pdlink.in/44GsWoC

Qualc :- https://pdlink.in/3YrFTyK

TCS :- https://pdlink.in/4cHavCa

Infosys :- https://pdlink.in/4jsHZXf

Enroll For FREE & Get Certified 🎓
👍3
𝐖𝐡𝐲 𝐄𝐯𝐞𝐫𝐲 𝐃𝐚𝐭𝐚 𝐀𝐧𝐚𝐥𝐲𝐬𝐭 & 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐭𝐢𝐬𝐭 𝐒𝐡𝐨𝐮𝐥𝐝 𝐌𝐚𝐬𝐭𝐞𝐫 𝐏𝐚𝐧𝐝𝐚𝐬

When it comes to data analysis and machine learning, Pandas is non-negotiable. It’s the 𝐟𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐝𝐚𝐭𝐚 𝐦𝐚𝐧𝐢𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧 𝐢𝐧 𝐏𝐲𝐭𝐡𝐨𝐧, turning messy datasets into meaningful insights — and that’s exactly what makes it a 𝐠𝐚𝐦𝐞-𝐜𝐡𝐚𝐧𝐠𝐞𝐫 in real-world projects.

Recently, I explored an in-depth guide on 𝐏𝐚𝐧𝐝𝐚𝐬 𝐟𝐫𝐨𝐦 𝐁𝐚𝐬𝐢𝐜𝐬 𝐭𝐨 𝐀𝐝𝐯𝐚𝐧𝐜𝐞𝐝, and here’s what stood out:-

- Use len() to analyze string data (e.g., name lengths in the Titanic dataset).
- Create pivot tables for grouped insights (like finding top batting averages per team).
- Simplify categories (e.g., replacing “male”/“female” with “M”/“F”).
- Merge and join datasets seamlessly, even with missing values.

𝐇𝐞𝐫𝐞’𝐬 𝐰𝐡𝐲 𝐏𝐚𝐧𝐝𝐚𝐬 𝐢𝐬 𝐜𝐫𝐢𝐭𝐢𝐜𝐚𝐥 𝐢𝐧 𝐃𝐚𝐭𝐚 𝐀𝐧𝐚𝐥𝐲𝐬𝐢𝐬 & 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞:

- 𝐃𝐚𝐭𝐚 𝐂𝐥𝐞𝐚𝐧𝐢𝐧𝐠:- Handle missing values, duplicates, and inconsistent formats.
- 𝐄𝐱𝐩𝐥𝐨𝐫𝐚𝐭𝐨𝐫𝐲 𝐃𝐚𝐭𝐚 𝐀𝐧𝐚𝐥𝐲𝐬𝐢𝐬 (𝐄𝐃𝐀):- Quickly summarize patterns and anomalies.
- 𝐅𝐞𝐚𝐭𝐮𝐫𝐞 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐢𝐧𝐠:- Create meaningful features to improve model performance.
- 𝐃𝐚𝐭𝐚 𝐈𝐧𝐭𝐞𝐠𝐫𝐚𝐭𝐢𝐨𝐧:- Combine multiple data sources with ease.
- 𝐓𝐢𝐦𝐞 𝐒𝐞𝐫𝐢𝐞𝐬 𝐒𝐮𝐩𝐩𝐨𝐫𝐭:- Ideal for forecasting and trend analysis.

In short — 𝐏𝐚𝐧𝐝𝐚𝐬 𝐭𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐬 𝐫𝐚𝐰 𝐝𝐚𝐭𝐚 𝐢𝐧𝐭𝐨 𝐚𝐜𝐭𝐢𝐨𝐧𝐚𝐛𝐥𝐞 𝐢𝐧𝐬𝐢𝐠𝐡𝐭𝐬.

If you’re learning Python for ML or analytics, make Pandas your priority.

👍 𝗟𝗶𝗸𝗲 for more such content.
👍4
𝗖𝗶𝘀𝗰𝗼 𝗙𝗥𝗘𝗘 𝗣𝗿𝗼𝗳𝗲𝘀𝘀𝗶𝗼𝗻𝗮𝗹 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 😍

 Stand out in the competitive job market.Cisco Networking Academy has you covered with free courses designed to enhance your professional skills.

 Learn the Most In-Demand Skills:
 Perfect for Everyone
 Earn Recognized Certificates

𝗟𝗶𝗻𝗸👇:- 

https://pdlink.in/3PeiTOW

Enroll for FREE & Get Certified 🎓
👍1
Essential Pandas Functions for Data Analysis

Data Loading:

pd.read_csv() - Load data from a CSV file.

pd.read_excel() - Load data from an Excel file.


Data Inspection:

df.head(n) - View the first n rows.

df.info() - Get a summary of the dataset.

df.describe() - Generate summary statistics.


Data Manipulation:

df.drop(columns=['col1', 'col2']) - Remove specific columns.

df.rename(columns={'old_name': 'new_name'}) - Rename columns.

df['col'] = df['col'].apply(func) - Apply a function to a column.


Filtering and Sorting:

df[df['col'] > value] - Filter rows based on a condition.

df.sort_values(by='col', ascending=True) - Sort rows by a column.


Aggregation:

df.groupby('col').sum() - Group data and compute the sum.

df['col'].value_counts() - Count unique values in a column.


Merging and Joining:

pd.merge(df1, df2, on='key') - Merge two DataFrames.

pd.concat([df1, df2]) - Concatenate

Here you can find essential Python Interview Resources👇
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02

Like this post for more resources like this 👍♥️

Share with credits: https://t.me/sqlspecialist

Hope it helps :)
👍2
Python is a popular programming language in the field of data analysis due to its versatility, ease of use, and extensive libraries for data manipulation, visualization, and analysis. Here are some key Python skills that are important for data analysts:

1. Basic Python Programming: Understanding basic Python syntax, data types, control structures, functions, and object-oriented programming concepts is essential for data analysis in Python.

2. NumPy: NumPy is a fundamental package for scientific computing in Python. It provides support for large multidimensional arrays and matrices, along with a collection of mathematical functions to operate on these arrays.

3. Pandas: Pandas is a powerful library for data manipulation and analysis in Python. It provides data structures like DataFrames and Series that make it easy to work with structured data and perform tasks such as filtering, grouping, joining, and reshaping data.

4. Matplotlib and Seaborn: Matplotlib is a versatile library for creating static, interactive, and animated visualizations in Python. Seaborn is built on top of Matplotlib and provides a higher-level interface for creating attractive statistical graphics.

5. Scikit-learn: Scikit-learn is a popular machine learning library in Python that provides tools for building predictive models, performing clustering and classification tasks, and evaluating model performance.

6. Jupyter Notebooks: Jupyter Notebooks are an interactive computing environment that allows you to create and share documents containing live code, equations, visualizations, and narrative text. They are commonly used by data analysts for exploratory data analysis and sharing insights.

7. SQLAlchemy: SQLAlchemy is a Python SQL toolkit and Object-Relational Mapping (ORM) library that provides a high-level interface for interacting with relational databases using Python.

8. Regular Expressions: Regular expressions (regex) are powerful tools for pattern matching and text processing in Python. They are useful for extracting specific information from text data or performing data cleaning tasks.

9. Data Visualization Libraries: In addition to Matplotlib and Seaborn, data analysts may also use other visualization libraries like Plotly, Bokeh, or Altair to create interactive visualizations in Python.

10. Web Scraping: Knowledge of web scraping techniques using libraries like BeautifulSoup or Scrapy can be useful for collecting data from websites for analysis.

By mastering these Python skills and applying them to real-world data analysis projects, you can enhance your proficiency as a data analyst and unlock new opportunities in the field.
3👍2
𝟱 𝗙𝗿𝗲𝗲 𝗗𝗮𝘁𝗮 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 𝘁𝗼 𝗦𝗸𝘆𝗿𝗼𝗰𝗸𝗲𝘁 𝗬𝗼𝘂𝗿 𝗖𝗮𝗿𝗲𝗲𝗿 𝗶𝗻 𝟮𝟬𝟮𝟱😍

Whether you’re a beginner, career switcher, or just curious about data analytics, these 5 free online courses are your perfect starting point!🎯

𝐋𝐢𝐧𝐤👇:-

https://pdlink.in/3FdLMcv

Gain the skills to manage analytics projects✅️
👍2
Data Analyst INTERVIEW QUESTIONS AND ANSWERS
👇👇

1.Can you name the wildcards in Excel?

Ans: There are 3 wildcards in Excel that can ve used in formulas.

Asterisk (*) – 0 or more characters. For example, Ex* could mean Excel, Extra, Expertise, etc.

Question mark (?) – Represents any 1 character. For example, R?ain may mean Rain or Ruin.

Tilde (~) – Used to identify a wildcard character (~, *, ?). For example, If you need to find the exact phrase India* in a list. If you use India* as the search string, you may get any word with India at the beginning followed by different characters (such as Indian, Indiana). If you have to look for India” exclusively, use ~.

Hence, the search string will be india~*. ~ is used to ensure that the spreadsheet reads the following character as is, and not as a wildcard.


2.What is cascading filter in tableau?

Ans: Cascading filters can also be understood as giving preference to a particular filter and then applying other filters on previously filtered data source. Right-click on the filter you want to use as a main filter and make sure it is set as all values in dashboard then select the subsequent filter and select only relevant values to cascade the filters. This will improve the performance of the dashboard as you have decreased the time wasted in running all the filters over complete data source.


3.What is the difference between .twb and .twbx extension?

Ans:
A .twb file contains information on all the sheets, dashboards and stories, but it won’t contain any information regarding data source. Whereas .twbx file contains all the sheets, dashboards, stories and also compressed data sources. For saving a .twbx extract needs to be performed on the data source. If we forward .twb file to someone else than they will be able to see the worksheets and dashboards but won’t be able to look into the dataset.


4.What are the various Power BI versions?

Power BI Premium capacity-based license, for example, allows users with a free license to act on content in workspaces with Premium capacity. A user with a free license can only use the Power BI service to connect to data and produce reports and dashboards in My Workspace outside of Premium capacity. They are unable to exchange material or publish it in other workspaces. To process material, a Power BI license with a free or Pro per-user license only uses a shared and restricted capacity. Users with a Power BI Pro license can only work with other Power BI Pro users if the material is stored in that shared capacity. They may consume user-generated information, post material to app workspaces, share dashboards, and subscribe to dashboards and reports. Pro users can share material with users who don’t have a Power BI Pro subscription while workspaces are at Premium capacity.

ENJOY LEARNING 👍👍
👍6
Exploratory Data Analysis ( EDA)
👍42
Python for Data Analysis: Must-Know Libraries 👇👇

Python is one of the most powerful tools for Data Analysts, and these libraries will supercharge your data analysis workflow by helping you clean, manipulate, and visualize data efficiently.

🔥 Essential Python Libraries for Data Analysis:

Pandas – The go-to library for data manipulation. It helps in filtering, grouping, merging datasets, handling missing values, and transforming data into a structured format.

📌 Example: Loading a CSV file and displaying the first 5 rows:

import pandas as pd df = pd.read_csv('data.csv') print(df.head()) 


NumPy – Used for handling numerical data and performing complex calculations. It provides support for multi-dimensional arrays and efficient mathematical operations.

📌 Example: Creating an array and performing basic operations:

import numpy as np arr = np.array([10, 20, 30]) print(arr.mean()) # Calculates the average 


Matplotlib & Seaborn – These are used for creating visualizations like line graphs, bar charts, and scatter plots to understand trends and patterns in data.

📌 Example: Creating a basic bar chart:

import matplotlib.pyplot as plt plt.bar(['A', 'B', 'C'], [5, 7, 3]) plt.show() 


Scikit-Learn – A must-learn library if you want to apply machine learning techniques like regression, classification, and clustering on your dataset.

OpenPyXL – Helps in automating Excel reports using Python by reading, writing, and modifying Excel files.

💡 Challenge for You!
Try writing a Python script that:
1️⃣ Reads a CSV file
2️⃣ Cleans missing data
3️⃣ Creates a simple visualization

React with ♥️ if you want me to post the script for above challenge! ⬇️

Share with credits: https://t.me/sqlspecialist

Hope it helps :)
👍5👏1
𝟯𝟬+ 𝗙𝗿𝗲𝗲 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗲𝗱 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 𝗯𝘆 𝗛𝗣 𝗟𝗜𝗙𝗘 𝘁𝗼 𝗦𝘂𝗽𝗲𝗿𝗰𝗵𝗮𝗿𝗴𝗲 𝗬𝗼𝘂𝗿 𝗖𝗮𝗿𝗲𝗲𝗿😍

Whether you’re a student, jobseeker, aspiring entrepreneur, or working professional—HP LIFE offers the perfect opportunity to learn, grow, and earn certifications for free📊🚀

𝐋𝐢𝐧𝐤👇:-

https://pdlink.in/45ci02k

Join millions of learners worldwide who are already upgrading their skillsets through HP LIFE✅️
👍1