Excel Basics for Data Analytics
Excel sits at the start of most analysis work.
What you use Excel for
• Cleaning raw data
• Exploring patterns
• Quick summaries for teams
Core concepts you must know
• Data setup
– Freeze header row. View → Freeze Top Row.
– Convert range to table. Ctrl + T.
– Use proper headers. No merged cells. One value per cell.
• Data cleaning
– Remove duplicates. Data → Remove Duplicates.
– Trim extra spaces. =TRIM(A2)
– Convert text to numbers. =VALUE(A2)
– Fix date format. Format Cells → Date.
– Handle blanks. Filter blanks, fill or delete.
– Find and replace. Ctrl + H.
• Essential formulas
– Math and counts
▪ SUM. =SUM(A2:A100)
▪ AVERAGE. =AVERAGE(A2:A100)
▪ MIN. =MIN(A2:A100)
▪ MAX. =MAX(A2:A100)
▪ COUNT. Counts numbers.
▪ COUNTA. Counts non blanks.
▪ COUNTBLANK. Counts blanks.
– Conditional formulas
▪ IF. =IF(A2>5000,"High","Low")
▪ IFS. Multiple conditions.
▪ AND. =AND(A2>5000,B2="West")
▪ OR. =OR(A2>5000,A2<1000)
– Lookup formulas
▪ XLOOKUP. =XLOOKUP(A2,Sheet2!A:A,Sheet2!B:B)
▪ VLOOKUP. Old but common.
▪ INDEX + MATCH. Powerful alternative.
– Text formulas
▪ LEFT. =LEFT(A2,4)
▪ RIGHT. =RIGHT(A2,2)
▪ MID. =MID(A2,2,3)
▪ LEN. =LEN(A2)
▪ CONCAT or TEXTJOIN.
▪ LOWER, UPPER, PROPER.
– Date formulas
▪ TODAY. Current date.
▪ NOW. Date and time.
▪ YEAR, MONTH, DAY.
▪ DATEDIF. Date difference.
▪ EOMONTH. Month end.
• Sorting and filtering
– Sort by multiple columns.
– Filter by value, color, condition.
– Top 10 filter for quick insights.
• Conditional formatting
– Highlight duplicates.
– Color scales for trends.
– Rules for thresholds. Example. Sales > 10000 in green.
• Pivot tables
– Insert → PivotTable.
– Rows. Category or Product.
– Values. Sum, Count, Average.
– Filters. Date, Region.
– Refresh after data update.
• Charts you must know
– Column. Comparison.
– Bar. Ranking.
– Line. Trends over time.
– Pie. Share or percentage.
– Combo. Actual vs target.
• Data validation
– Dropdown list. Data → Data Validation → List.
– Prevent wrong entries.
• Useful shortcuts
– Ctrl + Arrow. Jump data.
– Ctrl + Shift + Arrow. Select range.
– Ctrl + 1. Format cells.
– Ctrl + L. Apply filter.
– Alt + =. Auto sum.
– Ctrl + Z / Y. Undo redo.
• Common analyst mistakes to avoid
– Merged cells.
– Hard coded totals.
– Mixed data types in one column.
– No backup before cleaning.
• Daily practice task
– Download any sales CSV.
– Clean it.
– Build one pivot table.
– Create one chart.
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Data Analytics Roadmap: https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02/1354
Double Tap ♥️ For More
Excel sits at the start of most analysis work.
What you use Excel for
• Cleaning raw data
• Exploring patterns
• Quick summaries for teams
Core concepts you must know
• Data setup
– Freeze header row. View → Freeze Top Row.
– Convert range to table. Ctrl + T.
– Use proper headers. No merged cells. One value per cell.
• Data cleaning
– Remove duplicates. Data → Remove Duplicates.
– Trim extra spaces. =TRIM(A2)
– Convert text to numbers. =VALUE(A2)
– Fix date format. Format Cells → Date.
– Handle blanks. Filter blanks, fill or delete.
– Find and replace. Ctrl + H.
• Essential formulas
– Math and counts
▪ SUM. =SUM(A2:A100)
▪ AVERAGE. =AVERAGE(A2:A100)
▪ MIN. =MIN(A2:A100)
▪ MAX. =MAX(A2:A100)
▪ COUNT. Counts numbers.
▪ COUNTA. Counts non blanks.
▪ COUNTBLANK. Counts blanks.
– Conditional formulas
▪ IF. =IF(A2>5000,"High","Low")
▪ IFS. Multiple conditions.
▪ AND. =AND(A2>5000,B2="West")
▪ OR. =OR(A2>5000,A2<1000)
– Lookup formulas
▪ XLOOKUP. =XLOOKUP(A2,Sheet2!A:A,Sheet2!B:B)
▪ VLOOKUP. Old but common.
▪ INDEX + MATCH. Powerful alternative.
– Text formulas
▪ LEFT. =LEFT(A2,4)
▪ RIGHT. =RIGHT(A2,2)
▪ MID. =MID(A2,2,3)
▪ LEN. =LEN(A2)
▪ CONCAT or TEXTJOIN.
▪ LOWER, UPPER, PROPER.
– Date formulas
▪ TODAY. Current date.
▪ NOW. Date and time.
▪ YEAR, MONTH, DAY.
▪ DATEDIF. Date difference.
▪ EOMONTH. Month end.
• Sorting and filtering
– Sort by multiple columns.
– Filter by value, color, condition.
– Top 10 filter for quick insights.
• Conditional formatting
– Highlight duplicates.
– Color scales for trends.
– Rules for thresholds. Example. Sales > 10000 in green.
• Pivot tables
– Insert → PivotTable.
– Rows. Category or Product.
– Values. Sum, Count, Average.
– Filters. Date, Region.
– Refresh after data update.
• Charts you must know
– Column. Comparison.
– Bar. Ranking.
– Line. Trends over time.
– Pie. Share or percentage.
– Combo. Actual vs target.
• Data validation
– Dropdown list. Data → Data Validation → List.
– Prevent wrong entries.
• Useful shortcuts
– Ctrl + Arrow. Jump data.
– Ctrl + Shift + Arrow. Select range.
– Ctrl + 1. Format cells.
– Ctrl + L. Apply filter.
– Alt + =. Auto sum.
– Ctrl + Z / Y. Undo redo.
• Common analyst mistakes to avoid
– Merged cells.
– Hard coded totals.
– Mixed data types in one column.
– No backup before cleaning.
• Daily practice task
– Download any sales CSV.
– Clean it.
– Build one pivot table.
– Create one chart.
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Data Analytics Roadmap: https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02/1354
Double Tap ♥️ For More
❤28👍2👌1
Now, let's move to the next topic of data analytics roadmap:
SQL Basics for Data Analytics
What SQL does
- Pull data from databases
- Filter large datasets
- Combine tables
- Summarize metrics
Core clauses
- SELECT: Choose columns
Example:
- FROM: Source table
Example:
- WHERE: Filter rows
Example:
- ORDER BY: Sort results
Example:
- LIMIT: Restrict rows
Example:
Filtering operators
-
-
-
-
Example:
Logical conditions
-
-
-
Aggregations
- GROUP BY: Group rows
Example:
- Aggregate functions:
- HAVING: Filter after aggregation
Example:
JOINS
- INNER JOIN: Matching rows only
- LEFT JOIN: All left rows, matching right
- RIGHT JOIN: All right rows, matching left
- FULL JOIN: All rows from both tables
Example:
-
-
-
Subqueries
Query inside a query
Example:
- ROW_NUMBER: Unique row number
- RANK: Ranking with gaps
- PARTITION BY: Reset calculation per group
Example:
Common mistakes
- Forgetting
- Using
- Wrong join condition
- Ignoring NULLs
Daily practice
- Write 5
- Use 1
- Use 1
- Handle NULL values
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
Double Tap ♥️ For More
SQL Basics for Data Analytics
What SQL does
- Pull data from databases
- Filter large datasets
- Combine tables
- Summarize metrics
Core clauses
- SELECT: Choose columns
Example:
SELECT name, sales FROM orders;- FROM: Source table
Example:
FROM orders;- WHERE: Filter rows
Example:
WHERE sales > 5000;- ORDER BY: Sort results
Example:
ORDER BY sales DESC;- LIMIT: Restrict rows
Example:
LIMIT 10;Filtering operators
-
=, <>, >, <, >=, <=-
BETWEEN for ranges-
IN for lists-
LIKE for patternsExample:
WHERE region IN ('East','West');Logical conditions
-
AND-
OR-
NOTAggregations
- GROUP BY: Group rows
Example:
GROUP BY product;- Aggregate functions:
COUNT, SUM, AVG, MIN, MAX- HAVING: Filter after aggregation
Example:
HAVING SUM(sales) > 100000;JOINS
- INNER JOIN: Matching rows only
- LEFT JOIN: All left rows, matching right
- RIGHT JOIN: All right rows, matching left
- FULL JOIN: All rows from both tables
Example:
SELECT o.order_id, c.customer_nameNULL handling
FROM orders o
INNER JOIN customers c
ON o.customer_id = c.customer_id;
-
IS NULL-
IS NOT NULL-
COALESCE(column, 0)Subqueries
Query inside a query
Example:
SELECT *Window functions
FROM orders
WHERE sales > (SELECT AVG(sales) FROM orders);
- ROW_NUMBER: Unique row number
- RANK: Ranking with gaps
- PARTITION BY: Reset calculation per group
Example:
ROW_NUMBER() OVER (PARTITION BY department ORDER BY salary DESC)Common mistakes
- Forgetting
GROUP BY columns- Using
WHERE instead of HAVING- Wrong join condition
- Ignoring NULLs
Daily practice
- Write 5
SELECT queries- Use 1
JOIN- Use 1
GROUP BY- Handle NULL values
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
Double Tap ♥️ For More
❤16
Now, let's move to the next topic of data analytics roadmap:
Power BI Basics for Data Analytics ✅
What Power BI Does
- Connects to data sources
- Transforms data
- Builds dashboards
- Shares insights
Core Components
- Power BI Desktop: main tool for reports, modeling, and visuals
- Power BI Service: cloud sharing and collaboration
Data Sources
- Excel
- CSV
- SQL Server
- MySQL, PostgreSQL
- Web APIs
Data Loading
- Home → Get Data
- Choose source
- Load or Transform
Power Query Basics
- Clean data before analysis
- Remove duplicates
- Change data types
- Split columns
- Rename columns
- Filter rows
Data Model
- Tables connect using relationships
- One to many is standard
- Avoid many to many early
- Use proper keys
DAX Basics
- Measures run at report level
- Calculated columns run row by row
- Common DAX measures:
- Total Sales = SUM(Sales[Amount])
- Total Orders = COUNT(Sales[OrderID])
- Average Sales = AVERAGE(Sales[Amount])
Time Intelligence Basics
- YTD sales
- MTD sales
- Previous month comparison
Visuals You Must Know
- Table
- Matrix
- Bar chart
- Line chart
- KPI card
- Pie chart
Filters and Slicers
- Page level filters
- Visual level filters
- Slicers for user interaction
Dashboard Design Rules
- One page focus
- Use consistent colors
- Show KPIs on top
- Avoid clutter
Daily Practice Task
- Load a sales Excel file
- Clean data in Power Query
- Create 3 measures
- Build one dashboard page
Power BI Resources: https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c
Double Tap ♥️ For More
Power BI Basics for Data Analytics ✅
What Power BI Does
- Connects to data sources
- Transforms data
- Builds dashboards
- Shares insights
Core Components
- Power BI Desktop: main tool for reports, modeling, and visuals
- Power BI Service: cloud sharing and collaboration
Data Sources
- Excel
- CSV
- SQL Server
- MySQL, PostgreSQL
- Web APIs
Data Loading
- Home → Get Data
- Choose source
- Load or Transform
Power Query Basics
- Clean data before analysis
- Remove duplicates
- Change data types
- Split columns
- Rename columns
- Filter rows
Data Model
- Tables connect using relationships
- One to many is standard
- Avoid many to many early
- Use proper keys
DAX Basics
- Measures run at report level
- Calculated columns run row by row
- Common DAX measures:
- Total Sales = SUM(Sales[Amount])
- Total Orders = COUNT(Sales[OrderID])
- Average Sales = AVERAGE(Sales[Amount])
Time Intelligence Basics
- YTD sales
- MTD sales
- Previous month comparison
Visuals You Must Know
- Table
- Matrix
- Bar chart
- Line chart
- KPI card
- Pie chart
Filters and Slicers
- Page level filters
- Visual level filters
- Slicers for user interaction
Dashboard Design Rules
- One page focus
- Use consistent colors
- Show KPIs on top
- Avoid clutter
Daily Practice Task
- Load a sales Excel file
- Clean data in Power Query
- Create 3 measures
- Build one dashboard page
Power BI Resources: https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c
Double Tap ♥️ For More
❤18
Now, let's move to the next topic of data analytics roadmap:
Statistics Basics for Data Analysts ✅
Why Statistics Matters
- Explain trends
- Compare performance
- Avoid wrong conclusions
Descriptive Statistics
- Mean: Average value. Example: Average monthly sales ₹45,000.
- Median: Middle value. Handles outliers better than mean. Example: Typical salary in a team.
- Mode: Most frequent value. Example: Most sold product.
Spread of Data
- Range: Max minus min.
- Variance: Spread from the mean.
- Standard Deviation: How far values move from average. Low value means stable data.
Example: Avg sales ₹10,000. Std dev ₹500 means stable. Std dev ₹5,000 means volatile.
Percentages and Ratios
- Growth Rate: (Current - Previous) / Previous
- Conversion Rate: Leads to customers.
Correlation
- Relationship between two variables. Range: -1 to +1.
- Positive: Move together. Negative: Move opposite.
Example: Ad spend vs sales correlation 0.8.
Outliers
- Extreme values. Skew averages. Identify using sorting or box plots.
Sampling
- Small part of data. Saves time and cost.
- Full data often large. Samples give direction.
Common Mistakes
- Trusting averages only.
- Ignoring outliers.
- Confusing correlation with causation.
Mini Task
Take any sales data. Calculate mean, median, std dev. Check for outliers.
Statistics Resources: https://whatsapp.com/channel/0029Vat3Dc4KAwEcfFbNnZ3O
Double Tap ♥️ For More
Statistics Basics for Data Analysts ✅
Why Statistics Matters
- Explain trends
- Compare performance
- Avoid wrong conclusions
Descriptive Statistics
- Mean: Average value. Example: Average monthly sales ₹45,000.
- Median: Middle value. Handles outliers better than mean. Example: Typical salary in a team.
- Mode: Most frequent value. Example: Most sold product.
Spread of Data
- Range: Max minus min.
- Variance: Spread from the mean.
- Standard Deviation: How far values move from average. Low value means stable data.
Example: Avg sales ₹10,000. Std dev ₹500 means stable. Std dev ₹5,000 means volatile.
Percentages and Ratios
- Growth Rate: (Current - Previous) / Previous
- Conversion Rate: Leads to customers.
Correlation
- Relationship between two variables. Range: -1 to +1.
- Positive: Move together. Negative: Move opposite.
Example: Ad spend vs sales correlation 0.8.
Outliers
- Extreme values. Skew averages. Identify using sorting or box plots.
Sampling
- Small part of data. Saves time and cost.
- Full data often large. Samples give direction.
Common Mistakes
- Trusting averages only.
- Ignoring outliers.
- Confusing correlation with causation.
Mini Task
Take any sales data. Calculate mean, median, std dev. Check for outliers.
Statistics Resources: https://whatsapp.com/channel/0029Vat3Dc4KAwEcfFbNnZ3O
Double Tap ♥️ For More
❤15
Business Metrics Every Data Analyst Must Know ✅
Revenue Metrics
- Revenue: Total income from sales (e.g., monthly revenue ₹25 lakh)
- Gross Revenue vs Net Revenue: Gross (before costs), Net (after discounts and returns)
- Average Order Value: Revenue ÷ number of orders (e.g., ₹1,200 per order)
Growth Metrics
- Growth Rate: (Current − Previous) ÷ Previous (e.g., 15% month-over-month)
- Year-over-Year Growth: Compare same period last year
Customer Metrics
- Customer Count: Total active customers
- New vs Returning Customers: Shows retention strength
- Customer Acquisition Cost: Total marketing spend ÷ new customers
- Customer Lifetime Value: Total revenue from one customer over time
Retention and Churn
- Retention Rate: Customers who stayed ÷ total customers
- Churn Rate: Customers lost ÷ total customers (e.g., 1,000 customers, lost 50, churn rate 5%)
Marketing Metrics
- Conversion Rate: Conversions ÷ visitors
- Click-Through Rate: Clicks ÷ impressions
- Return on Ad Spend: Revenue ÷ ad spend
Product Metrics
- Daily Active Users: Users active per day
- Monthly Active Users: Users active per month
- DAU to MAU Ratio: Engagement strength
Operations Metrics
- Order Fulfillment Time: Time to deliver order
- Defect Rate: Defective units ÷ total units
Mini Task
Pick one business (E-commerce or EdTech). List 5 metrics it should track. Write one question each metric answers.
Let's take E-commerce:
1. Revenue: What's our total sales this month?
2. Customer Acquisition Cost: How much are we spending to acquire each new customer?
3. Retention Rate: How many customers are coming back to shop?
4. Average Order Value: What's the average amount customers are spending per order?
5. Order Fulfillment Time: How quickly are we delivering orders?
Double Tap ♥️ For More
Revenue Metrics
- Revenue: Total income from sales (e.g., monthly revenue ₹25 lakh)
- Gross Revenue vs Net Revenue: Gross (before costs), Net (after discounts and returns)
- Average Order Value: Revenue ÷ number of orders (e.g., ₹1,200 per order)
Growth Metrics
- Growth Rate: (Current − Previous) ÷ Previous (e.g., 15% month-over-month)
- Year-over-Year Growth: Compare same period last year
Customer Metrics
- Customer Count: Total active customers
- New vs Returning Customers: Shows retention strength
- Customer Acquisition Cost: Total marketing spend ÷ new customers
- Customer Lifetime Value: Total revenue from one customer over time
Retention and Churn
- Retention Rate: Customers who stayed ÷ total customers
- Churn Rate: Customers lost ÷ total customers (e.g., 1,000 customers, lost 50, churn rate 5%)
Marketing Metrics
- Conversion Rate: Conversions ÷ visitors
- Click-Through Rate: Clicks ÷ impressions
- Return on Ad Spend: Revenue ÷ ad spend
Product Metrics
- Daily Active Users: Users active per day
- Monthly Active Users: Users active per month
- DAU to MAU Ratio: Engagement strength
Operations Metrics
- Order Fulfillment Time: Time to deliver order
- Defect Rate: Defective units ÷ total units
Mini Task
Pick one business (E-commerce or EdTech). List 5 metrics it should track. Write one question each metric answers.
Let's take E-commerce:
1. Revenue: What's our total sales this month?
2. Customer Acquisition Cost: How much are we spending to acquire each new customer?
3. Retention Rate: How many customers are coming back to shop?
4. Average Order Value: What's the average amount customers are spending per order?
5. Order Fulfillment Time: How quickly are we delivering orders?
Double Tap ♥️ For More
❤25👍1
What does ORDER BY do in SQL
Anonymous Quiz
20%
A. Filters rows
72%
B. Sorts rows
5%
C. Limits rows
3%
D. Removes duplicates
❤7
What is the default sort order in ORDER BY
Anonymous Quiz
23%
A. DESC
12%
B. RANDOM
61%
C. ASC
4%
D. NONE
❤7
What does this query return
SELECT name FROM customers ORDER BY signup_date DESC LIMIT 1;
SELECT name FROM customers ORDER BY signup_date DESC LIMIT 1;
Anonymous Quiz
31%
A. Oldest customer
6%
B. Random customer
55%
C. Latest signed up customer
8%
D. All customers
❤8
What happens if you use LIMIT without ORDER BY
Anonymous Quiz
22%
A. Data is sorted automatically
49%
B. Rows returned have no guaranteed order
14%
C. Query fails
15%
D. Only one row is returned
❤8
What does this query do
SELECT order_id, amount FROM orders ORDER BY amount DESC LIMIT 5;
SELECT order_id, amount FROM orders ORDER BY amount DESC LIMIT 5;
Anonymous Quiz
6%
A. Returns 5 random orders
20%
B. Returns 5 smallest orders
10%
C. Returns all orders sorted by amount
64%
D. Returns top 5 highest value orders
❤8
SQL vs NoSQL Databases: Quick Comparison ✅
SQL Databases
- Structured data
- Fixed schema
- Table-based storage
- Strong consistency
- Popular tools: MySQL, PostgreSQL, SQL Server, Oracle
- Best use cases: Banking systems, ERP and CRM, transaction-heavy apps, reporting and analytics
- Job roles: Data Analyst, Backend Developer, Database Engineer, BI Developer
- Hiring reality: Mandatory in enterprises, core skill for analytics roles, used in almost every company
- India salary range: Fresher (4-7 LPA), Mid-level (8-18 LPA)
- Real tasks: Write complex queries, join multiple tables, build reports, ensure data integrity
NoSQL Databases
- Semi-structured or unstructured data
- Flexible schema
- Document, key-value, or graph based
- High scalability
- Popular tools: MongoDB, Cassandra, DynamoDB, Redis
- Best use cases: Real-time apps, big data systems, IoT platforms, rapidly changing products
- Job roles: Backend Developer, Data Engineer, Cloud Engineer, Platform Engineer
- Hiring reality: Strong demand in startups, common in cloud-native systems, often paired with SQL
- India salary range: Fresher (5-8 LPA), Mid-level (10-22 LPA)
- Real tasks: Store JSON documents, handle large traffic, design scalable schemas, optimize read and write speed
Quick Comparison
- Schema: SQL (fixed), NoSQL (flexible)
- Scaling: SQL (vertical), NoSQL (horizontal)
- Consistency: SQL (strong), NoSQL (eventual)
- Queries: SQL (powerful), NoSQL (simpler)
Role-based Choice
- Data Analyst: SQL required
- Backend Developer: Both useful
- Data Engineer: SQL + NoSQL
- Startup products: NoSQL preferred
Best Career Move
- Learn SQL first
- Add NoSQL for modern systems
- Use both in real projects
Which one do you prefer?
SQL ❤️
NoSQL 👍
Both 🙏
None 😮
SQL Databases
- Structured data
- Fixed schema
- Table-based storage
- Strong consistency
- Popular tools: MySQL, PostgreSQL, SQL Server, Oracle
- Best use cases: Banking systems, ERP and CRM, transaction-heavy apps, reporting and analytics
- Job roles: Data Analyst, Backend Developer, Database Engineer, BI Developer
- Hiring reality: Mandatory in enterprises, core skill for analytics roles, used in almost every company
- India salary range: Fresher (4-7 LPA), Mid-level (8-18 LPA)
- Real tasks: Write complex queries, join multiple tables, build reports, ensure data integrity
NoSQL Databases
- Semi-structured or unstructured data
- Flexible schema
- Document, key-value, or graph based
- High scalability
- Popular tools: MongoDB, Cassandra, DynamoDB, Redis
- Best use cases: Real-time apps, big data systems, IoT platforms, rapidly changing products
- Job roles: Backend Developer, Data Engineer, Cloud Engineer, Platform Engineer
- Hiring reality: Strong demand in startups, common in cloud-native systems, often paired with SQL
- India salary range: Fresher (5-8 LPA), Mid-level (10-22 LPA)
- Real tasks: Store JSON documents, handle large traffic, design scalable schemas, optimize read and write speed
Quick Comparison
- Schema: SQL (fixed), NoSQL (flexible)
- Scaling: SQL (vertical), NoSQL (horizontal)
- Consistency: SQL (strong), NoSQL (eventual)
- Queries: SQL (powerful), NoSQL (simpler)
Role-based Choice
- Data Analyst: SQL required
- Backend Developer: Both useful
- Data Engineer: SQL + NoSQL
- Startup products: NoSQL preferred
Best Career Move
- Learn SQL first
- Add NoSQL for modern systems
- Use both in real projects
Which one do you prefer?
SQL ❤️
NoSQL 👍
Both 🙏
None 😮
❤26🔥1
✅ End to End Data Analytics Project Roadmap
Step 1. Define the business problem
Start with a clear question.
Example: Why did sales drop last quarter?
Decide success metric.
Example: Revenue, growth rate.
Step 2. Understand the data
Identify data sources.
Example: Sales table, customers table.
Check rows, columns, data types.
Spot missing values.
Step 3. Clean the data
Remove duplicates.
Handle missing values.
Fix data types.
Standardize text.
Tools: Excel or Power Query SQL for large datasets.
Step 4. Explore the data
Basic summaries.
Trends over time.
Top and bottom performers.
Examples: Monthly sales trend, top 10 products, region-wise revenue.
Step 5. Analyze and find insights
Compare periods.
Segment data.
Identify drivers.
Examples: Sales drop in one region, high churn in one customer segment.
Step 6. Create visuals and dashboard
KPIs on top.
Trends in middle.
Breakdown charts below.
Tools: Power BI or Tableau.
Step 7. Interpret results
What changed?
Why it changed?
Business impact.
Step 8. Give recommendations
Actionable steps.
Example: Increase ads in high margin regions.
Step 9. Validate and iterate
Cross-check numbers.
Ask stakeholder questions.
Step 10. Present clearly
One-page summary.
Simple language.
Focus on impact.
Sample project ideas
• Sales performance analysis.
• Customer churn analysis.
• Marketing campaign analysis.
• HR attrition dashboard.
Mini task
• Choose one project idea.
• Write the business question.
• List 3 metrics you will track.
Example: For Sales Performance Analysis
Business Question: Why did sales drop last quarter?
Metrics:
1. Revenue growth rate
2. Sales target achievement (%)
3. Customer acquisition cost (CAC)
Double Tap ♥️ For More
Step 1. Define the business problem
Start with a clear question.
Example: Why did sales drop last quarter?
Decide success metric.
Example: Revenue, growth rate.
Step 2. Understand the data
Identify data sources.
Example: Sales table, customers table.
Check rows, columns, data types.
Spot missing values.
Step 3. Clean the data
Remove duplicates.
Handle missing values.
Fix data types.
Standardize text.
Tools: Excel or Power Query SQL for large datasets.
Step 4. Explore the data
Basic summaries.
Trends over time.
Top and bottom performers.
Examples: Monthly sales trend, top 10 products, region-wise revenue.
Step 5. Analyze and find insights
Compare periods.
Segment data.
Identify drivers.
Examples: Sales drop in one region, high churn in one customer segment.
Step 6. Create visuals and dashboard
KPIs on top.
Trends in middle.
Breakdown charts below.
Tools: Power BI or Tableau.
Step 7. Interpret results
What changed?
Why it changed?
Business impact.
Step 8. Give recommendations
Actionable steps.
Example: Increase ads in high margin regions.
Step 9. Validate and iterate
Cross-check numbers.
Ask stakeholder questions.
Step 10. Present clearly
One-page summary.
Simple language.
Focus on impact.
Sample project ideas
• Sales performance analysis.
• Customer churn analysis.
• Marketing campaign analysis.
• HR attrition dashboard.
Mini task
• Choose one project idea.
• Write the business question.
• List 3 metrics you will track.
Example: For Sales Performance Analysis
Business Question: Why did sales drop last quarter?
Metrics:
1. Revenue growth rate
2. Sales target achievement (%)
3. Customer acquisition cost (CAC)
Double Tap ♥️ For More
❤27👏2
Data Analyst Interview Preparation Roadmap ✅
Technical skills to revise
- SQL
Write queries from scratch.
Practice joins, group by, subqueries.
Handle duplicates and NULLs.
Window functions basics.
- Excel
Pivot tables without help.
XLOOKUP and IF confidently.
Data cleaning steps.
- Power BI or Tableau
Explain data model.
Write basic DAX.
Explain one dashboard end to end.
- Statistics
Mean vs median.
Standard deviation meaning.
Correlation vs causation.
- Python. If required
Pandas basics.
Groupby and filtering.
Interview question types
- SQL questions
Top N per group.
Running totals.
Duplicate records.
Date based queries.
- Business case questions
Why did sales drop.
Which metric matters most and why.
- Dashboard questions
Explain one KPI.
How users will use this report.
- Project questions
Data source.
Cleaning logic.
Key insight.
Business action.
Resume preparation
- Must have Tools section.
- One strong project.
- Metrics driven points.
Example: Improved reporting time by 30 percent using Power BI.
Mock interviews
- Practice explaining out loud.
- Time your answers.
- Use real datasets.
Daily prep plan
1 SQL problem.
1 dashboard review.
10 interview questions.
- Common mistakes
Memorizing queries.
No project explanation.
Weak business reasoning.
- Final task
- Prepare one project story.
- Prepare one SQL solution on paper.
- Prepare one business metric explanation.
Double Tap ♥️ For More
Technical skills to revise
- SQL
Write queries from scratch.
Practice joins, group by, subqueries.
Handle duplicates and NULLs.
Window functions basics.
- Excel
Pivot tables without help.
XLOOKUP and IF confidently.
Data cleaning steps.
- Power BI or Tableau
Explain data model.
Write basic DAX.
Explain one dashboard end to end.
- Statistics
Mean vs median.
Standard deviation meaning.
Correlation vs causation.
- Python. If required
Pandas basics.
Groupby and filtering.
Interview question types
- SQL questions
Top N per group.
Running totals.
Duplicate records.
Date based queries.
- Business case questions
Why did sales drop.
Which metric matters most and why.
- Dashboard questions
Explain one KPI.
How users will use this report.
- Project questions
Data source.
Cleaning logic.
Key insight.
Business action.
Resume preparation
- Must have Tools section.
- One strong project.
- Metrics driven points.
Example: Improved reporting time by 30 percent using Power BI.
Mock interviews
- Practice explaining out loud.
- Time your answers.
- Use real datasets.
Daily prep plan
1 SQL problem.
1 dashboard review.
10 interview questions.
- Common mistakes
Memorizing queries.
No project explanation.
Weak business reasoning.
- Final task
- Prepare one project story.
- Prepare one SQL solution on paper.
- Prepare one business metric explanation.
Double Tap ♥️ For More
❤27
What is the main purpose of WHERE in SQL
Anonymous Quiz
12%
A. Sort rows
32%
B. Filter groups
51%
C. Filter individual rows
5%
D. Limit rows
❤6
When should you use HAVING instead of WHERE
Anonymous Quiz
9%
A. When filtering text values
15%
B. When filtering before SELECT
69%
C. When filtering aggregated results
7%
D. When filtering columns
❤7
What will this query return
SELECT customer_id, SUM(amount) FROM orders GROUP BY customer_id HAVING SUM(amount) > 10000;
SELECT customer_id, SUM(amount) FROM orders GROUP BY customer_id HAVING SUM(amount) > 10000;
Anonymous Quiz
4%
C. All customers and their totals
78%
B. Customers with total spend above 10,000
16%
A. Orders above 10,000
3%
D. Orders grouped by amount
❤6
In which order does SQL process these clauses
Anonymous Quiz
47%
A. SELECT → WHERE → GROUP BY → HAVING
13%
B. WHERE → FROM → GROUP BY → HAVING
35%
C. FROM → WHERE → GROUP BY → HAVING
6%
D. FROM → GROUP BY → WHERE → HAVING
❤14
Top 100 Data Analyst Interview Questions
✅ Data Analytics Basics
1. What is data analytics?
2. Difference between data analytics and data science?
3. What problems does a data analyst solve?
4. What are the types of data analytics?
5. What tools do data analysts use daily?
6. What is a KPI?
7. What is a metric vs KPI?
8. What is descriptive analytics?
9. What is diagnostic analytics?
10. What does a typical day of a data analyst look like?
Data and Databases
11. What is structured data?
12. What is semi-structured data?
13. What is unstructured data?
14. What is a database?
15. Difference between OLTP and OLAP?
16. What is a primary key?
17. What is a foreign key?
18. What is a fact table?
19. What is a dimension table?
20. What is a data warehouse?
SQL for Data Analysts
21. What is SELECT used for?
22. Difference between WHERE and HAVING?
23. What is GROUP BY?
24. What are aggregate functions?
25. Difference between INNER and LEFT JOIN?
26. What are subqueries?
27. What is a CTE?
28. How do you handle duplicates in SQL?
29. How do you handle NULL values?
30. What are window functions?
Excel for Data Analysis
31. What are pivot tables?
32. Difference between VLOOKUP and XLOOKUP?
33. What is conditional formatting?
34. What are COUNTIFS and SUMIFS?
35. What is data validation?
36. How do you remove duplicates in Excel?
37. What is IF formula used for?
38. Difference between relative and absolute reference?
39. How do you clean data in Excel?
40. What are common Excel mistakes analysts make?
Data Cleaning and Preparation
41. What is data cleaning?
42. How do you handle missing data?
43. How do you treat outliers?
44. What is data normalization?
45. What is data standardization?
46. How do you check data quality?
47. What is duplicate data?
48. How do you validate source data?
49. What is data transformation?
50. Why is data preparation important?
Statistics for Data Analysts
51. Difference between mean and median?
52. What is standard deviation?
53. What is variance?
54. What is correlation?
55. Difference between correlation and causation?
56. What is an outlier?
57. What is sampling?
58. What is distribution?
59. What is skewness?
60. When do you use median over mean?
Data Visualization
61. Why is data visualization important?
62. Difference between bar and line chart?
63. When do you use a pie chart?
64. What is a dashboard?
65. What makes a good dashboard?
66. What is a KPI card?
67. Common visualization mistakes?
68. How do you choose the right chart?
69. What is drill down?
70. What is data storytelling?
Power BI or Tableau
71. What is Power BI or Tableau used for?
72. What is a data model?
73. What is a relationship?
74. What is DAX?
75. Difference between measure and calculated column?
76. What is Power Query?
77. What are filters and slicers?
78. What is row level security?
79. What is refresh schedule?
80. How do you optimize reports?
Business and Case Questions
81. How do you analyze a sales drop?
82. How do you define success metrics?
83. What business metrics have you worked on?
84. How do you prioritize insights?
85. How do you validate insights?
86. What questions do you ask stakeholders?
87. How do you handle vague requirements?
88. How do you measure business impact?
89. How do you explain numbers to managers?
90. How do you recommend actions?
Projects and Real World
91. Explain your best project.
92. What data sources did you use?
93. How did you clean the data?
94. What insight had the most impact?
95. What challenge did you face?
96. How did you solve it?
97. How did stakeholders use your dashboard?
98. What would you improve in your project?
99. How do you handle tight deadlines?
100. Why should we hire you as a data analyst?
Double Tap ♥️ For Detailed Answers
✅ Data Analytics Basics
1. What is data analytics?
2. Difference between data analytics and data science?
3. What problems does a data analyst solve?
4. What are the types of data analytics?
5. What tools do data analysts use daily?
6. What is a KPI?
7. What is a metric vs KPI?
8. What is descriptive analytics?
9. What is diagnostic analytics?
10. What does a typical day of a data analyst look like?
Data and Databases
11. What is structured data?
12. What is semi-structured data?
13. What is unstructured data?
14. What is a database?
15. Difference between OLTP and OLAP?
16. What is a primary key?
17. What is a foreign key?
18. What is a fact table?
19. What is a dimension table?
20. What is a data warehouse?
SQL for Data Analysts
21. What is SELECT used for?
22. Difference between WHERE and HAVING?
23. What is GROUP BY?
24. What are aggregate functions?
25. Difference between INNER and LEFT JOIN?
26. What are subqueries?
27. What is a CTE?
28. How do you handle duplicates in SQL?
29. How do you handle NULL values?
30. What are window functions?
Excel for Data Analysis
31. What are pivot tables?
32. Difference between VLOOKUP and XLOOKUP?
33. What is conditional formatting?
34. What are COUNTIFS and SUMIFS?
35. What is data validation?
36. How do you remove duplicates in Excel?
37. What is IF formula used for?
38. Difference between relative and absolute reference?
39. How do you clean data in Excel?
40. What are common Excel mistakes analysts make?
Data Cleaning and Preparation
41. What is data cleaning?
42. How do you handle missing data?
43. How do you treat outliers?
44. What is data normalization?
45. What is data standardization?
46. How do you check data quality?
47. What is duplicate data?
48. How do you validate source data?
49. What is data transformation?
50. Why is data preparation important?
Statistics for Data Analysts
51. Difference between mean and median?
52. What is standard deviation?
53. What is variance?
54. What is correlation?
55. Difference between correlation and causation?
56. What is an outlier?
57. What is sampling?
58. What is distribution?
59. What is skewness?
60. When do you use median over mean?
Data Visualization
61. Why is data visualization important?
62. Difference between bar and line chart?
63. When do you use a pie chart?
64. What is a dashboard?
65. What makes a good dashboard?
66. What is a KPI card?
67. Common visualization mistakes?
68. How do you choose the right chart?
69. What is drill down?
70. What is data storytelling?
Power BI or Tableau
71. What is Power BI or Tableau used for?
72. What is a data model?
73. What is a relationship?
74. What is DAX?
75. Difference between measure and calculated column?
76. What is Power Query?
77. What are filters and slicers?
78. What is row level security?
79. What is refresh schedule?
80. How do you optimize reports?
Business and Case Questions
81. How do you analyze a sales drop?
82. How do you define success metrics?
83. What business metrics have you worked on?
84. How do you prioritize insights?
85. How do you validate insights?
86. What questions do you ask stakeholders?
87. How do you handle vague requirements?
88. How do you measure business impact?
89. How do you explain numbers to managers?
90. How do you recommend actions?
Projects and Real World
91. Explain your best project.
92. What data sources did you use?
93. How did you clean the data?
94. What insight had the most impact?
95. What challenge did you face?
96. How did you solve it?
97. How did stakeholders use your dashboard?
98. What would you improve in your project?
99. How do you handle tight deadlines?
100. Why should we hire you as a data analyst?
Double Tap ♥️ For Detailed Answers
❤96🔥7👍3
7 Misconceptions About Data Analytics (and What’s Actually True): 📊🚀
❌ You need to be a math or statistics genius
✅ Basic math + logical thinking is enough. Most real-world analytics is about understanding data, not complex formulas.
❌ You must learn every tool before applying for jobs
✅ Start with core tools (Excel, SQL, one BI tool). Master fundamentals — tools can be learned on the job.
❌ Data analytics is only about numbers
✅ It’s about storytelling with data — explaining insights clearly to non-technical stakeholders.
❌ You need coding skills like a software developer
✅ Not required. SQL + basic Python/R is enough for most analyst roles. Deep coding is optional, not mandatory.
❌ Analysts just make dashboards all day
✅ Dashboards are just one part. Real work includes data cleaning, business understanding, ad-hoc analysis, and decision support.
❌ You need huge datasets to be a “real” data analyst
✅ Even small datasets can provide powerful insights if the questions are right.
❌ Once you learn analytics, your learning is done
✅ Data analytics evolves constantly — new tools, business problems, and techniques mean continuous learning.
💬 Tap ❤️ if you agree
❌ You need to be a math or statistics genius
✅ Basic math + logical thinking is enough. Most real-world analytics is about understanding data, not complex formulas.
❌ You must learn every tool before applying for jobs
✅ Start with core tools (Excel, SQL, one BI tool). Master fundamentals — tools can be learned on the job.
❌ Data analytics is only about numbers
✅ It’s about storytelling with data — explaining insights clearly to non-technical stakeholders.
❌ You need coding skills like a software developer
✅ Not required. SQL + basic Python/R is enough for most analyst roles. Deep coding is optional, not mandatory.
❌ Analysts just make dashboards all day
✅ Dashboards are just one part. Real work includes data cleaning, business understanding, ad-hoc analysis, and decision support.
❌ You need huge datasets to be a “real” data analyst
✅ Even small datasets can provide powerful insights if the questions are right.
❌ Once you learn analytics, your learning is done
✅ Data analytics evolves constantly — new tools, business problems, and techniques mean continuous learning.
💬 Tap ❤️ if you agree
❤31🔥1
✅ Data Analyst Interview Questions with Answers
1. What is data analytics?
Data analytics is the process of collecting, cleaning, analyzing, and interpreting data to support business decisions. The goal is to turn raw data into meaningful insights.
2. Difference between data analytics and data science?
Data analytics focuses on analyzing historical data to answer what happened and why. Data science focuses on building predictive models to answer what will happen next using machine learning.
3. What problems does a data analyst solve?
- Identifying trends and patterns
- Explaining business performance
- Finding reasons behind growth or decline
- Supporting decision-making with data
4. What are the types of data analytics?
- Descriptive – What happened
- Diagnostic – Why it happened
- Predictive – What may happen
- Prescriptive – What action to take
5. What tools do data analysts use daily?
- Excel for quick analysis
- SQL for querying databases
- Power BI or Tableau for dashboards
- Python (sometimes) for automation
- Statistics for interpretation
6. What is a KPI?
A KPI (Key Performance Indicator) is a measurable value that shows how well a business or team is achieving its objectives. Example: Monthly revenue, churn rate.
7. Difference between a metric and a KPI?
Metric: Any measurable value (page views, clicks).
KPI: A critical metric directly linked to business goals (conversion rate, revenue growth).
8. What is descriptive analytics?
Descriptive analytics summarizes historical data to understand past performance. Example: Total sales last month, average order value.
9. What is diagnostic analytics?
Diagnostic analytics explains why something happened by comparing data and identifying root causes. Example: Sales dropped because website traffic decreased.
10. What does a typical day of a data analyst look like?
- Pull data using SQL
- Clean data in Excel or Power Query
- Build or update dashboards
- Analyze trends and metrics
- Share insights with stakeholders
Double Tap ♥️ For Part-2
1. What is data analytics?
Data analytics is the process of collecting, cleaning, analyzing, and interpreting data to support business decisions. The goal is to turn raw data into meaningful insights.
2. Difference between data analytics and data science?
Data analytics focuses on analyzing historical data to answer what happened and why. Data science focuses on building predictive models to answer what will happen next using machine learning.
3. What problems does a data analyst solve?
- Identifying trends and patterns
- Explaining business performance
- Finding reasons behind growth or decline
- Supporting decision-making with data
4. What are the types of data analytics?
- Descriptive – What happened
- Diagnostic – Why it happened
- Predictive – What may happen
- Prescriptive – What action to take
5. What tools do data analysts use daily?
- Excel for quick analysis
- SQL for querying databases
- Power BI or Tableau for dashboards
- Python (sometimes) for automation
- Statistics for interpretation
6. What is a KPI?
A KPI (Key Performance Indicator) is a measurable value that shows how well a business or team is achieving its objectives. Example: Monthly revenue, churn rate.
7. Difference between a metric and a KPI?
Metric: Any measurable value (page views, clicks).
KPI: A critical metric directly linked to business goals (conversion rate, revenue growth).
8. What is descriptive analytics?
Descriptive analytics summarizes historical data to understand past performance. Example: Total sales last month, average order value.
9. What is diagnostic analytics?
Diagnostic analytics explains why something happened by comparing data and identifying root causes. Example: Sales dropped because website traffic decreased.
10. What does a typical day of a data analyst look like?
- Pull data using SQL
- Clean data in Excel or Power Query
- Build or update dashboards
- Analyze trends and metrics
- Share insights with stakeholders
Double Tap ♥️ For Part-2
❤55👍2🔥1
✅ Data Analyst Interview Questions with Answers: Part-2
11. What is structured data?
Structured data is organized in rows and columns with a fixed schema, making it easy to store and query using SQL. Example: Sales tables, customer databases.
12. What is semi-structured data?
Semi-structured data does not follow a strict table format but contains tags or keys. Example: JSON files, XML data, API responses.
13. What is unstructured data?
Unstructured data has no predefined format. Example: Emails, images, videos, customer reviews text.
14. What is a database?
A database is an organized system used to store, manage, and retrieve data efficiently. Example: MySQL, PostgreSQL, SQL Server.
15. Difference between OLTP and OLAP?
OLTP (Online Transaction Processing) → Handles daily transactions (e.g., orders, payments).
OLAP (Online Analytical Processing) → Used for reporting and analysis.
16. What is a primary key?
A primary key uniquely identifies each record in a table. Example: Customer_ID in a customer table.
17. What is a foreign key?
A foreign key links one table to another using the primary key of another table. Example: Customer_ID in Orders table linking to Customers table.
18. What is a fact table?
Fact table contains measurable business data like sales, revenue, or quantity.
19. What is a dimension table?
Dimension table contains descriptive details like customer name, region, product category.
20. What is a data warehouse?
A data warehouse is a centralized system that stores large volumes of historical data for analysis and reporting.
Double Tap ♥️ For Part-3
11. What is structured data?
Structured data is organized in rows and columns with a fixed schema, making it easy to store and query using SQL. Example: Sales tables, customer databases.
12. What is semi-structured data?
Semi-structured data does not follow a strict table format but contains tags or keys. Example: JSON files, XML data, API responses.
13. What is unstructured data?
Unstructured data has no predefined format. Example: Emails, images, videos, customer reviews text.
14. What is a database?
A database is an organized system used to store, manage, and retrieve data efficiently. Example: MySQL, PostgreSQL, SQL Server.
15. Difference between OLTP and OLAP?
OLTP (Online Transaction Processing) → Handles daily transactions (e.g., orders, payments).
OLAP (Online Analytical Processing) → Used for reporting and analysis.
16. What is a primary key?
A primary key uniquely identifies each record in a table. Example: Customer_ID in a customer table.
17. What is a foreign key?
A foreign key links one table to another using the primary key of another table. Example: Customer_ID in Orders table linking to Customers table.
18. What is a fact table?
Fact table contains measurable business data like sales, revenue, or quantity.
19. What is a dimension table?
Dimension table contains descriptive details like customer name, region, product category.
20. What is a data warehouse?
A data warehouse is a centralized system that stores large volumes of historical data for analysis and reporting.
Double Tap ♥️ For Part-3
❤32👍3🔥1