β
SQL Aggregate Functions Questions with Answers Part-2 ππ
π Q1. Find departments where the average salary is greater than 70,000.
ποΈ Table: "employees(emp_id, name, department_id, salary)"
β Answer:
π Q2. Count employees in each department having more than 5 employees.
ποΈ Table: "employees(emp_id, name, department_id)"
β Answer:
π Q3. Find the department with the highest total salary.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
π Q4. Get departments where the minimum salary is greater than 30,000.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
π Q5. Find the difference between highest and lowest salary in each department.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
Double Tap β₯οΈ For More
π Q1. Find departments where the average salary is greater than 70,000.
ποΈ Table: "employees(emp_id, name, department_id, salary)"
β Answer:
SELECT department_id, AVG(salary) AS avg_salary
FROM employees
GROUP BY department_id
HAVING AVG(salary) > 70000;
π Q2. Count employees in each department having more than 5 employees.
ποΈ Table: "employees(emp_id, name, department_id)"
β Answer:
SELECT department_id, COUNT(*) AS total_employees
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 5;
π Q3. Find the department with the highest total salary.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
SELECT department_id
FROM employees
GROUP BY department_id
ORDER BY SUM(salary) DESC
LIMIT 1;
π Q4. Get departments where the minimum salary is greater than 30,000.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
SELECT department_id, MIN(salary) AS min_salary
FROM employees
GROUP BY department_id
HAVING MIN(salary) > 30000;
π Q5. Find the difference between highest and lowest salary in each department.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
SELECT department_id, MAX(salary) - MIN(salary) AS salary_difference
FROM employees
GROUP BY department_id;
Double Tap β₯οΈ For More
β€23
Most Asked SQL Interview Questions at MAANG Companiesπ₯π₯
Preparing for an SQL Interview at MAANG Companies? Here are some crucial SQL Questions you should be ready to tackle:
1. How do you retrieve all columns from a table?
SELECT * FROM table_name;
2. What SQL statement is used to filter records?
SELECT * FROM table_name
WHERE condition;
The WHERE clause is used to filter records based on a specified condition.
3. How can you join multiple tables? Describe different types of JOINs.
SELECT columns
FROM table1
JOIN table2 ON table1.column = table2.column
JOIN table3 ON table2.column = table3.column;
Types of JOINs:
1. INNER JOIN: Returns records with matching values in both tables
SELECT * FROM table1
INNER JOIN table2 ON table1.column = table2.column;
2. LEFT JOIN: Returns all records from the left table & matched records from the right table. Unmatched records will have NULL values.
SELECT * FROM table1
LEFT JOIN table2 ON table1.column = table2.column;
3. RIGHT JOIN: Returns all records from the right table & matched records from the left table. Unmatched records will have NULL values.
SELECT * FROM table1
RIGHT JOIN table2 ON table1.column = table2.column;
4. FULL JOIN: Returns records when there is a match in either left or right table. Unmatched records will have NULL values.
SELECT * FROM table1
FULL JOIN table2 ON table1.column = table2.column;
4. What is the difference between WHERE & HAVING clauses?
WHERE: Filters records before any groupings are made.
SELECT * FROM table_name
WHERE condition;
HAVING: Filters records after groupings are made.
SELECT column, COUNT(*)
FROM table_name
GROUP BY column
HAVING COUNT(*) > value;
5. How do you calculate average, sum, minimum & maximum values in a column?
Average: SELECT AVG(column_name) FROM table_name;
Sum: SELECT SUM(column_name) FROM table_name;
Minimum: SELECT MIN(column_name) FROM table_name;
Maximum: SELECT MAX(column_name) FROM table_name;
Here you can find essential SQL Interview Resourcesπ
https://t.me/mysqldata
Like this post if you need more πβ€οΈ
Hope it helps :)
Preparing for an SQL Interview at MAANG Companies? Here are some crucial SQL Questions you should be ready to tackle:
1. How do you retrieve all columns from a table?
SELECT * FROM table_name;
2. What SQL statement is used to filter records?
SELECT * FROM table_name
WHERE condition;
The WHERE clause is used to filter records based on a specified condition.
3. How can you join multiple tables? Describe different types of JOINs.
SELECT columns
FROM table1
JOIN table2 ON table1.column = table2.column
JOIN table3 ON table2.column = table3.column;
Types of JOINs:
1. INNER JOIN: Returns records with matching values in both tables
SELECT * FROM table1
INNER JOIN table2 ON table1.column = table2.column;
2. LEFT JOIN: Returns all records from the left table & matched records from the right table. Unmatched records will have NULL values.
SELECT * FROM table1
LEFT JOIN table2 ON table1.column = table2.column;
3. RIGHT JOIN: Returns all records from the right table & matched records from the left table. Unmatched records will have NULL values.
SELECT * FROM table1
RIGHT JOIN table2 ON table1.column = table2.column;
4. FULL JOIN: Returns records when there is a match in either left or right table. Unmatched records will have NULL values.
SELECT * FROM table1
FULL JOIN table2 ON table1.column = table2.column;
4. What is the difference between WHERE & HAVING clauses?
WHERE: Filters records before any groupings are made.
SELECT * FROM table_name
WHERE condition;
HAVING: Filters records after groupings are made.
SELECT column, COUNT(*)
FROM table_name
GROUP BY column
HAVING COUNT(*) > value;
5. How do you calculate average, sum, minimum & maximum values in a column?
Average: SELECT AVG(column_name) FROM table_name;
Sum: SELECT SUM(column_name) FROM table_name;
Minimum: SELECT MIN(column_name) FROM table_name;
Maximum: SELECT MAX(column_name) FROM table_name;
Here you can find essential SQL Interview Resourcesπ
https://t.me/mysqldata
Like this post if you need more πβ€οΈ
Hope it helps :)
β€17
β
π€ AβZ of Data Analyst ππΌ
A β Analytics
The process of analyzing data to discover insights and support decision-making.
B β Business Intelligence (BI)
Technologies and tools used to analyze business data (Power BI, Tableau).
C β Cleaning (Data Cleaning)
Removing errors, duplicates, and inconsistencies from data.
D β Dashboard
A visual display of key metrics and insights.
E β ETL (Extract, Transform, Load)
Process of collecting, cleaning, and storing data for analysis.
F β Forecasting
Predicting future trends using historical data.
G β Group By
A method to organize data into categories for analysis.
H β Hypothesis Testing
Testing assumptions using statistical methods.
I β Insight
Meaningful information derived from data analysis.
J β Join
Combining data from multiple tables (SQL concept).
K β KPI (Key Performance Indicator)
A measurable value showing business performance.
L β Linear Regression
A statistical method used to predict relationships between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy and improve efficiency.
O β Outlier
A data point significantly different from others.
P β Pivot Table
A tool used to summarize and analyze data quickly.
Q β Query
A request to retrieve data from a database.
R β Reporting
Presenting data insights through charts and summaries.
S β SQL
Language used to manage and analyze structured data.
T β Trend Analysis
Identifying patterns or changes over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Representing data using charts or graphs.
W β Warehousing (Data Warehouse)
Central storage of large structured datasets.
X β X-axis
Horizontal axis in charts representing variables.
Y β YoY (Year-over-Year)
Comparing data from one year to another.
Z β Z-Score
Statistical measure showing how far a value is from the mean.
Double Tap β₯οΈ For More
A β Analytics
The process of analyzing data to discover insights and support decision-making.
B β Business Intelligence (BI)
Technologies and tools used to analyze business data (Power BI, Tableau).
C β Cleaning (Data Cleaning)
Removing errors, duplicates, and inconsistencies from data.
D β Dashboard
A visual display of key metrics and insights.
E β ETL (Extract, Transform, Load)
Process of collecting, cleaning, and storing data for analysis.
F β Forecasting
Predicting future trends using historical data.
G β Group By
A method to organize data into categories for analysis.
H β Hypothesis Testing
Testing assumptions using statistical methods.
I β Insight
Meaningful information derived from data analysis.
J β Join
Combining data from multiple tables (SQL concept).
K β KPI (Key Performance Indicator)
A measurable value showing business performance.
L β Linear Regression
A statistical method used to predict relationships between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy and improve efficiency.
O β Outlier
A data point significantly different from others.
P β Pivot Table
A tool used to summarize and analyze data quickly.
Q β Query
A request to retrieve data from a database.
R β Reporting
Presenting data insights through charts and summaries.
S β SQL
Language used to manage and analyze structured data.
T β Trend Analysis
Identifying patterns or changes over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Representing data using charts or graphs.
W β Warehousing (Data Warehouse)
Central storage of large structured datasets.
X β X-axis
Horizontal axis in charts representing variables.
Y β YoY (Year-over-Year)
Comparing data from one year to another.
Z β Z-Score
Statistical measure showing how far a value is from the mean.
Double Tap β₯οΈ For More
β€32
π Top 50 Data Analyst Interview Questions ππΌ
βπ EXCEL Questions
1. Can you show me how you'd clean this messy dataset in Excel? What functions like TRIM or Remove Duplicates would you use?
2. What's the difference between absolute ($A$1) and relative (A1) references? When do you use each?
3. Walk me through creating a PivotTable to analyze sales by region and product. What are the exact steps?
4. Write a VLOOKUP formula right now. What if you get #N/A? How do you fix it?
5. Why use INDEX-MATCH over VLOOKUP? Show me both formulas for this lookup.
6. What's COUNTIF vs SUMIF vs COUNTIFS? Write formulas for conditional sales totals.
7. How does Goal Seek work? Demo target revenue scenario on this data.
8. Apply conditional formatting to highlight top 10% sales performers. Which rule?
9. Build me a dynamic dashboard. How do slicers and timelines work together?
10. Explain SUMPRODUCT. Write formula for multi-condition sales sum.
11. What's Power Query? Show basic ETL steps for cleaning data.
12. Freeze panes vs split panesβwhen do you use each?
13. XLOOKUP vs VLOOKUP advantages? Write both for this example.
14. How do you find and fix circular references in formulas?
15. Create data validation dropdown + named ranges. Demo it.
βποΈ SQL Questions
16. Write query for 2nd highest salary from Employee table. Use subquery OR window function.
17. INNER JOIN vs LEFT JOIN vs FULL JOIN? Write examples for employees + departments.
18. Find and remove duplicate records. Use CTE + ROW_NUMBER() or GROUP BY.
19. WHERE vs HAVING with GROUP BY? Show department-wise avg salary > 50k.
20. RANK() vs DENSE_RANK() vs ROW_NUMBER()? Partition by dept, order by salary.
21. Top 5 products by total sales. Write complete query with GROUP BY + LIMIT.
22. Self-join for employee-manager hierarchy. Show employee name + manager name.
23. Handle NULL salaries. Use COALESCE, IS NULL, IFNULL examples.
24. Pivot sales data by month using CASE statements. Write query.
25. Subquery vs JOINβwhich is faster for this scenario? Why?
26. Recursive CTE for company hierarchy (CEO β managers β employees).
27. Clustered vs non-clustered indexes? When does each improve performance?
βπ¨ Tableau Questions
28. {FIXED [Region]: SUM([Sales])}βwhat's this LOD doing? Write region total ignoring filters.
29. Create dual-axis chart comparing sales vs profit trends. Exact steps?
30. Data blending vs joining? When do you use each approach?
31. Parameters vs filters? Write calculated field using parameter.
32. Build dashboard with filter action + highlight action. Demo flow.
33. % of total calculated field? Write formula for region sales %.
34. FIXED vs INCLUDE vs EXCLUDE LOD? Give 3 examples.
35. Tableau Extracts vs Live connection? Performance + refresh differences?
ββ‘ Power BI Questions
36. CALCULATE(SUM(Sales), SAMEPERIODLASTYEAR())βexplain this DAX. YoY growth?
37. Measures vs Calculated Columns? When do you use each? Write both.
38. Star schema vs Snowflake? Draw relationships for sales β products β customers.
39. Power Query: Write M code for custom column parsing dates.
40. Implement Row-Level Security (RLS). Show DAX for region manager filter.
41. DirectQuery vs Import mode? Pros/cons + when to choose each?
42. TOTALYTD(SUM(Sales))βexplain time intelligence DAX.
43. Dashboard loads slow. Optimization steps? Aggregations + query folding?
βπ Python/Pandas Questions
44. Group sales by region and sum: write pandas code. .reset_index()
45. pd.merge(df1, df2, on='ID', how='inner')βexplain all merge types.
46. Three ways to handle NaN values: fillna(), dropna(), interpolate().
47. loc[] vs iloc[]? Filter sales > 1000 by region vs first 5 rows.
48. pivot_table() vs groupby()? Reshape sales by month/product.
49. Read 1GB CSV without crashing: chunksize=10000 example.
50. df['New'] = df['Sales'].apply(lambda x: x*1.1)βalternatives to apply?
Double Tap β₯οΈ For More
βπ EXCEL Questions
1. Can you show me how you'd clean this messy dataset in Excel? What functions like TRIM or Remove Duplicates would you use?
2. What's the difference between absolute ($A$1) and relative (A1) references? When do you use each?
3. Walk me through creating a PivotTable to analyze sales by region and product. What are the exact steps?
4. Write a VLOOKUP formula right now. What if you get #N/A? How do you fix it?
5. Why use INDEX-MATCH over VLOOKUP? Show me both formulas for this lookup.
6. What's COUNTIF vs SUMIF vs COUNTIFS? Write formulas for conditional sales totals.
7. How does Goal Seek work? Demo target revenue scenario on this data.
8. Apply conditional formatting to highlight top 10% sales performers. Which rule?
9. Build me a dynamic dashboard. How do slicers and timelines work together?
10. Explain SUMPRODUCT. Write formula for multi-condition sales sum.
11. What's Power Query? Show basic ETL steps for cleaning data.
12. Freeze panes vs split panesβwhen do you use each?
13. XLOOKUP vs VLOOKUP advantages? Write both for this example.
14. How do you find and fix circular references in formulas?
15. Create data validation dropdown + named ranges. Demo it.
βποΈ SQL Questions
16. Write query for 2nd highest salary from Employee table. Use subquery OR window function.
17. INNER JOIN vs LEFT JOIN vs FULL JOIN? Write examples for employees + departments.
18. Find and remove duplicate records. Use CTE + ROW_NUMBER() or GROUP BY.
19. WHERE vs HAVING with GROUP BY? Show department-wise avg salary > 50k.
20. RANK() vs DENSE_RANK() vs ROW_NUMBER()? Partition by dept, order by salary.
21. Top 5 products by total sales. Write complete query with GROUP BY + LIMIT.
22. Self-join for employee-manager hierarchy. Show employee name + manager name.
23. Handle NULL salaries. Use COALESCE, IS NULL, IFNULL examples.
24. Pivot sales data by month using CASE statements. Write query.
25. Subquery vs JOINβwhich is faster for this scenario? Why?
26. Recursive CTE for company hierarchy (CEO β managers β employees).
27. Clustered vs non-clustered indexes? When does each improve performance?
βπ¨ Tableau Questions
28. {FIXED [Region]: SUM([Sales])}βwhat's this LOD doing? Write region total ignoring filters.
29. Create dual-axis chart comparing sales vs profit trends. Exact steps?
30. Data blending vs joining? When do you use each approach?
31. Parameters vs filters? Write calculated field using parameter.
32. Build dashboard with filter action + highlight action. Demo flow.
33. % of total calculated field? Write formula for region sales %.
34. FIXED vs INCLUDE vs EXCLUDE LOD? Give 3 examples.
35. Tableau Extracts vs Live connection? Performance + refresh differences?
ββ‘ Power BI Questions
36. CALCULATE(SUM(Sales), SAMEPERIODLASTYEAR())βexplain this DAX. YoY growth?
37. Measures vs Calculated Columns? When do you use each? Write both.
38. Star schema vs Snowflake? Draw relationships for sales β products β customers.
39. Power Query: Write M code for custom column parsing dates.
40. Implement Row-Level Security (RLS). Show DAX for region manager filter.
41. DirectQuery vs Import mode? Pros/cons + when to choose each?
42. TOTALYTD(SUM(Sales))βexplain time intelligence DAX.
43. Dashboard loads slow. Optimization steps? Aggregations + query folding?
βπ Python/Pandas Questions
44. Group sales by region and sum: write pandas code. .reset_index()
45. pd.merge(df1, df2, on='ID', how='inner')βexplain all merge types.
46. Three ways to handle NaN values: fillna(), dropna(), interpolate().
47. loc[] vs iloc[]? Filter sales > 1000 by region vs first 5 rows.
48. pivot_table() vs groupby()? Reshape sales by month/product.
49. Read 1GB CSV without crashing: chunksize=10000 example.
50. df['New'] = df['Sales'].apply(lambda x: x*1.1)βalternatives to apply?
Double Tap β₯οΈ For More
β€25
SQL CHEAT SHEETπ©βπ»
Here is a quick cheat sheet of some of the most essential SQL commands:
SELECT - Retrieves data from a database
UPDATE - Updates existing data in a database
DELETE - Removes data from a database
INSERT - Adds data to a database
CREATE - Creates an object such as a database or table
ALTER - Modifies an existing object in a database
DROP -Deletes an entire table or database
ORDER BY - Sorts the selected data in an ascending or descending order
WHERE β Condition used to filter a specific set of records from the database
GROUP BY - Groups a set of data by a common parameter
HAVING - Allows the use of aggregate functions within the query
JOIN - Joins two or more tables together to retrieve data
INDEX - Creates an index on a table, to speed up search times.
Here is a quick cheat sheet of some of the most essential SQL commands:
SELECT - Retrieves data from a database
UPDATE - Updates existing data in a database
DELETE - Removes data from a database
INSERT - Adds data to a database
CREATE - Creates an object such as a database or table
ALTER - Modifies an existing object in a database
DROP -Deletes an entire table or database
ORDER BY - Sorts the selected data in an ascending or descending order
WHERE β Condition used to filter a specific set of records from the database
GROUP BY - Groups a set of data by a common parameter
HAVING - Allows the use of aggregate functions within the query
JOIN - Joins two or more tables together to retrieve data
INDEX - Creates an index on a table, to speed up search times.
β€5
π Interviewer: How do you remove duplicate records in SQL?
π Me: We can remove duplicates using DISTINCT, GROUP BY, or delete duplicate rows using ROW_NUMBER().
β 1οΈβ£ Using DISTINCT (to fetch unique values)
π Returns unique records but does not delete duplicates.
β 2οΈβ£ Using GROUP BY (to identify duplicates)
π Helps find duplicate records.
β 3οΈβ£ Delete Duplicates Using ROW_NUMBER() (Most Important β)
(Keeps one record and deletes others)
π§ Logic Breakdown:
- DISTINCT β shows unique records
- GROUP BY β identifies duplicates
- ROW_NUMBER() β removes duplicates safely
β Use Case: Data cleaning, ETL processes, data quality checks.
π‘ Tip: Always take a backup before deleting duplicate records.
π¬ Tap β€οΈ for more!
π Me: We can remove duplicates using DISTINCT, GROUP BY, or delete duplicate rows using ROW_NUMBER().
β 1οΈβ£ Using DISTINCT (to fetch unique values)
SELECT DISTINCT column_name
FROM employees;
π Returns unique records but does not delete duplicates.
β 2οΈβ£ Using GROUP BY (to identify duplicates)
SELECT name, COUNT(*)
FROM employees
GROUP BY name
HAVING COUNT(*) > 1;
π Helps find duplicate records.
β 3οΈβ£ Delete Duplicates Using ROW_NUMBER() (Most Important β)
(Keeps one record and deletes others)
DELETE FROM employees
WHERE id IN (
SELECT id FROM (
SELECT id,
ROW_NUMBER() OVER (
PARTITION BY name, salary
ORDER BY id
) AS rn
FROM employees
) t
WHERE rn > 1
);
π§ Logic Breakdown:
- DISTINCT β shows unique records
- GROUP BY β identifies duplicates
- ROW_NUMBER() β removes duplicates safely
β Use Case: Data cleaning, ETL processes, data quality checks.
π‘ Tip: Always take a backup before deleting duplicate records.
π¬ Tap β€οΈ for more!
β€15
β
Excel Interview Questions with Answers ππΌ
1οΈβ£ How do you clean a messy dataset in Excel?
Steps:
- TRIM() β removes extra spaces
- CLEAN() β removes non-printable characters
- Remove Duplicates β Data β Remove Duplicates
- Text to Columns β split data
- Find & Replace (Ctrl+H) β fix values
- Filter β remove blanks or errors
2οΈβ£ Absolute vs Relative References
Relative (A1) β changes when copied
Absolute ($A$1) β stays fixed
When to use:
- Relative β normal calculations
- Absolute β fixed values (tax rate, constants)
3οΈβ£ Create PivotTable for Sales Analysis
Steps:
1. Select data
2. Insert β PivotTable
3. Drag: Region β Rows, Product β Columns, Sales β Values
Used for fast data summarization.
4οΈβ£ VLOOKUP Formula + #N/A Fix
Formula:
Fix #N/A:
- Check lookup value exists
- Match data types
Use:
5οΈβ£ INDEX-MATCH vs VLOOKUP
VLOOKUP:
INDEX-MATCH:
β Why INDEX-MATCH?
- Faster for large data
- Works left lookup
- More flexible
6οΈβ£ COUNTIF vs SUMIF vs COUNTIFS
COUNTIF β count condition
SUMIF β sum condition
COUNTIFS β multiple conditions
7οΈβ£ Goal Seek
Used for what-if analysis.
Steps:
1. Data β What-if Analysis β Goal Seek
2. Set cell β target value
3. Change variable cell
Example: target revenue calculation.
8οΈβ£ Conditional Formatting Top 10%
Steps: Select data
Home β Conditional Formatting
Top/Bottom Rules β Top 10%
9οΈβ£ Dynamic Dashboard + Slicers
Create PivotTable
Insert β Slicer
Insert β Timeline (for dates)
Connect slicers to multiple visuals
Used for interactive dashboards.
π SUMPRODUCT (Multi-condition sum)
Used for weighted or multiple-condition calculations.
1οΈβ£1οΈβ£ What is Power Query?
Excelβs ETL tool.
Steps:
- Get Data β Load data
- Remove columns
- Change types
- Remove duplicates
- Load cleaned data
Used for automation and transformation.
1οΈβ£2οΈβ£ Freeze Panes vs Split Panes
Freeze Panes β lock rows/columns while scrolling
Split Panes β divide screen into sections
1οΈβ£3οΈβ£ XLOOKUP vs VLOOKUP
XLOOKUP:
β Advantages:
- Left lookup
- No column index
- Default exact match
- Handles errors
1οΈβ£4οΈβ£ Circular References Fix
Occurs when formula refers to itself.
Fix:
Formulas β Error Checking β Circular References
Correct formula logic
1οΈβ£5οΈβ£ Data Validation + Named Range
Steps:
1. Formulas β Define Name
2. Data β Data Validation β List
3. Select named range
Used for dropdown lists.
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Double Tap β₯οΈ For More
1οΈβ£ How do you clean a messy dataset in Excel?
Steps:
- TRIM() β removes extra spaces
=TRIM(A1)- CLEAN() β removes non-printable characters
=CLEAN(A1)- Remove Duplicates β Data β Remove Duplicates
- Text to Columns β split data
- Find & Replace (Ctrl+H) β fix values
- Filter β remove blanks or errors
2οΈβ£ Absolute vs Relative References
Relative (A1) β changes when copied
Absolute ($A$1) β stays fixed
When to use:
- Relative β normal calculations
- Absolute β fixed values (tax rate, constants)
3οΈβ£ Create PivotTable for Sales Analysis
Steps:
1. Select data
2. Insert β PivotTable
3. Drag: Region β Rows, Product β Columns, Sales β Values
Used for fast data summarization.
4οΈβ£ VLOOKUP Formula + #N/A Fix
Formula:
=VLOOKUP(A2, Sheet2!A:B, 2, FALSE)Fix #N/A:
- Check lookup value exists
- Match data types
Use:
=IFERROR(VLOOKUP(A2, A:B, 2, FALSE),"Not Found")5οΈβ£ INDEX-MATCH vs VLOOKUP
VLOOKUP:
=VLOOKUP(A2,A:B,2,FALSE)INDEX-MATCH:
=INDEX(B:B, MATCH(A2,A:A,0))β Why INDEX-MATCH?
- Faster for large data
- Works left lookup
- More flexible
6οΈβ£ COUNTIF vs SUMIF vs COUNTIFS
COUNTIF β count condition
=COUNTIF(A:A,"East")SUMIF β sum condition
=SUMIF(A:A,"East",B:B)COUNTIFS β multiple conditions
=COUNTIFS(A:A,"East",B:B,">500")7οΈβ£ Goal Seek
Used for what-if analysis.
Steps:
1. Data β What-if Analysis β Goal Seek
2. Set cell β target value
3. Change variable cell
Example: target revenue calculation.
8οΈβ£ Conditional Formatting Top 10%
Steps: Select data
Home β Conditional Formatting
Top/Bottom Rules β Top 10%
9οΈβ£ Dynamic Dashboard + Slicers
Create PivotTable
Insert β Slicer
Insert β Timeline (for dates)
Connect slicers to multiple visuals
Used for interactive dashboards.
π SUMPRODUCT (Multi-condition sum)
=SUMPRODUCT((A2:A10="East")(B2:B10>500)C2:C10)Used for weighted or multiple-condition calculations.
1οΈβ£1οΈβ£ What is Power Query?
Excelβs ETL tool.
Steps:
- Get Data β Load data
- Remove columns
- Change types
- Remove duplicates
- Load cleaned data
Used for automation and transformation.
1οΈβ£2οΈβ£ Freeze Panes vs Split Panes
Freeze Panes β lock rows/columns while scrolling
Split Panes β divide screen into sections
1οΈβ£3οΈβ£ XLOOKUP vs VLOOKUP
XLOOKUP:
=XLOOKUP(A2,A:A,B:B)β Advantages:
- Left lookup
- No column index
- Default exact match
- Handles errors
1οΈβ£4οΈβ£ Circular References Fix
Occurs when formula refers to itself.
Fix:
Formulas β Error Checking β Circular References
Correct formula logic
1οΈβ£5οΈβ£ Data Validation + Named Range
Steps:
1. Formulas β Define Name
2. Data β Data Validation β List
3. Select named range
Used for dropdown lists.
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Double Tap β₯οΈ For More
β€13π1
SQL Interview Questions with Answers
β 16. Write a query to find the 2nd highest salary from Employee table using subquery OR window function.
β Using Subquery
β INNER JOIN β Only matching records
β Find Duplicates
π Difference
WHERE β filter before grouping
HAVING β filter after grouping
β Using COALESCE
JOIN is usually faster, subquery is easier to read.
β 26. Write a recursive CTE for company hierarchy (CEO β managers β employees).
β Clustered Index: physically sorts table data
β Non-Clustered Index: separate structure pointing to data
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
Double Tap β₯οΈ For More
β 16. Write a query to find the 2nd highest salary from Employee table using subquery OR window function.
β Using Subquery
SELECT MAX(salary) AS second_highest_salaryβ Using Window Function
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);
SELECT salaryβ 17. Explain INNER JOIN vs LEFT JOIN vs FULL JOIN with examples for employees and departments.
FROM (
SELECT salary, DENSE_RANK() OVER (ORDER BY salary DESC) AS rnk
FROM employees
) t
WHERE rnk = 2;
β INNER JOIN β Only matching records
SELECT e.name, d.department_nameβ LEFT JOIN β All employees + matching departments
FROM employees e
INNER JOIN departments d ON e.department_id = d.id;
SELECT e.name, d.department_nameβ FULL JOIN β All records from both tables
FROM employees e
LEFT JOIN departments d ON e.department_id = d.id;
SELECT e.name, d.department_nameβ 18. Find and remove duplicate records using CTE + ROW_NUMBER().
FROM employees e
FULL JOIN departments d ON e.department_id = d.id;
β Find Duplicates
WITH cte AS (β Remove Duplicates
SELECT *, ROW_NUMBER() OVER(PARTITION BY email ORDER BY id) rn
FROM employees
)
SELECT * FROM cte WHERE rn > 1;
WITH cte AS (β 19. Explain WHERE vs HAVING with GROUP BY. Show department-wise avg salary > 50k.
SELECT *, ROW_NUMBER() OVER(PARTITION BY email ORDER BY id) rn
FROM employees
)
DELETE FROM cte WHERE rn > 1;
π Difference
WHERE β filter before grouping
HAVING β filter after grouping
SELECT department_id, AVG(salary) AS avg_salaryβ 20. Explain RANK vs DENSE_RANK vs ROW_NUMBER partitioned by department ordered by salary.
FROM employees
GROUP BY department_id
HAVING AVG(salary) > 50000;
SELECT name, department_id, salary,β 21. Find top 5 products by total sales using GROUP BY + LIMIT.
ROW_NUMBER() OVER(PARTITION BY department_id ORDER BY salary DESC) rn,
RANK() OVER(PARTITION BY department_id ORDER BY salary DESC) rnk,
DENSE_RANK() OVER(PARTITION BY department_id ORDER BY salary DESC) drnk
FROM employees;
SELECT product_id, SUM(sales_amount) AS total_salesβ 22. Write a self join to show employee name and manager name.
FROM sales
GROUP BY product_id
ORDER BY total_sales DESC
LIMIT 5;
SELECT e.name AS employee, m.name AS managerβ 23. Handle NULL salaries using COALESCE, IS NULL, IFNULL.
FROM employees e
LEFT JOIN employees m ON e.manager_id = m.employee_id;
β Using COALESCE
SELECT name, COALESCE(salary, 0) AS salaryβ Using IS NULL
FROM employees;
SELECT * FROM employees WHERE salary IS NULL;β 24. Pivot sales data by month using CASE statement.
SELECTβ 25. Subquery vs JOIN β which is faster? Why?
SUM(CASE WHEN month = 'Jan' THEN sales ELSE 0 END) AS Jan,
SUM(CASE WHEN month = 'Feb' THEN sales ELSE 0 END) AS Feb,
SUM(CASE WHEN month = 'Mar' THEN sales ELSE 0 END) AS Mar
FROM sales;
JOIN is usually faster, subquery is easier to read.
β 26. Write a recursive CTE for company hierarchy (CEO β managers β employees).
WITH RECURSIVE emp_hierarchy AS (β 27. Explain clustered vs non-clustered indexes. When to use each?
SELECT employee_id, name, manager_id
FROM employees
WHERE manager_id IS NULL
UNION ALL
SELECT e.employee_id, e.name, e.manager_id
FROM employees e
JOIN emp_hierarchy h ON e.manager_id = h.employee_id
)
SELECT * FROM emp_hierarchy;
β Clustered Index: physically sorts table data
β Non-Clustered Index: separate structure pointing to data
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
Double Tap β₯οΈ For More
β€14
β
π€ AβZ of Data Analyst Terms ππ»π
A β A/B Testing
Experiment comparing two versions to see which performs better.
B β Business Intelligence (BI)
Technologies and processes for analyzing business data.
C β Correlation
Measure of relationship between two variables.
D β Data Cleaning
Process of fixing or removing incorrect/incomplete data.
E β ETL (Extract, Transform, Load)
Process of moving and preparing data for analysis.
F β Forecasting
Predicting future trends based on historical data.
G β Granularity
Level of detail in data (daily, monthly, yearly).
H β Hypothesis
Assumption made for testing using data.
I β Insight
Meaningful interpretation derived from data analysis.
J β Join
Combining data from multiple tables.
K β KPI (Key Performance Indicator)
Metric used to measure performance.
L β Linear Regression
Statistical method to model relationship between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy.
O β Outlier
Data point significantly different from others.
P β Pivot Table
Tool to summarize and analyze data.
Q β Query
Request to retrieve specific data.
R β Regression Analysis
Technique for predicting relationships between variables.
S β Segmentation
Dividing data into groups for analysis.
T β Trend Analysis
Identifying patterns over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Presenting data graphically (charts, dashboards).
W β Warehouse (Data Warehouse)
Central repository for integrated data.
X β X-Axis
Horizontal axis in charts.
Y β YoY (Year-over-Year)
Comparison of metrics from one year to another.
Z β Z-Score
Statistical measurement of how far a value is from mean.
β€οΈ Double Tap for More
A β A/B Testing
Experiment comparing two versions to see which performs better.
B β Business Intelligence (BI)
Technologies and processes for analyzing business data.
C β Correlation
Measure of relationship between two variables.
D β Data Cleaning
Process of fixing or removing incorrect/incomplete data.
E β ETL (Extract, Transform, Load)
Process of moving and preparing data for analysis.
F β Forecasting
Predicting future trends based on historical data.
G β Granularity
Level of detail in data (daily, monthly, yearly).
H β Hypothesis
Assumption made for testing using data.
I β Insight
Meaningful interpretation derived from data analysis.
J β Join
Combining data from multiple tables.
K β KPI (Key Performance Indicator)
Metric used to measure performance.
L β Linear Regression
Statistical method to model relationship between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy.
O β Outlier
Data point significantly different from others.
P β Pivot Table
Tool to summarize and analyze data.
Q β Query
Request to retrieve specific data.
R β Regression Analysis
Technique for predicting relationships between variables.
S β Segmentation
Dividing data into groups for analysis.
T β Trend Analysis
Identifying patterns over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Presenting data graphically (charts, dashboards).
W β Warehouse (Data Warehouse)
Central repository for integrated data.
X β X-Axis
Horizontal axis in charts.
Y β YoY (Year-over-Year)
Comparison of metrics from one year to another.
Z β Z-Score
Statistical measurement of how far a value is from mean.
β€οΈ Double Tap for More
β€17π1
π Donβt Overwhelm to Learn Data Analytics β Data Analytics is Only This Much π
πΉ FOUNDATIONS
1οΈβ£ What is Data Analytics
- Collecting data
- Cleaning data
- Analyzing data
- Finding insights
- Supporting decision-making
2οΈβ£ Excel (Basic Tool)
- Formulas (SUM, IF, VLOOKUP, INDEX-MATCH)
- Pivot Tables
- Charts
- Data cleaning
- Conditional formatting
π₯ Still heavily used in companies
3οΈβ£ SQL (Most Important β)
- SELECT, WHERE
- GROUP BY, HAVING
- JOINS (INNER, LEFT, RIGHT)
- Subqueries
- CTE
- Window functions
- Indexing basics
π₯ If you practice SQL daily β big advantage
4οΈβ£ Statistics Basics
- Mean, median, mode
- Variance & standard deviation
- Probability basics
- Distribution concepts
- Correlation
π₯ CORE DATA ANALYTICS SKILLS
5οΈβ£ Python for Data Analysis
- NumPy
- Pandas
- Data cleaning
- Handling missing values
- Data transformation
6οΈβ£ Data Visualization
- Matplotlib
- Seaborn
- Power BI
- Tableau
π₯ Storytelling with data is key
7οΈβ£ Data Cleaning (Very Important β)
- Handling null values
- Removing duplicates
- Data standardization
- Outlier detection
8οΈβ£ Exploratory Data Analysis (EDA)
- Understanding patterns
- Finding trends
- Correlation analysis
- Feature understanding
9οΈβ£ Business Understanding
- KPIs
- Metrics
- Business problems
- Stakeholder communication
π₯ What separates analyst from report generator
π ADVANCED ANALYTICS
π Dashboard Development
- Power BI dashboards
- Tableau dashboards
- Interactive reports
- Drill-down analysis
1οΈβ£1οΈβ£ Data Storytelling
- Presenting insights
- Creating reports
- Communicating findings clearly
1οΈβ£2οΈβ£ Basic Machine Learning (Optional)
- Regression
- Classification
- Forecasting
(Helpful but not mandatory for analyst role)
1οΈβ£3οΈβ£ A/B Testing
- Hypothesis testing
- Statistical significance
- Business experiments
1οΈβ£4οΈβ£ Data Warehousing Concepts
- Fact & dimension tables
- Star schema
- ETL basics
βοΈ INDUSTRY SKILLS
1οΈβ£5οΈβ£ Data Pipelines
- Extract β Transform β Load
- Data automation
1οΈβ£6οΈβ£ Automation
- Python scripts
- Scheduled reports
1οΈβ£7οΈβ£ Soft Skills
- Communication
- Presentation skills
- Explaining technical results simply
π₯ Extremely important in interviews
β TOOLS TO MASTER
- Excel
- SQL β
- Python
- Power BI / Tableau
- Basic statistics
Double Tap β₯οΈ For Detailed Explanation
πΉ FOUNDATIONS
1οΈβ£ What is Data Analytics
- Collecting data
- Cleaning data
- Analyzing data
- Finding insights
- Supporting decision-making
2οΈβ£ Excel (Basic Tool)
- Formulas (SUM, IF, VLOOKUP, INDEX-MATCH)
- Pivot Tables
- Charts
- Data cleaning
- Conditional formatting
π₯ Still heavily used in companies
3οΈβ£ SQL (Most Important β)
- SELECT, WHERE
- GROUP BY, HAVING
- JOINS (INNER, LEFT, RIGHT)
- Subqueries
- CTE
- Window functions
- Indexing basics
π₯ If you practice SQL daily β big advantage
4οΈβ£ Statistics Basics
- Mean, median, mode
- Variance & standard deviation
- Probability basics
- Distribution concepts
- Correlation
π₯ CORE DATA ANALYTICS SKILLS
5οΈβ£ Python for Data Analysis
- NumPy
- Pandas
- Data cleaning
- Handling missing values
- Data transformation
6οΈβ£ Data Visualization
- Matplotlib
- Seaborn
- Power BI
- Tableau
π₯ Storytelling with data is key
7οΈβ£ Data Cleaning (Very Important β)
- Handling null values
- Removing duplicates
- Data standardization
- Outlier detection
8οΈβ£ Exploratory Data Analysis (EDA)
- Understanding patterns
- Finding trends
- Correlation analysis
- Feature understanding
9οΈβ£ Business Understanding
- KPIs
- Metrics
- Business problems
- Stakeholder communication
π₯ What separates analyst from report generator
π ADVANCED ANALYTICS
π Dashboard Development
- Power BI dashboards
- Tableau dashboards
- Interactive reports
- Drill-down analysis
1οΈβ£1οΈβ£ Data Storytelling
- Presenting insights
- Creating reports
- Communicating findings clearly
1οΈβ£2οΈβ£ Basic Machine Learning (Optional)
- Regression
- Classification
- Forecasting
(Helpful but not mandatory for analyst role)
1οΈβ£3οΈβ£ A/B Testing
- Hypothesis testing
- Statistical significance
- Business experiments
1οΈβ£4οΈβ£ Data Warehousing Concepts
- Fact & dimension tables
- Star schema
- ETL basics
βοΈ INDUSTRY SKILLS
1οΈβ£5οΈβ£ Data Pipelines
- Extract β Transform β Load
- Data automation
1οΈβ£6οΈβ£ Automation
- Python scripts
- Scheduled reports
1οΈβ£7οΈβ£ Soft Skills
- Communication
- Presentation skills
- Explaining technical results simply
π₯ Extremely important in interviews
β TOOLS TO MASTER
- Excel
- SQL β
- Python
- Power BI / Tableau
- Basic statistics
Double Tap β₯οΈ For Detailed Explanation
β€46π2
π Data Analytics Fundamentals β Part:1
Data Analytics is the process of collecting, cleaning, transforming, and analyzing data to find useful insights that help businesses make better decisions.
π In simple words:
Data Analytics = Turning raw data into meaningful information.
Companies generate huge amounts of data daily (sales, customers, website visits, transactions). A data analyst converts this raw data into insights that improve performance and solve business problems.
β Why Data Analytics is Important
- Helps companies make data-driven decisions
- Improves business performance
- Identifies trends and patterns
- Predicts future outcomes
- Reduces risks
- Improves customer experience
π Example:
- Amazon recommends products β data analytics
- Netflix suggests movies β data analytics
- Companies track sales performance β data analytics
π Data Analytics Process (Step-by-Step)
1οΈβ£ Data Collection
Gathering data from different sources.
Sources include:
- Databases
- Excel files
- Websites
- Surveys
- Business applications
- APIs
π Example: Sales data, customer data, website traffic.
2οΈβ£ Data Cleaning (Most Time-Consuming Step β)
Raw data is messy and contains errors. Cleaning includes:
- Removing duplicates
- Handling missing values
- Fixing incorrect data
- Standardizing formats
π Example: Fixing names like βRahulβ, βrahulβ, βRAHULβ into one format.
π‘ Fun Fact: Data analysts spend ~70β80% of time cleaning data.
3οΈβ£ Data Analysis
Applying techniques to understand data. Includes:
- Finding trends
- Comparing values
- Calculating metrics
- Identifying patterns
π Example: Finding which product sells the most.
4οΈβ£ Finding Insights
Converting analysis into meaningful conclusions.
π Example:
- Sales drop on weekends
- Customers prefer online payments
- Certain regions generate more profit
Insights answer βWhy is this happening?β
5οΈβ£ Supporting Decision Making (Final Goal β)
Using insights to help businesses take action.
π Example:
- Increase marketing in high-performing regions
- Improve weak products
- Optimize pricing strategy
π‘ Final purpose of data analytics = Better decisions.
π§ Types of Data Analytics (Interview Important)
1οΈβ£ Descriptive Analytics β What happened?
- Past data analysis
- Reports and dashboards
π Example: Monthly sales report.
2οΈβ£ Diagnostic Analytics β Why it happened?
- Root cause analysis
π Example: Why sales dropped last month.
3οΈβ£ Predictive Analytics β What will happen?
- Forecasting future trends
π Example: Next month sales prediction.
4οΈβ£ Prescriptive Analytics β What should we do?
- Suggests best actions
π Example: Best pricing strategy.
πΌ Real-Life Example of Data Analytics
π E-commerce Company
- Collect customer purchase data
- Clean incorrect records
- Analyze buying patterns
- Find popular products
- Recommend products to customers
Result β More sales.
β Role of a Data Analyst
A data analyst:
β Collects data
β Cleans data
β Analyzes data
β Finds patterns
β Builds reports/dashboards
β Communicates insights
π Not just numbers β solving business problems.
Double Tap β₯οΈ For Part-2
Data Analytics is the process of collecting, cleaning, transforming, and analyzing data to find useful insights that help businesses make better decisions.
π In simple words:
Data Analytics = Turning raw data into meaningful information.
Companies generate huge amounts of data daily (sales, customers, website visits, transactions). A data analyst converts this raw data into insights that improve performance and solve business problems.
β Why Data Analytics is Important
- Helps companies make data-driven decisions
- Improves business performance
- Identifies trends and patterns
- Predicts future outcomes
- Reduces risks
- Improves customer experience
π Example:
- Amazon recommends products β data analytics
- Netflix suggests movies β data analytics
- Companies track sales performance β data analytics
π Data Analytics Process (Step-by-Step)
1οΈβ£ Data Collection
Gathering data from different sources.
Sources include:
- Databases
- Excel files
- Websites
- Surveys
- Business applications
- APIs
π Example: Sales data, customer data, website traffic.
2οΈβ£ Data Cleaning (Most Time-Consuming Step β)
Raw data is messy and contains errors. Cleaning includes:
- Removing duplicates
- Handling missing values
- Fixing incorrect data
- Standardizing formats
π Example: Fixing names like βRahulβ, βrahulβ, βRAHULβ into one format.
π‘ Fun Fact: Data analysts spend ~70β80% of time cleaning data.
3οΈβ£ Data Analysis
Applying techniques to understand data. Includes:
- Finding trends
- Comparing values
- Calculating metrics
- Identifying patterns
π Example: Finding which product sells the most.
4οΈβ£ Finding Insights
Converting analysis into meaningful conclusions.
π Example:
- Sales drop on weekends
- Customers prefer online payments
- Certain regions generate more profit
Insights answer βWhy is this happening?β
5οΈβ£ Supporting Decision Making (Final Goal β)
Using insights to help businesses take action.
π Example:
- Increase marketing in high-performing regions
- Improve weak products
- Optimize pricing strategy
π‘ Final purpose of data analytics = Better decisions.
π§ Types of Data Analytics (Interview Important)
1οΈβ£ Descriptive Analytics β What happened?
- Past data analysis
- Reports and dashboards
π Example: Monthly sales report.
2οΈβ£ Diagnostic Analytics β Why it happened?
- Root cause analysis
π Example: Why sales dropped last month.
3οΈβ£ Predictive Analytics β What will happen?
- Forecasting future trends
π Example: Next month sales prediction.
4οΈβ£ Prescriptive Analytics β What should we do?
- Suggests best actions
π Example: Best pricing strategy.
πΌ Real-Life Example of Data Analytics
π E-commerce Company
- Collect customer purchase data
- Clean incorrect records
- Analyze buying patterns
- Find popular products
- Recommend products to customers
Result β More sales.
β Role of a Data Analyst
A data analyst:
β Collects data
β Cleans data
β Analyzes data
β Finds patterns
β Builds reports/dashboards
β Communicates insights
π Not just numbers β solving business problems.
Double Tap β₯οΈ For Part-2
β€37
π Data Analytics Fundamentals β Part:2
π Excel in Data Analytics
β’ Microsoft Excel is a spreadsheet tool used for data cleaning, analysis, and visualization using formulas, pivot tables, and charts.
β’ Companies use Excel daily for reporting, dashboards, and quick analysis.
β Why Excel is Important for Data Analysts
β’ Used in almost every organization
β’ Best tool for quick analysis
β’ Helps clean messy data
β’ Creates reports and dashboards
β’ Used in interviews and real jobs
β’ Many companies expect strong Excel skills before SQL/Python.
π Core Excel Skills for Data Analytics
1οΈβ£ Formulas Functions (Most Important β)
β’ Formulas help perform calculations automatically.
β’ Common formulas:
β SUM() β Adds numbers
β AVERAGE() β Finds average
β IF() β Conditional logic
β VLOOKUP() β Search data vertically
β INDEX + MATCH β Advanced lookup
β COUNT() / COUNTIF() β Count values
β’ Examples:
β Find total sales
β Check pass/fail results
β Merge data from two sheets
2οΈβ£ Pivot Tables (Very Important β)
β’ Summarize large data quickly
β’ Used for:
β Grouping data
β Calculating totals
β Comparing categories
β Creating reports
β’ Examples:
β Total sales by region
β Employee count by department
β Monthly revenue summary
3οΈβ£ Data Cleaning in Excel
β’ Raw data contains errors β Excel helps fix them.
β’ Common cleaning tasks:
β Remove duplicates
β Handle missing values
β Trim extra spaces
β Split text into columns
β Standardize formats
β’ Tools used:
β Remove Duplicates
β Text to Columns
β Find Replace
β TRIM function
4οΈβ£ Sorting Filtering
β’ Helps explore and understand data.
β’ Used for:
β Finding top values
β Filtering specific records
β Organizing data logically
β’ Examples:
β Top 10 customers
β Filter sales above βΉ50,000
5οΈβ£ Conditional Formatting
β’ Highlights important data visually.
β’ Examples:
β Highlight highest sales
β Mark low performance
β Show trends using color
6οΈβ£ Charts Visualization
β’ Excel creates visual reports.
β’ Common charts:
β Bar chart
β Line chart
β Pie chart
β Histogram
β’ Used for:
β Showing trends
β Comparing performance
β Presenting insights
π How Excel is Used in Real Data Analyst Workflow
β’ Step 1 β Import data
β’ Step 2 β Clean data
β’ Step 3 β Analyze using formulas/pivot tables
β’ Step 4 β Create charts
β’ Step 5 β Share report
πΌ Real-World Example π Sales Analysis
β’ Import sales data
β’ Remove duplicate records
β’ Use pivot table for total sales
β’ Create chart for trends
β’ Share report with manager
π― Excel vs SQL vs Python
β’ Excel β Small/medium data, quick analysis
β’ SQL β Large database queries
β’ Python β Advanced analysis automation
β Excel Topics in Interviews
β’ VLOOKUP vs INDEX MATCH
β’ Pivot tables
β’ Conditional formatting
β’ Removing duplicates
β’ Data cleaning techniques
β’ Charts dashboards
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Double Tap β₯οΈ For Part-3
π Excel in Data Analytics
β’ Microsoft Excel is a spreadsheet tool used for data cleaning, analysis, and visualization using formulas, pivot tables, and charts.
β’ Companies use Excel daily for reporting, dashboards, and quick analysis.
β Why Excel is Important for Data Analysts
β’ Used in almost every organization
β’ Best tool for quick analysis
β’ Helps clean messy data
β’ Creates reports and dashboards
β’ Used in interviews and real jobs
β’ Many companies expect strong Excel skills before SQL/Python.
π Core Excel Skills for Data Analytics
1οΈβ£ Formulas Functions (Most Important β)
β’ Formulas help perform calculations automatically.
β’ Common formulas:
β SUM() β Adds numbers
β AVERAGE() β Finds average
β IF() β Conditional logic
β VLOOKUP() β Search data vertically
β INDEX + MATCH β Advanced lookup
β COUNT() / COUNTIF() β Count values
β’ Examples:
β Find total sales
β Check pass/fail results
β Merge data from two sheets
2οΈβ£ Pivot Tables (Very Important β)
β’ Summarize large data quickly
β’ Used for:
β Grouping data
β Calculating totals
β Comparing categories
β Creating reports
β’ Examples:
β Total sales by region
β Employee count by department
β Monthly revenue summary
3οΈβ£ Data Cleaning in Excel
β’ Raw data contains errors β Excel helps fix them.
β’ Common cleaning tasks:
β Remove duplicates
β Handle missing values
β Trim extra spaces
β Split text into columns
β Standardize formats
β’ Tools used:
β Remove Duplicates
β Text to Columns
β Find Replace
β TRIM function
4οΈβ£ Sorting Filtering
β’ Helps explore and understand data.
β’ Used for:
β Finding top values
β Filtering specific records
β Organizing data logically
β’ Examples:
β Top 10 customers
β Filter sales above βΉ50,000
5οΈβ£ Conditional Formatting
β’ Highlights important data visually.
β’ Examples:
β Highlight highest sales
β Mark low performance
β Show trends using color
6οΈβ£ Charts Visualization
β’ Excel creates visual reports.
β’ Common charts:
β Bar chart
β Line chart
β Pie chart
β Histogram
β’ Used for:
β Showing trends
β Comparing performance
β Presenting insights
π How Excel is Used in Real Data Analyst Workflow
β’ Step 1 β Import data
β’ Step 2 β Clean data
β’ Step 3 β Analyze using formulas/pivot tables
β’ Step 4 β Create charts
β’ Step 5 β Share report
πΌ Real-World Example π Sales Analysis
β’ Import sales data
β’ Remove duplicate records
β’ Use pivot table for total sales
β’ Create chart for trends
β’ Share report with manager
π― Excel vs SQL vs Python
β’ Excel β Small/medium data, quick analysis
β’ SQL β Large database queries
β’ Python β Advanced analysis automation
β Excel Topics in Interviews
β’ VLOOKUP vs INDEX MATCH
β’ Pivot tables
β’ Conditional formatting
β’ Removing duplicates
β’ Data cleaning techniques
β’ Charts dashboards
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Double Tap β₯οΈ For Part-3
β€35
Which SQL command is used to retrieve data from a database?
Anonymous Quiz
7%
A) INSERT
79%
B) SELECT
11%
C) UPDATE
2%
D) DELETE
β€3
Which SQL clause is used to filter records based on conditions?
Anonymous Quiz
15%
A) GROUP BY
15%
B) ORDER BY
68%
C) WHERE
2%
D) JOIN
β€3
Which SQL clause is used to group rows that have the same values?
Anonymous Quiz
80%
A) GROUP BY
7%
B) WHERE
4%
C) SELECT
9%
D) ORDER BY
β€3π₯°2
Which SQL operation is used to combine data from two or more tables?
Anonymous Quiz
2%
A) FILTER
20%
B) MERGE
68%
C) JOIN
11%
D) UNION
β€4
Which SQL function is used to assign ranking to rows in window functions?
Anonymous Quiz
4%
A) SUM()
7%
B) COUNT()
87%
C) RANK()
2%
D) AVG()
β€7
7 Underrated WhatsApp channels which will teach you life lessons:
48 Laws of Power: https://whatsapp.com/channel/0029Vb5vcEm1CYoM5rd6Ob3f
Broken by Books: https://whatsapp.com/channel/0029VbAi7WJId7nFFfTeTm21
Alpha Mode: https://whatsapp.com/channel/0029VbAV7CRId7nNluCbrb20
Psychology: https://whatsapp.com/channel/0029Vb62WgKG8l5KlJpcIe2r
Science Facts: https://whatsapp.com/channel/0029Vb5m9UR6xCSQo1YXTA0O
Learn English Speaking & Communication Skills: https://whatsapp.com/channel/0029VbApHoXJUM2eLAE62u3R
1% Money Mindset: https://whatsapp.com/channel/0029Vb4z8pbGZNCziNy4803J
Double Tap β€οΈ For More
48 Laws of Power: https://whatsapp.com/channel/0029Vb5vcEm1CYoM5rd6Ob3f
Broken by Books: https://whatsapp.com/channel/0029VbAi7WJId7nFFfTeTm21
Alpha Mode: https://whatsapp.com/channel/0029VbAV7CRId7nNluCbrb20
Psychology: https://whatsapp.com/channel/0029Vb62WgKG8l5KlJpcIe2r
Science Facts: https://whatsapp.com/channel/0029Vb5m9UR6xCSQo1YXTA0O
Learn English Speaking & Communication Skills: https://whatsapp.com/channel/0029VbApHoXJUM2eLAE62u3R
1% Money Mindset: https://whatsapp.com/channel/0029Vb4z8pbGZNCziNy4803J
Double Tap β€οΈ For More
β€5π₯°2
Which function is used to count the total number of rows in a table?
Anonymous Quiz
10%
A) SUM()
82%
B) COUNT()
2%
C) AVG()
6%
D) TOTAL()
β€6
What is the difference between COUNT(*) and COUNT(column_name)?
Anonymous Quiz
15%
A) Both behave the same
30%
B) COUNT(*) ignores NULL values
36%
C) COUNT(column_name) ignores NULL values
19%
D) COUNT(column_name) counts all rows
β€2
Which clause is used to group rows with the same values?
Anonymous Quiz
11%
A) ORDER BY
9%
B) WHERE
70%
C) GROUP BY
10%
D) HAVING