πΉ Top 10 SQL Functions/Commands Commonly Used in Data Analysis π
1οΈβ£ SELECT
β Used to retrieve specific columns from a table.
2οΈβ£ WHERE
β Filters rows based on a condition.
3οΈβ£ GROUP BY
β Groups rows that have the same values into summary rows.
4οΈβ£ ORDER BY
β Sorts the result by one or more columns.
5οΈβ£ JOIN
β Combines rows from two or more tables based on a related column.
6οΈβ£ COUNT() / SUM() / AVG() / MIN() / MAX()
β Common aggregate functions for metrics and summaries.
7οΈβ£ HAVING
β Filters after a GROUP BY (unlike WHERE, which filters before).
8οΈβ£ LIMIT
β Restricts number of rows returned.
9οΈβ£ CASE
β Implements conditional logic in queries.
π DATE functions (NOW(), DATE_PART(), DATEDIFF(), etc.)
β Handle and extract info from dates.
Join our WhatsApp channel: https://whatsapp.com/channel/0029VbAbnvPLSmbeFYNdNA29
π Tap β€οΈ for more!
1οΈβ£ SELECT
β Used to retrieve specific columns from a table.
SELECT name, age FROM users;
2οΈβ£ WHERE
β Filters rows based on a condition.
SELECT * FROM sales WHERE region = 'North';
3οΈβ£ GROUP BY
β Groups rows that have the same values into summary rows.
SELECT region, SUM(sales) FROM sales GROUP BY region;
4οΈβ£ ORDER BY
β Sorts the result by one or more columns.
SELECT * FROM customers ORDER BY created_at DESC;
5οΈβ£ JOIN
β Combines rows from two or more tables based on a related column.
SELECT a.name, b.salary
FROM employees a
JOIN salaries b ON a.id = b.emp_id;
6οΈβ£ COUNT() / SUM() / AVG() / MIN() / MAX()
β Common aggregate functions for metrics and summaries.
SELECT COUNT(*) FROM orders WHERE status = 'completed';
7οΈβ£ HAVING
β Filters after a GROUP BY (unlike WHERE, which filters before).
SELECT department, COUNT(*) FROM employees GROUP BY department HAVING COUNT(*) > 10;
8οΈβ£ LIMIT
β Restricts number of rows returned.
SELECT * FROM products LIMIT 5;
9οΈβ£ CASE
β Implements conditional logic in queries.
SELECT name,
CASE
WHEN score >= 90 THEN 'A'
WHEN score >= 75 THEN 'B'
ELSE 'C'
END AS grade
FROM students;
π DATE functions (NOW(), DATE_PART(), DATEDIFF(), etc.)
β Handle and extract info from dates.
SELECT DATE_PART('year', order_date) FROM orders;Join our WhatsApp channel: https://whatsapp.com/channel/0029VbAbnvPLSmbeFYNdNA29
π Tap β€οΈ for more!
β€13π4π3π₯1
β
7 Habits That Make You a Better Data Analyst ππ§
1οΈβ£ Explore Real Datasets Regularly
β Use Kaggle, Data.gov, or Google Dataset Search
β Focus on different domains: sales, HR, marketing, etc.
2οΈβ£ Master the Art of Asking Questions
β Start with: What do we want to know?
β Then: What data do we need to answer it?
3οΈβ£ Use SQL & Excel Daily
β Practice joins, window functions, pivot tables, formulas
β Aim to solve 1 real-world query per day
4οΈβ£ Visualize Everything
β Use Power BI, Tableau, or Matplotlib
β Keep charts simple, clear, and insight-driven
5οΈβ£ Storytelling > Just Reporting
β Always add βSo what?β to your analysis
β Help stakeholders take action, not just read numbers
6οΈβ£ Document Your Work
β Use Notion, Google Docs, or GitHub
β Write what you did, how, and whyβitβll save time later
7οΈβ£ Review & Reflect Weekly
β What did you learn? What confused you?
β Track mistakes + insights in a learning journal
π‘ Pro Tip: Join data communities (Reddit, LinkedIn, Slack groups) to grow faster.
π Tap for more
1οΈβ£ Explore Real Datasets Regularly
β Use Kaggle, Data.gov, or Google Dataset Search
β Focus on different domains: sales, HR, marketing, etc.
2οΈβ£ Master the Art of Asking Questions
β Start with: What do we want to know?
β Then: What data do we need to answer it?
3οΈβ£ Use SQL & Excel Daily
β Practice joins, window functions, pivot tables, formulas
β Aim to solve 1 real-world query per day
4οΈβ£ Visualize Everything
β Use Power BI, Tableau, or Matplotlib
β Keep charts simple, clear, and insight-driven
5οΈβ£ Storytelling > Just Reporting
β Always add βSo what?β to your analysis
β Help stakeholders take action, not just read numbers
6οΈβ£ Document Your Work
β Use Notion, Google Docs, or GitHub
β Write what you did, how, and whyβitβll save time later
7οΈβ£ Review & Reflect Weekly
β What did you learn? What confused you?
β Track mistakes + insights in a learning journal
π‘ Pro Tip: Join data communities (Reddit, LinkedIn, Slack groups) to grow faster.
π Tap for more
β€26π4π2π₯°1π1
Which SQL command is used to add new records into a table?*
Anonymous Quiz
26%
a) UPDATE
2%
b) DELETE
70%
c) INSERT
2%
d) SELECT
β€10
What does a correlated subquery mean?
Anonymous Quiz
5%
a) A subquery executed only once
78%
b) A subquery that depends on the outer query for its values
10%
c) A subquery that returns multiple rows
7%
d) A subquery with UNION operation
β€4π€©1
Which of the following is used to combine the results of two SELECT statements and removes duplicates?
Anonymous Quiz
70%
UNION
30%
UNION ALL
β€5π₯°1
Which SQL function would you use to find the number of days between two dates?
Anonymous Quiz
2%
a) NOW()
83%
b) DATEDIFF()
5%
c) SUBSTRING()
9%
d) COUNT()
β€5
What does the following SQL command do?
ALTER TABLE employees ADD COLUMN salary INT;
ALTER TABLE employees ADD COLUMN salary INT;
Anonymous Quiz
2%
a) Deletes the salary column
88%
b) Adds a new column named salary of type integer
9%
c) Changes salary column to integer
2%
d) Drops the table employees
β€3π₯°1
Which constraint ensures that a column cannot have NULL values?
Anonymous Quiz
29%
UNIQUE
71%
NOT NULL
β€5π₯°1
Which of the following statements about Views is TRUE?
Anonymous Quiz
8%
a) Views store data physically
8%
b) Views cannot be updated
76%
c) Views are virtual tables created by a query
8%
d) Views automatically index the data
β€8π₯2
β° Quick Reminder!
π Agent.ai Challenge is LIVE!
π° Win up to $50,000 β no code needed!
π₯ Open to all. Limited time!
π Register now β
https://shorturl.at/lSfTv
Double Tap β€οΈ for more AI Resources
π Agent.ai Challenge is LIVE!
π° Win up to $50,000 β no code needed!
π₯ Open to all. Limited time!
π Register now β
https://shorturl.at/lSfTv
Double Tap β€οΈ for more AI Resources
β€6π2π1
π Data Analyst Interview Cheat Sheet (2025 Edition)
β 1. SQL Essentials
Key Concepts:
β’ SELECT, WHERE, GROUP BY, HAVING
β’ JOINs (INNER, LEFT, RIGHT, FULL)
β’ Window Functions (ROW_NUMBER, RANK, LEAD/LAG)
β’ Subqueries & CTEs
β’ Aggregations & Filtering
Practice Queries:
β’ Top 3 customers by revenue
β’ Monthly active users
β’ Running total or moving average
β’ Products never sold
β 2. Excel/Spreadsheet Skills
Key Concepts:
β’ VLOOKUP, XLOOKUP, INDEX-MATCH
β’ IF, AND, OR logic
β’ Pivot Tables & Charts
β’ Conditional Formatting
β’ Data Cleaning Functions (TRIM, CLEAN, TEXTSPLIT)
β 3. Data Visualization
Tools: Tableau, Power BI, Excel
Key Charts:
β’ Line chart β Trend
β’ Bar chart β Comparison
β’ Pie chart β Distribution
β’ Scatter plot β Correlation
β’ Heatmaps
Best Practices:
β’ Keep visuals simple & clear
β’ Use color intentionally
β’ Add titles, labels, tooltips
β 4. Statistics & Analytics Concepts
Key Concepts:
β’ Mean, Median, Mode
β’ Standard Deviation, Variance
β’ Correlation vs Causation
β’ Hypothesis Testing (p-value, t-test)
β’ A/B Testing basics
β’ Confidence Intervals
β 5. Python for Data Analysis
Key Libraries:
β’ Pandas β data manipulation
β’ NumPy β numerical ops
β’ Matplotlib/Seaborn β visualization
β’ SQLAlchemy β database access
Common Tasks:
β’ Read CSV/excel files
β’ GroupBy and aggregations
β’ Handling missing data
β’ Merge/join datasets
β’ Create charts
β 6. Business Acumen & Communication
Key Skills:
β’ Ask the right questions
β’ Translate data into insights
β’ Storytelling with data
β’ Build dashboards with KPIs
β’ Communicate with non-tech stakeholders
β 7. Tools to Know
β’ Excel / Google Sheets
β’ SQL (MySQL, PostgreSQL, etc.)
β’ Tableau / Power BI
β’ Python / R
β’ Jupyter / VS Code
π Tap β€οΈ for more!
β 1. SQL Essentials
Key Concepts:
β’ SELECT, WHERE, GROUP BY, HAVING
β’ JOINs (INNER, LEFT, RIGHT, FULL)
β’ Window Functions (ROW_NUMBER, RANK, LEAD/LAG)
β’ Subqueries & CTEs
β’ Aggregations & Filtering
Practice Queries:
β’ Top 3 customers by revenue
β’ Monthly active users
β’ Running total or moving average
β’ Products never sold
β 2. Excel/Spreadsheet Skills
Key Concepts:
β’ VLOOKUP, XLOOKUP, INDEX-MATCH
β’ IF, AND, OR logic
β’ Pivot Tables & Charts
β’ Conditional Formatting
β’ Data Cleaning Functions (TRIM, CLEAN, TEXTSPLIT)
β 3. Data Visualization
Tools: Tableau, Power BI, Excel
Key Charts:
β’ Line chart β Trend
β’ Bar chart β Comparison
β’ Pie chart β Distribution
β’ Scatter plot β Correlation
β’ Heatmaps
Best Practices:
β’ Keep visuals simple & clear
β’ Use color intentionally
β’ Add titles, labels, tooltips
β 4. Statistics & Analytics Concepts
Key Concepts:
β’ Mean, Median, Mode
β’ Standard Deviation, Variance
β’ Correlation vs Causation
β’ Hypothesis Testing (p-value, t-test)
β’ A/B Testing basics
β’ Confidence Intervals
β 5. Python for Data Analysis
Key Libraries:
β’ Pandas β data manipulation
β’ NumPy β numerical ops
β’ Matplotlib/Seaborn β visualization
β’ SQLAlchemy β database access
Common Tasks:
β’ Read CSV/excel files
β’ GroupBy and aggregations
β’ Handling missing data
β’ Merge/join datasets
β’ Create charts
β 6. Business Acumen & Communication
Key Skills:
β’ Ask the right questions
β’ Translate data into insights
β’ Storytelling with data
β’ Build dashboards with KPIs
β’ Communicate with non-tech stakeholders
β 7. Tools to Know
β’ Excel / Google Sheets
β’ SQL (MySQL, PostgreSQL, etc.)
β’ Tableau / Power BI
β’ Python / R
β’ Jupyter / VS Code
π Tap β€οΈ for more!
β€19π6π₯°2π2
β
20 Data Analyst Interview Questions
1. What is data analysis
The process of inspecting, cleaning, transforming, and modeling data to discover useful information and support decision-making.
2. What tools do data analysts commonly use
Excel, SQL, Python, R, Tableau, Power BI, SAS, and Google Sheets. Each tool serves different purposes like querying, visualization, or statistical analysis.
3. What is the difference between data analyst and data scientist
β’ Data Analyst: Focuses on interpreting existing data and generating reports
β’ Data Scientist: Builds predictive models and algorithms using advanced techniques
4. How do you handle missing data
β’ Remove rows
β’ Impute values (mean, median, mode)
β’ Use algorithms that handle missing data
β’ Flag missing values for analysis
5. What is the difference between INNER JOIN and LEFT JOIN in SQL
β’ INNER JOIN: Returns only matching rows
β’ LEFT JOIN: Returns all rows from the left table and matching rows from the right
6. What is normalization in databases
Organizing data to reduce redundancy and improve integrity. Common forms: 1NF, 2NF, 3NF.
7. How do you ensure data quality
β’ Validate data sources
β’ Check for duplicates and missing values
β’ Use consistency checks
β’ Automate data cleaning pipelines
8. What is the difference between structured and unstructured data
β’ Structured: Organized in rows and columns (e.g., SQL tables)
β’ Unstructured: No fixed format (e.g., images, emails, social media)
9. What is exploratory data analysis (EDA)
Initial investigation of data using visualizations and statistics to uncover patterns, anomalies, and relationships.
10. How do you visualize data effectively
Choose the right chart type (bar, line, pie, scatter), use clear labels, avoid clutter, and highlight key insights.
11. What is the difference between COUNT, COUNT(*) and COUNT(column) in SQL
β’ COUNT(*): Counts all rows
β’ COUNT(column): Counts non-null values in that column
12. What is a pivot table
A tool in Excel or BI platforms that summarizes data by grouping and aggregating values dynamically.
13. How do you calculate correlation between two variables
Use Pearson correlation coefficient in Python (df.corr()), R, or Excel. Values range from -1 to +1.
14. What is the difference between a dashboard and a report
β’ Dashboard: Interactive, real-time visual summary
β’ Report: Static or scheduled document with detailed analysis
15. What is the purpose of GROUP BY in SQL
Used to aggregate data across rows that share a common value in one or more columns.
16. What is the difference between WHERE and HAVING in SQL
β’ WHERE: Filters rows before aggregation
β’ HAVING: Filters groups after aggregation
17. How do you handle outliers in data
β’ Remove or cap them
β’ Use robust statistical methods
β’ Transform data (e.g., log scale)
18. What is the difference between mean, median, and mode
β’ Mean: Average
β’ Median: Middle value
β’ Mode: Most frequent value
19. What is time series analysis
Analyzing data points collected over time to identify trends, seasonality, and make forecasts.
20. How do you communicate insights to non-technical stakeholders
Use simple language, visualizations, storytelling, and focus on business impact rather than technical jargon.
π React for more Interview Resources
1. What is data analysis
The process of inspecting, cleaning, transforming, and modeling data to discover useful information and support decision-making.
2. What tools do data analysts commonly use
Excel, SQL, Python, R, Tableau, Power BI, SAS, and Google Sheets. Each tool serves different purposes like querying, visualization, or statistical analysis.
3. What is the difference between data analyst and data scientist
β’ Data Analyst: Focuses on interpreting existing data and generating reports
β’ Data Scientist: Builds predictive models and algorithms using advanced techniques
4. How do you handle missing data
β’ Remove rows
β’ Impute values (mean, median, mode)
β’ Use algorithms that handle missing data
β’ Flag missing values for analysis
5. What is the difference between INNER JOIN and LEFT JOIN in SQL
β’ INNER JOIN: Returns only matching rows
β’ LEFT JOIN: Returns all rows from the left table and matching rows from the right
6. What is normalization in databases
Organizing data to reduce redundancy and improve integrity. Common forms: 1NF, 2NF, 3NF.
7. How do you ensure data quality
β’ Validate data sources
β’ Check for duplicates and missing values
β’ Use consistency checks
β’ Automate data cleaning pipelines
8. What is the difference between structured and unstructured data
β’ Structured: Organized in rows and columns (e.g., SQL tables)
β’ Unstructured: No fixed format (e.g., images, emails, social media)
9. What is exploratory data analysis (EDA)
Initial investigation of data using visualizations and statistics to uncover patterns, anomalies, and relationships.
10. How do you visualize data effectively
Choose the right chart type (bar, line, pie, scatter), use clear labels, avoid clutter, and highlight key insights.
11. What is the difference between COUNT, COUNT(*) and COUNT(column) in SQL
β’ COUNT(*): Counts all rows
β’ COUNT(column): Counts non-null values in that column
12. What is a pivot table
A tool in Excel or BI platforms that summarizes data by grouping and aggregating values dynamically.
13. How do you calculate correlation between two variables
Use Pearson correlation coefficient in Python (df.corr()), R, or Excel. Values range from -1 to +1.
14. What is the difference between a dashboard and a report
β’ Dashboard: Interactive, real-time visual summary
β’ Report: Static or scheduled document with detailed analysis
15. What is the purpose of GROUP BY in SQL
Used to aggregate data across rows that share a common value in one or more columns.
16. What is the difference between WHERE and HAVING in SQL
β’ WHERE: Filters rows before aggregation
β’ HAVING: Filters groups after aggregation
17. How do you handle outliers in data
β’ Remove or cap them
β’ Use robust statistical methods
β’ Transform data (e.g., log scale)
18. What is the difference between mean, median, and mode
β’ Mean: Average
β’ Median: Middle value
β’ Mode: Most frequent value
19. What is time series analysis
Analyzing data points collected over time to identify trends, seasonality, and make forecasts.
20. How do you communicate insights to non-technical stakeholders
Use simple language, visualizations, storytelling, and focus on business impact rather than technical jargon.
π React for more Interview Resources
β€19π4
Top Excel Formulas Every Data Analyst Should Know
SUM():
Purpose: Adds up a range of numbers.
Example: =SUM(A1:A10)
AVERAGE():
Purpose: Calculates the average of a range of numbers.
Example: =AVERAGE(B1:B10)
COUNT():
Purpose: Counts the number of cells containing numbers.
Example: =COUNT(C1:C10)
IF():
Purpose: Returns one value if a condition is true, and another if false.
Example: =IF(A1 > 10, "Yes", "No")
VLOOKUP():
Purpose: Searches for a value in the first column and returns a value in the same row from another column.
Example: =VLOOKUP(D1, A1:B10, 2, FALSE)
HLOOKUP():
Purpose: Searches for a value in the first row and returns a value in the same column from another row.
Example: =HLOOKUP("Sales", A1:F5, 3, FALSE)
INDEX():
Purpose: Returns the value of a cell based on row and column numbers.
Example: =INDEX(A1:C10, 2, 3)
MATCH():
Purpose: Searches for a value and returns its position in a range.
Example: =MATCH("Product B", A1:A10, 0)
CONCATENATE() or CONCAT():
Purpose: Joins multiple text strings into one.
Example: =CONCATENATE(A1, " ", B1)
TEXT():
Purpose: Formats numbers or dates as text.
Example: =TEXT(A1, "dd/mm/yyyy")
Excel Resources: t.me/excel_data
I have curated Data Analytics Resources ππ
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02
SUM():
Purpose: Adds up a range of numbers.
Example: =SUM(A1:A10)
AVERAGE():
Purpose: Calculates the average of a range of numbers.
Example: =AVERAGE(B1:B10)
COUNT():
Purpose: Counts the number of cells containing numbers.
Example: =COUNT(C1:C10)
IF():
Purpose: Returns one value if a condition is true, and another if false.
Example: =IF(A1 > 10, "Yes", "No")
VLOOKUP():
Purpose: Searches for a value in the first column and returns a value in the same row from another column.
Example: =VLOOKUP(D1, A1:B10, 2, FALSE)
HLOOKUP():
Purpose: Searches for a value in the first row and returns a value in the same column from another row.
Example: =HLOOKUP("Sales", A1:F5, 3, FALSE)
INDEX():
Purpose: Returns the value of a cell based on row and column numbers.
Example: =INDEX(A1:C10, 2, 3)
MATCH():
Purpose: Searches for a value and returns its position in a range.
Example: =MATCH("Product B", A1:A10, 0)
CONCATENATE() or CONCAT():
Purpose: Joins multiple text strings into one.
Example: =CONCATENATE(A1, " ", B1)
TEXT():
Purpose: Formats numbers or dates as text.
Example: =TEXT(A1, "dd/mm/yyyy")
Excel Resources: t.me/excel_data
I have curated Data Analytics Resources ππ
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02
β€16π2π1
β SQL alone wonβt make you a Data Analyst
β SQL wonβt guarantee you a 20 LPA job
β SQL cannot be mastered in one weekend
β SQL is not just βSELECT * FROM tableβ
β SQL isnβt only for technical people
β SQL is not outdated or getting replaced
But hereβs what SQL *can* do:
βοΈ SQL helps you handle millions of rows with ease
βοΈ SQL empowers you to extract real insights from raw data
βοΈ SQL makes you independent of Excel limitations
βοΈ SQL lets you ask deep, complex business questions
βοΈ SQL is the foundation of most data tools (Power BI, Tableau, Python, etc.)
βοΈ SQL is a must-have skill for data professionals
βοΈ SQL is trusted by companies across the globe
Right mindset = Right learning path
React β€οΈ if you agree
β SQL wonβt guarantee you a 20 LPA job
β SQL cannot be mastered in one weekend
β SQL is not just βSELECT * FROM tableβ
β SQL isnβt only for technical people
β SQL is not outdated or getting replaced
But hereβs what SQL *can* do:
βοΈ SQL helps you handle millions of rows with ease
βοΈ SQL empowers you to extract real insights from raw data
βοΈ SQL makes you independent of Excel limitations
βοΈ SQL lets you ask deep, complex business questions
βοΈ SQL is the foundation of most data tools (Power BI, Tableau, Python, etc.)
βοΈ SQL is a must-have skill for data professionals
βοΈ SQL is trusted by companies across the globe
Right mindset = Right learning path
React β€οΈ if you agree
β€41π8
β
π Top 10 Data Analyst Interview Questions
1οΈβ£ What is Data Wrangling?
Answer: It's the process of cleaning, structuring, and enriching raw data into a desired format for analysis. It includes handling nulls, removing duplicates, and standardizing formats.
2οΈβ£ How is Excel used in Data Analysis?
Answer: Excel is used for quick data cleaning, pivot tables, basic stats, visualizations, and what-if analysis.
3οΈβ£ What are the different types of data?
Answer:
- Structured: Organized in rows/columns (e.g. databases)
- Unstructured: No format (e.g. text, images)
- Semi-structured: Tags or markers (e.g. JSON, XML)
4οΈβ£ Define Normalization. Why is it important?
Answer: It's the process of organizing data to reduce redundancy. It ensures consistency and optimizes storage.
5οΈβ£ What is the difference between WHERE and HAVING in SQL?
Answer:
- WHERE: Filters rows before aggregation
- HAVING: Filters groups after aggregation
6οΈβ£ What is the use of GROUP BY in SQL?
Answer: It groups rows with the same values in specified columns, often used with aggregate functions like COUNT(), SUM(), AVG().
7οΈβ£ What is an Outlier? How do you detect it?
Answer: An outlier is a data point that differs significantly from others. Detection methods: IQR, Z-score, boxplots.
8οΈβ£ How do you prioritize tasks when handling multiple projects?
Answer: By assessing deadlines, impact, complexity, and using tools like Trello, Notion, or Excel trackers.
9οΈβ£ What are Data Dashboards?
Answer: Visual interfaces that display key metrics and KPIs in real-time, used for quick business decision-making.
π Whatβs the difference between OLAP and OLTP?
Answer:
- OLAP (Analytical): Used for complex queries & reporting
- OLTP (Transactional): Used for real-time data processing (e.g. banking systems)
π‘ Pro Tip: Be ready to explain your thought process with real-life projects or case studies during interviews!
π React β€οΈ if this helped!
1οΈβ£ What is Data Wrangling?
Answer: It's the process of cleaning, structuring, and enriching raw data into a desired format for analysis. It includes handling nulls, removing duplicates, and standardizing formats.
2οΈβ£ How is Excel used in Data Analysis?
Answer: Excel is used for quick data cleaning, pivot tables, basic stats, visualizations, and what-if analysis.
3οΈβ£ What are the different types of data?
Answer:
- Structured: Organized in rows/columns (e.g. databases)
- Unstructured: No format (e.g. text, images)
- Semi-structured: Tags or markers (e.g. JSON, XML)
4οΈβ£ Define Normalization. Why is it important?
Answer: It's the process of organizing data to reduce redundancy. It ensures consistency and optimizes storage.
5οΈβ£ What is the difference between WHERE and HAVING in SQL?
Answer:
- WHERE: Filters rows before aggregation
- HAVING: Filters groups after aggregation
6οΈβ£ What is the use of GROUP BY in SQL?
Answer: It groups rows with the same values in specified columns, often used with aggregate functions like COUNT(), SUM(), AVG().
7οΈβ£ What is an Outlier? How do you detect it?
Answer: An outlier is a data point that differs significantly from others. Detection methods: IQR, Z-score, boxplots.
8οΈβ£ How do you prioritize tasks when handling multiple projects?
Answer: By assessing deadlines, impact, complexity, and using tools like Trello, Notion, or Excel trackers.
9οΈβ£ What are Data Dashboards?
Answer: Visual interfaces that display key metrics and KPIs in real-time, used for quick business decision-making.
π Whatβs the difference between OLAP and OLTP?
Answer:
- OLAP (Analytical): Used for complex queries & reporting
- OLTP (Transactional): Used for real-time data processing (e.g. banking systems)
π‘ Pro Tip: Be ready to explain your thought process with real-life projects or case studies during interviews!
π React β€οΈ if this helped!
β€24
β
π Essential SQL Commands & Functions Cheatsheet π§βπ»
Whether beginner or prepping for data roles, mastering these essentials helps a lot! π‘
β¬οΈ Quick SQL reference:
1) SELECT β Retrieve data
2) WHERE β Filter rows by condition
3) GROUP BY β Aggregate by column(s)
4) HAVING β Filter aggregated groups
5) ORDER BY β Sort results
6) JOIN β Combine tables
7) UNION β Merge query results
8) INSERT INTO β Add new records
9) UPDATE β Modify records
10) DELETE β Remove records
11) CREATE TABLE β Make a new table
12) ALTER TABLE β Modify table structure
13) DROP TABLE β Delete a table
14) TRUNCATE TABLE β Remove all rows
15) DISTINCT β Get unique values
16) LIMIT β Restrict result count
17) IN / BETWEEN β Filter by multiple values/ranges
18) LIKE β Pattern match
19) IS NULL β Filter NULLs
20) COUNT()/SUM()/AVG() β Aggregate functions
β Save & save time in your next SQL task! π
Data Analytics Resources: https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02
π React β₯οΈ for more
Whether beginner or prepping for data roles, mastering these essentials helps a lot! π‘
β¬οΈ Quick SQL reference:
1) SELECT β Retrieve data
2) WHERE β Filter rows by condition
3) GROUP BY β Aggregate by column(s)
4) HAVING β Filter aggregated groups
5) ORDER BY β Sort results
6) JOIN β Combine tables
7) UNION β Merge query results
8) INSERT INTO β Add new records
9) UPDATE β Modify records
10) DELETE β Remove records
11) CREATE TABLE β Make a new table
12) ALTER TABLE β Modify table structure
13) DROP TABLE β Delete a table
14) TRUNCATE TABLE β Remove all rows
15) DISTINCT β Get unique values
16) LIMIT β Restrict result count
17) IN / BETWEEN β Filter by multiple values/ranges
18) LIKE β Pattern match
19) IS NULL β Filter NULLs
20) COUNT()/SUM()/AVG() β Aggregate functions
β Save & save time in your next SQL task! π
Data Analytics Resources: https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02
π React β₯οΈ for more
β€21
β
Core Data Analytics Concepts You Should Know:
1. Excel & Spreadsheets (Basics)
- Data entry, sorting, filtering
- Basic formulas: SUM, AVERAGE, IF, VLOOKUP, COUNTIF
- Pivot tables & charts
2. Statistics & Math Basics
- Mean, Median, Mode
- Standard Deviation, Variance
- Correlation & Regression
- Probability basics
3. SQL (Data Extraction)
- SELECT, WHERE, GROUP BY, HAVING
- JOINs (INNER, LEFT, RIGHT)
- Subqueries & CTEs
- Window functions (ROW_NUMBER, RANK, etc.)
4. Data Cleaning & Wrangling
- Handling missing values
- Removing duplicates
- Formatting and standardization
5. Data Visualization
- Tools: Excel, Power BI, Tableau
- Charts: Bar, Line, Pie, Histogram
- Dashboards & storytelling with data
6. Programming with Python (Optional but recommended)
- Pandas, NumPy for data manipulation
- Matplotlib, Seaborn for visualization
- Jupyter Notebooks for analysis
7. Business Understanding
- Asking the right questions
- KPI understanding
- Domain knowledge
8. Projects & Case Studies
- Sales analysis, Customer retention, Market trends
- Use real-world datasets (Kaggle, Google Data Studio)
9. Reporting & Communication
- Presenting insights clearly.
- Visual storytelling
- Report automation basics (Excel, PowerPoint)
10. Tools Knowledge
- Power BI / Tableau
- SQL Workbench / BigQuery
- Excel / Google Sheets
π React β€οΈ for more
1. Excel & Spreadsheets (Basics)
- Data entry, sorting, filtering
- Basic formulas: SUM, AVERAGE, IF, VLOOKUP, COUNTIF
- Pivot tables & charts
2. Statistics & Math Basics
- Mean, Median, Mode
- Standard Deviation, Variance
- Correlation & Regression
- Probability basics
3. SQL (Data Extraction)
- SELECT, WHERE, GROUP BY, HAVING
- JOINs (INNER, LEFT, RIGHT)
- Subqueries & CTEs
- Window functions (ROW_NUMBER, RANK, etc.)
4. Data Cleaning & Wrangling
- Handling missing values
- Removing duplicates
- Formatting and standardization
5. Data Visualization
- Tools: Excel, Power BI, Tableau
- Charts: Bar, Line, Pie, Histogram
- Dashboards & storytelling with data
6. Programming with Python (Optional but recommended)
- Pandas, NumPy for data manipulation
- Matplotlib, Seaborn for visualization
- Jupyter Notebooks for analysis
7. Business Understanding
- Asking the right questions
- KPI understanding
- Domain knowledge
8. Projects & Case Studies
- Sales analysis, Customer retention, Market trends
- Use real-world datasets (Kaggle, Google Data Studio)
9. Reporting & Communication
- Presenting insights clearly.
- Visual storytelling
- Report automation basics (Excel, PowerPoint)
10. Tools Knowledge
- Power BI / Tableau
- SQL Workbench / BigQuery
- Excel / Google Sheets
π React β€οΈ for more
β€18
β
Top 10 SQL Statements & Functions for Data Analysis ππ»
Mastering SQL is essential for data analysts. Here are the most commonly used SQL commands and functions that help extract, manipulate, and summarize data efficiently.
1οΈβ£ SELECT β Retrieve Data
Use it to fetch specific columns from a table.
2οΈβ£ FROM β Specify Table
Tells SQL where to pull the data from.
3οΈβ£ WHERE β Filter Data
Applies conditions to filter rows.
4οΈβ£ GROUP BY β Aggregate by Categories
Groups rows based on one or more columns for aggregation.
5οΈβ£ HAVING β Filter After Grouping
Filters groups after GROUP BY (unlike WHERE, which filters rows).
6οΈβ£ ORDER BY β Sort Results
Sorts the result set in ascending or descending order.
7οΈβ£ COUNT() β Count Records
Counts number of rows or non-null values.
8οΈβ£ SUM() β Total Values
Calculates the sum of numeric values.
9οΈβ£ AVG() β Average Values
Returns the average of numeric values.
π JOIN β Combine Tables
Combines rows from multiple tables based on related columns.
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
π¬ Tap β€οΈ for more!
Mastering SQL is essential for data analysts. Here are the most commonly used SQL commands and functions that help extract, manipulate, and summarize data efficiently.
1οΈβ£ SELECT β Retrieve Data
Use it to fetch specific columns from a table.
SELECT name, age FROM employees;
2οΈβ£ FROM β Specify Table
Tells SQL where to pull the data from.
SELECT * FROM sales_data;
3οΈβ£ WHERE β Filter Data
Applies conditions to filter rows.
SELECT * FROM customers WHERE city = 'Delhi';
4οΈβ£ GROUP BY β Aggregate by Categories
Groups rows based on one or more columns for aggregation.
SELECT department, COUNT(*) FROM employees GROUP BY department;
5οΈβ£ HAVING β Filter After Grouping
Filters groups after GROUP BY (unlike WHERE, which filters rows).
SELECT category, SUM(sales)
FROM orders
GROUP BY category
HAVING SUM(sales) > 10000;
6οΈβ£ ORDER BY β Sort Results
Sorts the result set in ascending or descending order.
SELECT name, salary FROM employees ORDER BY salary DESC;
7οΈβ£ COUNT() β Count Records
Counts number of rows or non-null values.
SELECT COUNT(*) FROM products;
8οΈβ£ SUM() β Total Values
Calculates the sum of numeric values.
SELECT SUM(amount) FROM transactions;
9οΈβ£ AVG() β Average Values
Returns the average of numeric values.
SELECT AVG(price) FROM items;
π JOIN β Combine Tables
Combines rows from multiple tables based on related columns.
SELECT a.name, b.order_date
FROM customers a
JOIN orders b ON a.id = b.customer_id;
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
π¬ Tap β€οΈ for more!
β€10
π§ SQL Basics Cheatsheet ππ οΈ
1. What is SQL?
SQL (Structured Query Language) is used to store, retrieve, update, and delete data in relational databases.
2. Common SQL Commands:
- SELECT β Retrieves data
- INSERT INTO β Adds new data
- UPDATE β Modifies existing data
- DELETE β Removes data
- WHERE β Filters records
- ORDER BY β Sorts results
- GROUP BY β Aggregates data
- JOIN β Combines data from multiple tables
3. Data Types (Examples):
- INT, FLOAT, VARCHAR(n), DATE, BOOLEAN
4. Clauses to Know:
- WHERE β Filters rows
- LIKE, BETWEEN, IN, IS NULL β Conditional filters
- DISTINCT β Removes duplicates
- LIMIT β Restricts row count
- AS β Rename columns
5. SQL JOINS (Very Important):
- INNER JOIN β Matching rows in both tables
- LEFT JOIN β All from left + matches from right
- RIGHT JOIN β All from right + matches from left
- FULL OUTER JOIN β All rows from both tables
6. Aggregate Functions:
- COUNT(), SUM(), AVG(), MIN(), MAX()
7. Example Query:
SELECT name, AVG(score)
FROM students
WHERE grade = 'A'
GROUP BY name
ORDER BY AVG(score) DESC;
8. Constraints:
- PRIMARY KEY, FOREIGN KEY, NOT NULL, UNIQUE, CHECK
9. Indexing & Optimization:
- Use INDEX to speed up queries
- Avoid SELECT * in production
- Use EXPLAIN to analyze query plans
10. Popular SQL Databases:
- MySQL, PostgreSQL, SQLite, Microsoft SQL Server, Oracle
Double Tap β₯οΈ For More
1. What is SQL?
SQL (Structured Query Language) is used to store, retrieve, update, and delete data in relational databases.
2. Common SQL Commands:
- SELECT β Retrieves data
- INSERT INTO β Adds new data
- UPDATE β Modifies existing data
- DELETE β Removes data
- WHERE β Filters records
- ORDER BY β Sorts results
- GROUP BY β Aggregates data
- JOIN β Combines data from multiple tables
3. Data Types (Examples):
- INT, FLOAT, VARCHAR(n), DATE, BOOLEAN
4. Clauses to Know:
- WHERE β Filters rows
- LIKE, BETWEEN, IN, IS NULL β Conditional filters
- DISTINCT β Removes duplicates
- LIMIT β Restricts row count
- AS β Rename columns
5. SQL JOINS (Very Important):
- INNER JOIN β Matching rows in both tables
- LEFT JOIN β All from left + matches from right
- RIGHT JOIN β All from right + matches from left
- FULL OUTER JOIN β All rows from both tables
6. Aggregate Functions:
- COUNT(), SUM(), AVG(), MIN(), MAX()
7. Example Query:
SELECT name, AVG(score)
FROM students
WHERE grade = 'A'
GROUP BY name
ORDER BY AVG(score) DESC;
8. Constraints:
- PRIMARY KEY, FOREIGN KEY, NOT NULL, UNIQUE, CHECK
9. Indexing & Optimization:
- Use INDEX to speed up queries
- Avoid SELECT * in production
- Use EXPLAIN to analyze query plans
10. Popular SQL Databases:
- MySQL, PostgreSQL, SQLite, Microsoft SQL Server, Oracle
Double Tap β₯οΈ For More
β€21π₯1π1
π§ Top 10 Real-World SQL Scenarios with Sample Answers ππ»
1. Find Duplicate Records in a Table
2. Find the Second Highest Salary
3. Customers with More Than 3 Orders in Last 30 Days
4. Calculate Monthly Revenue
5. Find Employees Without Managers
6. Join Two Tables and Filter by Amount
7. Use CASE for Conditional Logic
8. Find Top-Selling Products
9. Identify Inactive Users
π Calculate Conversion Rate
π‘ Pro Tip: Practice these with real datasets and explain your logic clearly in interviews.
π¬ Tap β€οΈ if this helped you prep smarter!
1. Find Duplicate Records in a Table
SELECT email, COUNT(*)
FROM customers
GROUP BY email
HAVING COUNT(*) > 1;
2. Find the Second Highest Salary
SELECT MAX(salary)
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);
3. Customers with More Than 3 Orders in Last 30 Days
SELECT customer_id
FROM orders
WHERE order_date >= CURRENT_DATE - INTERVAL '30 days'
GROUP BY customer_id
HAVING COUNT(*) > 3;
4. Calculate Monthly Revenue
SELECT DATE_TRUNC('month', sale_date) AS month,
SUM(amount) AS monthly_revenue
FROM sales
GROUP BY month
ORDER BY month;5. Find Employees Without Managers
SELECT *
FROM employees
WHERE manager_id IS NULL;
6. Join Two Tables and Filter by Amount
SELECT o.order_id, c.name, o.amount
FROM orders o
JOIN customers c ON o.customer_id = c.customer_id
WHERE o.amount > 100;
7. Use CASE for Conditional Logic
SELECT name,
CASE
WHEN score >= 90 THEN 'Excellent'
WHEN score >= 75 THEN 'Good'
ELSE 'Needs Improvement'
END AS rating
FROM students;
8. Find Top-Selling Products
SELECT product_id, SUM(quantity) AS total_sold
FROM sales
GROUP BY product_id
ORDER BY total_sold DESC
LIMIT 5;
9. Identify Inactive Users
SELECT user_id
FROM users
WHERE last_login < CURRENT_DATE - INTERVAL '90 days';
π Calculate Conversion Rate
SELECT COUNT(*) FILTER (WHERE status = 'converted') * 100.0 / COUNT(*) AS conversion_rate
FROM leads;
π‘ Pro Tip: Practice these with real datasets and explain your logic clearly in interviews.
π¬ Tap β€οΈ if this helped you prep smarter!
β€25π5π₯°2π2
π 15 Data Analyst Interview Questions for Freshers (with Answers)
β¦ Who is a Data Analyst?
Ans: A professional who collects, processes, and analyzes data to help organizations make informed decisions.
β¦ What tools do data analysts commonly use?
Ans: Excel, SQL, Power BI, Tableau, Python, R, and Google Sheets.
β¦ What is data cleaning?
Ans: The process of fixing or removing incorrect, corrupted, duplicate, or incomplete data.
β¦ What is the difference between data and information?
Ans: Data is raw, unorganized facts. Information is processed data that has meaning.
β¦ What are the types of data?
Ans: Qualitative (categorical) and Quantitative (numerical), further split into discrete and continuous.
β¦ What is exploratory data analysis (EDA)?
Ans: A technique to understand data patterns using visualization and statistics before building models.
β¦ What is the difference between Excel and SQL?
Ans: Excel is good for small-scale data analysis. SQL is better for querying large databases efficiently.
β¦ What is data visualization?
Ans: Representing data using charts, graphs, dashboards, etc., to make insights clearer.
β¦ Name a few types of charts used in data analysis.
Ans: Bar chart, Line chart, Pie chart, Histogram, Box plot, Scatter plot.
β¦ What is the difference between INNER JOIN and OUTER JOIN?
Ans: INNER JOIN returns only matched rows; OUTER JOIN returns matched + unmatched rows from one or both tables.
β¦ What is a pivot table in Excel?
Ans: A tool to summarize, sort, and analyze large data sets dynamically.
β¦ How do you handle missing data?
Ans: Techniques include removing rows, filling with mean/median, or using predictive models.
β¦ What is correlation?
Ans: A statistical measure that expresses the extent to which two variables are related.
β¦ What is the difference between structured and unstructured data?
Ans: Structured data is organized (e.g., tables); unstructured is not (e.g., text, images).
β¦ What are KPIs?
Ans: Key Performance Indicators β measurable values that show how effectively objectives are being achieved.
π‘ Tip: Be clear with your basics, tools, and communication!
π¬ React with β€οΈ for more!
β¦ Who is a Data Analyst?
Ans: A professional who collects, processes, and analyzes data to help organizations make informed decisions.
β¦ What tools do data analysts commonly use?
Ans: Excel, SQL, Power BI, Tableau, Python, R, and Google Sheets.
β¦ What is data cleaning?
Ans: The process of fixing or removing incorrect, corrupted, duplicate, or incomplete data.
β¦ What is the difference between data and information?
Ans: Data is raw, unorganized facts. Information is processed data that has meaning.
β¦ What are the types of data?
Ans: Qualitative (categorical) and Quantitative (numerical), further split into discrete and continuous.
β¦ What is exploratory data analysis (EDA)?
Ans: A technique to understand data patterns using visualization and statistics before building models.
β¦ What is the difference between Excel and SQL?
Ans: Excel is good for small-scale data analysis. SQL is better for querying large databases efficiently.
β¦ What is data visualization?
Ans: Representing data using charts, graphs, dashboards, etc., to make insights clearer.
β¦ Name a few types of charts used in data analysis.
Ans: Bar chart, Line chart, Pie chart, Histogram, Box plot, Scatter plot.
β¦ What is the difference between INNER JOIN and OUTER JOIN?
Ans: INNER JOIN returns only matched rows; OUTER JOIN returns matched + unmatched rows from one or both tables.
β¦ What is a pivot table in Excel?
Ans: A tool to summarize, sort, and analyze large data sets dynamically.
β¦ How do you handle missing data?
Ans: Techniques include removing rows, filling with mean/median, or using predictive models.
β¦ What is correlation?
Ans: A statistical measure that expresses the extent to which two variables are related.
β¦ What is the difference between structured and unstructured data?
Ans: Structured data is organized (e.g., tables); unstructured is not (e.g., text, images).
β¦ What are KPIs?
Ans: Key Performance Indicators β measurable values that show how effectively objectives are being achieved.
π‘ Tip: Be clear with your basics, tools, and communication!
π¬ React with β€οΈ for more!
β€19π2π₯1