Data Analyst Interview Resources
51.8K subscribers
338 photos
1 video
53 files
396 links
Join our telegram channel to learn how data analysis can reveal fascinating patterns, trends, and stories hidden within the numbers! ๐Ÿ“Š

For ads & suggestions: @love_data
Download Telegram
๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ & ๐—™๐˜‚๐—น๐—น๐˜€๐˜๐—ฎ๐—ฐ๐—ธ ๐——๐—ฒ๐˜ƒ๐—ฒ๐—น๐—ผ๐—ฝ๐—บ๐—ฒ๐—ป๐˜ ๐—”๐—ฟ๐—ฒ ๐—›๐—ถ๐—ด๐—ต๐—น๐˜† ๐——๐—ฒ๐—บ๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด ๐—œ๐—ป ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ๐Ÿ˜

Learn these skills from the Top 1% of the tech industry

๐ŸŒŸ Trusted by 7500+ Students
๐Ÿค 500+ Hiring Partners

๐—™๐˜‚๐—น๐—น๐˜€๐˜๐—ฎ๐—ฐ๐—ธ :-  https://pdlink.in/4hO7rWY

๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ :-  https://pdlink.in/4fdWxJB

Hurry Up, Limited seats available!
โค1
1. What are the ways to detect outliers?

Outliers are detected using two methods:

Box Plot Method: According to this method, the value is considered an outlier if it exceeds or falls below 1.5*IQR (interquartile range), that is, if it lies above the top quartile (Q3) or below the bottom quartile (Q1).

Standard Deviation Method: According to this method, an outlier is defined as a value that is greater or lower than the mean ยฑ (3*standard deviation).


2. What is a Recursive Stored Procedure?

A stored procedure that calls itself until a boundary condition is reached, is called a recursive stored procedure. This recursive function helps the programmers to deploy the same set of code several times as and when required.


3. What is the shortcut to add a filter to a table in EXCEL?

The filter mechanism is used when you want to display only specific data from the entire dataset. By doing so, there is no change being made to the data. The shortcut to add a filter to a table is Ctrl+Shift+L.

4. What is DAX in Power BI?

DAX stands for Data Analysis Expressions. It's a collection of functions, operators, and constants used in formulas to calculate and return values. In other words, it helps you create new info from data you already have.
โค5
๐ŸŽ“ ๐—–๐—ถ๐˜€๐—ฐ๐—ผ ๐—™๐—ฅ๐—˜๐—˜ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—–๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€ โ€“ ๐—Ÿ๐—ถ๐—บ๐—ถ๐˜๐—ฒ๐—ฑ ๐—ง๐—ถ๐—บ๐—ฒ! ๐Ÿ˜

Upskill in todayโ€™s most in-demand tech domains and boost your career ๐Ÿš€

โœ… FREE Courses Offered:
๐Ÿ’ซ Modern AI
๐Ÿ” Cyber Security
๐ŸŒ Networking
๐Ÿ“ฒ Internet of Things (IoT)

๐Ÿ’ซPerfect for students, freshers, and tech enthusiasts.

๐—˜๐—ป๐—ฟ๐—ผ๐—น๐—น ๐—™๐—ผ๐—ฟ ๐—™๐—ฅ๐—˜๐—˜๐Ÿ‘‡:- 

https://pdlink.in/4qgtrxU

๐ŸŽ“ Get Certified by Cisco โ€“ 100% Free!
โค1
โœ… Data Analytics Essentials

TECH SKILLS (NON-NEGOTIABLE)

1๏ธโƒฃ SQL
โ€ข Joins, Group by, Window functions
โ€ข Handle NULLs and duplicates
Example: LEFT JOIN fits a churn query to include non-churned users

2๏ธโƒฃ Excel
โ€ข Pivot tables, Lookups, IF logic
โ€ข Clean raw data fast
Example: Reconcile 50k rows in minutes using Pivot tables

3๏ธโƒฃ Power BI or Tableau
โ€ข Data modeling, Measures, Filters
โ€ข One dashboard, One question
Example: Sales drop by region and month dashboard

4๏ธโƒฃ Python
โ€ข pandas for cleaning and analysis
โ€ข matplotlib or seaborn for quick visuals
Example: Groupby revenue by cohort

5๏ธโƒฃ Statistics Basics
โ€ข Mean vs median, Variance, Correlation
โ€ข Know when averages lie
Example: Median salary explains skewed data

 

SOFT SKILLS (DEAL BREAKERS)

1๏ธโƒฃ Business Thinking
โ€ข Ask why before how
โ€ข Tie insights to decisions
Example: High churn points to onboarding gaps

2๏ธโƒฃ Communication
โ€ข Explain insights without jargon
โ€ข One slide, One takeaway
Example: Revenue fell due to fewer repeat users

3๏ธโƒฃ Problem Framing
โ€ข Convert vague asks into clear questions
โ€ข Define metrics early
Example: What defines an active user?

4๏ธโƒฃ Attention to Detail
โ€ข Validate numbers
โ€ข Double check logic
โ€ข Small errors kill trust

5๏ธโƒฃ Stakeholder Handling
โ€ข Listen first
โ€ข Clarify scope
โ€ข Push back with data

๐ŸŽฏ Balance both tech and soft skills to grow faster as an analyst

Double Tap โ™ฅ๏ธ For More
โค7
๐—”๐—œ & ๐— ๐—Ÿ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—•๐˜† ๐—œ๐—œ๐—ง ๐—ฃ๐—ฎ๐˜๐—ป๐—ฎ ๐Ÿ˜

Placement Assistance With 5000+ companies.

Companies are actively hiring candidates with AI & ML skills.

๐ŸŽ“ Prestigious IIT certificate
๐Ÿ”ฅ Hands-on industry projects
๐Ÿ“ˆ Career-ready skills for AI & ML jobs

Deadline :- March 1, 2026
 
๐—ฅ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฒ๐—ฟ ๐—™๐—ผ๐—ฟ ๐—ฆ๐—ฐ๐—ต๐—ผ๐—น๐—ฎ๐—ฟ๐˜€๐—ต๐—ถ๐—ฝ ๐—ง๐—ฒ๐˜€๐˜ ๐Ÿ‘‡ :- 

https://pdlink.in/4pBNxkV

โœ… Limited seats only
Complete Syllabus for Data Analytics interview:

SQL:
1. Basic
  - SELECT statements with WHERE, ORDER BY, GROUP BY, HAVING
  - Basic JOINS (INNER, LEFT, RIGHT, FULL)
  - Creating and using simple databases and tables

2. Intermediate
  - Aggregate functions (COUNT, SUM, AVG, MAX, MIN)
  - Subqueries and nested queries
  - Common Table Expressions (WITH clause)
  - CASE statements for conditional logic in queries

3. Advanced
  - Advanced JOIN techniques (self-join, non-equi join)
  - Window functions (OVER, PARTITION BY, ROW_NUMBER, RANK, DENSE_RANK, lead, lag)
  - optimization with indexing
  - Data manipulation (INSERT, UPDATE, DELETE)

Python:
1. Basic
  - Syntax, variables, data types (integers, floats, strings, booleans)
  - Control structures (if-else, for and while loops)
  - Basic data structures (lists, dictionaries, sets, tuples)
  - Functions, lambda functions, error handling (try-except)
  - Modules and packages

2. Pandas & Numpy
  - Creating and manipulating DataFrames and Series
  - Indexing, selecting, and filtering data
  - Handling missing data (fillna, dropna)
  - Data aggregation with groupby, summarizing data
  - Merging, joining, and concatenating datasets

3. Basic Visualization
  - Basic plotting with Matplotlib (line plots, bar plots, histograms)
  - Visualization with Seaborn (scatter plots, box plots, pair plots)
  - Customizing plots (sizes, labels, legends, color palettes)
  - Introduction to interactive visualizations (e.g., Plotly)

Excel:
1. Basic
  - Cell operations, basic formulas (SUMIFS, COUNTIFS, AVERAGEIFS, IF, AND, OR, NOT & Nested Functions etc.)
  - Introduction to charts and basic data visualization
  - Data sorting and filtering
  - Conditional formatting

2. Intermediate
  - Advanced formulas (V/XLOOKUP, INDEX-MATCH, nested IF)
  - PivotTables and PivotCharts for summarizing data
  - Data validation tools
  - What-if analysis tools (Data Tables, Goal Seek)

3. Advanced
  - Array formulas and advanced functions
  - Data Model & Power Pivot
- Advanced Filter
- Slicers and Timelines in Pivot Tables
  - Dynamic charts and interactive dashboards

Power BI:
1. Data Modeling
  - Importing data from various sources
  - Creating and managing relationships between different datasets
  - Data modeling basics (star schema, snowflake schema)

2. Data Transformation
  - Using Power Query for data cleaning and transformation
  - Advanced data shaping techniques
  - Calculated columns and measures using DAX

3. Data Visualization and Reporting
  - Creating interactive reports and dashboards
  - Visualizations (bar, line, pie charts, maps)
  - Publishing and sharing reports, scheduling data refreshes

Statistics Fundamentals:
Mean, Median, Mode, Standard Deviation, Variance, Probability Distributions, Hypothesis Testing, P-values, Confidence Intervals, Correlation, Simple Linear Regression, Normal Distribution, Binomial Distribution, Poisson Distribution.
โค9
๐—ฃ๐—ฎ๐˜† ๐—”๐—ณ๐˜๐—ฒ๐—ฟ ๐—ฃ๐—น๐—ฎ๐—ฐ๐—ฒ๐—บ๐—ฒ๐—ป๐˜ ๐—ง๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด ๐Ÿ˜

๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐—–๐—ผ๐—ฑ๐—ถ๐—ป๐—ด & ๐—š๐—ฒ๐˜ ๐—ฃ๐—น๐—ฎ๐—ฐ๐—ฒ๐—ฑ ๐—œ๐—ป ๐—ง๐—ผ๐—ฝ ๐— ๐—ก๐—–๐˜€

 Eligibility:- BE/BTech / BCA / BSc

๐ŸŒŸ 2000+ Students Placed
๐Ÿค 500+ Hiring Partners
๐Ÿ’ผ Avg. Rs. 7.4 LPA
๐Ÿš€ 41 LPA Highest Package

๐—•๐—ผ๐—ผ๐—ธ ๐—ฎ ๐—™๐—ฅ๐—˜๐—˜ ๐——๐—ฒ๐—บ๐—ผ๐Ÿ‘‡:-

https://pdlink.in/4hO7rWY

( Hurry Up ๐Ÿƒโ€โ™‚๏ธLimited Slots )
๐Ÿง  SQL Interview Question (Commonly Asked)
๐Ÿ“Œ

orders(order_id, customer_id, order_date, order_amount)

โ“ Ques :

๐Ÿ‘‰ Find customers whose order amount strictly increased compared to their previous order.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Order data correctly using dates
โ€ข Compare current row with previous row
โ€ข Use window functions for self-comparison
โ€ข Avoid self-joins when window functions fit better
โ€ข Filter only after comparison logic

๐Ÿ’ก SQL Solution

WITH ranked_orders AS (
SELECT
customer_id,
order_date,
order_amount,
LAG(order_amount) OVER (
PARTITION BY customer_id
ORDER BY order_date
) AS prev_order_amount
FROM orders
)
SELECT DISTINCT
customer_id
FROM ranked_orders
WHERE order_amount > prev_order_amount;

๐Ÿ”ฅ React โ™ฅ๏ธ if you want more moderate-to-advanced SQL interview questions
โค4๐Ÿ‘Œ1
๐—ง๐—ผ๐—ฝ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐—ข๐—ณ๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ฑ ๐—•๐˜† ๐—œ๐—œ๐—ง'๐˜€ & ๐—œ๐—œ๐—  ๐Ÿ˜ 

Placement Assistance With 5000+ companies.

Companies are actively hiring candidates with AI & ML skills.

โณ Deadline: 28th Feb 2026

๐—”๐—œ & ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ :- https://pdlink.in/4kucM7E

๐—”๐—œ & ๐— ๐—ฎ๐—ฐ๐—ต๐—ถ๐—ป๐—ฒ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด :- https://pdlink.in/4rMivIA

๐——๐—ฎ๐˜๐—ฎ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ ๐—ช๐—ถ๐˜๐—ต ๐—”๐—œ :- https://pdlink.in/4ay4wPG

๐—•๐˜‚๐˜€๐—ถ๐—ป๐—ฒ๐˜€๐˜€ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ ๐—ช๐—ถ๐˜๐—ต ๐—”๐—œ :- https://pdlink.in/3ZtIZm9

๐— ๐—Ÿ ๐—ช๐—ถ๐˜๐—ต ๐—ฃ๐˜†๐˜๐—ต๐—ผ๐—ป :- https://pdlink.in/3OD9jI1

โœ… Hurry Up...Limited seats only
๐Ÿง  SQL Interview Question (Commonly Asked)
๐Ÿ“Œ

products(product_id, product_name, category_id, price)

โ“ Ques :

๐Ÿ‘‰ Find the second highest priced product in each category.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Partition data by category
โ€ข Rank products based on price (descending)
โ€ข Understand difference between RANK, DENSE_RANK, and ROW_NUMBER
โ€ข Handle ties properly
โ€ข Filter after ranking logic

๐Ÿ’ก SQL Solution

WITH ranked_products AS (
SELECT
product_id,
product_name,
category_id,
price,
DENSE_RANK() OVER (
PARTITION BY category_id
ORDER BY price DESC
) AS price_rank
FROM products
)

SELECT
product_id,
product_name,
category_id,
price
FROM ranked_products
WHERE price_rank = 2;

๐Ÿ”ฅ Why this question is powerful:

โ€ข Tests window functions deeply
โ€ข Checks ranking logic understanding
โ€ข Very common in Data Analyst interviews

โค๏ธ React if you want more scenario-based SQL questions
โค4
๐Ÿง  SQL Interview Question (Tricky & Logic-Based)
๐Ÿ“Œ

logins(user_id, login_date)

โ“ Ques :

๐Ÿ‘‰ Find users who logged in for 3 or more consecutive days.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Understand consecutive date logic
โ€ข Use date arithmetic smartly
โ€ข Create groups using row-number difference trick
โ€ข Avoid complex self-joins
โ€ข Aggregate after forming streak groups

๐Ÿ’ก SQL Solution

WITH numbered_logins AS (
SELECT
user_id,
login_date,
ROW_NUMBER() OVER (
PARTITION BY user_id
ORDER BY login_date
) AS rn
FROM logins
),
grouped_logins AS (
SELECT
user_id,
login_date,
DATE_SUB(login_date, INTERVAL rn DAY) AS grp
FROM numbered_logins
)

SELECT
user_id
FROM grouped_logins
GROUP BY user_id, grp
HAVING COUNT(*) >= 3;

๐Ÿ”ฅ Why this question is powerful:

โ€ข Tests advanced window function usage
โ€ข Checks understanding of gaps & islands concept
โ€ข Evaluates real-world product analytics thinking
โ€ข Very common in growth / engagement analytics interviews

โค๏ธ React if you want more scenario-based SQL questions
โค6
๐Ÿง  SQL Interview Question (Moderate & Analytical)
๐Ÿ“Œ

events(user_id, event_name, event_date)

-- event_name values: 'Visited', 'Added_to_Cart', 'Purchased'

โ“ Ques :

๐Ÿ‘‰ Find users who added a product to cart but never completed the purchase.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Understand funnel stage logic
โ€ข Apply conditional aggregation correctly
โ€ข Ensure absence of a specific event
โ€ข Avoid double counting users

๐Ÿ’ก SQL Solution

SELECT
user_id
FROM events
GROUP BY user_id
HAVING
SUM(CASE WHEN event_name = 'Added_to_Cart' THEN 1 ELSE 0 END) > 0
AND SUM(CASE WHEN event_name = 'Purchased' THEN 1 ELSE 0 END) = 0;

๐Ÿ”ฅ Why This Question Is Powerful

โ€ข Tests real business thinking (conversion funnel analysis)
โ€ข Checks ability to detect missing conditions
โ€ข Common in product & e-commerce analytics interviews
โ€ข Evaluates aggregation + logical filtering skills together

โค๏ธ React if you want more real interview-level SQL questions
โค6
๐—”๐—œ & ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฃ๐—ฟ๐—ผ๐—ด๐—ฟ๐—ฎ๐—บ ๐—•๐˜† ๐—œ๐—œ๐—ง ๐—ฅ๐—ผ๐—ผ๐—ฟ๐—ธ๐—ฒ๐—ฒ ๐Ÿ˜

๐Ÿ‘‰Learn from IIT faculty and industry experts
๐Ÿ”ฅ100% Online | 6 Months
๐ŸŽ“Get Prestigious Certificate

 ๐Ÿ’ซCompanies are actively hiring candidates with Data Science & AI skills.

 Deadline: 8th March 2026

๐—ฅ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฒ๐—ฟ ๐—™๐—ผ๐—ฟ ๐—ฆ๐—ฐ๐—ต๐—ผ๐—น๐—ฎ๐—ฟ๐˜€๐—ต๐—ถ๐—ฝ ๐—ง๐—ฒ๐˜€๐˜ ๐Ÿ‘‡ :- 

https://pdlink.in/4kucM7E

โœ… Limited seats only
1. What is the difference between the RANK() and DENSE_RANK() functions?

The RANK() function in the result set defines the rank of each row within your ordered partition. If both rows have the same rank, the next number in the ranking will be the previous rank plus a number of duplicates. If we have three records at rank 4, for example, the next level indicated is 7. The DENSE_RANK() function assigns a distinct rank to each row within a partition based on the provided column value, with no gaps. If we have three records at rank 4, for example, the next level indicated is 5.

2. Explain One-hot encoding and Label Encoding. How do they affect the dimensionality of the given dataset?

One-hot encoding is the representation of categorical variables as binary vectors. Label Encoding is converting labels/words into numeric form. Using one-hot encoding increases the dimensionality of the data set. Label encoding doesnโ€™t affect the dimensionality of the data set. One-hot encoding creates a new variable for each level in the variable whereas, in Label encoding, the levels of a variable get encoded as 1 and 0.

3. Explain the Difference Between Tableau Worksheet, Dashboard, Story, and Workbook in Tableau?

Tableau uses a workbook and sheet file structure, much like Microsoft Excel.
A workbook contains sheets, which can be a worksheet, dashboard, or a story.
A worksheet contains a single view along with shelves, legends, and the Data pane.
A dashboard is a collection of views from multiple worksheets.
A story contains a sequence of worksheets or dashboards that work together to convey information.

4. How can you split a column into 2 or more columns?

You can split a column into 2 or more columns by following the below steps:
1. Select the cell that you want to split. Then, navigate to the Data tab, after that, select Text to Columns. 2. Select the delimiter. 3. Choose the column data format and select the destination you want to display the split. 4. The final output will look like below where the text is split into multiple columns.

5. Do you wanna make your career in Data Science & Analytics but don't know how to start ?

https://t.me/sqlspecialist/851

Here are free resources that will make you technically strong enough to crack any Data Analyst and also learn Pro Career Growth Hacks to land on your Dream Job.
โค4
๐Ÿง  SQL Interview Question (Moderate & Revenue Analysis)
๐Ÿ“Œ

orders(order_id, customer_id, order_amount)

โ“ Ques :

๐Ÿ‘‰ Find customers who contribute more than 30% of the total company revenue.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Calculate overall total revenue
โ€ข Aggregate revenue at customer level
โ€ข Compare individual contribution against total
โ€ข Avoid recalculating total multiple times inefficiently

๐Ÿ’ก SQL Solution

WITH total_revenue AS (
SELECT SUM(order_amount) AS total_rev
FROM orders
),
customer_revenue AS (
SELECT
customer_id,
SUM(order_amount) AS cust_rev
FROM orders
GROUP BY customer_id
)
SELECT c.customer_id
FROM customer_revenue c
CROSS JOIN total_revenue t
WHERE c.cust_rev > 0.30 * t.total_rev;

๐Ÿ”ฅ Why This Question Is Powerful

โ€ข Tests percentage-based business logic
โ€ข Evaluates ability to combine multiple aggregations
โ€ข Reflects real-world Pareto (80/20) analysis scenarios
โ€ข Common in product, growth & revenue analytics interviews

โค๏ธ React if you want more real interview-level SQL questions
โค1
SQL beginner to advanced level