Data Analyst Interview Resources
51.8K subscribers
338 photos
1 video
53 files
396 links
Join our telegram channel to learn how data analysis can reveal fascinating patterns, trends, and stories hidden within the numbers! ๐Ÿ“Š

For ads & suggestions: @love_data
Download Telegram
๐Ÿง  SQL Interview Question (Commonly Asked)
๐Ÿ“Œ

products(product_id, product_name, category_id, price)

โ“ Ques :

๐Ÿ‘‰ Find the second highest priced product in each category.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Partition data by category
โ€ข Rank products based on price (descending)
โ€ข Understand difference between RANK, DENSE_RANK, and ROW_NUMBER
โ€ข Handle ties properly
โ€ข Filter after ranking logic

๐Ÿ’ก SQL Solution

WITH ranked_products AS (
SELECT
product_id,
product_name,
category_id,
price,
DENSE_RANK() OVER (
PARTITION BY category_id
ORDER BY price DESC
) AS price_rank
FROM products
)

SELECT
product_id,
product_name,
category_id,
price
FROM ranked_products
WHERE price_rank = 2;

๐Ÿ”ฅ Why this question is powerful:

โ€ข Tests window functions deeply
โ€ข Checks ranking logic understanding
โ€ข Very common in Data Analyst interviews

โค๏ธ React if you want more scenario-based SQL questions
โค4
๐Ÿง  SQL Interview Question (Tricky & Logic-Based)
๐Ÿ“Œ

logins(user_id, login_date)

โ“ Ques :

๐Ÿ‘‰ Find users who logged in for 3 or more consecutive days.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Understand consecutive date logic
โ€ข Use date arithmetic smartly
โ€ข Create groups using row-number difference trick
โ€ข Avoid complex self-joins
โ€ข Aggregate after forming streak groups

๐Ÿ’ก SQL Solution

WITH numbered_logins AS (
SELECT
user_id,
login_date,
ROW_NUMBER() OVER (
PARTITION BY user_id
ORDER BY login_date
) AS rn
FROM logins
),
grouped_logins AS (
SELECT
user_id,
login_date,
DATE_SUB(login_date, INTERVAL rn DAY) AS grp
FROM numbered_logins
)

SELECT
user_id
FROM grouped_logins
GROUP BY user_id, grp
HAVING COUNT(*) >= 3;

๐Ÿ”ฅ Why this question is powerful:

โ€ข Tests advanced window function usage
โ€ข Checks understanding of gaps & islands concept
โ€ข Evaluates real-world product analytics thinking
โ€ข Very common in growth / engagement analytics interviews

โค๏ธ React if you want more scenario-based SQL questions
โค6
๐Ÿง  SQL Interview Question (Moderate & Analytical)
๐Ÿ“Œ

events(user_id, event_name, event_date)

-- event_name values: 'Visited', 'Added_to_Cart', 'Purchased'

โ“ Ques :

๐Ÿ‘‰ Find users who added a product to cart but never completed the purchase.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Understand funnel stage logic
โ€ข Apply conditional aggregation correctly
โ€ข Ensure absence of a specific event
โ€ข Avoid double counting users

๐Ÿ’ก SQL Solution

SELECT
user_id
FROM events
GROUP BY user_id
HAVING
SUM(CASE WHEN event_name = 'Added_to_Cart' THEN 1 ELSE 0 END) > 0
AND SUM(CASE WHEN event_name = 'Purchased' THEN 1 ELSE 0 END) = 0;

๐Ÿ”ฅ Why This Question Is Powerful

โ€ข Tests real business thinking (conversion funnel analysis)
โ€ข Checks ability to detect missing conditions
โ€ข Common in product & e-commerce analytics interviews
โ€ข Evaluates aggregation + logical filtering skills together

โค๏ธ React if you want more real interview-level SQL questions
โค6
๐—”๐—œ & ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฃ๐—ฟ๐—ผ๐—ด๐—ฟ๐—ฎ๐—บ ๐—•๐˜† ๐—œ๐—œ๐—ง ๐—ฅ๐—ผ๐—ผ๐—ฟ๐—ธ๐—ฒ๐—ฒ ๐Ÿ˜

๐Ÿ‘‰Learn from IIT faculty and industry experts
๐Ÿ”ฅ100% Online | 6 Months
๐ŸŽ“Get Prestigious Certificate

 ๐Ÿ’ซCompanies are actively hiring candidates with Data Science & AI skills.

 Deadline: 8th March 2026

๐—ฅ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฒ๐—ฟ ๐—™๐—ผ๐—ฟ ๐—ฆ๐—ฐ๐—ต๐—ผ๐—น๐—ฎ๐—ฟ๐˜€๐—ต๐—ถ๐—ฝ ๐—ง๐—ฒ๐˜€๐˜ ๐Ÿ‘‡ :- 

https://pdlink.in/4kucM7E

โœ… Limited seats only
1. What is the difference between the RANK() and DENSE_RANK() functions?

The RANK() function in the result set defines the rank of each row within your ordered partition. If both rows have the same rank, the next number in the ranking will be the previous rank plus a number of duplicates. If we have three records at rank 4, for example, the next level indicated is 7. The DENSE_RANK() function assigns a distinct rank to each row within a partition based on the provided column value, with no gaps. If we have three records at rank 4, for example, the next level indicated is 5.

2. Explain One-hot encoding and Label Encoding. How do they affect the dimensionality of the given dataset?

One-hot encoding is the representation of categorical variables as binary vectors. Label Encoding is converting labels/words into numeric form. Using one-hot encoding increases the dimensionality of the data set. Label encoding doesnโ€™t affect the dimensionality of the data set. One-hot encoding creates a new variable for each level in the variable whereas, in Label encoding, the levels of a variable get encoded as 1 and 0.

3. Explain the Difference Between Tableau Worksheet, Dashboard, Story, and Workbook in Tableau?

Tableau uses a workbook and sheet file structure, much like Microsoft Excel.
A workbook contains sheets, which can be a worksheet, dashboard, or a story.
A worksheet contains a single view along with shelves, legends, and the Data pane.
A dashboard is a collection of views from multiple worksheets.
A story contains a sequence of worksheets or dashboards that work together to convey information.

4. How can you split a column into 2 or more columns?

You can split a column into 2 or more columns by following the below steps:
1. Select the cell that you want to split. Then, navigate to the Data tab, after that, select Text to Columns. 2. Select the delimiter. 3. Choose the column data format and select the destination you want to display the split. 4. The final output will look like below where the text is split into multiple columns.

5. Do you wanna make your career in Data Science & Analytics but don't know how to start ?

https://t.me/sqlspecialist/851

Here are free resources that will make you technically strong enough to crack any Data Analyst and also learn Pro Career Growth Hacks to land on your Dream Job.
โค4
๐Ÿง  SQL Interview Question (Moderate & Revenue Analysis)
๐Ÿ“Œ

orders(order_id, customer_id, order_amount)

โ“ Ques :

๐Ÿ‘‰ Find customers who contribute more than 30% of the total company revenue.

๐Ÿงฉ How Interviewers Expect You to Think

โ€ข Calculate overall total revenue
โ€ข Aggregate revenue at customer level
โ€ข Compare individual contribution against total
โ€ข Avoid recalculating total multiple times inefficiently

๐Ÿ’ก SQL Solution

WITH total_revenue AS (
SELECT SUM(order_amount) AS total_rev
FROM orders
),
customer_revenue AS (
SELECT
customer_id,
SUM(order_amount) AS cust_rev
FROM orders
GROUP BY customer_id
)
SELECT c.customer_id
FROM customer_revenue c
CROSS JOIN total_revenue t
WHERE c.cust_rev > 0.30 * t.total_rev;

๐Ÿ”ฅ Why This Question Is Powerful

โ€ข Tests percentage-based business logic
โ€ข Evaluates ability to combine multiple aggregations
โ€ข Reflects real-world Pareto (80/20) analysis scenarios
โ€ข Common in product, growth & revenue analytics interviews

โค๏ธ React if you want more real interview-level SQL questions
โค1
SQL beginner to advanced level
๐Ÿš€๐—š๐—ฒ๐˜ ๐—ง๐—ผ๐—ฝ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐—œ๐—œ๐—ง's & ๐—œ๐—œ๐—  

Dreaming of studying at an IIT and building a career in AI ? This is your chance

โœ… Prestigious IIT  Certification
โœ… Learn directly from IIT Professors
โœ… Placement Assistance with 5000+ Companies

๐Ÿ’ก Todayโ€™s top companies are actively looking for professionals with AI skills. 

๐—ฅ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฒ๐—ฟ ๐—ก๐—ผ๐˜„ ๐Ÿ‘‡ :- 

๐—”๐—œ & ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ :- https://pdlink.in/4kucM7E

๐—”๐—œ & ๐— ๐—ฎ๐—ฐ๐—ต๐—ถ๐—ป๐—ฒ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด :- https://pdlink.in/4rMivIA

๐——๐—ฎ๐˜๐—ฎ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ ๐—ช๐—ถ๐˜๐—ต ๐—”๐—œ :- https://pdlink.in/4ay4wPG

โณ Limited seats โ€“ Register before the link expires!
๐Ÿ“Š Interviewer: How do you remove duplicate records in SQL?

๐Ÿ‘‹ Me: We can remove duplicates using DISTINCT, GROUP BY, or delete duplicate rows using ROW_NUMBER().


โœ… 1๏ธโƒฃ Using DISTINCT (to fetch unique values)

SELECT DISTINCT column_name
FROM employees;


๐Ÿ‘‰ Returns unique records but does not delete duplicates.


โœ… 2๏ธโƒฃ Using GROUP BY (to identify duplicates)

SELECT name, COUNT(*)
FROM employees
GROUP BY name
HAVING COUNT(*) > 1;


๐Ÿ‘‰ Helps find duplicate records.


โœ… 3๏ธโƒฃ Delete Duplicates Using ROW_NUMBER() (Most Important โญ)
(Keeps one record and deletes others)

DELETE FROM employees
WHERE id IN (
SELECT id FROM (
SELECT id,
ROW_NUMBER() OVER (
PARTITION BY name, salary
ORDER BY id
) AS rn
FROM employees
) t
WHERE rn > 1
);


๐Ÿง  Logic Breakdown:

- DISTINCT โ†’ shows unique records
- GROUP BY โ†’ identifies duplicates
- ROW_NUMBER() โ†’ removes duplicates safely


โœ… Use Case: Data cleaning, ETL processes, data quality checks.

๐Ÿ’ก Tip: Always take a backup before deleting duplicate records.

๐Ÿ’ฌ Tap โค๏ธ for more!
โค2
๐——๐—ฒ๐˜ƒ๐—ข๐—ฝ๐˜€ ๐—™๐—ฅ๐—˜๐—˜ ๐—ข๐—ป๐—น๐—ถ๐—ป๐—ฒ ๐— ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟ๐—ฐ๐—น๐—ฎ๐˜€๐˜€ ๐—•๐˜† ๐—œ๐—ป๐—ฑ๐˜‚๐˜€๐˜๐—ฟ๐˜† ๐—˜๐˜…๐—ฝ๐—ฒ๐—ฟ๐˜๐˜€๐Ÿ˜

- Bridge the Gap Between Your Current Skills and What DevOps Roles Demand
- Know The Roadmap To Become DevOps Engineer In 2026

Eligibility :- Students ,Freshers & Working Professionals

๐—ฅ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฒ๐—ฟ ๐—™๐—ผ๐—ฟ ๐—™๐—ฅ๐—˜๐—˜๐Ÿ‘‡ :- 

https://pdlink.in/40YmeqV

( Limited Slots ..Hurry Up๐Ÿƒโ€โ™‚๏ธ )

Date & Time :- March 10 , 2026 , 7:00 PM
โค3
๐Ÿ“Š Essential SQL Concepts Every Data Analyst Must Know

๐Ÿš€ SQL is the most important skill for Data Analysts. Almost every analytics job requires working with databases to extract, filter, analyze, and summarize data.

Understanding the following SQL concepts will help you write efficient queries and solve real business problems with data.

1๏ธโƒฃ SELECT Statement (Data Retrieval)

What it is: Retrieves data from a table.

SELECT name, salary
FROM employees;


Use cases: Retrieving specific columns, viewing datasets, extracting required information.

2๏ธโƒฃ WHERE Clause (Filtering Data)

What it is: Filters rows based on specific conditions.

SELECT *
FROM orders
WHERE order_amount > 500;


Common conditions: =, >, <, >=, <=, BETWEEN, IN, LIKE

3๏ธโƒฃ ORDER BY (Sorting Data)

What it is: Sorts query results in ascending or descending order.

SELECT name, salary
FROM employees
ORDER BY salary DESC;


Sorting options: ASC (default), DESC

4๏ธโƒฃ GROUP BY (Aggregation)

What it is: Groups rows with same values into summary rows.

SELECT department, COUNT(*)
FROM employees
GROUP BY department;


Use cases: Sales per region, customers per country, orders per product category.

5๏ธโƒฃ Aggregate Functions

What they do: Perform calculations on multiple rows.

SELECT AVG(salary)
FROM employees;


Common functions: COUNT(), SUM(), AVG(), MIN(), MAX()

6๏ธโƒฃ HAVING Clause

What it is: Filters grouped data after aggregation.

SELECT department, COUNT(*)
FROM employees
GROUP BY department
HAVING COUNT(*) > 5;


Key difference: WHERE filters rows before grouping, HAVING filters groups after aggregation.

7๏ธโƒฃ SQL JOINS (Combining Tables)

What they do: Combine tables.

-- INNER JOIN
SELECT orders.order_id, customers.customer_name
FROM orders
INNER JOIN customers
ON orders.customer_id = customers.customer_id;


-- LEFT JOIN
SELECT customers.customer_name, orders.order_id
FROM customers
LEFT JOIN orders
ON customers.customer_id = orders.customer_id;


Common types: INNER JOIN, LEFT JOIN, RIGHT JOIN, FULL JOIN

8๏ธโƒฃ Subqueries

What it is: Query inside another query.

SELECT name
FROM employees
WHERE salary > (SELECT AVG(salary) FROM employees);


Use cases: Comparing values, filtering based on aggregated results.

9๏ธโƒฃ Common Table Expressions (CTE)

What it is: Temporary result set used inside a query.

WITH high_salary AS (
SELECT name, salary
FROM employees
WHERE salary > 70000
)
SELECT *
FROM high_salary;


Benefits: Cleaner queries, easier debugging, better readability.

๐Ÿ”Ÿ Window Functions

What they do: Perform calculations across rows related to current row.

SELECT name, salary, RANK() OVER (ORDER BY salary DESC) AS salary_rank
FROM employees;


Common functions: ROW_NUMBER(), RANK(), DENSE_RANK(), LAG(), LEAD()

Why SQL is Critical for Data Analysts
โ€ข Extract data from databases
โ€ข Analyze large datasets efficiently
โ€ข Generate reports and dashboards
โ€ข Support business decision-making

SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v

Double Tap โ™ฅ๏ธ For More
โค4๐Ÿ‘1
๐Ÿ“ข ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—”๐—น๐—ฒ๐—ฟ๐˜ โ€“ ๐——๐—ฎ๐˜๐—ฎ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ ๐—ช๐—ถ๐˜๐—ต ๐—”๐—œ

Upgrade your career with AI-powered data analytics skills.

๐Ÿ“Š Learn Data Analytics from Scratch
๐Ÿค– AI Tools & Automation
๐Ÿ“ˆ Data Visualization & Insights
๐ŸŽ“ Certification Program

๐Ÿ”ฅ Highly demanded skill in todayโ€™s job market.

๐—”๐—ฝ๐—ฝ๐—น๐˜† ๐—ก๐—ผ๐˜„๐Ÿ‘‡ :- 

https://pdlink.in/4syEItX

๐Ÿš€ Perfect for Students ,Freshers & Working Professionals
Essential Excel Functions for Data Analysts ๐Ÿš€

1๏ธโƒฃ Basic Functions

SUM() โ€“ Adds a range of numbers. =SUM(A1:A10)

AVERAGE() โ€“ Calculates the average. =AVERAGE(A1:A10)

MIN() / MAX() โ€“ Finds the smallest/largest value. =MIN(A1:A10)


2๏ธโƒฃ Logical Functions

IF() โ€“ Conditional logic. =IF(A1>50, "Pass", "Fail")

IFS() โ€“ Multiple conditions. =IFS(A1>90, "A", A1>80, "B", TRUE, "C")

AND() / OR() โ€“ Checks multiple conditions. =AND(A1>50, B1<100)


3๏ธโƒฃ Text Functions

LEFT() / RIGHT() / MID() โ€“ Extract text from a string.

=LEFT(A1, 3) (First 3 characters)

=MID(A1, 3, 2) (2 characters from the 3rd position)


LEN() โ€“ Counts characters. =LEN(A1)

TRIM() โ€“ Removes extra spaces. =TRIM(A1)

UPPER() / LOWER() / PROPER() โ€“ Changes text case.


4๏ธโƒฃ Lookup Functions

VLOOKUP() โ€“ Searches for a value in a column.

=VLOOKUP(1001, A2:B10, 2, FALSE)


HLOOKUP() โ€“ Searches in a row.

XLOOKUP() โ€“ Advanced lookup replacing VLOOKUP.

=XLOOKUP(1001, A2:A10, B2:B10, "Not Found")



5๏ธโƒฃ Date & Time Functions

TODAY() โ€“ Returns the current date.

NOW() โ€“ Returns the current date and time.

YEAR(), MONTH(), DAY() โ€“ Extracts parts of a date.

DATEDIF() โ€“ Calculates the difference between two dates.


6๏ธโƒฃ Data Cleaning Functions

REMOVE DUPLICATES โ€“ Found in the "Data" tab.

CLEAN() โ€“ Removes non-printable characters.

SUBSTITUTE() โ€“ Replaces text within a string.

=SUBSTITUTE(A1, "old", "new")



7๏ธโƒฃ Advanced Functions

INDEX() & MATCH() โ€“ More flexible alternative to VLOOKUP.

TEXTJOIN() โ€“ Joins text with a delimiter.

UNIQUE() โ€“ Returns unique values from a range.

FILTER() โ€“ Filters data dynamically.

=FILTER(A2:B10, B2:B10>50)



8๏ธโƒฃ Pivot Tables & Power Query

PIVOT TABLES โ€“ Summarizes data dynamically.

GETPIVOTDATA() โ€“ Extracts data from a Pivot Table.

POWER QUERY โ€“ Automates data cleaning & transformation.


You can find Free Excel Resources here: https://t.me/excel_data

Hope it helps :)

#dataanalytics
โค4