πΉ DATA ANALYST β INTERVIEW REVISION SHEET
1οΈβ£ Role Clarity
> βA data analyst collects, cleans, analyzes data, and converts it into insights that help businesses make decisions.β
2οΈβ£ SQL (Most Important)
Must-know clauses:
β’ SELECT, WHERE, ORDER BY, LIMIT
β’ GROUP BY, HAVING
β’ JOINS (INNER, LEFT)
β’ Subqueries, CTEs
β’ Window functions (ROW_NUMBER, RANK)
Golden rules:
β’ WHERE β before aggregation
β’ HAVING β after aggregation
β’ LEFT JOIN β keeps all left table rows
β’ NULLs break calculations β use COALESCE
Classic questions:
β’ Top N per group
β’ Find duplicates
β’ Running totals
3οΈβ£ Excel Essentials
Formulas:
β’ IF, XLOOKUP
β’ COUNTIFS, SUMIFS
β’ TRIM, LEFT, RIGHT
Core features:
β’ Pivot tables
β’ Conditional formatting
β’ Data validation (dropdowns)
Avoid:
β’ Merged cells
β’ Hard-coded values
4οΈβ£ Power BI / Tableau
Concepts:
β’ Data model (star schema)
β’ Relationships (one-to-many)
β’ Measures > calculated columns
Must-know DAX:
β’ Total Sales = SUM(Sales[Amount])
β’ YTD Sales = TOTALYTD(SUM(Sales[Amount]), Sales[Date])
Design rules:
β’ KPIs on top
β’ One story per dashboard
β’ Minimal visuals
5οΈβ£ Statistics (Only What Matters)
β’ Mean vs Median
β’ Standard deviation
β’ Correlation β causation
β’ Outliers distort averages
β’ Use median for Salaries, House prices
6οΈβ£ Data Cleaning (Interview Gold)
Steps you should say:
1. Remove duplicates
2. Handle missing values
3. Fix data types
4. Standardize text
7οΈβ£ Business Metrics
β’ Revenue
β’ Growth rate
β’ Conversion rate
β’ Churn
β’ Retention
β’ Average order value
Always connect metrics to business impact.
8οΈβ£ Case Question Framework (Very Important)
Always answer like this:
1. What happened
2. Why it happened
3. What should be done
Example:
> βSales dropped due to lower traffic in one region, so Iβd recommend increasing marketing spend there.β
9οΈβ£ Project Explanation Template
> βThe goal was . I used to clean data, to analyze, and to visualize. The key insight was . The business impact was .β
Memorize this.
π HR Power Answers
Why data analyst?
> βI enjoy finding patterns in data and turning them into actionable insights.β
Strength:
βI combine technical skills with business understanding.β
Weakness:
βI used to over-analyze, but now I focus on impact.β
π§ Last-Day Interview Tips
β’ Think out loud
β’ Ask clarifying questions
β’ Donβt jump to tools immediately
β’ Focus on impact, not syntax
π¬ Tap β€οΈ for more!
1οΈβ£ Role Clarity
> βA data analyst collects, cleans, analyzes data, and converts it into insights that help businesses make decisions.β
2οΈβ£ SQL (Most Important)
Must-know clauses:
β’ SELECT, WHERE, ORDER BY, LIMIT
β’ GROUP BY, HAVING
β’ JOINS (INNER, LEFT)
β’ Subqueries, CTEs
β’ Window functions (ROW_NUMBER, RANK)
Golden rules:
β’ WHERE β before aggregation
β’ HAVING β after aggregation
β’ LEFT JOIN β keeps all left table rows
β’ NULLs break calculations β use COALESCE
Classic questions:
β’ Top N per group
β’ Find duplicates
β’ Running totals
3οΈβ£ Excel Essentials
Formulas:
β’ IF, XLOOKUP
β’ COUNTIFS, SUMIFS
β’ TRIM, LEFT, RIGHT
Core features:
β’ Pivot tables
β’ Conditional formatting
β’ Data validation (dropdowns)
Avoid:
β’ Merged cells
β’ Hard-coded values
4οΈβ£ Power BI / Tableau
Concepts:
β’ Data model (star schema)
β’ Relationships (one-to-many)
β’ Measures > calculated columns
Must-know DAX:
β’ Total Sales = SUM(Sales[Amount])
β’ YTD Sales = TOTALYTD(SUM(Sales[Amount]), Sales[Date])
Design rules:
β’ KPIs on top
β’ One story per dashboard
β’ Minimal visuals
5οΈβ£ Statistics (Only What Matters)
β’ Mean vs Median
β’ Standard deviation
β’ Correlation β causation
β’ Outliers distort averages
β’ Use median for Salaries, House prices
6οΈβ£ Data Cleaning (Interview Gold)
Steps you should say:
1. Remove duplicates
2. Handle missing values
3. Fix data types
4. Standardize text
7οΈβ£ Business Metrics
β’ Revenue
β’ Growth rate
β’ Conversion rate
β’ Churn
β’ Retention
β’ Average order value
Always connect metrics to business impact.
8οΈβ£ Case Question Framework (Very Important)
Always answer like this:
1. What happened
2. Why it happened
3. What should be done
Example:
> βSales dropped due to lower traffic in one region, so Iβd recommend increasing marketing spend there.β
9οΈβ£ Project Explanation Template
> βThe goal was . I used to clean data, to analyze, and to visualize. The key insight was . The business impact was .β
Memorize this.
π HR Power Answers
Why data analyst?
> βI enjoy finding patterns in data and turning them into actionable insights.β
Strength:
βI combine technical skills with business understanding.β
Weakness:
βI used to over-analyze, but now I focus on impact.β
π§ Last-Day Interview Tips
β’ Think out loud
β’ Ask clarifying questions
β’ Donβt jump to tools immediately
β’ Focus on impact, not syntax
π¬ Tap β€οΈ for more!
β€19π2
β
Step-by-Step Approach to Learn Data Analytics ππ§
β Excel Fundamentals:
β Master formulas, pivot tables, data validation, charts, and graphs.
β SQL Basics:
β Learn to query databases, use SELECT, FROM, WHERE, JOIN, GROUP BY, and aggregate functions.
β Data Visualization:
β Get proficient with tools like Tableau or Power BI to create insightful dashboards.
β Statistical Concepts:
β Understand descriptive statistics (mean, median, mode), distributions, and hypothesis testing.
β Data Cleaning & Preprocessing:
β Learn how to handle missing data, outliers, and data inconsistencies.
β Exploratory Data Analysis (EDA):
β Explore datasets, identify patterns, and formulate hypotheses.
β Python for Data Analysis (Optional but Recommended):
β Learn Pandas and NumPy for data manipulation and analysis.
β Real-World Projects:
β Analyze datasets from Kaggle, UCI Machine Learning Repository, or your own collection.
β Business Acumen:
β Understand key business metrics and how data insights impact business decisions.
β Build a Portfolio:
β Showcase your projects on GitHub, Tableau Public, or a personal website. Highlight the impact of your analysis.
π Tap β€οΈ for more!
β Excel Fundamentals:
β Master formulas, pivot tables, data validation, charts, and graphs.
β SQL Basics:
β Learn to query databases, use SELECT, FROM, WHERE, JOIN, GROUP BY, and aggregate functions.
β Data Visualization:
β Get proficient with tools like Tableau or Power BI to create insightful dashboards.
β Statistical Concepts:
β Understand descriptive statistics (mean, median, mode), distributions, and hypothesis testing.
β Data Cleaning & Preprocessing:
β Learn how to handle missing data, outliers, and data inconsistencies.
β Exploratory Data Analysis (EDA):
β Explore datasets, identify patterns, and formulate hypotheses.
β Python for Data Analysis (Optional but Recommended):
β Learn Pandas and NumPy for data manipulation and analysis.
β Real-World Projects:
β Analyze datasets from Kaggle, UCI Machine Learning Repository, or your own collection.
β Business Acumen:
β Understand key business metrics and how data insights impact business decisions.
β Build a Portfolio:
β Showcase your projects on GitHub, Tableau Public, or a personal website. Highlight the impact of your analysis.
π Tap β€οΈ for more!
β€25π5
If you want to Excel as a Data Analyst, master these powerful skills:
β’ SQL Queries β SELECT, JOINs, GROUP BY, CTEs, Window Functions
β’ Excel Functions β VLOOKUP, XLOOKUP, PIVOT TABLES, POWER QUERY
β’ Data Cleaning β Handle missing values, duplicates, and inconsistencies
β’ Python for Data Analysis β Pandas, NumPy, Matplotlib, Seaborn
β’ Data Visualization β Create dashboards in Power BI/Tableau
β’ Statistical Analysis β Hypothesis testing, correlation, regression
β’ ETL Process β Extract, Transform, Load data efficiently
β’ Business Acumen β Understand industry-specific KPIs
β’ A/B Testing β Data-driven decision-making
β’ Storytelling with Data β Present insights effectively
Like it if you need a complete tutorial on all these topics! πβ€οΈ
β’ SQL Queries β SELECT, JOINs, GROUP BY, CTEs, Window Functions
β’ Excel Functions β VLOOKUP, XLOOKUP, PIVOT TABLES, POWER QUERY
β’ Data Cleaning β Handle missing values, duplicates, and inconsistencies
β’ Python for Data Analysis β Pandas, NumPy, Matplotlib, Seaborn
β’ Data Visualization β Create dashboards in Power BI/Tableau
β’ Statistical Analysis β Hypothesis testing, correlation, regression
β’ ETL Process β Extract, Transform, Load data efficiently
β’ Business Acumen β Understand industry-specific KPIs
β’ A/B Testing β Data-driven decision-making
β’ Storytelling with Data β Present insights effectively
Like it if you need a complete tutorial on all these topics! πβ€οΈ
β€41π14
β
SQL Interview Challenge! π§ π»
ππ»ππ²πΏππΆπ²ππ²πΏ: Find all employees who *donβt have a manager* (i.e.,
π π²: Using
β Why it works:
β
β Simple and fast for identifying top-level employees in an organization.
π Bonus Tip: Combine with
π¬ Tap β€οΈ if this helped you!
ππ»ππ²πΏππΆπ²ππ²πΏ: Find all employees who *donβt have a manager* (i.e.,
manager_id is NULL) and list their names and departments. π π²: Using
WHERE with IS NULL:SELECT name, department
FROM employees
WHERE manager_id IS NULL;
β Why it works:
β
IS NULL filters rows where manager_id is missing. β Simple and fast for identifying top-level employees in an organization.
π Bonus Tip: Combine with
LEFT JOIN to also include department names from another table if needed. π¬ Tap β€οΈ if this helped you!
β€29π7
π Complete Roadmap to Become a Power BI Expert
π 1. Understand Basics of Data & BI
β What is Business Intelligence?
β Importance of data visualization
π 2. Learn Power BI Interface
β Power BI Desktop overview
β Power Query Editor basics
π 3. Connect to Data Sources
β Excel, SQL Server, SharePoint, APIs, CSV, etc.
π 4. Data Transformation & Cleaning
β Use Power Query to shape, clean, and prepare data
π 5. Learn Data Modeling
β Create relationships between tables
β Understand star schema & normalization basics
π 6. Master DAX (Data Analysis Expressions)
β Calculated columns, measures, time intelligence functions
π 7. Create Interactive Visualizations
β Charts, slicers, maps, tables, and custom visuals
π 8. Build Dashboards & Reports
β Combine visuals for insightful dashboards
β Use bookmarks, drill-throughs, tooltips
π 9. Publish & Share Reports
β Power BI Service basics
β Sharing, workspaces, and app creation
π 10. Learn Power BI Administration
β Row-level security (RLS)
β Gateway setup & scheduled refresh
π 11. Practice Real-World Projects
β Sales dashboards, financial reports, customer insights
π Like for more!
π 1. Understand Basics of Data & BI
β What is Business Intelligence?
β Importance of data visualization
π 2. Learn Power BI Interface
β Power BI Desktop overview
β Power Query Editor basics
π 3. Connect to Data Sources
β Excel, SQL Server, SharePoint, APIs, CSV, etc.
π 4. Data Transformation & Cleaning
β Use Power Query to shape, clean, and prepare data
π 5. Learn Data Modeling
β Create relationships between tables
β Understand star schema & normalization basics
π 6. Master DAX (Data Analysis Expressions)
β Calculated columns, measures, time intelligence functions
π 7. Create Interactive Visualizations
β Charts, slicers, maps, tables, and custom visuals
π 8. Build Dashboards & Reports
β Combine visuals for insightful dashboards
β Use bookmarks, drill-throughs, tooltips
π 9. Publish & Share Reports
β Power BI Service basics
β Sharing, workspaces, and app creation
π 10. Learn Power BI Administration
β Row-level security (RLS)
β Gateway setup & scheduled refresh
π 11. Practice Real-World Projects
β Sales dashboards, financial reports, customer insights
π Like for more!
β€19
SQL From Basic to Advanced level
Basic SQL is ONLY 7 commands:
- SELECT
- FROM
- WHERE (also use SQL comparison operators such as =, <=, >=, <> etc.)
- ORDER BY
- Aggregate functions such as SUM, AVERAGE, COUNT etc.
- GROUP BY
- CREATE, INSERT, DELETE, etc.
You can do all this in just one morning.
Once you know these, take the next step and learn commands like:
- LEFT JOIN
- INNER JOIN
- LIKE
- IN
- CASE WHEN
- HAVING (undertstand how it's different from GROUP BY)
- UNION ALL
This should take another day.
Once both basic and intermediate are done, start learning more advanced SQL concepts such as:
- Subqueries (when to use subqueries vs CTE?)
- CTEs (WITH AS)
- Stored Procedures
- Triggers
- Window functions (LEAD, LAG, PARTITION BY, RANK, DENSE RANK)
These can be done in a couple of days.
Learning these concepts is NOT hard at all
- what takes time is practice and knowing what command to use when. How do you master that?
- First, create a basic SQL project
- Then, work on an intermediate SQL project (search online) -
Lastly, create something advanced on SQL with many CTEs, subqueries, stored procedures and triggers etc.
This is ALL you need to become a badass in SQL, and trust me when I say this, it is not rocket science. It's just logic.
Remember that practice is the key here. It will be more clear and perfect with the continous practice
Best telegram channel to learn SQL: https://t.me/sqlanalyst
Data Analyst Jobsπ
https://t.me/jobs_SQL
Join @free4unow_backup for more free resources.
Like this post if it helps πβ€οΈ
ENJOY LEARNING ππ
Basic SQL is ONLY 7 commands:
- SELECT
- FROM
- WHERE (also use SQL comparison operators such as =, <=, >=, <> etc.)
- ORDER BY
- Aggregate functions such as SUM, AVERAGE, COUNT etc.
- GROUP BY
- CREATE, INSERT, DELETE, etc.
You can do all this in just one morning.
Once you know these, take the next step and learn commands like:
- LEFT JOIN
- INNER JOIN
- LIKE
- IN
- CASE WHEN
- HAVING (undertstand how it's different from GROUP BY)
- UNION ALL
This should take another day.
Once both basic and intermediate are done, start learning more advanced SQL concepts such as:
- Subqueries (when to use subqueries vs CTE?)
- CTEs (WITH AS)
- Stored Procedures
- Triggers
- Window functions (LEAD, LAG, PARTITION BY, RANK, DENSE RANK)
These can be done in a couple of days.
Learning these concepts is NOT hard at all
- what takes time is practice and knowing what command to use when. How do you master that?
- First, create a basic SQL project
- Then, work on an intermediate SQL project (search online) -
Lastly, create something advanced on SQL with many CTEs, subqueries, stored procedures and triggers etc.
This is ALL you need to become a badass in SQL, and trust me when I say this, it is not rocket science. It's just logic.
Remember that practice is the key here. It will be more clear and perfect with the continous practice
Best telegram channel to learn SQL: https://t.me/sqlanalyst
Data Analyst Jobsπ
https://t.me/jobs_SQL
Join @free4unow_backup for more free resources.
Like this post if it helps πβ€οΈ
ENJOY LEARNING ππ
β€18π1
Top Career Paths in Data Analytics ππΌ
1οΈβ£ Data Analyst
πΉ Analyzes data to drive business decisions
πΉ Creates reports, dashboards, and visualizations
πΉ Skills: SQL, Excel, Tableau, Power BI
2οΈβ£ Data Scientist
πΉ Extracts insights from complex data using ML stats
πΉ Builds predictive models and algorithms
πΉ Skills: Python, R, ML, stats
3οΈβ£ Business Intelligence (BI) Analyst
πΉ Translates data into business actions
πΉ Focus on reporting and data visualization
πΉ Skills: BI tools, SQL, data warehousing
4οΈβ£ Data Engineer
πΉ Builds and maintains data pipelines
πΉ Ensures data quality and infrastructure
πΉ Skills: SQL, Python, data warehousing, ETL
5οΈβ£ Marketing Analyst
πΉ Analyzes customer data for marketing insights
πΉ Optimizes campaigns and strategies
πΉ Skills: Analytics tools, SQL, marketing metrics
6οΈβ£ Financial Analyst
πΉ Uses data for financial planning and analysis
πΉ Forecasting, budgeting, and reporting
πΉ Skills: Excel, financial modeling, SQL
7οΈβ£ Operations Analyst
πΉ Improves business processes using data
πΉ Focus on efficiency and optimization
πΉ Skills: Process mapping, SQL, analytics tools
8οΈβ£ Data Visualization Specialist
πΉ Creates visual stories with data
πΉ Uses tools like Tableau, Power BI, D3.js
πΉ Skills: Design, storytelling, BI tools
9οΈβ£ Quantitative Analyst
πΉ Applies math models to financial data
πΉ Risk analysis, trading strategies
πΉ Skills: Math, Python, financial markets
π Data Analytics Consultant
πΉ Helps businesses implement data strategies
πΉ Focus on insights and problem-solving
πΉ Skills: Analytics tools, business acumen
π‘ Double Tap β₯οΈ For More
1οΈβ£ Data Analyst
πΉ Analyzes data to drive business decisions
πΉ Creates reports, dashboards, and visualizations
πΉ Skills: SQL, Excel, Tableau, Power BI
2οΈβ£ Data Scientist
πΉ Extracts insights from complex data using ML stats
πΉ Builds predictive models and algorithms
πΉ Skills: Python, R, ML, stats
3οΈβ£ Business Intelligence (BI) Analyst
πΉ Translates data into business actions
πΉ Focus on reporting and data visualization
πΉ Skills: BI tools, SQL, data warehousing
4οΈβ£ Data Engineer
πΉ Builds and maintains data pipelines
πΉ Ensures data quality and infrastructure
πΉ Skills: SQL, Python, data warehousing, ETL
5οΈβ£ Marketing Analyst
πΉ Analyzes customer data for marketing insights
πΉ Optimizes campaigns and strategies
πΉ Skills: Analytics tools, SQL, marketing metrics
6οΈβ£ Financial Analyst
πΉ Uses data for financial planning and analysis
πΉ Forecasting, budgeting, and reporting
πΉ Skills: Excel, financial modeling, SQL
7οΈβ£ Operations Analyst
πΉ Improves business processes using data
πΉ Focus on efficiency and optimization
πΉ Skills: Process mapping, SQL, analytics tools
8οΈβ£ Data Visualization Specialist
πΉ Creates visual stories with data
πΉ Uses tools like Tableau, Power BI, D3.js
πΉ Skills: Design, storytelling, BI tools
9οΈβ£ Quantitative Analyst
πΉ Applies math models to financial data
πΉ Risk analysis, trading strategies
πΉ Skills: Math, Python, financial markets
π Data Analytics Consultant
πΉ Helps businesses implement data strategies
πΉ Focus on insights and problem-solving
πΉ Skills: Analytics tools, business acumen
π‘ Double Tap β₯οΈ For More
β€21
π¨Do not miss this (Top FREE AI certificate courses)
Enroll now in these 50+ Free AI certification courses , available for a limited time: https://docs.google.com/spreadsheets/d/1k0XXLD2e8FnXgN2Ja_mG4MI7w1ImW5AF_JKWUscTyq8/edit?usp=sharing
LIFETIME ACCESS
Top FREE AI, ML, & Python Certificate courses which will help to boost resume & in getting better jobs.
Enroll now in these 50+ Free AI certification courses , available for a limited time: https://docs.google.com/spreadsheets/d/1k0XXLD2e8FnXgN2Ja_mG4MI7w1ImW5AF_JKWUscTyq8/edit?usp=sharing
LIFETIME ACCESS
Top FREE AI, ML, & Python Certificate courses which will help to boost resume & in getting better jobs.
β€6
PREPARATION GUIDE FOR DATA ANALYST INTERVIEW
π Review the job description and requirements: Carefully review the job description and requirements for the data analyst position to understand the specific skills and knowledge required.
π Brush up on data analysis concepts and techniques: Make sure you have a solid understanding of data analysis concepts, such as data cleaning, data visualization, and statistical analysis. Review the basics of these techniques, and be familiar with the tools and software used for data analysis.
π Study data visualization tools: Familiarize yourself with data visualization tools like Tableau, PowerBI, and others, and be able to explain how to use them to analyze and present data.
π Brush up on SQL: SQL is a key tool for data analysts, so be sure to review basic SQL commands and be familiar with more advanced concepts such as joining tables and aggregating data.
π Practice your communication skills: Data analysts need to be able to effectively communicate their findings to others, so make sure you have strong written and verbal communication skills.
π Be prepared to discuss real-life examples: Be prepared to discuss specific examples of data analysis projects you have worked on, and be able to explain the methods and techniques you used to complete them.
π Review the company's data and analytics strategy: Research the company's data and analytics strategy, and be prepared to discuss how your skills and experience align with their goals and objectives.
π Free learning resources
https://t.me/free4unow_backup/361
ENJOY LEARNING ππ
π Review the job description and requirements: Carefully review the job description and requirements for the data analyst position to understand the specific skills and knowledge required.
π Brush up on data analysis concepts and techniques: Make sure you have a solid understanding of data analysis concepts, such as data cleaning, data visualization, and statistical analysis. Review the basics of these techniques, and be familiar with the tools and software used for data analysis.
π Study data visualization tools: Familiarize yourself with data visualization tools like Tableau, PowerBI, and others, and be able to explain how to use them to analyze and present data.
π Brush up on SQL: SQL is a key tool for data analysts, so be sure to review basic SQL commands and be familiar with more advanced concepts such as joining tables and aggregating data.
π Practice your communication skills: Data analysts need to be able to effectively communicate their findings to others, so make sure you have strong written and verbal communication skills.
π Be prepared to discuss real-life examples: Be prepared to discuss specific examples of data analysis projects you have worked on, and be able to explain the methods and techniques you used to complete them.
π Review the company's data and analytics strategy: Research the company's data and analytics strategy, and be prepared to discuss how your skills and experience align with their goals and objectives.
π Free learning resources
https://t.me/free4unow_backup/361
ENJOY LEARNING ππ
β€9
β
Scenario-Based Data Analyst Practice Questions with Answers ππ₯
π Q1. Sales dropped by 20% last month. How would you analyze the problem?
β Answer:
Compare sales with previous months
Break down by region, product, and customer segment
Check seasonal trends and external factors
Identify root cause using data patterns
π Q2. You find missing values in a dataset. What will you do?
β Answer:
Remove rows if data is small
Replace with mean/median/mode
Use interpolation or business logic
Analyze impact before handling
π Q3. A stakeholder asks for insights from raw data. What steps will you follow?
β Answer:
Data collection β Data cleaning β Data exploration β Analysis β Visualization β Business insights.
π Q4. How would you identify top-performing products?
β Answer:
Use revenue or sales metrics, apply sorting or ranking, and compare performance across categories.
π Q5. How do you explain technical results to non-technical stakeholders?
β Answer:
Use simple language, charts, dashboards, and focus on business impact instead of technical details.
π Q6. How would you detect outliers in data?
β Answer:
Use box plots, statistical methods (IQR, Z-score), or visualization techniques.
π Q7. A dashboard is slow. How would you improve performance?
β Answer:
Optimize queries, reduce data size, remove unnecessary visuals, improve data model.
π Q8. How would you measure customer churn?
β Answer:
Calculate customers lost during a period Γ· total customers at the start Γ 100.
π Q9. What would you check before trusting a dataset?
β Answer:
Data source reliability, missing values, duplicates, consistency, and accuracy.
π Q10. How do you prioritize multiple analysis requests?
β Answer:
Based on business impact, urgency, stakeholder needs, and deadlines.
Double Tap β₯οΈ For More
π Q1. Sales dropped by 20% last month. How would you analyze the problem?
β Answer:
Compare sales with previous months
Break down by region, product, and customer segment
Check seasonal trends and external factors
Identify root cause using data patterns
π Q2. You find missing values in a dataset. What will you do?
β Answer:
Remove rows if data is small
Replace with mean/median/mode
Use interpolation or business logic
Analyze impact before handling
π Q3. A stakeholder asks for insights from raw data. What steps will you follow?
β Answer:
Data collection β Data cleaning β Data exploration β Analysis β Visualization β Business insights.
π Q4. How would you identify top-performing products?
β Answer:
Use revenue or sales metrics, apply sorting or ranking, and compare performance across categories.
π Q5. How do you explain technical results to non-technical stakeholders?
β Answer:
Use simple language, charts, dashboards, and focus on business impact instead of technical details.
π Q6. How would you detect outliers in data?
β Answer:
Use box plots, statistical methods (IQR, Z-score), or visualization techniques.
π Q7. A dashboard is slow. How would you improve performance?
β Answer:
Optimize queries, reduce data size, remove unnecessary visuals, improve data model.
π Q8. How would you measure customer churn?
β Answer:
Calculate customers lost during a period Γ· total customers at the start Γ 100.
π Q9. What would you check before trusting a dataset?
β Answer:
Data source reliability, missing values, duplicates, consistency, and accuracy.
π Q10. How do you prioritize multiple analysis requests?
β Answer:
Based on business impact, urgency, stakeholder needs, and deadlines.
Double Tap β₯οΈ For More
β€19π1π1
β
SQL Roadmap: Step-by-Step Guide to Master SQL π§ π»
Whether you're aiming to be a backend dev, data analyst, or full-time SQL pro β this roadmap has got you covered π
π 1. SQL Basics
β¦ SELECT, FROM, WHERE
β¦ ORDER BY, LIMIT, DISTINCT
Learn data retrieval & filtering.
π 2. Joins Mastery
β¦ INNER JOIN, LEFT/RIGHT/FULL OUTER JOIN
β¦ SELF JOIN, CROSS JOIN
Master table relationships.
π 3. Aggregate Functions
β¦ COUNT(), SUM(), AVG(), MIN(), MAX()
Key for reporting & analytics.
π 4. Grouping Data
β¦ GROUP BY to group
β¦ HAVING to filter groups
Example: Sales by region, top categories.
π 5. Subqueries & Nested Queries
β¦ Use subqueries in WHERE, FROM, SELECT
β¦ Use EXISTS, IN, ANY, ALL
Build complex logic without extra joins.
π 6. Data Modification
β¦ INSERT INTO, UPDATE, DELETE
β¦ MERGE (advanced)
Safely change dataset content.
π 7. Database Design Concepts
β¦ Normalization (1NF to 3NF)
β¦ Primary, Foreign, Unique Keys
Design scalable, clean DBs.
π 8. Indexing & Query Optimization
β¦ Speed queries with indexes
β¦ Use EXPLAIN, ANALYZE to tune
Vital for big data/enterprise work.
π 9. Stored Procedures & Functions
β¦ Reusable logic, control flow (IF, CASE, LOOP)
Backend logic inside the DB.
π 10. Transactions & Locks
β¦ ACID properties
β¦ BEGIN, COMMIT, ROLLBACK
β¦ Lock types (SHARED, EXCLUSIVE)
Prevent data corruption in concurrency.
π 11. Views & Triggers
β¦ CREATE VIEW for abstraction
β¦ TRIGGERS auto-run SQL on events
Automate & maintain logic.
π 12. Backup & Restore
β¦ Backup/restore with tools (mysqldump, pg_dump)
Keep your data safe.
π 13. NoSQL Basics (Optional)
β¦ Learn MongoDB, Redis basics
β¦ Understand where SQL ends & NoSQL begins.
π 14. Real Projects & Practice
β¦ Build projects: Employee DB, Sales Dashboard, Blogging System
β¦ Practice on LeetCode, StrataScratch, HackerRank
π 15. Apply for SQL Dev Roles
β¦ Tailor resume with projects & optimization skills
β¦ Prepare for interviews with SQL challenges
β¦ Know common business use cases
π‘ Pro Tip: Combine SQL with Python or Excel to boost your data career options.
π¬ Double Tap β₯οΈ For More!
Whether you're aiming to be a backend dev, data analyst, or full-time SQL pro β this roadmap has got you covered π
π 1. SQL Basics
β¦ SELECT, FROM, WHERE
β¦ ORDER BY, LIMIT, DISTINCT
Learn data retrieval & filtering.
π 2. Joins Mastery
β¦ INNER JOIN, LEFT/RIGHT/FULL OUTER JOIN
β¦ SELF JOIN, CROSS JOIN
Master table relationships.
π 3. Aggregate Functions
β¦ COUNT(), SUM(), AVG(), MIN(), MAX()
Key for reporting & analytics.
π 4. Grouping Data
β¦ GROUP BY to group
β¦ HAVING to filter groups
Example: Sales by region, top categories.
π 5. Subqueries & Nested Queries
β¦ Use subqueries in WHERE, FROM, SELECT
β¦ Use EXISTS, IN, ANY, ALL
Build complex logic without extra joins.
π 6. Data Modification
β¦ INSERT INTO, UPDATE, DELETE
β¦ MERGE (advanced)
Safely change dataset content.
π 7. Database Design Concepts
β¦ Normalization (1NF to 3NF)
β¦ Primary, Foreign, Unique Keys
Design scalable, clean DBs.
π 8. Indexing & Query Optimization
β¦ Speed queries with indexes
β¦ Use EXPLAIN, ANALYZE to tune
Vital for big data/enterprise work.
π 9. Stored Procedures & Functions
β¦ Reusable logic, control flow (IF, CASE, LOOP)
Backend logic inside the DB.
π 10. Transactions & Locks
β¦ ACID properties
β¦ BEGIN, COMMIT, ROLLBACK
β¦ Lock types (SHARED, EXCLUSIVE)
Prevent data corruption in concurrency.
π 11. Views & Triggers
β¦ CREATE VIEW for abstraction
β¦ TRIGGERS auto-run SQL on events
Automate & maintain logic.
π 12. Backup & Restore
β¦ Backup/restore with tools (mysqldump, pg_dump)
Keep your data safe.
π 13. NoSQL Basics (Optional)
β¦ Learn MongoDB, Redis basics
β¦ Understand where SQL ends & NoSQL begins.
π 14. Real Projects & Practice
β¦ Build projects: Employee DB, Sales Dashboard, Blogging System
β¦ Practice on LeetCode, StrataScratch, HackerRank
π 15. Apply for SQL Dev Roles
β¦ Tailor resume with projects & optimization skills
β¦ Prepare for interviews with SQL challenges
β¦ Know common business use cases
π‘ Pro Tip: Combine SQL with Python or Excel to boost your data career options.
π¬ Double Tap β₯οΈ For More!
β€20
β
SQL Aggregate Functions Practice Questions with Answers π§ π
π Q1. Find the total salary of all employees.
ποΈ Table: "employees(emp_id, name, salary)"
β Answer:
SELECT SUM(salary) AS total_salary
FROM employees;
π Q2. Calculate the average salary of employees.
ποΈ Table: "employees(emp_id, name, salary)"
β Answer:
SELECT AVG(salary) AS avg_salary
FROM employees;
π Q3. Count total number of employees in the company.
ποΈ Table: "employees(emp_id, name)"
β Answer:
SELECT COUNT(*) AS total_employees
FROM employees;
π Q4. Find the highest and lowest salary.
ποΈ Table: "employees(emp_id, name, salary)"
β Answer:
SELECT MAX(salary) AS highest_salary,
MIN(salary) AS lowest_salary
FROM employees;
π Q5. Get total salary paid in each department.
ποΈ Table: "employees(emp_id, name, department_id, salary)"
β Answer:
SELECT department_id,
SUM(salary) AS total_salary
FROM employees
GROUP BY department_id;
Double Tap β₯οΈ For More
π Q1. Find the total salary of all employees.
ποΈ Table: "employees(emp_id, name, salary)"
β Answer:
SELECT SUM(salary) AS total_salary
FROM employees;
π Q2. Calculate the average salary of employees.
ποΈ Table: "employees(emp_id, name, salary)"
β Answer:
SELECT AVG(salary) AS avg_salary
FROM employees;
π Q3. Count total number of employees in the company.
ποΈ Table: "employees(emp_id, name)"
β Answer:
SELECT COUNT(*) AS total_employees
FROM employees;
π Q4. Find the highest and lowest salary.
ποΈ Table: "employees(emp_id, name, salary)"
β Answer:
SELECT MAX(salary) AS highest_salary,
MIN(salary) AS lowest_salary
FROM employees;
π Q5. Get total salary paid in each department.
ποΈ Table: "employees(emp_id, name, department_id, salary)"
β Answer:
SELECT department_id,
SUM(salary) AS total_salary
FROM employees
GROUP BY department_id;
Double Tap β₯οΈ For More
β€31
β
SQL Aggregate Functions Questions with Answers Part-2 ππ
π Q1. Find departments where the average salary is greater than 70,000.
ποΈ Table: "employees(emp_id, name, department_id, salary)"
β Answer:
π Q2. Count employees in each department having more than 5 employees.
ποΈ Table: "employees(emp_id, name, department_id)"
β Answer:
π Q3. Find the department with the highest total salary.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
π Q4. Get departments where the minimum salary is greater than 30,000.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
π Q5. Find the difference between highest and lowest salary in each department.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
Double Tap β₯οΈ For More
π Q1. Find departments where the average salary is greater than 70,000.
ποΈ Table: "employees(emp_id, name, department_id, salary)"
β Answer:
SELECT department_id, AVG(salary) AS avg_salary
FROM employees
GROUP BY department_id
HAVING AVG(salary) > 70000;
π Q2. Count employees in each department having more than 5 employees.
ποΈ Table: "employees(emp_id, name, department_id)"
β Answer:
SELECT department_id, COUNT(*) AS total_employees
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 5;
π Q3. Find the department with the highest total salary.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
SELECT department_id
FROM employees
GROUP BY department_id
ORDER BY SUM(salary) DESC
LIMIT 1;
π Q4. Get departments where the minimum salary is greater than 30,000.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
SELECT department_id, MIN(salary) AS min_salary
FROM employees
GROUP BY department_id
HAVING MIN(salary) > 30000;
π Q5. Find the difference between highest and lowest salary in each department.
ποΈ Table: "employees(emp_id, department_id, salary)"
β Answer:
SELECT department_id, MAX(salary) - MIN(salary) AS salary_difference
FROM employees
GROUP BY department_id;
Double Tap β₯οΈ For More
β€23
Most Asked SQL Interview Questions at MAANG Companiesπ₯π₯
Preparing for an SQL Interview at MAANG Companies? Here are some crucial SQL Questions you should be ready to tackle:
1. How do you retrieve all columns from a table?
SELECT * FROM table_name;
2. What SQL statement is used to filter records?
SELECT * FROM table_name
WHERE condition;
The WHERE clause is used to filter records based on a specified condition.
3. How can you join multiple tables? Describe different types of JOINs.
SELECT columns
FROM table1
JOIN table2 ON table1.column = table2.column
JOIN table3 ON table2.column = table3.column;
Types of JOINs:
1. INNER JOIN: Returns records with matching values in both tables
SELECT * FROM table1
INNER JOIN table2 ON table1.column = table2.column;
2. LEFT JOIN: Returns all records from the left table & matched records from the right table. Unmatched records will have NULL values.
SELECT * FROM table1
LEFT JOIN table2 ON table1.column = table2.column;
3. RIGHT JOIN: Returns all records from the right table & matched records from the left table. Unmatched records will have NULL values.
SELECT * FROM table1
RIGHT JOIN table2 ON table1.column = table2.column;
4. FULL JOIN: Returns records when there is a match in either left or right table. Unmatched records will have NULL values.
SELECT * FROM table1
FULL JOIN table2 ON table1.column = table2.column;
4. What is the difference between WHERE & HAVING clauses?
WHERE: Filters records before any groupings are made.
SELECT * FROM table_name
WHERE condition;
HAVING: Filters records after groupings are made.
SELECT column, COUNT(*)
FROM table_name
GROUP BY column
HAVING COUNT(*) > value;
5. How do you calculate average, sum, minimum & maximum values in a column?
Average: SELECT AVG(column_name) FROM table_name;
Sum: SELECT SUM(column_name) FROM table_name;
Minimum: SELECT MIN(column_name) FROM table_name;
Maximum: SELECT MAX(column_name) FROM table_name;
Here you can find essential SQL Interview Resourcesπ
https://t.me/mysqldata
Like this post if you need more πβ€οΈ
Hope it helps :)
Preparing for an SQL Interview at MAANG Companies? Here are some crucial SQL Questions you should be ready to tackle:
1. How do you retrieve all columns from a table?
SELECT * FROM table_name;
2. What SQL statement is used to filter records?
SELECT * FROM table_name
WHERE condition;
The WHERE clause is used to filter records based on a specified condition.
3. How can you join multiple tables? Describe different types of JOINs.
SELECT columns
FROM table1
JOIN table2 ON table1.column = table2.column
JOIN table3 ON table2.column = table3.column;
Types of JOINs:
1. INNER JOIN: Returns records with matching values in both tables
SELECT * FROM table1
INNER JOIN table2 ON table1.column = table2.column;
2. LEFT JOIN: Returns all records from the left table & matched records from the right table. Unmatched records will have NULL values.
SELECT * FROM table1
LEFT JOIN table2 ON table1.column = table2.column;
3. RIGHT JOIN: Returns all records from the right table & matched records from the left table. Unmatched records will have NULL values.
SELECT * FROM table1
RIGHT JOIN table2 ON table1.column = table2.column;
4. FULL JOIN: Returns records when there is a match in either left or right table. Unmatched records will have NULL values.
SELECT * FROM table1
FULL JOIN table2 ON table1.column = table2.column;
4. What is the difference between WHERE & HAVING clauses?
WHERE: Filters records before any groupings are made.
SELECT * FROM table_name
WHERE condition;
HAVING: Filters records after groupings are made.
SELECT column, COUNT(*)
FROM table_name
GROUP BY column
HAVING COUNT(*) > value;
5. How do you calculate average, sum, minimum & maximum values in a column?
Average: SELECT AVG(column_name) FROM table_name;
Sum: SELECT SUM(column_name) FROM table_name;
Minimum: SELECT MIN(column_name) FROM table_name;
Maximum: SELECT MAX(column_name) FROM table_name;
Here you can find essential SQL Interview Resourcesπ
https://t.me/mysqldata
Like this post if you need more πβ€οΈ
Hope it helps :)
β€17
β
π€ AβZ of Data Analyst ππΌ
A β Analytics
The process of analyzing data to discover insights and support decision-making.
B β Business Intelligence (BI)
Technologies and tools used to analyze business data (Power BI, Tableau).
C β Cleaning (Data Cleaning)
Removing errors, duplicates, and inconsistencies from data.
D β Dashboard
A visual display of key metrics and insights.
E β ETL (Extract, Transform, Load)
Process of collecting, cleaning, and storing data for analysis.
F β Forecasting
Predicting future trends using historical data.
G β Group By
A method to organize data into categories for analysis.
H β Hypothesis Testing
Testing assumptions using statistical methods.
I β Insight
Meaningful information derived from data analysis.
J β Join
Combining data from multiple tables (SQL concept).
K β KPI (Key Performance Indicator)
A measurable value showing business performance.
L β Linear Regression
A statistical method used to predict relationships between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy and improve efficiency.
O β Outlier
A data point significantly different from others.
P β Pivot Table
A tool used to summarize and analyze data quickly.
Q β Query
A request to retrieve data from a database.
R β Reporting
Presenting data insights through charts and summaries.
S β SQL
Language used to manage and analyze structured data.
T β Trend Analysis
Identifying patterns or changes over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Representing data using charts or graphs.
W β Warehousing (Data Warehouse)
Central storage of large structured datasets.
X β X-axis
Horizontal axis in charts representing variables.
Y β YoY (Year-over-Year)
Comparing data from one year to another.
Z β Z-Score
Statistical measure showing how far a value is from the mean.
Double Tap β₯οΈ For More
A β Analytics
The process of analyzing data to discover insights and support decision-making.
B β Business Intelligence (BI)
Technologies and tools used to analyze business data (Power BI, Tableau).
C β Cleaning (Data Cleaning)
Removing errors, duplicates, and inconsistencies from data.
D β Dashboard
A visual display of key metrics and insights.
E β ETL (Extract, Transform, Load)
Process of collecting, cleaning, and storing data for analysis.
F β Forecasting
Predicting future trends using historical data.
G β Group By
A method to organize data into categories for analysis.
H β Hypothesis Testing
Testing assumptions using statistical methods.
I β Insight
Meaningful information derived from data analysis.
J β Join
Combining data from multiple tables (SQL concept).
K β KPI (Key Performance Indicator)
A measurable value showing business performance.
L β Linear Regression
A statistical method used to predict relationships between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy and improve efficiency.
O β Outlier
A data point significantly different from others.
P β Pivot Table
A tool used to summarize and analyze data quickly.
Q β Query
A request to retrieve data from a database.
R β Reporting
Presenting data insights through charts and summaries.
S β SQL
Language used to manage and analyze structured data.
T β Trend Analysis
Identifying patterns or changes over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Representing data using charts or graphs.
W β Warehousing (Data Warehouse)
Central storage of large structured datasets.
X β X-axis
Horizontal axis in charts representing variables.
Y β YoY (Year-over-Year)
Comparing data from one year to another.
Z β Z-Score
Statistical measure showing how far a value is from the mean.
Double Tap β₯οΈ For More
β€32
π Top 50 Data Analyst Interview Questions ππΌ
βπ EXCEL Questions
1. Can you show me how you'd clean this messy dataset in Excel? What functions like TRIM or Remove Duplicates would you use?
2. What's the difference between absolute ($A$1) and relative (A1) references? When do you use each?
3. Walk me through creating a PivotTable to analyze sales by region and product. What are the exact steps?
4. Write a VLOOKUP formula right now. What if you get #N/A? How do you fix it?
5. Why use INDEX-MATCH over VLOOKUP? Show me both formulas for this lookup.
6. What's COUNTIF vs SUMIF vs COUNTIFS? Write formulas for conditional sales totals.
7. How does Goal Seek work? Demo target revenue scenario on this data.
8. Apply conditional formatting to highlight top 10% sales performers. Which rule?
9. Build me a dynamic dashboard. How do slicers and timelines work together?
10. Explain SUMPRODUCT. Write formula for multi-condition sales sum.
11. What's Power Query? Show basic ETL steps for cleaning data.
12. Freeze panes vs split panesβwhen do you use each?
13. XLOOKUP vs VLOOKUP advantages? Write both for this example.
14. How do you find and fix circular references in formulas?
15. Create data validation dropdown + named ranges. Demo it.
βποΈ SQL Questions
16. Write query for 2nd highest salary from Employee table. Use subquery OR window function.
17. INNER JOIN vs LEFT JOIN vs FULL JOIN? Write examples for employees + departments.
18. Find and remove duplicate records. Use CTE + ROW_NUMBER() or GROUP BY.
19. WHERE vs HAVING with GROUP BY? Show department-wise avg salary > 50k.
20. RANK() vs DENSE_RANK() vs ROW_NUMBER()? Partition by dept, order by salary.
21. Top 5 products by total sales. Write complete query with GROUP BY + LIMIT.
22. Self-join for employee-manager hierarchy. Show employee name + manager name.
23. Handle NULL salaries. Use COALESCE, IS NULL, IFNULL examples.
24. Pivot sales data by month using CASE statements. Write query.
25. Subquery vs JOINβwhich is faster for this scenario? Why?
26. Recursive CTE for company hierarchy (CEO β managers β employees).
27. Clustered vs non-clustered indexes? When does each improve performance?
βπ¨ Tableau Questions
28. {FIXED [Region]: SUM([Sales])}βwhat's this LOD doing? Write region total ignoring filters.
29. Create dual-axis chart comparing sales vs profit trends. Exact steps?
30. Data blending vs joining? When do you use each approach?
31. Parameters vs filters? Write calculated field using parameter.
32. Build dashboard with filter action + highlight action. Demo flow.
33. % of total calculated field? Write formula for region sales %.
34. FIXED vs INCLUDE vs EXCLUDE LOD? Give 3 examples.
35. Tableau Extracts vs Live connection? Performance + refresh differences?
ββ‘ Power BI Questions
36. CALCULATE(SUM(Sales), SAMEPERIODLASTYEAR())βexplain this DAX. YoY growth?
37. Measures vs Calculated Columns? When do you use each? Write both.
38. Star schema vs Snowflake? Draw relationships for sales β products β customers.
39. Power Query: Write M code for custom column parsing dates.
40. Implement Row-Level Security (RLS). Show DAX for region manager filter.
41. DirectQuery vs Import mode? Pros/cons + when to choose each?
42. TOTALYTD(SUM(Sales))βexplain time intelligence DAX.
43. Dashboard loads slow. Optimization steps? Aggregations + query folding?
βπ Python/Pandas Questions
44. Group sales by region and sum: write pandas code. .reset_index()
45. pd.merge(df1, df2, on='ID', how='inner')βexplain all merge types.
46. Three ways to handle NaN values: fillna(), dropna(), interpolate().
47. loc[] vs iloc[]? Filter sales > 1000 by region vs first 5 rows.
48. pivot_table() vs groupby()? Reshape sales by month/product.
49. Read 1GB CSV without crashing: chunksize=10000 example.
50. df['New'] = df['Sales'].apply(lambda x: x*1.1)βalternatives to apply?
Double Tap β₯οΈ For More
βπ EXCEL Questions
1. Can you show me how you'd clean this messy dataset in Excel? What functions like TRIM or Remove Duplicates would you use?
2. What's the difference between absolute ($A$1) and relative (A1) references? When do you use each?
3. Walk me through creating a PivotTable to analyze sales by region and product. What are the exact steps?
4. Write a VLOOKUP formula right now. What if you get #N/A? How do you fix it?
5. Why use INDEX-MATCH over VLOOKUP? Show me both formulas for this lookup.
6. What's COUNTIF vs SUMIF vs COUNTIFS? Write formulas for conditional sales totals.
7. How does Goal Seek work? Demo target revenue scenario on this data.
8. Apply conditional formatting to highlight top 10% sales performers. Which rule?
9. Build me a dynamic dashboard. How do slicers and timelines work together?
10. Explain SUMPRODUCT. Write formula for multi-condition sales sum.
11. What's Power Query? Show basic ETL steps for cleaning data.
12. Freeze panes vs split panesβwhen do you use each?
13. XLOOKUP vs VLOOKUP advantages? Write both for this example.
14. How do you find and fix circular references in formulas?
15. Create data validation dropdown + named ranges. Demo it.
βποΈ SQL Questions
16. Write query for 2nd highest salary from Employee table. Use subquery OR window function.
17. INNER JOIN vs LEFT JOIN vs FULL JOIN? Write examples for employees + departments.
18. Find and remove duplicate records. Use CTE + ROW_NUMBER() or GROUP BY.
19. WHERE vs HAVING with GROUP BY? Show department-wise avg salary > 50k.
20. RANK() vs DENSE_RANK() vs ROW_NUMBER()? Partition by dept, order by salary.
21. Top 5 products by total sales. Write complete query with GROUP BY + LIMIT.
22. Self-join for employee-manager hierarchy. Show employee name + manager name.
23. Handle NULL salaries. Use COALESCE, IS NULL, IFNULL examples.
24. Pivot sales data by month using CASE statements. Write query.
25. Subquery vs JOINβwhich is faster for this scenario? Why?
26. Recursive CTE for company hierarchy (CEO β managers β employees).
27. Clustered vs non-clustered indexes? When does each improve performance?
βπ¨ Tableau Questions
28. {FIXED [Region]: SUM([Sales])}βwhat's this LOD doing? Write region total ignoring filters.
29. Create dual-axis chart comparing sales vs profit trends. Exact steps?
30. Data blending vs joining? When do you use each approach?
31. Parameters vs filters? Write calculated field using parameter.
32. Build dashboard with filter action + highlight action. Demo flow.
33. % of total calculated field? Write formula for region sales %.
34. FIXED vs INCLUDE vs EXCLUDE LOD? Give 3 examples.
35. Tableau Extracts vs Live connection? Performance + refresh differences?
ββ‘ Power BI Questions
36. CALCULATE(SUM(Sales), SAMEPERIODLASTYEAR())βexplain this DAX. YoY growth?
37. Measures vs Calculated Columns? When do you use each? Write both.
38. Star schema vs Snowflake? Draw relationships for sales β products β customers.
39. Power Query: Write M code for custom column parsing dates.
40. Implement Row-Level Security (RLS). Show DAX for region manager filter.
41. DirectQuery vs Import mode? Pros/cons + when to choose each?
42. TOTALYTD(SUM(Sales))βexplain time intelligence DAX.
43. Dashboard loads slow. Optimization steps? Aggregations + query folding?
βπ Python/Pandas Questions
44. Group sales by region and sum: write pandas code. .reset_index()
45. pd.merge(df1, df2, on='ID', how='inner')βexplain all merge types.
46. Three ways to handle NaN values: fillna(), dropna(), interpolate().
47. loc[] vs iloc[]? Filter sales > 1000 by region vs first 5 rows.
48. pivot_table() vs groupby()? Reshape sales by month/product.
49. Read 1GB CSV without crashing: chunksize=10000 example.
50. df['New'] = df['Sales'].apply(lambda x: x*1.1)βalternatives to apply?
Double Tap β₯οΈ For More
β€25
SQL CHEAT SHEETπ©βπ»
Here is a quick cheat sheet of some of the most essential SQL commands:
SELECT - Retrieves data from a database
UPDATE - Updates existing data in a database
DELETE - Removes data from a database
INSERT - Adds data to a database
CREATE - Creates an object such as a database or table
ALTER - Modifies an existing object in a database
DROP -Deletes an entire table or database
ORDER BY - Sorts the selected data in an ascending or descending order
WHERE β Condition used to filter a specific set of records from the database
GROUP BY - Groups a set of data by a common parameter
HAVING - Allows the use of aggregate functions within the query
JOIN - Joins two or more tables together to retrieve data
INDEX - Creates an index on a table, to speed up search times.
Here is a quick cheat sheet of some of the most essential SQL commands:
SELECT - Retrieves data from a database
UPDATE - Updates existing data in a database
DELETE - Removes data from a database
INSERT - Adds data to a database
CREATE - Creates an object such as a database or table
ALTER - Modifies an existing object in a database
DROP -Deletes an entire table or database
ORDER BY - Sorts the selected data in an ascending or descending order
WHERE β Condition used to filter a specific set of records from the database
GROUP BY - Groups a set of data by a common parameter
HAVING - Allows the use of aggregate functions within the query
JOIN - Joins two or more tables together to retrieve data
INDEX - Creates an index on a table, to speed up search times.
β€5
π Interviewer: How do you remove duplicate records in SQL?
π Me: We can remove duplicates using DISTINCT, GROUP BY, or delete duplicate rows using ROW_NUMBER().
β 1οΈβ£ Using DISTINCT (to fetch unique values)
π Returns unique records but does not delete duplicates.
β 2οΈβ£ Using GROUP BY (to identify duplicates)
π Helps find duplicate records.
β 3οΈβ£ Delete Duplicates Using ROW_NUMBER() (Most Important β)
(Keeps one record and deletes others)
π§ Logic Breakdown:
- DISTINCT β shows unique records
- GROUP BY β identifies duplicates
- ROW_NUMBER() β removes duplicates safely
β Use Case: Data cleaning, ETL processes, data quality checks.
π‘ Tip: Always take a backup before deleting duplicate records.
π¬ Tap β€οΈ for more!
π Me: We can remove duplicates using DISTINCT, GROUP BY, or delete duplicate rows using ROW_NUMBER().
β 1οΈβ£ Using DISTINCT (to fetch unique values)
SELECT DISTINCT column_name
FROM employees;
π Returns unique records but does not delete duplicates.
β 2οΈβ£ Using GROUP BY (to identify duplicates)
SELECT name, COUNT(*)
FROM employees
GROUP BY name
HAVING COUNT(*) > 1;
π Helps find duplicate records.
β 3οΈβ£ Delete Duplicates Using ROW_NUMBER() (Most Important β)
(Keeps one record and deletes others)
DELETE FROM employees
WHERE id IN (
SELECT id FROM (
SELECT id,
ROW_NUMBER() OVER (
PARTITION BY name, salary
ORDER BY id
) AS rn
FROM employees
) t
WHERE rn > 1
);
π§ Logic Breakdown:
- DISTINCT β shows unique records
- GROUP BY β identifies duplicates
- ROW_NUMBER() β removes duplicates safely
β Use Case: Data cleaning, ETL processes, data quality checks.
π‘ Tip: Always take a backup before deleting duplicate records.
π¬ Tap β€οΈ for more!
β€15
β
Excel Interview Questions with Answers ππΌ
1οΈβ£ How do you clean a messy dataset in Excel?
Steps:
- TRIM() β removes extra spaces
- CLEAN() β removes non-printable characters
- Remove Duplicates β Data β Remove Duplicates
- Text to Columns β split data
- Find & Replace (Ctrl+H) β fix values
- Filter β remove blanks or errors
2οΈβ£ Absolute vs Relative References
Relative (A1) β changes when copied
Absolute ($A$1) β stays fixed
When to use:
- Relative β normal calculations
- Absolute β fixed values (tax rate, constants)
3οΈβ£ Create PivotTable for Sales Analysis
Steps:
1. Select data
2. Insert β PivotTable
3. Drag: Region β Rows, Product β Columns, Sales β Values
Used for fast data summarization.
4οΈβ£ VLOOKUP Formula + #N/A Fix
Formula:
Fix #N/A:
- Check lookup value exists
- Match data types
Use:
5οΈβ£ INDEX-MATCH vs VLOOKUP
VLOOKUP:
INDEX-MATCH:
β Why INDEX-MATCH?
- Faster for large data
- Works left lookup
- More flexible
6οΈβ£ COUNTIF vs SUMIF vs COUNTIFS
COUNTIF β count condition
SUMIF β sum condition
COUNTIFS β multiple conditions
7οΈβ£ Goal Seek
Used for what-if analysis.
Steps:
1. Data β What-if Analysis β Goal Seek
2. Set cell β target value
3. Change variable cell
Example: target revenue calculation.
8οΈβ£ Conditional Formatting Top 10%
Steps: Select data
Home β Conditional Formatting
Top/Bottom Rules β Top 10%
9οΈβ£ Dynamic Dashboard + Slicers
Create PivotTable
Insert β Slicer
Insert β Timeline (for dates)
Connect slicers to multiple visuals
Used for interactive dashboards.
π SUMPRODUCT (Multi-condition sum)
Used for weighted or multiple-condition calculations.
1οΈβ£1οΈβ£ What is Power Query?
Excelβs ETL tool.
Steps:
- Get Data β Load data
- Remove columns
- Change types
- Remove duplicates
- Load cleaned data
Used for automation and transformation.
1οΈβ£2οΈβ£ Freeze Panes vs Split Panes
Freeze Panes β lock rows/columns while scrolling
Split Panes β divide screen into sections
1οΈβ£3οΈβ£ XLOOKUP vs VLOOKUP
XLOOKUP:
β Advantages:
- Left lookup
- No column index
- Default exact match
- Handles errors
1οΈβ£4οΈβ£ Circular References Fix
Occurs when formula refers to itself.
Fix:
Formulas β Error Checking β Circular References
Correct formula logic
1οΈβ£5οΈβ£ Data Validation + Named Range
Steps:
1. Formulas β Define Name
2. Data β Data Validation β List
3. Select named range
Used for dropdown lists.
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Double Tap β₯οΈ For More
1οΈβ£ How do you clean a messy dataset in Excel?
Steps:
- TRIM() β removes extra spaces
=TRIM(A1)- CLEAN() β removes non-printable characters
=CLEAN(A1)- Remove Duplicates β Data β Remove Duplicates
- Text to Columns β split data
- Find & Replace (Ctrl+H) β fix values
- Filter β remove blanks or errors
2οΈβ£ Absolute vs Relative References
Relative (A1) β changes when copied
Absolute ($A$1) β stays fixed
When to use:
- Relative β normal calculations
- Absolute β fixed values (tax rate, constants)
3οΈβ£ Create PivotTable for Sales Analysis
Steps:
1. Select data
2. Insert β PivotTable
3. Drag: Region β Rows, Product β Columns, Sales β Values
Used for fast data summarization.
4οΈβ£ VLOOKUP Formula + #N/A Fix
Formula:
=VLOOKUP(A2, Sheet2!A:B, 2, FALSE)Fix #N/A:
- Check lookup value exists
- Match data types
Use:
=IFERROR(VLOOKUP(A2, A:B, 2, FALSE),"Not Found")5οΈβ£ INDEX-MATCH vs VLOOKUP
VLOOKUP:
=VLOOKUP(A2,A:B,2,FALSE)INDEX-MATCH:
=INDEX(B:B, MATCH(A2,A:A,0))β Why INDEX-MATCH?
- Faster for large data
- Works left lookup
- More flexible
6οΈβ£ COUNTIF vs SUMIF vs COUNTIFS
COUNTIF β count condition
=COUNTIF(A:A,"East")SUMIF β sum condition
=SUMIF(A:A,"East",B:B)COUNTIFS β multiple conditions
=COUNTIFS(A:A,"East",B:B,">500")7οΈβ£ Goal Seek
Used for what-if analysis.
Steps:
1. Data β What-if Analysis β Goal Seek
2. Set cell β target value
3. Change variable cell
Example: target revenue calculation.
8οΈβ£ Conditional Formatting Top 10%
Steps: Select data
Home β Conditional Formatting
Top/Bottom Rules β Top 10%
9οΈβ£ Dynamic Dashboard + Slicers
Create PivotTable
Insert β Slicer
Insert β Timeline (for dates)
Connect slicers to multiple visuals
Used for interactive dashboards.
π SUMPRODUCT (Multi-condition sum)
=SUMPRODUCT((A2:A10="East")(B2:B10>500)C2:C10)Used for weighted or multiple-condition calculations.
1οΈβ£1οΈβ£ What is Power Query?
Excelβs ETL tool.
Steps:
- Get Data β Load data
- Remove columns
- Change types
- Remove duplicates
- Load cleaned data
Used for automation and transformation.
1οΈβ£2οΈβ£ Freeze Panes vs Split Panes
Freeze Panes β lock rows/columns while scrolling
Split Panes β divide screen into sections
1οΈβ£3οΈβ£ XLOOKUP vs VLOOKUP
XLOOKUP:
=XLOOKUP(A2,A:A,B:B)β Advantages:
- Left lookup
- No column index
- Default exact match
- Handles errors
1οΈβ£4οΈβ£ Circular References Fix
Occurs when formula refers to itself.
Fix:
Formulas β Error Checking β Circular References
Correct formula logic
1οΈβ£5οΈβ£ Data Validation + Named Range
Steps:
1. Formulas β Define Name
2. Data β Data Validation β List
3. Select named range
Used for dropdown lists.
Excel Resources: https://whatsapp.com/channel/0029VaifY548qIzv0u1AHz3i
Double Tap β₯οΈ For More
β€13π1
SQL Interview Questions with Answers
β 16. Write a query to find the 2nd highest salary from Employee table using subquery OR window function.
β Using Subquery
β INNER JOIN β Only matching records
β Find Duplicates
π Difference
WHERE β filter before grouping
HAVING β filter after grouping
β Using COALESCE
JOIN is usually faster, subquery is easier to read.
β 26. Write a recursive CTE for company hierarchy (CEO β managers β employees).
β Clustered Index: physically sorts table data
β Non-Clustered Index: separate structure pointing to data
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
Double Tap β₯οΈ For More
β 16. Write a query to find the 2nd highest salary from Employee table using subquery OR window function.
β Using Subquery
SELECT MAX(salary) AS second_highest_salaryβ Using Window Function
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);
SELECT salaryβ 17. Explain INNER JOIN vs LEFT JOIN vs FULL JOIN with examples for employees and departments.
FROM (
SELECT salary, DENSE_RANK() OVER (ORDER BY salary DESC) AS rnk
FROM employees
) t
WHERE rnk = 2;
β INNER JOIN β Only matching records
SELECT e.name, d.department_nameβ LEFT JOIN β All employees + matching departments
FROM employees e
INNER JOIN departments d ON e.department_id = d.id;
SELECT e.name, d.department_nameβ FULL JOIN β All records from both tables
FROM employees e
LEFT JOIN departments d ON e.department_id = d.id;
SELECT e.name, d.department_nameβ 18. Find and remove duplicate records using CTE + ROW_NUMBER().
FROM employees e
FULL JOIN departments d ON e.department_id = d.id;
β Find Duplicates
WITH cte AS (β Remove Duplicates
SELECT *, ROW_NUMBER() OVER(PARTITION BY email ORDER BY id) rn
FROM employees
)
SELECT * FROM cte WHERE rn > 1;
WITH cte AS (β 19. Explain WHERE vs HAVING with GROUP BY. Show department-wise avg salary > 50k.
SELECT *, ROW_NUMBER() OVER(PARTITION BY email ORDER BY id) rn
FROM employees
)
DELETE FROM cte WHERE rn > 1;
π Difference
WHERE β filter before grouping
HAVING β filter after grouping
SELECT department_id, AVG(salary) AS avg_salaryβ 20. Explain RANK vs DENSE_RANK vs ROW_NUMBER partitioned by department ordered by salary.
FROM employees
GROUP BY department_id
HAVING AVG(salary) > 50000;
SELECT name, department_id, salary,β 21. Find top 5 products by total sales using GROUP BY + LIMIT.
ROW_NUMBER() OVER(PARTITION BY department_id ORDER BY salary DESC) rn,
RANK() OVER(PARTITION BY department_id ORDER BY salary DESC) rnk,
DENSE_RANK() OVER(PARTITION BY department_id ORDER BY salary DESC) drnk
FROM employees;
SELECT product_id, SUM(sales_amount) AS total_salesβ 22. Write a self join to show employee name and manager name.
FROM sales
GROUP BY product_id
ORDER BY total_sales DESC
LIMIT 5;
SELECT e.name AS employee, m.name AS managerβ 23. Handle NULL salaries using COALESCE, IS NULL, IFNULL.
FROM employees e
LEFT JOIN employees m ON e.manager_id = m.employee_id;
β Using COALESCE
SELECT name, COALESCE(salary, 0) AS salaryβ Using IS NULL
FROM employees;
SELECT * FROM employees WHERE salary IS NULL;β 24. Pivot sales data by month using CASE statement.
SELECTβ 25. Subquery vs JOIN β which is faster? Why?
SUM(CASE WHEN month = 'Jan' THEN sales ELSE 0 END) AS Jan,
SUM(CASE WHEN month = 'Feb' THEN sales ELSE 0 END) AS Feb,
SUM(CASE WHEN month = 'Mar' THEN sales ELSE 0 END) AS Mar
FROM sales;
JOIN is usually faster, subquery is easier to read.
β 26. Write a recursive CTE for company hierarchy (CEO β managers β employees).
WITH RECURSIVE emp_hierarchy AS (β 27. Explain clustered vs non-clustered indexes. When to use each?
SELECT employee_id, name, manager_id
FROM employees
WHERE manager_id IS NULL
UNION ALL
SELECT e.employee_id, e.name, e.manager_id
FROM employees e
JOIN emp_hierarchy h ON e.manager_id = h.employee_id
)
SELECT * FROM emp_hierarchy;
β Clustered Index: physically sorts table data
β Non-Clustered Index: separate structure pointing to data
SQL Resources: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v
Double Tap β₯οΈ For More
β€14
β
π€ AβZ of Data Analyst Terms ππ»π
A β A/B Testing
Experiment comparing two versions to see which performs better.
B β Business Intelligence (BI)
Technologies and processes for analyzing business data.
C β Correlation
Measure of relationship between two variables.
D β Data Cleaning
Process of fixing or removing incorrect/incomplete data.
E β ETL (Extract, Transform, Load)
Process of moving and preparing data for analysis.
F β Forecasting
Predicting future trends based on historical data.
G β Granularity
Level of detail in data (daily, monthly, yearly).
H β Hypothesis
Assumption made for testing using data.
I β Insight
Meaningful interpretation derived from data analysis.
J β Join
Combining data from multiple tables.
K β KPI (Key Performance Indicator)
Metric used to measure performance.
L β Linear Regression
Statistical method to model relationship between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy.
O β Outlier
Data point significantly different from others.
P β Pivot Table
Tool to summarize and analyze data.
Q β Query
Request to retrieve specific data.
R β Regression Analysis
Technique for predicting relationships between variables.
S β Segmentation
Dividing data into groups for analysis.
T β Trend Analysis
Identifying patterns over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Presenting data graphically (charts, dashboards).
W β Warehouse (Data Warehouse)
Central repository for integrated data.
X β X-Axis
Horizontal axis in charts.
Y β YoY (Year-over-Year)
Comparison of metrics from one year to another.
Z β Z-Score
Statistical measurement of how far a value is from mean.
β€οΈ Double Tap for More
A β A/B Testing
Experiment comparing two versions to see which performs better.
B β Business Intelligence (BI)
Technologies and processes for analyzing business data.
C β Correlation
Measure of relationship between two variables.
D β Data Cleaning
Process of fixing or removing incorrect/incomplete data.
E β ETL (Extract, Transform, Load)
Process of moving and preparing data for analysis.
F β Forecasting
Predicting future trends based on historical data.
G β Granularity
Level of detail in data (daily, monthly, yearly).
H β Hypothesis
Assumption made for testing using data.
I β Insight
Meaningful interpretation derived from data analysis.
J β Join
Combining data from multiple tables.
K β KPI (Key Performance Indicator)
Metric used to measure performance.
L β Linear Regression
Statistical method to model relationship between variables.
M β Metrics
Quantifiable measures used to track performance.
N β Normalization
Organizing data to reduce redundancy.
O β Outlier
Data point significantly different from others.
P β Pivot Table
Tool to summarize and analyze data.
Q β Query
Request to retrieve specific data.
R β Regression Analysis
Technique for predicting relationships between variables.
S β Segmentation
Dividing data into groups for analysis.
T β Trend Analysis
Identifying patterns over time.
U β Unstructured Data
Data without predefined format (text, images).
V β Visualization
Presenting data graphically (charts, dashboards).
W β Warehouse (Data Warehouse)
Central repository for integrated data.
X β X-Axis
Horizontal axis in charts.
Y β YoY (Year-over-Year)
Comparison of metrics from one year to another.
Z β Z-Score
Statistical measurement of how far a value is from mean.
β€οΈ Double Tap for More
β€17π1