Data Analytics
108K subscribers
126 photos
2 files
791 links
Perfect channel to learn Data Analytics

Learn SQL, Python, Alteryx, Tableau, Power BI and many more

For Promotions: @coderfun @love_data
Download Telegram
Which of the following is DML command in SQL?
Anonymous Quiz
66%
INSERT
22%
CREATE
7%
NOTIFY
6%
NULL
πŸ‘17πŸ₯°2
Which of the following function is used to return length of the string?
Anonymous Quiz
79%
LEN()
3%
LEFT()
1%
RIGHT()
17%
STRINGLENGTH()
πŸ‘11πŸ‘2😁1
SQL Notes PART-4
Data Wrangling functions on string data types πŸ˜„πŸ‘
πŸ‘24❀2πŸ‘2🀯2πŸ‘Ž1
SQL Notes Part-5
(DATE data type Functions)
πŸ‘18πŸ‘Ž1
Data Analytics
SQL Notes Part-5 (DATE data type Functions)
Do you need more notes like this?
Anonymous Poll
94%
Yes
6%
No, don't need SQL notes anymore
πŸ‘19❀4
Data Analytics
Nice to see amazing response for SQL, Power BI, Statistics, Python & Excel :)
Data Analysis with Excel
πŸ‘‡πŸ‘‡
https://t.me/excel_analyst/2

Power BI DAX Functions
πŸ‘‡πŸ‘‡
https://t.me/PowerBI_analyst/2

All about SQL
πŸ‘‡πŸ‘‡
https://t.me/sqlanalyst/29

Python for data analysis
πŸ‘‡πŸ‘‡
https://t.me/pythonanalyst/26

Statistics Book and other useful resources
πŸ‘‡πŸ‘‡
https://t.me/DataAnalystInterview/34

Join channel as per your interest :)
πŸ‘51❀15πŸ‘7πŸ”₯2πŸ₯°2😁1
Do you also need data analysis projects and more free courses to be shared in this channel?
Anonymous Poll
98%
Yes
2%
Not needed as of now
πŸ‘33πŸ‘5
SQL Interview Questions Part-1

Like if you want more Interview Questions on Data analytics, SQL, R, Python & Power BI πŸ˜„πŸ‘
πŸ‘75❀7πŸ‘5
Which of the following function is used to capitalize first letter in SQL?
Anonymous Quiz
53%
UPPER()
29%
INITCAP()
1%
LOWER()
17%
FIRSTLETTERCAPITAL()
πŸ‘32❀1πŸ”₯1
πŸ”Ÿ Project Ideas for a data analyst

Customer Segmentation: Analyze customer data to segment them based on their behaviors, preferences, or demographics, helping businesses tailor their marketing strategies.

Churn Prediction: Build a model to predict customer churn, identifying factors that contribute to churn and proposing strategies to retain customers.

Sales Forecasting: Use historical sales data to create a predictive model that forecasts future sales, aiding inventory management and resource planning.

Market Basket Analysis: Analyze
transaction data to identify associations between products often purchased together, assisting retailers in optimizing product placement and cross-selling.

Sentiment Analysis: Analyze social media or customer reviews to gauge public sentiment about a product or service, providing valuable insights for brand reputation management.

Healthcare Analytics: Examine medical records to identify trends, patterns, or correlations in patient data, aiding in disease prediction, treatment optimization, and resource allocation.

Financial Fraud Detection: Develop algorithms to detect anomalous transactions and patterns in financial data, helping prevent fraud and secure transactions.

A/B Testing Analysis: Evaluate the results of A/B tests to determine the effectiveness of different strategies or changes on websites, apps, or marketing campaigns.

Energy Consumption Analysis: Analyze energy usage data to identify patterns and inefficiencies, suggesting strategies for optimizing energy consumption in buildings or industries.

Real Estate Market Analysis: Study housing market data to identify trends in property prices, rental rates, and demand, assisting buyers, sellers, and investors in making informed decisions.

Remember to choose a project that aligns with your interests and the domain you're passionate about.

Data Analyst Roadmap

https://t.me/sqlspecialist/379

ENJOY LEARNING πŸ‘πŸ‘
πŸ‘48❀12πŸ”₯2πŸ₯°1πŸ‘1
1. What is concurrency control in DBMS?
This is a process of managing simultaneous operations in a database so that database integrity is not compromised. The following are the two approaches involved in concurrency control:
Optimistic approach – Involves versioning
Pessimistic approach – Involves locking

2. What is a checkpoint in DBMS and when does it occur?
A checkpoint is a mechanism where all the previous logs are removed from the system and are permanently stored on the storage disk. So, basically, checkpoints are those points from where the transaction log record can be used to recover all the committed data up to the point of crash.

3. What are groups in Tableau?
A group is a combination of dimension members that make higher level categories. For example, if you are working with a view that shows average test scores by major, you may want to group certain majors together to create major categories.

4. How are nested IF statements used in Excel?
The function IF() can be nested when we have multiple conditions to meet. The FALSE value in the first IF function is replaced by another IF function to make a further test.

5. What is a Recursive Stored Procedure?

A stored procedure that calls itself until a boundary condition is reached, is called a recursive stored procedure. This recursive function helps the programmers to deploy the same set of code several times as and when required.
πŸ‘25❀10πŸ”₯1πŸ‘1
Which of the following form should not have any transitive dependency in SQL?
Anonymous Quiz
41%
1st normal form
30%
2nd normal form
28%
3rd normal form
😁12πŸ‘9πŸ‘6
1. What are the different subsets of SQL?

Data Definition Language (DDL) – It allows you to perform various operations on the database such as CREATE, ALTER, and DELETE objects.
Data Manipulation Language(DML) – It allows you to access and manipulate data. It helps you to insert, update, delete and retrieve data from the database.
Data Control Language(DCL) – It allows you to control access to the database. Example – Grant, Revoke access permissions.

2. List the different types of relationships in SQL.

There are different types of relations in the database:
One-to-One – This is a connection between two tables in which each record in one table corresponds to the maximum of one record in the other.
One-to-Many and Many-to-One – This is the most frequent connection, in which a record in one table is linked to several records in another.
Many-to-Many – This is used when defining a relationship that requires several instances on each sides.
Self-Referencing Relationships – When a table has to declare a connection with itself, this is the method to employ.

3. How to create empty tables with the same structure as another table?

To create empty tables:
Using the INTO operator to fetch the records of one table into a new table while setting a WHERE clause to false for all entries, it is possible to create empty tables with the same structure. As a result, SQL creates a new table with a duplicate structure to accept the fetched entries, but nothing is stored into the new table since the WHERE clause is active.

4. What is Normalization and what are the advantages of it?

Normalization in SQL is the process of organizing data to avoid duplication and redundancy. Some of the advantages are:
Better Database organization
More Tables with smaller rows
Efficient data access
Greater Flexibility for Queries
Quickly find the information
Easier to implement Security
πŸ‘38❀10πŸ”₯1
TOP 10 SQL Concepts for Job Interview

1. Aggregate Functions (SUM/AVG)
2. Group By and Order By
3. JOINs (Inner/Left/Right)
4. Union and Union All
5. Date and Time processing
6. String processing
7. Window Functions (Partition by)
8. Subquery
9. View and Index
10. Common Table Expression (CTE)


TOP 10 Statistics Concepts for Job Interview

1. Sampling
2. Experiments (A/B tests)
3. Descriptive Statistics
4. p-value
5. Probability Distributions
6. t-test
7. ANOVA
8. Correlation
9. Linear Regression
10. Logistics Regression


TOP 10 Python Concepts for Job Interview

1. Reading data from file/table
2. Writing data to file/table
3. Data Types
4. Function
5. Data Preprocessing (numpy/pandas)
6. Data Visualisation (Matplotlib/seaborn/bokeh)
7. Machine Learning (sklearn)
8. Deep Learning (Tensorflow/Keras/PyTorch)
9. Distributed Processing (PySpark)
10. Functional and Object Oriented Programming
πŸ‘71❀7πŸ”₯2
Coding and Aptitude Round before interview

Coding challenges are meant to test your coding skills (especially if you are applying for ML engineer role). The coding challenges can contain algorithm and data structures problems of varying difficulty. These challenges will be timed based on how complicated the questions are. These are intended to test your basic algorithmic thinking.
Sometimes, a complicated data science question like making predictions based on twitter data are also given. These challenges are hosted on HackerRank, HackerEarth, CoderByte etc. In addition, you may even be asked multiple-choice questions on the fundamentals of data science and statistics. This round is meant to be a filtering round where candidates whose fundamentals are little shaky are eliminated. These rounds are typically conducted without any manual intervention, so it is important to be well prepared for this round.

Sometimes a separate Aptitude test is conducted or along with the technical round an aptitude test is also conducted to assess your aptitude skills. A Data Scientist is expected to have a good aptitude as this field is continuously evolving and a Data Scientist encounters new challenges every day. If you have appeared for GMAT / GRE or CAT, this should be easy for you.

Resources for Prep:

For algorithms and data structures prep,Leetcode and Hackerrank are good resources.

For aptitude prep, you can refer to IndiaBixand Practice Aptitude.

With respect to data science challenges, practice well on GLabs and Kaggle.

Brilliant is an excellent resource for tricky math and statistics questions.

For practising SQL, SQL Zoo and Mode Analytics are good resources that allow you to solve the exercises in the browser itself.

Things to Note:

Ensure that you are calm and relaxed before you attempt to answer the challenge. Read through all the questions before you start attempting the same. Let your mind go into problem-solving mode before your fingers do!

In case, you are finished with the test before time, recheck your answers and then submit.

Sometimes these rounds don’t go your way, you might have had a brain fade, it was not your day etc. Don’t worry! Shake if off for there is always a next time and this is not the end of the world.
πŸ‘25❀5πŸ‘2
Career Path for a Data Analyst

Education: Start by earning a bachelor's degree in fields like math, stats, economics, or computer science.

Skills Growth: Learn programming (Python/R), data tools (SQL/Excel), and visualization. Master data analysis basics.

Entry-Level Role: Begin as a Junior Data Analyst. Learn data cleaning, organization, and basic analysis.

Specialization: Deepen your expertise in a specific industry. Explore advanced analytics and visualization tools.

Advanced Analytics: Move up to Senior Data Analyst. Tackle complex projects and predictive modeling.

Machine Learning: Explore machine learning and data modeling techniques. Familiarize yourself with algorithms, and learn how to implement predictive and classification models.

Domain Expertise: Develop expertise in a particular industry, such as healthcare, finance, e-commerce, etc. This knowledge will enable you to provide more valuable insights from data.

Leadership Roles: As you gain experience, you can move into roles like Data Analytics Manager or Data Science Manager, where you'll oversee teams and projects.

Continuous Learning: Stay updated with the latest tools, techniques, and industry trends. Attend workshops, conferences, and online courses to keep your skills relevant.

Networking: Build a strong professional network within the data analytics community. This can open up opportunities and help you stay informed about industry developments.

Remember, your career path can be personalized based on your interests and strengths. Continuous learning and adaptability are key in the ever-evolving field of data analysis :)
πŸ‘48❀18🍌5πŸ‘¨β€πŸ’»3πŸ”₯2πŸ€”2
Top 10 interview questions for Tableau with answers

What is Tableau and what are its key features?
Tableau is a data visualization tool that allows users to create interactive and shareable dashboards. Key features include drag-and-drop functionality, real-time data connection, and the ability to handle large datasets.

Explain the difference between Tableau Desktop, Tableau Server, and Tableau Online.
Tableau Desktop is for creating and editing visualizations. Tableau Server is for sharing and collaborating on Tableau content within an organization. Tableau Online is a cloud-based version of Tableau Server.

How do you connect to data sources in Tableau, and what types of data sources does Tableau support?
You can connect to data sources in Tableau by selecting "Data" and choosing your source. Tableau supports a wide range of data sources, including databases, spreadsheets, and web data connectors.

What is a dimension and a measure in Tableau? Provide examples.
Dimensions are categorical data that define the structure of the data, like "Product Category" or "Customer Name." Measures are numerical data that can be aggregated, like "Sales" or "Profit."

Describe the difference between a calculated field and a table calculation in Tableau.
A calculated field is a new field created by a user-defined calculation. A table calculation is a computation performed on the data within a visualization, like running totals or percent of total.

What are the various types of joins in Tableau, and when would you use them?
Tableau supports inner, left, right, and full outer joins. Use them based on your data relationship requirements, like combining data from two tables with common keys.

How can you create a dashboard in Tableau, and what elements can be added to a dashboard?
To create a dashboard, drag sheets, objects, and filters onto the dashboard canvas. Elements that can be added include charts, images, text, and web content.

Explain the concept of data blending in Tableau and when it is necessary.
Data blending is used when data comes from different data sources, and you need to combine them in a single visualization. It's necessary when a simple join isn't possible.

What are the different ways to share Tableau visualizations and reports with others?
You can share Tableau workbooks via Tableau Server, Tableau Online, or by exporting them as PDFs or images. You can also create Tableau Public dashboards for sharing publicly.

How can you optimize the performance of a Tableau workbook?
To optimize performance, minimize the use of live connections, limit the data displayed, optimize calculations, use extracts, and consider hardware and server resources.

These answers should give you a good foundation for discussing Tableau in an interview. Remember to expand on these answers based on your experience and knowledge by giving personalized examples :)
πŸ‘36❀7πŸ₯°1
Top 10 Power BI Interview Questions with Answers

What is Power BI, and what are its main components?
Power BI is a business analytics tool by Microsoft. Its main components include Power Query, Power Pivot, Power View, and Power Map.

How do you import data into Power BI?
Data can be imported into Power BI through various sources like Excel files, databases, web services, or by using Power Query to transform and load data.

What is a Power BI Desktop and how does it differ from Power BI Service?
Power BI Desktop is used for creating reports and dashboards, while Power BI Service is a cloud-based platform for publishing, sharing, and collaborating on reports created in Power BI Desktop.

Explain the difference between a measure and a calculated column in Power BI.
A measure is a calculation performed on the fly, often used in visualizations. A calculated column is a column added to a table with a calculated formula and is precomputed during data load.

What is the purpose of DAX (Data Analysis Expressions) in Power BI?
DAX is a formula language used for creating custom calculations, aggregations, and calculated columns in Power BI. It is essential for building complex calculations.

How can you create relationships between tables in Power BI, and why is it important?
Relationships are established by linking common fields between tables. They are crucial for enabling data integration and accurate calculations in Power BI reports.

What are the differences between a slicer and a filter in Power BI?
Slicers are visual elements that allow users to make selections, affecting visuals on a report page. Filters, on the other hand, affect the entire report by restricting data based on certain criteria.

Explain the difference between a calculated column and a measure in Power BI.
A calculated column is computed at the row level and can be used in visuals like any other column. Measures are computed at the aggregate level and are used in calculations.

What is the role of Power Query in Power BI, and how does it help in data transformation?
Power Query is used for data transformation and cleaning. It helps import, transform, and load data into Power BI by shaping and structuring it for analysis.

How can you share Power BI reports with others, and what are the sharing options available?
You can share Power BI reports by publishing them to the Power BI Service, sharing them with specific users, or embedding them in websites and applications. Sharing options can be controlled via workspace permissions.

These questions and answers should help you prepare for a Power BI interview as a data analyst. Be ready to expand on these responses based on your experience and knowledge :)
❀30πŸ‘19πŸ₯°2πŸ”₯1
Top 15 advanced Power BI interview questions

1. Explain the concept of row-level security in Power BI and how to implement it.

2. What are calculated tables in Power BI, and when would you use them?

3. Describe the differences between DirectQuery, Live Connection, and Import Data storage modes in Power BI.

4. How can you optimize the performance of a Power BI report or dashboard with large datasets?

5. What is the DAX language, and how is it used in Power BI? Provide an example of a complex DAX calculation.

6. Explain the role of Power Query in data transformation within Power BI. What are some common data cleansing techniques in Power Query?

7. What is the purpose of the Power BI Data Model, and how do relationships between tables impact report development?

8. How can you create custom visuals or extensions in Power BI? Provide an example of when you would use custom visuals.

9. Describe the steps involved in setting up Power BI Gateway and its significance in a corporate environment.

10. What are the differences between Power BI Desktop, Power BI Service, and Power BI Mobile? How do they work together in a typical Power BI workflow?

11. Discuss the process of incremental data refresh in Power BI and its benefits.

12. How can you implement dynamic security roles in Power BI, and why might you need them in a multi-user environment?

13. What are Power BI paginated reports, and when would you choose to use them over standard interactive reports?

14. Explain the concept of drill-through in Power BI, including its configuration and use cases.

15. How can you integrate Power BI with other Microsoft products, such as Azure Data Lake Storage or SharePoint?

Like this post if you want the answers in next post β€οΈπŸ‘
πŸ‘53❀8πŸ”₯4
Here are brief answers to the above advanced Power BI interview questions:

Row-level security in Power BI is a feature that allows you to restrict data access at a granular level based on user roles. To implement it, you define roles and their corresponding filters in Power BI Desktop. These filters are then applied to the data model, ensuring that users only see the data that's relevant to their role. It's commonly used in scenarios where different users should have access to different subsets of the same dataset.

Calculated tables in Power BI are tables created by defining a DAX formula. They don't exist in the data source but are generated within Power BI based on the formula you specify. Calculated tables are useful when you need to create custom tables with calculated values or to simplify complex data models. They can improve performance by precalculating values.

DirectQuery, Live Connection, and Import Data are storage modes in Power BI. DirectQuery connects directly to the data source for real-time querying, suitable for large datasets that shouldn't be imported. Live Connection connects to a dataset hosted in Power BI Service, ideal for collaborative report development. Import Data loads data into Power BI for fast performance but may not be suitable for large datasets due to storage limitations.

To optimize performance with large datasets in Power BI, you can:
Use data compression techniques.
Limit unnecessary data columns.
Optimize DAX calculations.
Use summary tables.
Implement data partitioning and incremental refresh.

DAX (Data Analysis Expressions) is a formula language used in Power BI for creating custom calculations and aggregations. An example of a complex DAX calculation might be calculating a moving average of sales over a rolling 3-month period, involving functions like SUMX, FILTER, and DATESINPERIOD.

Power Query is used for data transformation. Common data cleansing techniques include removing duplicates, handling missing values, and transforming data types.

The Power BI Data Model defines relationships between tables, which impact how data is retrieved and displayed in reports. Properly defining relationships is crucial for report development.

Custom visuals or extensions in Power BI are created using tools like Power BI Visuals SDK or Charticulator. They are used to create custom visualizations beyond the built-in visuals.

Power BI Gateway is used to connect on-premises data sources to Power BI Service. It's important for refreshing data from on-premises sources in the cloud.

Power BI Desktop is used for report creation, Power BI Service for sharing and collaboration, and Power BI Mobile for accessing reports on mobile. They work together to create an end-to-end BI solution.

Incremental data refresh is a feature that allows you to refresh only new or changed data, reducing data refresh time and resource usage.

Dynamic security roles in Power BI are based on DAX expressions and allow data access control based on user-specific criteria, such as region or department.

Paginated reports are used for pixel-perfect, printable reports. They are suitable when precise formatting and printing are required.

Drill-through in Power BI enables users to explore details by clicking on data points. You configure it by defining drill-through fields and target pages.

Power BI can be integrated with other Microsoft products using connectors or APIs. For example, you can connect to Azure Data Lake Storage for data storage or embed reports in SharePoint for collaboration.

These answers provide a brief overview of the topics covered in the questions. In interviews, candidates should provide more detailed and practical responses based on their experience and knowledge :)
πŸ‘19❀11πŸ‘1