@Codingdidi
9.18K subscribers
26 photos
7 videos
47 files
260 links
Free learning Resources For Data Analysts, Data science, ML, AI, GEN AI and Job updates, career growth, Tech updates
Download Telegram
Gartner is hiring Associate Data Scientist πŸš€

Experience : 0-3 Years
Location : Gurugram

Apply link : https://gartner.wd5.myworkdayjobs.com/EXT/job/Gurgaon/Associate-Data-Scientist_101739-1/apply?source=JB-10120
❀2
American Express is hiring Analyst πŸš€

Min. Experience : 1 Year
Location : Gurugram

Apply link : https://aexp.eightfold.ai/careers/job/30504056?hl=en&utm_source=linkedin&domain=aexp.com
❀4
Here are some essential SQL tips for beginners πŸ‘‡πŸ‘‡

β—† Primary Key = Unique Key + Not Null constraint
β—† To perform case insensitive search use UPPER() function ex. UPPER(customer_name) LIKE β€˜A%A’
β—† LIKE operator is for string data type
β—† COUNT(*), COUNT(1), COUNT(0) all are same
β—† All aggregate functions ignore the NULL values
β—† Aggregate functions MIN, MAX, SUM, AVG, COUNT are for int data type whereas STRING_AGG is for string data type
β—† For row level filtration use WHERE and aggregate level filtration use HAVING
β—† UNION ALL will include duplicates where as UNION excludes duplicates 
β—† If the results will not have any duplicates, use UNION ALL instead of UNION
β—† We have to alias the subquery if we are using the columns in the outer select query
β—† Subqueries can be used as output with NOT IN condition.
β—† CTEs look better than subqueries. Performance wise both are same.
β—† When joining two tables , if one table has only one value then we can use 1=1 as a condition to join the tables. This will be considered as CROSS JOIN.
β—† Window functions work at ROW level.
β—† The difference between RANK() and DENSE_RANK() is that RANK() skips the rank if the values are the same.
β—† EXISTS works on true/false conditions. If the query returns at least one value, the condition is TRUE. All the records corresponding to the conditions are returned.

Like for more πŸ˜„πŸ˜„

@codingdidi
❀5
Q. Explain the data preprocessing steps in data analysis.

Ans. Data preprocessing transforms the data into a format that is more easily and effectively processed in data mining, machine learning and other data science tasks.
1. Data profiling.
2. Data cleansing.
3. Data reduction.
4. Data transformation.
5. Data enrichment.
6. Data validation.

Q. What Are the Three Stages of Building a Model in Machine Learning?

Ans. The three stages of building a machine learning model are:

Model Building: Choosing a suitable algorithm for the model and train it according to the requirement

Model Testing: Checking the accuracy of the model through the test data

Applying the Model: Making the required changes after testing and use the final model for real-time projects


Q. What are the subsets of SQL?

Ans. The following are the four significant subsets of the SQL:

Data definition language (DDL): It defines the data structure that consists of commands like CREATE, ALTER, DROP, etc.

Data manipulation language (DML): It is used to manipulate existing data in the database. The commands in this category are SELECT, UPDATE, INSERT, etc.

Data control language (DCL): It controls access to the data stored in the database. The commands in this category include GRANT and REVOKE.

Transaction Control Language (TCL): It is used to deal with the transaction operations in the database. The commands in this category are COMMIT, ROLLBACK, SET TRANSACTION, SAVEPOINT, etc.


Q. What is a Parameter in Tableau? Give an Example.

Ans. A parameter is a dynamic value that a customer could select, and you can use it to replace constant values in calculations, filters, and reference lines.
For example, when creating a filter to show the top 10 products based on total profit instead of the fixed value, you can update the filter to show the top 10, 20, or 30 products using a parameter.
❀4
IBM Summer Internship Program!
Position: Research Intern - AI
Qualifications: Bachelor’s Degree
Salary: 30K - 50K Per Month (Expected)
Batch: 2024/ 2025/ 2026/ 2027
Experience: Freshers
Location: Bangalore; Gurgaon, India (Hybrid)

πŸ“ŒApply Now: https://ibmglobal.avature.net/en_US/careers/JobDetail?jobId=59041&source=WEB_Search_INDIA

All the best πŸ‘πŸ‘
❀2
Template to ask for referrals
(For freshers)
πŸ‘‡πŸ‘‡

Hi [Name],

I hope this message finds you well.

My name is [Your Name], and I recently graduated with a degree in [Your Degree] from [Your University]. I am passionate about data analytics and have developed a strong foundation through my coursework and practical projects.
I am currently seeking opportunities to start my career as a Data Analyst and came across the exciting roles at [Company Name].

I am reaching out to you because I admire your professional journey and expertise in the field of data analytics. Your role at [Company Name] is particularly inspiring, and I am very interested in contributing to such an innovative and dynamic team.

I am confident that my skills and enthusiasm would make me a valuable addition to this role [Job ID / Link]. If possible, I would be incredibly grateful for your referral or any advice you could offer on how to best position myself for this opportunity.

Thank you very much for considering my request. I understand how busy you must be and truly appreciate any assistance you can provide.

Best regards,
[Your Full Name]
[Your Email Address]
❀4
Data Engineering Roadmap for Beginners (2025)

> Language β†’ Python + SQL.
> OS Basics β†’ Linux + Bash + Git.
> Data Modeling β†’ Normalization + Star/Snowflake Schema.
> Databases β†’ PostgreSQL + MySQL + MongoDB.
> Data Warehousing β†’ Snowflake + BigQuery + Redshift.
> Data Processing β†’ Apache Spark + PySpark.
> Workflow Orchestration β†’ Airflow + Prefect.
> Data Lakes β†’ Delta Lake + Apache Hudi + Iceberg.

> Streaming β†’ Kafka + Flink
> Cloud Platforms β†’ AWS (S3, Glue, EMR) / GCP (GCS, Dataflow, BigQuery) / Azure (Data Factory, Synapse).
> Data Quality/Validation β†’ Great Expectations.
> Containerization β†’ Docker + Kubernetes.
> Infra as Code β†’ Terraform.
> Visualization β†’ dbt + Looker/PowerBI/Tableau.
❀5
βœ… Step-by-Step Approach to Learn Data Analytics πŸ“ˆπŸ§ 

➊ Excel Fundamentals:
βœ” Master formulas, pivot tables, data validation, charts, and graphs.

βž‹ SQL Basics:
βœ” Learn to query databases, use SELECT, FROM, WHERE, JOIN, GROUP BY, and aggregate functions.

➌ Data Visualization:
βœ” Get proficient with tools like Tableau or Power BI to create insightful dashboards.

➍ Statistical Concepts:
βœ” Understand descriptive statistics (mean, median, mode), distributions, and hypothesis testing.

➎ Data Cleaning & Preprocessing:
βœ” Learn how to handle missing data, outliers, and data inconsistencies.

➏ Exploratory Data Analysis (EDA):
βœ” Explore datasets, identify patterns, and formulate hypotheses.

➐ Python for Data Analysis (Optional but Recommended):
βœ” Learn Pandas and NumPy for data manipulation and analysis.

βž‘ Real-World Projects:
βœ” Analyze datasets from Kaggle, UCI Machine Learning Repository, or your own collection.

βž’ Business Acumen:
βœ” Understand key business metrics and how data insights impact business decisions.

βž“ Build a Portfolio:
βœ” Showcase your projects on GitHub, Tableau Public, or a personal website. Highlight the impact of your analysis.

πŸ‘ Tap ❀️ for more!
❀7