Machine Learning And AI

Given an array of bird sightings where every element represents a bird type id, determine the id of the most frequently sighted type. If more than 1 type ha ...

836 views18:25

242 views08:22

Machine Learning And AI

𝐇𝐨𝐰 𝐝𝐨𝐞𝐬 𝐏𝐂𝐀 𝐦𝐚𝐧𝐮𝐚𝐥𝐥𝐲 𝐜𝐨𝐦𝐩𝐮𝐭𝐞? 𝐀𝐧𝐝 𝐖𝐡𝐲 𝐒𝐡𝐨𝐮𝐥𝐝 𝐖𝐞 𝐊𝐧𝐨𝐰 𝐈𝐭?

In data science, machine learning, and statistics, Principal Component Analysis (PCA) is a dimensionality-reduction method often used to reduce the dimensionality of large data sets by transforming a large set of variables into a smaller one that still contains most of the information in the large set.

Reducing the number of variables in a data set naturally comes at the expense of accuracy. Still, the trick in dimensionality reduction is to trade a little accuracy for simplicity. Smaller data sets are easier to explore and visualize, making analyzing data much easier and faster for machine learning algorithms without extraneous variables to process.

PCA finds directions for the maximal variance of the data. It finds mutually orthogonal directions. Mutually orthogonal means it's a global algorithm. Global means that all the directions and all the new features they find have a significant global constraint, namely that they must be mutually orthogonal.

Let’s see how we can manually compute PCA given some random table of values (see the illustration)

𝑺𝒕𝒆𝒑 1: Standardize the dataset.
𝑺𝒕𝒆𝒑 2: Calculate the covariance matrix for the features in the dataset.
𝑺𝒕𝒆𝒑 3: Calculate the eigenvalues and eigenvectors for the covariance matrix.
𝑺𝒕𝒆𝒑 4: Sort eigenvalues and their corresponding eigenvectors.
𝑺𝒕𝒆𝒑 5: Calculate eigenvector for each eigenvalue using Cramer’s rule
𝑺𝒕𝒆𝒑 6: Build eigenvectors matrix
𝑺𝒕𝒆𝒑 7: Pick k eigenvalues and form a matrix of eigenvectors.
𝑺𝒕𝒆𝒑 8: Transform the original matrix.

𝐊𝐧𝐨𝐰𝐢𝐧𝐠 𝐡𝐨𝐰 𝐭𝐨 𝐜𝐨𝐦𝐩𝐮𝐭𝐞 𝐏𝐂𝐀 𝐦𝐚𝐧𝐮𝐚𝐥𝐥𝐲 𝐜𝐚𝐧 𝐛𝐞 𝐞𝐬𝐬𝐞𝐧𝐭𝐢𝐚𝐥 𝐟𝐨𝐫 𝐬𝐞𝐯𝐞𝐫𝐚𝐥:

▸ Conceptual understanding enhances your grasp of the underlying mathematical principles.

▸ Sometimes, we may need to customize the PCA process to suit specific requirements or constraints. Manual computation enables us to adapt PCA and adjust it to 𝐨𝐮𝐫 needs as necessary.

▸ Understanding the inner workings of PCA through manual computation can enhance our problem-solving skills in data analysis and dimensionality reduction. We will be better equipped to tackle complex data-related challenges.

▸ A solid grasp of manual PCA can be a foundation for understanding 𝐦𝐨𝐫𝐞 𝐚𝐝𝐯𝐚𝐧𝐜𝐞𝐝 𝐝𝐢𝐦𝐞𝐧𝐬𝐢𝐨𝐧𝐚𝐥𝐢𝐭𝐲 𝐫𝐞𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐭𝐞𝐜𝐡𝐧𝐢𝐪𝐮𝐞𝐬 and related machine learning and data analysis methods.

▸ Manual computation can be a valuable educational tool if we teach or learn about PCA. It allows instructors and students to see how PCA works from a foundational perspective.

👍1

264 views08:23

Machine Learning And AI

https://geekycodes.in/hackerrank-bill-division/

Geeky Codes

Hackerrank | Bill Division | Geeky Codes

Two friends Anna and Brian, are deciding how to split the bill at a dinner. Each will only pay for the items they consume. Brian gets the check and calculat ...

746 views18:15

Machine Learning And AI

https://geekycodes.in/how-to-extract-a-zipfile-in-aws-ec2-from-s3/

Geeky Codes

How to Extract a zipfile in AWS EC2 from S3 | Geeky Codes

Handling data stored on Amazon S3 and extracting files in a Python environment on an EC2 instance can be a common task in data processing workflows. In this ...

462 views18:51

Machine Learning And AI

401 views08:33

Machine Learning And AI

https://geekycodes.in/hackerrank-sales-by-match/

Geeky Codes

Hackerrank | Sales by Match | Geeky Codes

There is a large pile of socks that must be paired by color. Given an array of integers representing the color of each sock, determine how many pairs of soc ...

676 views17:31

Machine Learning And AI

https://geekycodes.in/hackerrank-drawing-book/

Geeky Codes

Hackerrank | Drawing Book | Geeky Codes

A teacher asks the class to open their books to a page number. A student can either start turning pages from the front of the book or from the back of the b ...

648 views12:58

Machine Learning And AI

https://geekycodes.in/hackerrank-apple-and-orange/

Geeky Codes

Hackerrank | Apple and Orange | Geeky Codes

Sam's house has an apple tree and an orange tree that yield an abundance of fruit. Using the information given below, determine the number of apples an ...

484 views17:24

Machine Learning And AI

Join my channel for more technical blogs and daily updates

❤1

487 views18:23

Machine Learning And AI

https://geekycodes.in/hackerrank-counting-valleys/

Geeky Codes

Hackerrank | Counting Valleys | Geeky Codes

An avid hiker keeps meticulous records of their hikes. During the last hike that took exactly steps, for every step it was noted if it was an uphill, U, or ...

489 views18:58

Machine Learning And AI

https://geekycodes.in/hackerrank-electronics-shop/

Geeky Codes

Hackerrank | Electronics Shop | Geeky Codes

A person wants to determine the most expensive computer keyboard and USB drive that can be purchased with a give budget. Given price lists for keyboards and ...

487 views16:11

Machine Learning And AI

389 views06:21

Machine Learning And AI

Hi All
Welcome to GeekyCodes. Thank you for 500 members in this group. Join our channel for latest Programming Blogs,Job Openings at various organizations and machine learning blogs.
In case you've any doubt regarding ML/Data Science please reach out to me @ved1104

👍1

432 viewsedited 06:53

Machine Learning And AI

https://geekycodes.in/hackerrank-cats-and-a-mouse/

Geeky Codes

Hackerrank | Cats and a Mouse | Geeky Codes

Two cats and a mouse are at various positions on a line. You will be given their starting positions. Your task is to determine which cat will reach the mous ...

423 views11:37

Machine Learning And AI

https://geekycodes.in/a-python-script-to-check-ip-address-reachability/

Geeky Codes

A Python Script to Check IP Address Reachability | Geeky Codes