Machine Learning And AI

The TikTok recommender system is widely regarded as one of the best in the world at the scale it operates at. It can recommend videos or ads, and even the other big tech companies could not compete. Recommending on a platform like TikTok is tough because the training data is non-stationary as a user's interest can change in a matter of minutes and the number of users, videos, and ads keeps changing.

The predictive performance of a recommender system on a social media platform deteriorates in a matter of hours, so it needs to be updated as often as possible. TikTok built a streaming engine to ensure the model is continuously trained in an online manner. The model server generates features for the model to recommend videos, and in return, the user interacts with the recommended items. This feedback loop leads to new training samples that are immediately sent to the training server. The training server holds a copy of the model, and the model parameters are updated in the parameter server. Every minute, the parameter server synchronizes itself with the production model.

The recommendation model is several terabytes in size, so it is very slow to synchronize such a big model across the network. That is why the model is only partially updated. The leading cause of non-stationary (concept drift) comes from the sparse variables (users, videos, ads, etc.) that are represented by embedding tables. When a user interacts with a recommended item, only the vectors associated with the user and the item get updated, as well as some of the weights on the network. Therefore, only the updated vectors get synchronized on a minute basis, and the network weights are synchronized on a longer time frame.
Typical recommender systems use fixed embedding tables, and the categories of the sparse variables get assigned to a vector through a hash function. Typically, the hash size is smaller than the number of categories, and multiple categories get assigned to the same vector. For example, multiple users share the same vector. This allows us to deal with the cold start problem for new users, and it puts a constraint on the maximum memory that the whole table will use. But this also tends to reduce the performance of the model because user behaviors get conflated. Instead, TikTok uses dynamic embedding sizes such that new users can be added to their own vector. They use a collisionless hashing function so each user gets its own vector. Low-activity users will not influence the model performance that much, so they dynamically remove those low-occurrence IDs as well as stale IDs. This keeps the embedding table small while preserving the quality of the model.

220 views06:28

https://geekycodes.in/hackerrank-plus-minus-problem/

Geeky Codes

Hackerrank | Plus Minus Problem | Geeky Codes

Given an array of integers, calculate the ratios of its elements that are positive, negative, and zero. Print the decimal value of each fraction on a new l ...

353 views07:49

Machine Learning And AI

Comment Your answers

320 views16:48

Machine Learning And AI

https://geekycodes.in/hackerrank-problem-staircase-detail/

Geeky Codes

HackerRank Problem | Staircase detail | Geeky Codes

Staircase detail This is a staircase of size n=4: # ## ### #### Its base and height are both equal to 4 . It is drawn using # symbols and spaces. The last l ...

369 views17:40

Machine Learning And AI

https://geekycodes.in/hackerrank-min-max-sum-of-n-1-elements/

Geeky Codes

Hackerrank | Min Max Sum of N-1 Elements | Geeky Codes

Given five positive integers, find the minimum and maximum values that can be calculated by summing exactly four of the five integers. Then print the respec ...

375 views18:22

Machine Learning And AI

339 views07:01

Machine Learning And AI

https://geekycodes.in/hackerrank-birthday-candles-problem/

Geeky Codes

Hackerrank | Birthday Candles Problem | Geeky Codes

You are in charge of the cake for a child's birthday. You have decided the cake will have one candle for each year of their total age. They will only b ...

403 views16:17

Machine Learning And AI

https://geekycodes.in/hackerrank-grading-students/

Geeky Codes

Hackerrank | Grading Students | Geeky Codes

HackerLand University has the following grading policy: Every student receives a grade in the inclusive range from 0 to 100. Any less than 40 is a failing ...

431 views17:52

Machine Learning And AI

Happy New Year Guys.
Have a wonderful year ahead. Keep learning

👍1

303 views18:44

Machine Learning And AI

Happy New Year 💝 to our valued Telegram channel members! as we step into 2024, may each moment be adorned with joy and success. Your ongoing support is deeply appreciated. Here's to another year of shared growth and camaraderie! 🎉

@ved1104

🔥1

482 views19:14

Machine Learning And AI

Linear Regression.pdf

4.5 MB

❤1

418 views14:51

Machine Learning And AI

MODEL BASED MACHINE LEARNING.pdf

25.4 MB

294 views14:53

Machine Learning And AI

https://geekycodes.in/understanding-and-implementing-batch-gradient-descent-for-linear-regression-in-python/

Geeky Codes

Understanding and Implementing Batch Gradient Descent for Linear Regression in Python | Geeky Codes

Machine learning algorithms often involve optimizing a model to fit the given data. One such optimization technique is gradient descent, a fundamental algor ...

470 views19:17

Machine Learning And AI

https://geekycodes.in/hackerrank-migratory-birds/

Geeky Codes

Hackerrank | Migratory Birds | Geeky Codes

Given an array of bird sightings where every element represents a bird type id, determine the id of the most frequently sighted type. If more than 1 type ha ...

836 views18:25

Machine Learning And AI

242 views08:22

Machine Learning And AI

𝐇𝐨𝐰 𝐝𝐨𝐞𝐬 𝐏𝐂𝐀 𝐦𝐚𝐧𝐮𝐚𝐥𝐥𝐲 𝐜𝐨𝐦𝐩𝐮𝐭𝐞? 𝐀𝐧𝐝 𝐖𝐡𝐲 𝐒𝐡𝐨𝐮𝐥𝐝 𝐖𝐞 𝐊𝐧𝐨𝐰 𝐈𝐭?

In data science, machine learning, and statistics, Principal Component Analysis (PCA) is a dimensionality-reduction method often used to reduce the dimensionality of large data sets by transforming a large set of variables into a smaller one that still contains most of the information in the large set.

Reducing the number of variables in a data set naturally comes at the expense of accuracy. Still, the trick in dimensionality reduction is to trade a little accuracy for simplicity. Smaller data sets are easier to explore and visualize, making analyzing data much easier and faster for machine learning algorithms without extraneous variables to process.

PCA finds directions for the maximal variance of the data. It finds mutually orthogonal directions. Mutually orthogonal means it's a global algorithm. Global means that all the directions and all the new features they find have a significant global constraint, namely that they must be mutually orthogonal.

Let’s see how we can manually compute PCA given some random table of values (see the illustration)

𝑺𝒕𝒆𝒑 1: Standardize the dataset.
𝑺𝒕𝒆𝒑 2: Calculate the covariance matrix for the features in the dataset.
𝑺𝒕𝒆𝒑 3: Calculate the eigenvalues and eigenvectors for the covariance matrix.
𝑺𝒕𝒆𝒑 4: Sort eigenvalues and their corresponding eigenvectors.
𝑺𝒕𝒆𝒑 5: Calculate eigenvector for each eigenvalue using Cramer’s rule
𝑺𝒕𝒆𝒑 6: Build eigenvectors matrix
𝑺𝒕𝒆𝒑 7: Pick k eigenvalues and form a matrix of eigenvectors.
𝑺𝒕𝒆𝒑 8: Transform the original matrix.

𝐊𝐧𝐨𝐰𝐢𝐧𝐠 𝐡𝐨𝐰 𝐭𝐨 𝐜𝐨𝐦𝐩𝐮𝐭𝐞 𝐏𝐂𝐀 𝐦𝐚𝐧𝐮𝐚𝐥𝐥𝐲 𝐜𝐚𝐧 𝐛𝐞 𝐞𝐬𝐬𝐞𝐧𝐭𝐢𝐚𝐥 𝐟𝐨𝐫 𝐬𝐞𝐯𝐞𝐫𝐚𝐥:

▸ Conceptual understanding enhances your grasp of the underlying mathematical principles.

▸ Sometimes, we may need to customize the PCA process to suit specific requirements or constraints. Manual computation enables us to adapt PCA and adjust it to 𝐨𝐮𝐫 needs as necessary.

▸ Understanding the inner workings of PCA through manual computation can enhance our problem-solving skills in data analysis and dimensionality reduction. We will be better equipped to tackle complex data-related challenges.

▸ A solid grasp of manual PCA can be a foundation for understanding 𝐦𝐨𝐫𝐞 𝐚𝐝𝐯𝐚𝐧𝐜𝐞𝐝 𝐝𝐢𝐦𝐞𝐧𝐬𝐢𝐨𝐧𝐚𝐥𝐢𝐭𝐲 𝐫𝐞𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐭𝐞𝐜𝐡𝐧𝐢𝐪𝐮𝐞𝐬 and related machine learning and data analysis methods.

▸ Manual computation can be a valuable educational tool if we teach or learn about PCA. It allows instructors and students to see how PCA works from a foundational perspective.

👍1

264 views08:23

About

Blog

Apps

Platform