Machine Learning And AI

💡 𝗛𝗼𝘄 𝗱𝗼𝗲𝘀 𝗮𝗻 𝗟𝗟𝗠 (𝗹𝗮𝗿𝗴𝗲 𝗹𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗺𝗼𝗱𝗲𝗹) 𝗮𝗰𝘁𝘂𝗮𝗹𝗹𝘆 𝗹𝗲𝗮𝗿𝗻?
It’s a journey through 3 key phases:

1️⃣ 𝗦𝗲𝗹𝗳-𝗦𝘂𝗽𝗲𝗿𝘃𝗶𝘀𝗲𝗱 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 (𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲)
The model is trained on massive text datasets (Wikipedia, blogs, websites). This is where the transformer architecture comes into picture which you can simply think of it as neural networks that sees words and predicts what comes next.
For example:
“A flash flood watch will be in effect all _____.”
The model ranks possible answers like “night,” “day,” or even “giraffe.” Over time, it gets really good at picking the right one.

2️⃣ 𝗦𝘂𝗽𝗲𝗿𝘃𝗶𝘀𝗲𝗱 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 (𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀)
Next, we teach it how humans like their answers. Thousands of examples of questions and well-crafted responses are fed to the model. This step is smaller but crucial, it’s where the model learns to align with human intent.

3️⃣ 𝗥𝗲𝗶𝗻𝗳𝗼𝗿𝗰𝗲𝗺𝗲𝗻𝘁 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 (𝗜𝗺𝗽𝗿𝗼𝘃𝗶𝗻𝗴 𝗕𝗲𝗵𝗮𝘃𝗶𝗼𝗿)
Finally, the model learns to improve its behavior based on feedback. Humans rate its answers (thumbs up or thumbs down), and the model adjusts.
This helps it avoid harmful or wrong answers and focus on being helpful, honest, and safe.

Through this process, the model learns patterns and relationships in language, which are stored as numerical weights. These weights are then compressed into the parameter file, the core of what makes the model function.

⚙️ So what happens when you ask a question?
The model breaks your question into tokens (small pieces of text, turned into numbers). It processes these numbers through its neural networks and predicts the most likely response.

For example:
“What should I eat today?” might turn into numbers like [123, 11, 45, 78], which the model uses to calculate the next best words to give you the answer.

❗️But here’s something important: every model has a token limit -> a maximum number of tokens it can handle at once. This can vary between small and larger models. Once it reaches that limit, it forgets the earlier context and focuses only on the most recent tokens.

Finally, you can imagine an LLM as just two files:

➡️ 𝗣𝗮𝗿𝗮𝗺𝗲𝘁𝗲𝗿 𝗳𝗶𝗹𝗲 – This is the big file, where all the knowledge lives. Think of it like a giant zip file containing everything the model has learned about language.

➡️ 𝗥𝘂𝗻 𝗳𝗶𝗹𝗲 – This is the set of instructions needed to use the parameter file. It defines the model’s architecture, handles text tokenization, and manages how the model generates outputs.

That’s a very simple way to break down how LLMs work!
These models are the backbone of AI agents, so lets not forget about them 😉

530 views11:21