Analysis for Llama 3.1.
1. 15.6T tokens, Tools & Multilingual
2. Llama arch + new RoPE
3. fp16 & static fp8 quant for 405b
4. Dedicated pad token
5. <|python_tag|><|eom_id|> for tools?
6. Roberta to classify good quality data
7. 6 staged 800B tokens long context expansion
Data mixture
- 50% general knowledge
- 25% maths & reasoning
- 17% code data and tasks
- 8% multilingual data
Source
1. 15.6T tokens, Tools & Multilingual
2. Llama arch + new RoPE
3. fp16 & static fp8 quant for 405b
4. Dedicated pad token
5. <|python_tag|><|eom_id|> for tools?
6. Roberta to classify good quality data
7. 6 staged 800B tokens long context expansion
Data mixture
- 50% general knowledge
- 25% maths & reasoning
- 17% code data and tasks
- 8% multilingual data
Source
π2
Forwarded from Techα’α΅ (Hilina)
It's been an incredibly exciting week in the world of AI:
- OpenAI launched a new search tool called SearchGPT
- Meta updated its Llama language model to version 3.1
- Mistral AI released a new and improved Mistral Large 2 model
- DeepMind's AI achieved a silver medal at the International Math Olympiad
- Elon Musk announced plans to develop Grok 2 and 3
- OpenAI launched a new search tool called SearchGPT
- Meta updated its Llama language model to version 3.1
- Mistral AI released a new and improved Mistral Large 2 model
- DeepMind's AI achieved a silver medal at the International Math Olympiad
- Elon Musk announced plans to develop Grok 2 and 3
β‘5
Btw it's depressing that almost allπ of them are almost inaccessible, either beta version or needs high gpus.
π’6
I expect
1. bank rates will rapidly converge to the current parallel market rate, maybe a bit lower.
2. market rate will not go down but it will rise more slowly, over the next 6 months, % change of ETB/USD will be less than the last 6 months.
3. longer term, ETB will strengthen
Source from Nemo Semret
1. bank rates will rapidly converge to the current parallel market rate, maybe a bit lower.
2. market rate will not go down but it will rise more slowly, over the next 6 months, % change of ETB/USD will be less than the last 6 months.
3. longer term, ETB will strengthen
Source from Nemo Semret
πRanking Programming Languages by Energy Efficiency
Compiled languages βtend to beβ the most energy-efficient and fastest-running.
...the five slowest languages were all interpreted: Lua, Python, Perl, Ruby and Typescript. And the five languages which consumed the most energy were also interpreted ones.
Paper
Compiled languages βtend to beβ the most energy-efficient and fastest-running.
...the five slowest languages were all interpreted: Lua, Python, Perl, Ruby and Typescript. And the five languages which consumed the most energy were also interpreted ones.
Paper
β‘7π1
Who do you think are some of the smartest or coolest or awesome people in AI/ML?
I'll start Alec Radford (the real guy behind almost every OpenAI projects and amazing things)
I'll start Alec Radford (the real guy behind almost every OpenAI projects and amazing things)
Forwarded from Dagmawi Babi
NGL my only paid AI service is ChatGPT and it's been so worth it!
Forwarded from Luna's pathwayπ€ (Luna)
Be Smart on your Pricing Strategy
Front and BackπΈ
Well let me explain I was trying to upgrade my GPT and I thought It's 20$ - as you see on the pricing plan. But after I dive to the confirmation page it Include 4$ VAT which is not displayed on the front pageπ If the User see the price is 24$ they may hesitate to buy it but ..
Front and Back
Well let me explain I was trying to upgrade my GPT and I thought It's 20$ - as you see on the pricing plan. But after I dive to the confirmation page it Include 4$ VAT which is not displayed on the front page
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM
π1
Forwarded from Techα’α΅ (Tolo$a)
π¨ Exciting News! π¨
Join us in just ONE HOUR for an incredible episode of the Techα’α΅ Podcast S02E07 π
π We're featuring BEKA, a top developer turning unique ideas into reality! π
Spread the word and share this with anyone you think will love it! πβ¨
πππJoin
@Techinethio
Join us in just ONE HOUR for an incredible episode of the Techα’α΅ Podcast S02E07 π
π We're featuring BEKA, a top developer turning unique ideas into reality! π
Spread the word and share this with anyone you think will love it! πβ¨
πππJoin
@Techinethio
π1π₯1
π9
If you usually use generated images and you want to enhance their resolution use AuraSR-v2: a GAN-based Super-Resolution for upscaling generated images,
Try the model on hf spaces with your images for free: https://huggingface.co/spaces/gokaygokay/AuraSR-v2
Btw a little about GANs, they have of two neural networks, the generator(create data that is similar to the real data) and the discriminator(distinguish between real data (from the training set) and fake data (produced by the generator), which are trained together through a process of adversarial learning where the generator and discriminator are in sort of a competition.
Try the model on hf spaces with your images for free: https://huggingface.co/spaces/gokaygokay/AuraSR-v2
Btw a little about GANs, they have of two neural networks, the generator(create data that is similar to the real data) and the discriminator(distinguish between real data (from the training set) and fake data (produced by the generator), which are trained together through a process of adversarial learning where the generator and discriminator are in sort of a competition.
β€7
Linear Algebra for Data Science
Covers a vector of topics from matrices, vector spaces, orthogonality and projections, singular value decomposition, determinants, eigen values and vectors, big theorems, to name a few!
https://kyunghyuncho.me/linear-algebra-for-data-science/
Covers a vector of topics from matrices, vector spaces, orthogonality and projections, singular value decomposition, determinants, eigen values and vectors, big theorems, to name a few!
https://kyunghyuncho.me/linear-algebra-for-data-science/
π8
Let's talk about Samπ
Introducing Meta Segment Anything Model 2 (SAM 2) β the first unified model for real-time, promptable object segmentation in images & videos.
SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experience.
I will try to test it on something very soon and will post the results here.
Source
Introducing Meta Segment Anything Model 2 (SAM 2) β the first unified model for real-time, promptable object segmentation in images & videos.
SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experience.
I will try to test it on something very soon and will post the results here.
Source
β‘8
Let's talk about Luna
Luna-AI-Llama2-Uncensored
This is is a Llama2 based Chat model fine-tuned on over 40,000 long form chat discussions. This model was fine-tuned by Tap, the creator of Luna AI.
https://huggingface.co/Tap-M/Luna-AI-Llama2-Uncensored
Luna-AI-Llama2-Uncensored
This is is a Llama2 based Chat model fine-tuned on over 40,000 long form chat discussions. This model was fine-tuned by Tap, the creator of Luna AI.
https://huggingface.co/Tap-M/Luna-AI-Llama2-Uncensored
huggingface.co
Tap-M/Luna-AI-Llama2-Uncensored Β· Hugging Face
Weβre on a journey to advance and democratize artificial intelligence through open source and open science.