Henok | Neural Nets
1.61K subscribers
233 photos
20 videos
13 files
157 links
Download Telegram
Henok | Neural Nets
This is interesting, how is tensorflow beating Jax in a normal 10,000 × 10,000 matrices multiplication(element-wise)? Anway, if you only need per-element operations, use multiply() (very fast) and tf will be faster than jax. If you need actual matrix computations…
Spent the last 2 weeks exploring JAX(I've some prior experience) for no particular reason, for most of the time Numpy outperforms it for a few things I tried esp CPU and low GPU requiring tasks, but I'll try to train/inference in big models like Gemma to see the difference.

If you hated PyTorch for not being functional, then go for JAX.

I really want to know if anyone worked with JAX extensively and see how your experience was.

Here is a very good notebook to learn more by some friends: Notebook

Oh and Deepmind uses JAX😉
9👍2
Llama 3.2 400M Amharic

This is a smaller version of the Meta's Llama-3.2-1B decoder transformer model pretrained from scratch for 23 hours using a single A100 40GB GPU and 274 million tokens of Amharic text.



https://huggingface.co/rasyosef/Llama-3.2-400M-Amharic
🔥15
Forwarded from Beka (Beka)
Hey guys good news :)

Better Auth has been accepted into Y Combinator's Spring 2025 batch (X25)! 🎉

Myself and @kinfishfarms, will be part of YC's first spring batch. Super excited and thanks everyone here for being part of my journey so far :)) but a lot more to come!
13🔥7👍1
Congrats @beka_cru on this🎉, let's not make fun of him at least for today 😂
😁22🤣5
OpenAI is the company that's going to take us to the next chapter !!!
🔥14😈1
Btw how can I make my papers title like this, or something quirky like "Attention is all you need" 😂
😁15
100k is a lotttt.
😁8🤔1
This media is not supported in your browser
VIEW IN TELEGRAM
Hasab AI 🔥

Here is a great practical use case of ML in Ethiopia. The inference is really optimized 🔥.

Take a look at the demo.

https://www.hasab.ai/
🔥28
Why is the field of robotics so slow in the past 15 years?

Boston Dynamics is the only one I can think of.
💯6
Let's have fun😁
🤣27😁7
take it easy bro, where is the fun :)
😁3
My favorite open source model series. The Llama 4 herd is out

https://ai.meta.com/blog/llama-4-multimodal-intelligence/
🔥11
What are some *practical* ways for reducing input tokens (e.g. chunking, summarizing, selective filtering) ? Just looking for something that worked well.
Forwarded from Debugging Epohul (epohul)
I'm craving a PhD
❤‍🔥3
I'm not signing a doc to access a so called "open source dataset", it's also very clear that i won't be able to develop a model with such small data, let alone use for commercial purposes.
😁9😭2🥱1