Henok | Neural Nets
This is interesting, how is tensorflow beating Jax in a normal 10,000 × 10,000 matrices multiplication(element-wise)? Anway, if you only need per-element operations, use multiply() (very fast) and tf will be faster than jax. If you need actual matrix computations…
Spent the last 2 weeks exploring JAX(I've some prior experience) for no particular reason, for most of the time Numpy outperforms it for a few things I tried esp CPU and low GPU requiring tasks, but I'll try to train/inference in big models like Gemma to see the difference.
If you hated PyTorch for not being functional, then go for JAX.
I really want to know if anyone worked with JAX extensively and see how your experience was.
Here is a very good notebook to learn more by some friends: Notebook
Oh and Deepmind uses JAX😉
If you hated PyTorch for not being functional, then go for JAX.
I really want to know if anyone worked with JAX extensively and see how your experience was.
Here is a very good notebook to learn more by some friends: Notebook
Oh and Deepmind uses JAX😉
GitHub
indaba-pracs-2024/practicals/Intro_to_ML_using_JAX/Introduction_to_ML_using_JAX.ipynb at main · deep-learning-indaba/indaba-pracs…
Notebooks for the Practicals at the Deep Learning Indaba 2024. - deep-learning-indaba/indaba-pracs-2024
❤9👍2
Forwarded from Beka (Beka)
Better Auth is 500 stars away from 10k stars ✨ could you please give us a star if you haven't ;)
https://github.com/better-auth/better-auth
https://github.com/better-auth/better-auth
GitHub
GitHub - better-auth/better-auth: The most comprehensive authentication framework for TypeScript
The most comprehensive authentication framework for TypeScript - better-auth/better-auth
❤3
Llama 3.2 400M Amharic
This is a smaller version of the Meta's Llama-3.2-1B decoder transformer model pretrained from scratch for 23 hours using a single A100 40GB GPU and 274 million tokens of Amharic text.
https://huggingface.co/rasyosef/Llama-3.2-400M-Amharic
This is a smaller version of the Meta's Llama-3.2-1B decoder transformer model pretrained from scratch for 23 hours using a single A100 40GB GPU and 274 million tokens of Amharic text.
https://huggingface.co/rasyosef/Llama-3.2-400M-Amharic
🔥15
Forwarded from Beka (Beka)
Hey guys good news :)
Better Auth has been accepted into Y Combinator's Spring 2025 batch (X25)! 🎉
Myself and @kinfishfarms, will be part of YC's first spring batch. Super excited and thanks everyone here for being part of my journey so far :)) but a lot more to come!
Better Auth has been accepted into Y Combinator's Spring 2025 batch (X25)! 🎉
Myself and @kinfishfarms, will be part of YC's first spring batch. Super excited and thanks everyone here for being part of my journey so far :)) but a lot more to come!
⚡13🔥7👍1
Congrats @beka_cru on this🎉, let's not make fun of him at least for today 😂
😁22🤣5
OpenAI is the company that's going to take us to the next chapter !!!
🔥14😈1
This media is not supported in your browser
VIEW IN TELEGRAM
Hasab AI 🔥
Here is a great practical use case of ML in Ethiopia. The inference is really optimized 🔥.
Take a look at the demo.
https://www.hasab.ai/
Here is a great practical use case of ML in Ethiopia. The inference is really optimized 🔥.
Take a look at the demo.
https://www.hasab.ai/
🔥28
Why is the field of robotics so slow in the past 15 years?
Boston Dynamics is the only one I can think of.
Boston Dynamics is the only one I can think of.
💯6
Henok | Neural Nets
If you are interested in Computer Vision make sure to apply here. It's a great program and I was a TA there last year and learned a lot too and met some cool people. The curriculum emphasizes the importance of ethical considerations, geometry and math, deep…
Started reviewing applicants today. So many great people.
Text to Bark, what a breakthrough in human civilization.
https://x.com/elevenlabsio/status/1907014022009876508
This got me fooled tbh
https://x.com/elevenlabsio/status/1907014022009876508
X (formerly Twitter)
ElevenLabs (@elevenlabsio) on X
We pioneered the first ultra-realistic Text to Speech model, and recently launched the world's most accurate Speech to Text model, Scribe.
But we're not stopping there.
Today, we're taking one small step for man, and one giant leap for man's best friend...…
But we're not stopping there.
Today, we're taking one small step for man, and one giant leap for man's best friend...…
🤣10
My favorite open source model series. The Llama 4 herd is out
https://ai.meta.com/blog/llama-4-multimodal-intelligence/
https://ai.meta.com/blog/llama-4-multimodal-intelligence/
🔥11
What are some *practical* ways for reducing input tokens (e.g. chunking, summarizing, selective filtering) ? Just looking for something that worked well.
Anthropic’s Evaluation of Chain-of-Thought Faithfulness
https://www.marktechpost.com/2025/04/05/anthropics-evaluation-of-chain-of-thought-faithfulness-investigating-hidden-reasoning-reward-hacks-and-the-limitations-of-verbal-ai-transparency-in-reasoning-models/
https://www.marktechpost.com/2025/04/05/anthropics-evaluation-of-chain-of-thought-faithfulness-investigating-hidden-reasoning-reward-hacks-and-the-limitations-of-verbal-ai-transparency-in-reasoning-models/
MarkTechPost
Anthropic’s Evaluation of Chain-of-Thought Faithfulness: Investigating Hidden Reasoning, Reward Hacks, and the Limitations of Verbal…
A key advancement in AI capabilities is the development and use of chain-of-thought (CoT) reasoning, where models explain their steps before reaching an answer. This structured intermediate reasoning is not just a performance tool; it’s also expected to enhance…
I'm not signing a doc to access a so called "open source dataset", it's also very clear that i won't be able to develop a model with such small data, let alone use for commercial purposes.
😁9😭2🥱1