Henok | Neural Nets

If anyone is interested in pretraining any kind of text based models, you might have experienced that there isn't much you can do than just feeding the text itself in bulk. But there's more to it.

Here's some slides to gemini's pretraining and generally about pretraining. It's super compute intensive even for old decoder based models

https://vladfeinberg.com/assets/2025-04-24-princeton-talk.pdf

Gemini Pretraining

❤6👍2

1.37K views10:58

Henok | Neural Nets

I'll be talking about running many kinds of open source models from small ~100M to 7B LLMs in the web, android and ios. Also about optimization & quantization, efficient finetuning techniques etc and all about local first AI development on April 29

For anyone interested link

🔥12👍3

1.51K viewsedited 11:05

Henok | Neural Nets

So PewDiePie is now miles ahead of me on Linux 😂. The only time I tried installing linux was in high school and it was kali and hated my experience so much.

So what's next for him, colab with primeagen and bash Javascript lol

😁13

1.36K views14:53

Henok | Neural Nets

PhD at Inria Paris

If anyone have MS and got interest in Physics-Grounded Vision Foundation Models and interests+some experience in computer vision. DM me your CV and I'll make a referral to your application.

Raoul is a great research and amazing guy to work with and it will be super cool to see an Ethiopian in his lab.

https://astra-vision.github.io/jobs/

astra-vision.github.io

Jobs | Astra-vision - Computer vision group, Astra, Inria

Computer vision group of the Astra research team, Inria, Paris.

👍6

1.67K viewsedited 17:51

Henok | Neural Nets

Why do I feel burnout in days when I'm doing nothing but sleep.

😁17🤝6😭1

1.38K views18:32

Henok | Neural Nets

Incase anyone is interested to submit their papers to an Ethiopian NLP workshop. It's good to get some writing experience and also get into research.

https://ethionlp.github.io/index.html#call-for-papers

❤4🔥1

2.06K viewsedited 12:49

Henok | Neural Nets

Job Post

If anyone is interested in a kind of project involves sync engine, AI agent, vertical saas, DM me here @feeling_stoic.

It's a US startup and the guy running it super cool and you will learn a lot from him and enjoy working with him.

💰The offer is $800-$1000 + some equity.

Share it with others if you know anyone.

👍9❤4⚡3

1.9K viewsedited 19:40

Henok | Neural Nets

Job Post If anyone is interested in a kind of project involves sync engine, AI agent, vertical saas, DM me here @feeling_stoic. It's a US startup and the guy running it super cool and you will learn a lot from him and enjoy working with him. 💰The offer…

Job Closed❗️

I've shared your CVs with the founder and he will take care of it moving forward.

Good luck to you all.

🙏10👏2

1.37K views14:14

Henok | Neural Nets

Damn China🥶

https://stanfordreview.org/investigation-uncovering-chinese-academic-espionage-at-stanford/

🤯5

1.4K views07:54

Henok | Neural Nets

Oh no....students please verify you don't do such crap

https://futurism.com/college-students-ai-typos

😁9

1.59K views15:30

Henok | Neural Nets

Forwarded from Beka (Beka)

We just did our YC launch. I need you guys to do me a favor. Can you go ahead and upvote please 🙏

https://www.ycombinator.com/launches/NUm-better-auth-the-authentication-framework-for-typescript

Y Combinator

Launch YC: Better Auth - The Authentication Framework for TypeScript | Y Combinator

The fastest growing Auth framework for TypeScript: 13K stars + 100K weekly downloads!

🔥6👍1

981 views15:36

Henok | Neural Nets

Hakuna Matata !!!

🔥16❤3

1.55K views12:22

Henok | Neural Nets

Last statement is too bold

👍9🤨1

1.99K views08:56

Henok | Neural Nets

Gemini Diffusion is super fast. 564 tokens/s. It means it can write almost a 200 page book in ~3 minutes.

⚡5🔥4❤1

1.47K views19:07

Henok | Neural Nets

If AI can, why shouldn't it take the entire software engineering jobs or a job of a research scientist?

1.19K viewsedited 15:53

Henok | Neural Nets

We don't care about AI, AGI, or whatever for tonight, let's cook this chicken, let's go United🔥

🔥12🤣10😭3

1.2K views18:27

Henok | Neural Nets

We are back to AI

🤣18😁7💔2

1.38K views21:19

Henok | Neural Nets

I hate Lofi and i don't really get them. But if you need something while working just check Mulatu's Jazz. He's simply the best.

Here are my favorites

Yekermo Sew
https://youtu.be/jwdBRqIsVUY?si=X-7T9QIUiMsO4a5a

Tizita
https://youtu.be/sXLfV2kegUI?si=cjZdWw_FKUXhmXLi

YouTube

Yèkèrmo Sèw

Provided to YouTube by K7 Records GmbH

Yèkèrmo Sèw · Mulatu Astatke

New York - Addis - London: The Story of Ethio Jazz 1965-1975

℗ 1969 Amha Records

Released on: 2009-10-19

Music Publisher: Copyright Control
Composer: Mulatu Astatke
Lyricist: Mulatu…

❤13🔥3👍1🤝1

1.35K viewsedited 19:11

Henok | Neural Nets

Religious benchmarks for LLM evaluation seems cool, I've not seen much work towards this. Are the best models of today biased against one religion, teaching, how would they interpret things.

Recommend me a paper if you've seen in this area, I'll be happy to read it.

❤7

1.24K views04:53

Henok | Neural Nets

Building LLMs from scratch has to be one of the challenging things I was in and very underrated. Pretraining data, how many parameters is enough, instruction fine tuning, making them generalize and alignment, all under resources constraints, even in big tech companies compute budget exists, this all is really hard.

So when ever a new model is out and they beat others on some areas is a huge W.

So one suggestion, if you don't have to, don't start from scratch and also expanding toknizer and updating model weights per user or something should be well studied to adapt models to new langs and tasks.

👍7❤3

1.5K viewsedited 14:25

About

Blog

Apps

Platform