07:54:04
[ReTweet]
RT @beffjezos: The rarest object type in the universe isn't black holes. It's us. Conscious matter. The flame of life.
We have a duty to e…
[ReTweet]
RT @beffjezos: The rarest object type in the universe isn't black holes. It's us. Conscious matter. The flame of life.
We have a duty to e…
08:22:00
[ReTweet]
RT @cremieuxrecueil: For the first year on record, wind and solar produced more electricity than coal in the U.S. https://t.co/SiXbr0Mn6G
[ReTweet]
RT @cremieuxrecueil: For the first year on record, wind and solar produced more electricity than coal in the U.S. https://t.co/SiXbr0Mn6G
15:12:48
[Reply]
@BasilTheGreat Not even close to good enough. Those officers committed a serious crime.
[Reply]
@BasilTheGreat Not even close to good enough. Those officers committed a serious crime.
16:40:02
[Reply]
Next will be writing the inference stack in C for simultaneous high-speed RL across a large block of GB300s.
(We do use a little C++ tbh, but not much)
[Reply]
Next will be writing the inference stack in C for simultaneous high-speed RL across a large block of GB300s.
(We do use a little C++ tbh, but not much)
17:39:24
[Reply]
@eastdakota Yes.
It’s not that we’ve discovered some magic bullet, but rather that JAX, or at least the open source version of it, is mostly optimized for small to medium-sized training runs on Google TPUs, whereas we need to massive training runs on Nvidia GPUs.
Pipeline parallelism is…
[Reply]
@eastdakota Yes.
It’s not that we’ve discovered some magic bullet, but rather that JAX, or at least the open source version of it, is mostly optimized for small to medium-sized training runs on Google TPUs, whereas we need to massive training runs on Nvidia GPUs.
Pipeline parallelism is…
19:41:39
[ReTweet]
RT @SwipeWright: ANNOUNCEMENT: WE’RE SAVING SCIENCE!
We’re often told that science is “self-correcting.”
But that’s not really true.
Sci…
[ReTweet]
RT @SwipeWright: ANNOUNCEMENT: WE’RE SAVING SCIENCE!
We’re often told that science is “self-correcting.”
But that’s not really true.
Sci…
19:48:36
[Reply]
Note, I am posted this to encourage those who enjoy getting incredible performance out of hardware to join SpaceX
[Reply]
Note, I am posted this to encourage those who enjoy getting incredible performance out of hardware to join SpaceX
19:57:42
[Reply]
@__tinygrad__ Agreed, the CPU cores on the GB300 should be tasked with agentic jobs. Very little of their compute is needed for GPU management.
[Reply]
@__tinygrad__ Agreed, the CPU cores on the GB300 should be tasked with agentic jobs. Very little of their compute is needed for GPU management.