Vol Building AGI

Channel name was changed to «Vol Building AGI»

03:50

Cheng Lu and Yang Song have solved diffusion https://arxiv.org/abs/2410.11081

Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models

Consistency models (CMs) are a powerful class of diffusion-based generative models optimized for fast sampling. Most existing CMs are trained using discretized timesteps, which introduce...

🤯2

241 views03:52

Vol Building AGI

Cheng Lu and Yang Song have solved diffusion https://arxiv.org/abs/2410.11081

131 views03:52

Vol Building AGI

https://youtu.be/ZANbujPTvOY

This graduate level problem benchmark was solved by o1 in less than a year since the benchmark was released — it was supposed unsolvable by language models for a while

YouTube

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Authors: David Rein, Betty Li Hou, Asa Cooper Stickland, Jackson Petty, Richard Yuanzhe Pang, Julien Dirani, Julian Michael, Samuel R. Bowman

We present GPQA, a challenging dataset of 448 multiple-choice questions written by domain experts in biology, physics…

150 viewsedited 06:50

Vol Building AGI

How to build AGI, Ukrainian book from 1979. Still relevant

https://scholar.google.com/citations?view_op=view_citation&hl=en&user=QkZlKBMAAAAJ&citation_for_view=QkZlKBMAAAAJ:CHSYGLWDkRkC

English version

https://archive.org/details/modelingofthinki0000amos/mode/2up

Google

Алгоритмы разума

НМ Амосов, 1979 - Cited by 257

👍4

162 viewsedited 05:44

Vol Building AGI

Sora Turbo is generally available now in Ukraine, https://x.com/model_mechanic/status/1866183714407141603?s=46&t=qNUYWfgTfF4u1RKfN0ir3A

❤4

310 views20:56

Vol Building AGI

Mechinterp on chain of thought circuits https://arxiv.org/abs/2406.02128

arXiv.org

Iteration Head: A Mechanistic Study of Chain-of-Thought

Chain-of-Thought (CoT) reasoning is known to improve Large Language Models both empirically and in terms of theoretical approximation power. However, our understanding of the inner workings and...

113 views06:17

Vol Building AGI

Pretraining is dead. I love you all.

https://youtu.be/1yvBqasHLZs

YouTube

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Ilya Sutskever full talk "Sequence to sequence learning with neural networks: what a decade" at NeurIPS 2024 in Vancouver, Canada.

"Pre-training as we know it will end" and what comes next is superintelligence: agentic, reasons, understands and is self aware.…

❤3

134 viewsedited 17:38

Vol Building AGI

o1 excels at generating gpu kernels that are harder than level 1

126 views18:48

Vol Building AGI

https://gpu-mode.github.io/popcorn/

127 views18:48

Vol Building AGI

Debug neural networks by casting them as neural fields https://github.com/neale/neural-canvas

GitHub

GitHub - neale/neural-canvas: creative deep learning with implicit neural representations

creative deep learning with implicit neural representations - neale/neural-canvas

175 views20:24

Vol Building AGI

ARC-AGI has been solved. Apply for safety testing of o3: https://openai.com/12-days/

OpenAI

12 Days of OpenAI

162 viewsedited 18:24

Vol Building AGI

Heatmap of additively smoothed log probabilities of character bigrams log p(target|source) in common voice uk 10.0. Every word is padded by spaces on each side.

що looks like a very popular word. ї is missing entirely! You can see vowels making up distinct columns and rows.

284 viewsedited 02:52

Vol Building AGI

Plotting columns of the DFT basis with matplotlib looks very vibrant out of the box. A small enough ratio of the window size to the sampling rate allows for stripes to reveal underlying higher frequency filters at the back.


def dft(size=512, rate=16000, low=50):
    k = np.linspace(low, rate / 2, size, endpoint=False)
    t = np.arange(size) / rate
    return np.exp(-2j * np.pi * k[:, None] * t)

160 views06:06

Vol Building AGI

Marcus Hutter has provided a recipe to build an agent that provably solves any problem. He wrote a new book about it: https://x.com/mhutter42/status/1871426793380688255?s=46&t=qNUYWfgTfF4u1RKfN0ir3A

X (formerly Twitter)

Marcus Hutter (@mhutter42) on X

Santa Arrived! The PDF (of a colorful Xmas version) of the "Introduction to Universal AI" book is now freely available online at https://t.co/r9COEBnf4S Wishing you all joyful reading, Merry Xmas & a :-) New Year.

🔥2

192 views17:51

Vol Building AGI

Why want high precision accumulation while doing low precision computations

👍1

142 views21:56

I installed ghostty so my terminal can render images and have fragment shaders applied on the whole window.

👍2🤯1

141 views01:13

Let's animate the process of extracting a 13 Mel-Frequency Cepstral Coefficients (MFCC) spectrogram from an MP3 file.

369 views00:00

Vol Building AGI

To update gaussian mixture models with 16384 components of 13d MFCC frames using expectation maximization, I need to initialize the mixtures. The simplest data-driven initializer for GMMs is taking cluster centroids.

I decided to compare three algorithms for clustering:

1. random sampling (usually decent init for other algorithms, no hyperparmeters)
2. minibatch Lloyd (performs EMA updates on mini batches, has one EMA weight hypeparameter)
3. Linde-Buzo-Gray (LBG) with minibatch Lloyd refinement — a classic algorithm for computing quantization codebooks. Its main trick is to start with a cluster of size 1, and progressively double the clustering size by perturbing the original set and refining it with k-means.

I have 11 million frames of from common voice uk 10.0, so Lloyd algorithm (classical k means) and SVD/QR are out — they require materializing matrices that are a bit too big for my macbook.

On the plot the x axis is number of steps, the Y axis is the quantization loss.

154 viewsedited 07:48

Vol Building AGI

I found that LBG is very compute efficient — it spends most of the time running k-means for small clusterings, so in terms of wall clock (ballpark 10x faster?) my cpu-only pure-numpy implementation with efficient L2 distance computation. It also seems to be more data efficient: I couldn't get better results with lloyd when I ran it for more steps.

I didn't bother tuning the EMA learning rate too much and settled at 0.9. Progressive scaling is all we need?

👍1

153 viewsedited 07:48

Vol Building AGI

0:14

This media is not supported in your browser

VIEW IN TELEGRAM

Alignment self-training can work even with a single utterance. In this video expectation maximization for a GMM acoustic model with a linear chain HMM successfully finds a plausible alignment in 30 steps.

My algorithm only updates GMM mixture coefficients in the maximization step. Expectation step uses my implementation of scaled forward-backward recursions.

1024 GMM means are pretrained using LBG, the HMM prior is a linear chain — similar to what you see on the right hand side in the plot.

175 viewsedited 08:21

About

Blog

Apps

Platform