AI, Python, Cognitive Neuroscience

In a paper published recently, researchers from MIT’s Computer Science & Artificial Intelligence Laboratory have proposed a method for learning a face from audio recordings of that person speaking.

In their architecture, researchers utilize facial recognition pre-trained models as well as a face decoder model which takes as an input a latent vector and outputs an image with a reconstruction.

Paper: https://lnkd.in/fiUBjqh

#machinelearning #deeplearning #speech2face

✴️ @AI_Python_EN

767 viewsFarzad, 19:21

About

Blog

Apps

Platform