In a paper published recently, researchers from MIT’s Computer Science & Artificial Intelligence Laboratory have proposed a method for learning a face from audio recordings of that person speaking.
In their architecture, researchers utilize facial recognition pre-trained models as well as a face decoder model which takes as an input a latent vector and outputs an image with a reconstruction.
Paper: https://lnkd.in/fiUBjqh
#machinelearning #deeplearning #speech2face
✴️ @AI_Python_EN
In their architecture, researchers utilize facial recognition pre-trained models as well as a face decoder model which takes as an input a latent vector and outputs an image with a reconstruction.
Paper: https://lnkd.in/fiUBjqh
#machinelearning #deeplearning #speech2face
✴️ @AI_Python_EN