Neural network that turns sketches into realistic photo.
Paper is called «Semantic Image Synthesis with Spatially-Adaptive Normalization».
#CVPR19 oral paper on a new conditional normalization layer for semantic image synthesis #SPADE and its demo app #GauGAN
ArXiV: https://arxiv.org/abs/1903.07291
Website: https://nvlabs.github.io/SPADE/
#GAN #CV #DL
Paper is called «Semantic Image Synthesis with Spatially-Adaptive Normalization».
#CVPR19 oral paper on a new conditional normalization layer for semantic image synthesis #SPADE and its demo app #GauGAN
ArXiV: https://arxiv.org/abs/1903.07291
Website: https://nvlabs.github.io/SPADE/
#GAN #CV #DL
arXiv.org
Semantic Image Synthesis with Spatially-Adaptive Normalization
We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout...
Website using Deep Learning to colorize pictures.
Link: https://colourise.sg/#colorize
#DL #CV #demo
Link: https://colourise.sg/#colorize
#DL #CV #demo
Using AI to generate recipes from food images
Facebook developed image 2 recipe architecture.
Paper: https://research.fb.com/publications/inverse-cooking-recipe-generation-from-food-images/
Code: https://github.com/facebookresearch/inversecooking
Link: https://ai.facebook.com/blog/inverse-cooking/
#CV #DL
Facebook developed image 2 recipe architecture.
Paper: https://research.fb.com/publications/inverse-cooking-recipe-generation-from-food-images/
Code: https://github.com/facebookresearch/inversecooking
Link: https://ai.facebook.com/blog/inverse-cooking/
#CV #DL
Meta Research
Inverse Cooking: Recipe Generation from Food Images - Meta Research
People enjoy food photography because they appreciate food. Behind each meal there is a story described in a complex recipe and, unfortunately, by simply looking at a food image we do not have access to its preparation process. Therefore, in this paper we…
Head animation from single shot by #SamsungAI team
Samsung researchers have released a model that can generate faces in new poses from just a single image/frame (for each of face, pose). Done by building a well-trained landmark model in advance & one-shotting from that, using keypoints, adaptive instance norms and GANs. Model performs no 3D face modelling!
ArXiV: https://arxiv.org/abs/1905.08233v1
Youtube: https://www.youtube.com/watch?v=p1b5aiTrGzY
#GAN #CV #DL
Samsung researchers have released a model that can generate faces in new poses from just a single image/frame (for each of face, pose). Done by building a well-trained landmark model in advance & one-shotting from that, using keypoints, adaptive instance norms and GANs. Model performs no 3D face modelling!
ArXiV: https://arxiv.org/abs/1905.08233v1
Youtube: https://www.youtube.com/watch?v=p1b5aiTrGzY
#GAN #CV #DL
arXiv.org
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models
Several recent works have shown how highly realistic human head images can be obtained by training convolutional neural networks to generate them. In order to create a personalized talking head...
New deep learning framework from Facebook
Pythia is a deep learning framework that supports multitasking in the vision and language domain. Built on our open-source #PyTorch framework, the modular, plug-and-play design enables researchers to quickly build, reproduce, and benchmark AI models. #Pythia is designed for vision and language tasks, such as answering questions related to visual data and automatically generating image captions.
Link: https://code.fb.com/ai-research/pythia/
GitHub: https://github.com/facebookresearch/pythia
#Facebook #FacebookAI #DL #CV #multimodal
Pythia is a deep learning framework that supports multitasking in the vision and language domain. Built on our open-source #PyTorch framework, the modular, plug-and-play design enables researchers to quickly build, reproduce, and benchmark AI models. #Pythia is designed for vision and language tasks, such as answering questions related to visual data and automatically generating image captions.
Link: https://code.fb.com/ai-research/pythia/
GitHub: https://github.com/facebookresearch/pythia
#Facebook #FacebookAI #DL #CV #multimodal
Engineering at Meta
Releasing Pythia for vision and language multimodal AI models
Pythia is a new open source deep learning framework that enables researchers to quickly build, reproduce, and benchmark AI models.