This media is not supported in your browser
VIEW IN TELEGRAM
πͺ° EasyMocap: Open Neural Mocap πͺ°
πEasyMocap: open-source marker-less mocap with novel view synthesis from RGB
ππ’π π‘π₯π’π π‘ππ¬ (of last paper added):
β Editable free-viewpoint video
β Layered neural representation of humans
β Multi-pax -> instances, weakly-supervised
β HQ neural representation of the humans
β Addressing camera error by human poses
More: https://bit.ly/3p6lUDO
πEasyMocap: open-source marker-less mocap with novel view synthesis from RGB
ππ’π π‘π₯π’π π‘ππ¬ (of last paper added):
β Editable free-viewpoint video
β Layered neural representation of humans
β Multi-pax -> instances, weakly-supervised
β HQ neural representation of the humans
β Addressing camera error by human poses
More: https://bit.ly/3p6lUDO
π€―6π3π3β€2
This media is not supported in your browser
VIEW IN TELEGRAM
π° Texturify: Neural Textures Generator π°
πA step towards automated content creation. HQ textures directly on surface of 3D object
ππ’π π‘π₯π’π π‘ππ¬:
β TUM + Max Planck + Apple π
β Realistic, HQ textures from 2D pics
β 3D shape geometry, no 3D supervision
β 3D-aware surface-based generation net
More: https://bit.ly/3BW7UUU
πA step towards automated content creation. HQ textures directly on surface of 3D object
ππ’π π‘π₯π’π π‘ππ¬:
β TUM + Max Planck + Apple π
β Realistic, HQ textures from 2D pics
β 3D shape geometry, no 3D supervision
β 3D-aware surface-based generation net
More: https://bit.ly/3BW7UUU
π8
This media is not supported in your browser
VIEW IN TELEGRAM
π¨ Scaling Neural Indoor Scene π¨
πNeural scene rendering for indoor: scalable in both training/rendering
ππ’π π‘π₯π’π π‘ππ¬:
β Neural scene rendering for indoor
β #3D into tiles with MLPs to scale up
β Parallel training of tile-based MLPs
β View-indep. components (via surf-MLP)
More: https://bit.ly/3bH94IX
πNeural scene rendering for indoor: scalable in both training/rendering
ππ’π π‘π₯π’π π‘ππ¬:
β Neural scene rendering for indoor
β #3D into tiles with MLPs to scale up
β Parallel training of tile-based MLPs
β View-indep. components (via surf-MLP)
More: https://bit.ly/3bH94IX
π₯2π1
AI with Papers - Artificial Intelligence & Deep Learning
π₯ MobileNeRF is out -> Pure Fire! π₯ πMobileNeRF is out: the mobile evolution of NeRF via textured polygons. ππ’π π‘π₯π’π π‘ππ¬: β
Same quality, 10x faster than SNeRG β
Memory-- by storing surface textures β
Integrated GPUs: less memory/power β
Suitable for browser &β¦
π₯π₯UPDATEπ₯π₯
Code Released: https://github.com/google-research/jax3d/tree/main/jax3d/projects/mobilenerf
Code Released: https://github.com/google-research/jax3d/tree/main/jax3d/projects/mobilenerf
β€6π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Stable Diffusion on clips. INSANEπ₯
πThe most advanced latent text-to-image DM. #RunwayML just announced is going to apply it on clips
ππ’π π‘π₯π’π π‘ππ¬:
β Latent DM on 512p from LAION-5B
β Frozen CLIP ViT-L/14 text encoder
β Lightweight, runs on a 10GB-GPU
β Checkpoints only for research
More: https://bit.ly/3QfkRx3
πThe most advanced latent text-to-image DM. #RunwayML just announced is going to apply it on clips
ππ’π π‘π₯π’π π‘ππ¬:
β Latent DM on 512p from LAION-5B
β Frozen CLIP ViT-L/14 text encoder
β Lightweight, runs on a 10GB-GPU
β Checkpoints only for research
More: https://bit.ly/3QfkRx3
π€―13π±12π2π₯1
This media is not supported in your browser
VIEW IN TELEGRAM
π Implicitron: "democratizing" NeRFπ
π#META opens a novel framework for NeRF-world in #PyTorch3D #pytorch
ππ’π π‘π₯π’π π‘ππ¬:
β Implicit representations (NeRF) / Render
β RaySampler/PointSampler & more
β NeRFβs MLP, IDRβs FF, SRN, etc.
β Renderers: MEAR, LSTMRenderer, etc.
More: https://bit.ly/3bPyJPJ
π#META opens a novel framework for NeRF-world in #PyTorch3D #pytorch
ππ’π π‘π₯π’π π‘ππ¬:
β Implicit representations (NeRF) / Render
β RaySampler/PointSampler & more
β NeRFβs MLP, IDRβs FF, SRN, etc.
β Renderers: MEAR, LSTMRenderer, etc.
More: https://bit.ly/3bPyJPJ
π₯4π€―2
This media is not supported in your browser
VIEW IN TELEGRAM
π§° FGT: flow-guided inpainting π§°
π#Microsoft (+USTC) unveils FGT: flow-guided ViT for video inpainting π€―
ππ’π π‘π₯π’π π‘ππ¬:
β OF into transformer for attention++
β Flow completion net w/ local feats.
β Dual perspective spatial MHSA
β Local attention with global content
More: https://bit.ly/3pk5J5S
π#Microsoft (+USTC) unveils FGT: flow-guided ViT for video inpainting π€―
ππ’π π‘π₯π’π π‘ππ¬:
β OF into transformer for attention++
β Flow completion net w/ local feats.
β Dual perspective spatial MHSA
β Local attention with global content
More: https://bit.ly/3pk5J5S
β€11π5
This media is not supported in your browser
VIEW IN TELEGRAM
πNeuMan: Human NeRF in the wildπ
π#Apple opens a novel human pose/view from just a single in-the-wild video
ππ’π π‘π₯π’π π‘ππ¬:
β No extra devices/annotations
β Both Human (novel poses) + Scene
β E2E SMPL optimization + error-corr.
β Applications such as "telegathering"
More: https://bit.ly/3K4iTO6
π#Apple opens a novel human pose/view from just a single in-the-wild video
ππ’π π‘π₯π’π π‘ππ¬:
β No extra devices/annotations
β Both Human (novel poses) + Scene
β E2E SMPL optimization + error-corr.
β Applications such as "telegathering"
More: https://bit.ly/3K4iTO6
π15
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ CLIP-based Neural Style Transfer π₯
πFrom #Nvidia a novel method for transferring the style to a #3D object
ππ’π π‘π₯π’π π‘ππ¬:
β Texture style for 3D by CLIP-ResNet50
β Nearest-neighbor feature matching loss
β CLIP-based loss extraction of textures
β NNFM for multiple style pics / control
β No source code or models available π
More: https://bit.ly/3c32dK5
πFrom #Nvidia a novel method for transferring the style to a #3D object
ππ’π π‘π₯π’π π‘ππ¬:
β Texture style for 3D by CLIP-ResNet50
β Nearest-neighbor feature matching loss
β CLIP-based loss extraction of textures
β NNFM for multiple style pics / control
β No source code or models available π
More: https://bit.ly/3c32dK5
π€―12π₯5β€4π2π±2π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ KeypointNeRF: code is out! π₯
πKeypointNeRF by #Meta: "NeRF"-avatars
ππ’π π‘π₯π’π π‘ππ¬:
β Generalizable NeRF for virtual avatar
β Sparse 3D keypoints for SOTA avatar
β Novel unseen subjects from 2/3 views
β "iPhone" captures for #metaverse
More: https://bit.ly/3pyl17e
πKeypointNeRF by #Meta: "NeRF"-avatars
ππ’π π‘π₯π’π π‘ππ¬:
β Generalizable NeRF for virtual avatar
β Sparse 3D keypoints for SOTA avatar
β Novel unseen subjects from 2/3 views
β "iPhone" captures for #metaverse
More: https://bit.ly/3pyl17e
π₯8π3π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Massive GTA-V human datasetπ₯
πGTA-Human: outperforming SOTA with a purely synthetic training.
ππ’π π‘π₯π’π π‘ππ¬:
β 600+ gender, age, ethnicity & clothing
β 20,000+ clips, variety of human activities
β 6 categories of location, different BGs
β Occlusions, lighting, and weather system
More: https://bit.ly/3wpZyRD
πGTA-Human: outperforming SOTA with a purely synthetic training.
ππ’π π‘π₯π’π π‘ππ¬:
β 600+ gender, age, ethnicity & clothing
β 20,000+ clips, variety of human activities
β 6 categories of location, different BGs
β Occlusions, lighting, and weather system
More: https://bit.ly/3wpZyRD
π₯14β€2π1
This media is not supported in your browser
VIEW IN TELEGRAM
πDeepBillboards: old-school trick for #VRπ
πDeepBillboards models a 3D object implicitly using neural net on the userβs viewing direction
ππ’π π‘π₯π’π π‘ππ¬:
β #Google Brain +Tsukuba + Tokyo
β Rendering at higher res., improving #VR
β NeRF into interactive VR with accuracy++
β NeRF (or any others) directly in #Unity
More: https://bit.ly/3CsTQ5y
πDeepBillboards models a 3D object implicitly using neural net on the userβs viewing direction
ππ’π π‘π₯π’π π‘ππ¬:
β #Google Brain +Tsukuba + Tokyo
β Rendering at higher res., improving #VR
β NeRF into interactive VR with accuracy++
β NeRF (or any others) directly in #Unity
More: https://bit.ly/3CsTQ5y
π6π1
This media is not supported in your browser
VIEW IN TELEGRAM
πRelPose: Probabilistic Relative Poseπ
πA novel method for core component in #SLAM / NeRF-powered apps.
ππ’π π‘π₯π’π π‘ππ¬:
β Core component of SfM/SLAM
β Pre-processing for neural (NeRF)
β Energy-based over rotations
β SOTA on both seen/unseen objects
More: https://bit.ly/3T60TXw
πA novel method for core component in #SLAM / NeRF-powered apps.
ππ’π π‘π₯π’π π‘ππ¬:
β Core component of SfM/SLAM
β Pre-processing for neural (NeRF)
β Energy-based over rotations
β SOTA on both seen/unseen objects
More: https://bit.ly/3T60TXw
π₯12π2π2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π #StableDiffusion archive is outπ
πLexica art is a Stable Diffusion prompt search engine. Real-time, countless #stablediffusion results for everyone. I had fun with the GOAT, #Maradona.
ππ’π π‘π₯π’π π‘ππ¬:
β Maradona scoring against a capybara...
β A poster of space jam with Maradona...
β Painting of Maradona very detailed...
β Painting of Maradona in heaven...
More: https://bit.ly/3PTXHLH
πLexica art is a Stable Diffusion prompt search engine. Real-time, countless #stablediffusion results for everyone. I had fun with the GOAT, #Maradona.
ππ’π π‘π₯π’π π‘ππ¬:
β Maradona scoring against a capybara...
β A poster of space jam with Maradona...
β Painting of Maradona very detailed...
β Painting of Maradona in heaven...
More: https://bit.ly/3PTXHLH
β€9π5
This media is not supported in your browser
VIEW IN TELEGRAM
π¦PANDORA: Polarized Neural Decompositionπ¦
πCIL lab unveils PANDORA: polarimetric inverse rendering approach via INR
ππ’π π‘π₯π’π π‘ππ¬:
β Geometry, reflectance & illumination
β normal, signed distance field, mesh
β Diffuse-specular separation
β Hi-fI incident illumination
More https://bit.ly/3CzGp3F
πCIL lab unveils PANDORA: polarimetric inverse rendering approach via INR
ππ’π π‘π₯π’π π‘ππ¬:
β Geometry, reflectance & illumination
β normal, signed distance field, mesh
β Diffuse-specular separation
β Hi-fI incident illumination
More https://bit.ly/3CzGp3F
π3π₯3
This media is not supported in your browser
VIEW IN TELEGRAM
π₯IDOL (#CVPR2022 winner): code is out!π₯
πIDOL for VIS: outperforming all online/offline methods, the new SOTA!
ππ’π π‘π₯π’π π‘ππ¬:
β Online usually inferior by >10AP
β Online based on contrast-learning
β Discriminative++ instance embeddings
β Full exploiting history for stability
More https://bit.ly/3dXCDXw
πIDOL for VIS: outperforming all online/offline methods, the new SOTA!
ππ’π π‘π₯π’π π‘ππ¬:
β Online usually inferior by >10AP
β Online based on contrast-learning
β Discriminative++ instance embeddings
β Full exploiting history for stability
More https://bit.ly/3dXCDXw
π€―16π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ #AIwithPapers: we are 4,000+! π₯
ππLot of people joined, and we talked about #StableDiffusion only twice! Can't believe it.ππ
π Invite your friends -> https://t.me/AI_DeepLearning
ππLot of people joined, and we talked about #StableDiffusion only twice! Can't believe it.ππ
π Invite your friends -> https://t.me/AI_DeepLearning
π₯10
This media is not supported in your browser
VIEW IN TELEGRAM
π΅ Deep Saliency: driving the attention π΅
πGoogle unveils a family of operators to "drive" human saliency
ππ’π π‘π₯π’π π‘ππ¬:
β Editing image to drive Saliency
β Transforms to hide distractors
β Warping operator for distractor
β GAN-op for less-saliency altern.
More: https://bit.ly/3KoQQc2
πGoogle unveils a family of operators to "drive" human saliency
ππ’π π‘π₯π’π π‘ππ¬:
β Editing image to drive Saliency
β Transforms to hide distractors
β Warping operator for distractor
β GAN-op for less-saliency altern.
More: https://bit.ly/3KoQQc2
π9π€©4