AI with Papers - Artificial Intelligence & Deep Learning
14.8K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅซ Plug 'n' play self-checkout ๐Ÿฅซ

๐Ÿ‘‰#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved

๐Ÿ˜ŽReview https://bit.ly/3J58hQe
๐Ÿ˜ŽNews https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŠ GLIGEN: Grounded T2I Diffusion ๐ŸŠ

๐Ÿ‘‰New (insane๐Ÿคฏ) SOTA in zero-shot layout-to-image generation. Demo available!

๐Ÿ˜ŽReview https://bit.ly/3J0rnHw
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07093.pdf
๐Ÿ˜ŽCode github.com/gligen/GLIGEN
๐Ÿ˜ŽDemo dev.hliu.cc/gligen_mirror2/
๐Ÿ˜ŽProject gligen.github.io/
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงค Handy: Hands "Pipeline" by #Shopify ๐Ÿงค

๐Ÿ‘‰Shopify open-sourced Handy: hand gestures via #metaquest headsets -> into #Blender

๐Ÿ˜ŽReview https://bit.ly/3Wpkpi2
๐Ÿ˜ŽProject github.com/Shopify/handy
๐Ÿ˜ŽDemo diegomacario.github.io/Hands-In-The-Web/public/index.html
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ€ NeRF2NeRF: 3D registration on NeRFs ๐Ÿ€

๐Ÿ‘‰A novel 3D registration that operates directly on NeRFs

๐Ÿ˜ŽReview https://bit.ly/3ZRgz4a
๐Ÿ˜ŽPaper arxiv.org/pdf/2211.01600.pdf
๐Ÿ˜ŽCode github.com/nerf2nerf
๐Ÿ˜ŽProject https://nerf2nerf.github.io/
๐Ÿ˜ŽDataset https://drive.google.com/drive/folders/1jNpwAv1T1ntjIHUMJ1wABePA2Z8_nRRQ
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽจ RecolorNeRF: #3D Color Editing ๐ŸŽจ

๐Ÿ‘‰INSANE palette-based color editing of NeRF scenes

๐Ÿ˜ŽReview https://bit.ly/3GYjhfR
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07958.pdf
๐Ÿ˜ŽProject sites.google.com/view/recolornerf
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸžOmniObject3D: Realistic 3D Dataset ๐Ÿž

๐Ÿ‘‰Large-vocabulary #3D dataset for realistic perception, reconstruction & generation

๐Ÿ˜ŽReview https://bit.ly/3HlXyjp
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07525.pdf
๐Ÿ˜ŽProject omniobject3d.github.io/
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ„ BTS: Density Fields from Single View ๐Ÿ„

๐Ÿ‘‰Volumetric scene representation from a single image in challenging conditions

๐Ÿ˜ŽReview https://bit.ly/3wjHDvH
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07668.pdf
๐Ÿ˜ŽProject fwmb.github.io/bts/
This media is not supported in your browser
VIEW IN TELEGRAM
โšกStyleGAN-T: unlocking Power of GANsโšก

๐Ÿ‘‰#Nvidia unveils StyleGAN-T to regain competitiveness to GANs vs. Diffusive Models

๐Ÿ˜ŽReview https://bit.ly/3HtKxEA
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.09515.pdf
๐Ÿ˜ŽProject sites.google.com/view/stylegan-t
๐Ÿ˜ŽCode github.com/autonomousvision/stylegan-t
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿช€ NeRF in Time, Space and Appearance ๐Ÿช€

๐Ÿ‘‰From Berkeley k-planes: a white-box model for radiance fields in arbitrary dimensions

๐Ÿ˜ŽReview https://bit.ly/3J8GiiS
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.10241.pdf
๐Ÿ˜ŽProject sarafridov.github.io/K-Planes/
๐Ÿ˜ŽCode github.com/sarafridov/K-Planes
Media is too big
VIEW IN TELEGRAM
๐Ÿ”ฅ Neural Tracking via Weighted OF ๐Ÿ”ฅ

๐Ÿ‘‰The new SOTA in planar neural tracking is INSANE!

๐Ÿ˜ŽReview https://bit.ly/404gcDs
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.10057.pdf
๐Ÿ˜ŽCode github.com/serycjon/WOFT
๐Ÿ˜ŽProject cmp.felk.cvut.cz/~serycjon/WOFT
This media is not supported in your browser
VIEW IN TELEGRAM
โ™ฟ Detecting Vulnerable Pedestrian โ™ฟ

๐Ÿ‘‰ BGSU opens a novel pedestrian dataset for vulnerable people

๐Ÿ˜ŽReview https://bit.ly/3JjVmu2
๐Ÿ˜ŽPaper arxiv.org/pdf/2212.06218.pdf
๐Ÿ˜ŽData github.com/devvansh1997/BGVP
๐Ÿง  SERENA: LLM for Mental Health Support ๐Ÿง 

๐Ÿ‘‰Interactive #AI (in "#chatgpt" style) designed for mental health counseling

๐Ÿ˜ŽReview https://bit.ly/3wtbW37
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.09412.pdf
๐Ÿ˜ŽProject https://serena.chat/
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ• MAV3D: #3D Video from Text ๐Ÿ•

๐Ÿ‘‰#META unveils a novel #AI for generating #3D dynamic videos from text

๐Ÿ˜ŽReview https://bit.ly/3XN0zin
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.11280.pdf
๐Ÿ˜ŽProject make-a-video3d.github.io
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅCutLER: Unsupervised Segmentation ๐Ÿ”ฅ

๐Ÿ‘‰Novel paper by #META on detection & instance segmentation without human annotations

๐Ÿ˜ŽReview https://bit.ly/3DlFiUG
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.11320.pdf
๐Ÿ˜ŽCode github.com/facebookresearch/CutLER
๐Ÿ˜ŽProject people.eecs.berkeley.edu/~xdwang/projects/CutLER
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ˜ CLIP/GPT3-driven Affective Faces ๐Ÿ˜

๐Ÿ‘‰Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

๐Ÿ˜ŽReview https://bit.ly/3HERna0
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.10939.pdf
๐Ÿ˜ŽProject realtalk.cs.columbia.edu
๐Ÿ˜ŽCode github.com/scottgeng00/realtalk
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ Physics-inspired Computer Vision ๐Ÿฆ

๐Ÿ‘‰UCLA unveils PhyCV, the first Physics-inspired Computer Vision Library

๐Ÿ˜ŽReview https://bit.ly/3HEWozI
๐Ÿ˜ŽCode github.com/JalaliLabUCLA/phycv
๐Ÿ˜ŽProject photonics.ucla.edu/2022/05/12/jalali-lab-open-sources-phycv-a-physics-inspired-computer-vision-library/
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽทAudio-Visual Semantic Segmentation๐ŸŽท

๐Ÿ‘‰A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

๐Ÿ˜ŽReview https://bit.ly/3wFY6dw
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.13190.pdf
๐Ÿ˜ŽProject opennlplab.github.io/AVSBench
๐Ÿ˜ŽCode github.com/OpenNLPLab/AVSBench
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿš› Text-driven Video Neural Editing ๐Ÿš›

๐Ÿ‘‰A novel text-guided video editing with both appearance/shape

๐Ÿ˜ŽReview https://bit.ly/3YcfMJO
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.13173.pdf
๐Ÿ˜ŽProject text-video-edit.github.io/
This media is not supported in your browser
VIEW IN TELEGRAM
โญ Mono-STAR: Unified Track/3D โญ

๐Ÿ‘‰Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

๐Ÿ˜ŽReview https://bit.ly/3Dxvxmx
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.13244.pdf
๐Ÿ˜ŽProject github.com/changhaonan/Mono-STAR-demo