AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฅMarker-free 6D-point tracking๐Ÿฅ

๐Ÿ‘‰Full position and rotation of skeletal joints, with only a RGB frame

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Full 3-axis joint rotations
โœ…V-markers, emulating mocap
โœ…#3D from monocular with NN
โœ…Generalization, no retraining
โœ…SOTA rotation/position est.

More: https://bit.ly/34GdoF5
๐Ÿ”ฅ12๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงผ Synthetic dataset for #Retail ๐Ÿงผ

๐Ÿ‘‰A large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Dataset from Standard.AI
โœ…2,134 unique scenes
โœ…25k+ annotated samples
โœ…Introducing the "change detection"
โœ…Multi-view representation learning
โœ…NonCommercial-ShareAlike 4.0

More: https://bit.ly/3uXqubB
๐Ÿคฏ6๐Ÿฅฐ3๐Ÿ‘1๐Ÿ”ฅ1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŒˆ Graph Neural Nets Forecasting๐ŸŒˆ

๐Ÿ‘‰Data-driven approach for forecasting global weather using graph neural networks

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Data-driven forecasting via GNNs
โœ…Model: 6.7M parameters, float32
โœ…6-hours forecast in 0.04 secs.
โœ…A 5-day forecast in 0.8 secs.

More: https://bit.ly/3LH4CXR
๐Ÿ‘4๐Ÿ‘2๐Ÿค”1
Media is too big
VIEW IN TELEGRAM
๐ŸฅซWatch Those Words!๐Ÿฅซ

๐Ÿ‘‰Berkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Regardless of falsification
โœ…Semantic person-specific
โœ…Word-conditioned analysis
โœ…Generalization across fakes

More: https://bit.ly/3oXWmcd
๐Ÿ‘5๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”‹V2X-sim for #selfdriving is out!๐Ÿ”‹

๐Ÿ‘‰V2X: collaboration between a vehicle and any surrounding entity

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Suitable for #selfdrivingcars
โœ…Rec. from road & vehicles
โœ…Multi-streams/perception
โœ…Detection, tracking, & segmentation
โœ…RGB, depth, semantic, BEV & LiDAR

More: https://bit.ly/3H6veOI
๐Ÿ”ฅ6๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸInfinite Synthetic dataset for Fitness๐Ÿ

๐Ÿ‘‰Opensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…60k images, 1-5 avatars
โœ…15 categories, 21 variations
โœ…Blender and ray-tracing
โœ…SMPL-X + facial expression
โœ…Cloth/skin tone sampled
โœ…147 4K HDRI panoramas
โœ…Creative Commons 4.0

More: https://bit.ly/33B1R9q
๐Ÿคฉ5โค1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
โ™Š DITTO: Digital Twins from Interaction โ™Š

๐Ÿ‘‰Digitizing objects for #metaverse through interactive perception

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…DIgital Twin of arTiculated Objects
โœ…Geometry & kinematic articulation
โœ…Articulation & 3D via perception
โœ…Source code under MIT License

More:https://bit.ly/3LMazCV
๐Ÿ”ฅ5โค2๐Ÿ‘1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿค– Robotic Telekinesis from Youtube ๐Ÿค–

๐Ÿ‘‰CMU unveils a Robot that observes humans and imitates their actions in real-time

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Enabling robo-hand teleoperation
โœ…Suitable for untrained operator
โœ…Single uncalibrated RGB camera
โœ…Leveraging unlabeled #youtube
โœ…No active fine-tuning or setup
โœ…No collision via Adv-Training

More: https://bit.ly/3H7zUnh
๐Ÿ”ฅ3๐Ÿคฏ2๐Ÿ‘1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’„DIGAN: #AI for video generation๐Ÿ’„

๐Ÿ‘‰A novel INR-based generative adversarial network for video generation

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Dynamics-aware generator
โœ…INR-based clip generator
โœ…Manipulating space/time
โœ…Identifying unnatural motion

More: https://bit.ly/3H6sHE4
๐Ÿ”ฅ4๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ„FILM Neural Frame Interpolation๐Ÿฆ„

๐Ÿ‘‰Frame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Single unified network
โœ…High quality output
โœ…SOTA on the Xiph
โœ…Apache License 2.0

More: https://bit.ly/3pl4ZxH
๐Ÿ”ฅ5๐Ÿ‘2๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ˆNeural Maintenance via listening๐Ÿ”ˆ

๐Ÿ‘‰Novel neural-method to detect whether a machine is "healthy" or requires maintenance

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Defects at an early stage
โœ…FDWT, fast discrete wavelet
โœ…Learnable wavelet/denoising
โœ…Unsupervised learnable FDWT
โœ…The new SOTA in PM

More: https://bit.ly/3hiKWeX
๐Ÿคฏ6๐Ÿค”1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŸฆ๐ŸŸจ StyleGAN on Internet pics ๐ŸŸฆ๐ŸŸจ

๐Ÿ‘‰StyleGAN on raw uncurated images collected from Internet

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Outliers & multi-modal
โœ…Self-distillation approach
โœ…Self-filtering of outliers
โœ…Perceptual clustering

More: https://bit.ly/33Z1d5H
โค2๐Ÿ‘1๐Ÿ”ฅ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆœThe new SOTA for Unsupervised ๐Ÿฆœ

๐Ÿ‘‰Self-supervised transformer to discover objects in images

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Visual tokens as nodes in graph
โœ…Edges as connectivity score
โœ…The second smallest eV = fg
โœ…Suitable for unsupervised saliency
โœ…Weakly supervised obj. detection
โœ…Code under MIT License


More: https://bit.ly/3sqbFg3
๐Ÿ‘4๐Ÿ”ฅ3๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅฆ GAN-generated CryptoPunks ๐Ÿฅฆ

๐Ÿ‘‰A simple (and funny) SN-GAN to generate cryptopunks

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Spectral normalization (2018)
โœ…Easy to incorporate into training
โœ…A project by Teddy Koker ๐ŸŽฉ

More: https://bit.ly/35C1rQI
โค3๐Ÿ˜3๐Ÿ‘1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸคชSEER: self-AI from BILLIONS pic๐Ÿคช

๐Ÿ‘‰META + INRIA trained models on billions of random images without any pre-processing or assumptions

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Self-supervised on pics from web
โœ…Discovering properties in datasets
โœ…More fair, less biased & less harmful
โœ…Better OOD generalization
โœ…Source code available!

More: https://bit.ly/3vy69dd
๐Ÿ”ฅ4๐Ÿ‘3๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฒA novel AI-controllable synthesis๐Ÿฒ

๐Ÿ‘‰Modeling local semantic parts separately and synthesizing images in a compositional way

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Structure & texture locally controlled
โœ…Disentanglement between areas
โœ…Fine-grained editing of images
โœ…Extendible via transfer learning
โœ…Just accepted to #CVPR2022

More: https://bit.ly/3IBgkBy
๐Ÿ˜ฑ3๐Ÿคฏ2โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅฃ #AI-Generation with Dream Fields ๐Ÿฅฃ

๐Ÿ‘‰Neural rendering with multi-modal image and text representations

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Aligned image & text models
โœ…3D from natural language
โœ…No additional data
โœ…D.F. neural-scene

More: https://bit.ly/3Mhwm5D
๐Ÿ‘10๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŸช Mip-NeRF 360 for unbounded scenes ๐ŸŸช

๐Ÿ‘‰An extension of NeRF to overcome the challenges presented by unbounded scenes

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Realistic synthesized views
โœ…Intricate/unbounded scenes
โœ…Detailed depth maps
โœ…Mean-squared error -54%
โœ…No code provided ๐Ÿ˜ฅ

More: https://bit.ly/36ZxsD4
๐Ÿคฏ4โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ“ PINA: personal Neural Avatar ๐Ÿ“

๐Ÿ‘‰A novel method to acquire neural avatars from RGB-D videos

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…A virtual copy of themselves
โœ…Realistic clothing deformations
โœ…Shape & non-rigid deformation
โœ…Avatars from RGB-D sequences
โœ…Creative Commons Zero v1.0

More: https://bit.ly/3HAtRIh
๐Ÿ‘4โค1๐Ÿ‘1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ EfficientVIS: new SOTA for VIS ๐Ÿฆ

๐Ÿ‘‰Simultaneous classification, segmentation, and tracking multiple object instances in videos

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Efficient and fully end-to-end
โœ…Iterative query-video interaction
โœ…First RoI-wise clip-level RT-VIS
โœ…Requires 15ร— fewer epochs

More: https://bit.ly/3KfqurN
๐Ÿ‘10๐Ÿ”ฅ3๐Ÿ‘Ž1๐Ÿคฏ1