AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”΄ Geogram: geometric algos in C++ πŸ”΄

πŸ‘‰Novel open-source programming library with (research) geometric algorithms in C++

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Geometry Processing from #INRIA
βœ…30+ papers from SIGGRAPH, etc.
βœ…Grants: GOODSHAPE & VORPALINE
βœ…Code (mostly C++) under BSD 3

More: https://bit.ly/3mhS4L7
πŸ”₯6πŸ‘3❀1
🍏 Open Source Vision from #Apple 🍏

πŸ‘‰CVNets: open-source (not a joke) lib for neural vision.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…PyTorch-based neural lib. for vision
βœ…Train 2βˆ’4Γ— longer w/ augmentations
βœ…Plug-and-play components for CV
βœ…Source code under a custom license

More: https://bit.ly/39d1dSj
πŸ‘9
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡πŸ»Neural Clips by #Nvidia: INSANE πŸ‡πŸ»

πŸ‘‰Neural generation with changes in camera viewpoint & content that arises over time 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel hierarchical generator architecture
βœ…Temp. receptive field + temporal embed.
βœ…Multi-res. with super-resolution network
βœ…SOTA in long clip with motion & changes
βœ…Code, data & models in August 2022 πŸ–οΈ

More: https://bit.ly/3zroWsC
🀯9πŸ‘Ž2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
⚽ Zero to #Messi with #deeplearning ⚽

πŸ‘‰EA unveils a neural system to learn multiple soccer juggling skills 😍

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Learning difficult soccer juggling skills
βœ…Layer-wise mixture-of-experts architecture
βœ…Specialization arises naturally
βœ…Adaptive random walk training strategy

More: https://bit.ly/3mwRaL2
πŸ”₯7πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ–οΈ HumanNeRF: source code is out! πŸ–οΈ

πŸ‘‰Pausing the video at any frame and rendering the subject from arbitrary views!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Synthesizing photorealistic humans
βœ…Synthesizing details, ie. cloth & face
βœ…Volumetric canonical T-pose
βœ…Skeletal rigid/non-rigid decomposition

More: https://bit.ly/3NEkTNY
🀯17πŸ”₯5πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽ’ EG3D: source code is out! πŸŽ’

πŸ‘‰#Nvidia just opened EG3D: real time multi-view faces w/ HQ #3D geometry!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Tri-plane-based 3D GAN framework
βœ…Pose-correlated attribute (expression)
βœ…SOTA in uncond. 3D-aware synthesis
βœ…Source code & models NOW available!

More: https://bit.ly/3aOfHs0
πŸ”₯7🀯6πŸ‘4❀2
πŸ”₯One Millisecond Backbone. Fire!πŸ”₯

πŸ‘‰MobileOne by #Apple: efficient mobile backbone with inference <1 ms on #iPhone12!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…75.9% top-1 accuracy on ImageNet
βœ…38Γ— faster than MobileFormer net
βœ…Classification, detection & segmentation
βœ…Source code & model soon available!

More: https://bit.ly/3tsT7f2
❀24πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🧨 Scaling Transformers to GigaPixels!🧨

πŸ‘‰Novel ViT called Hierarchical Image Pyramid Transformer (HIPT) -> Scaling to GigaPixels!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Gigapixel whole-slide imaging (WSI)
βœ…Leveraging natural hier. structure of WSI
βœ…Self-supervised Hi-Res representations
βœ…Source code and models available!

More: https://bit.ly/3xLuzkg
🀯16πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘—BodyMap: Hyper-Detailed HumansπŸ‘—

πŸ‘‰#META unveils 1st-ever dense continuous correspondence for clothed humans

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…1st-ever dense continuous corresp.
βœ…HQ fingers, hair, and clothes
βœ…Novel ViT-based architecture
βœ…SOTA on DensePose COCO

More: https://bit.ly/39nEPps
πŸ‘13❀2
🐹 NOAH just open-sourced! 🐹

πŸ‘‰A novel approach to find the optimal design of prompt modules through NAS algos.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NOAH from Neural prOmpt seArcH
βœ…Parameter-efficient β€œprompt modules”
βœ…Efficient NAS-based implementation
βœ…Better than transfer, few-shot & domain gen.

More: https://bit.ly/3MKfVhi
πŸ‘5πŸ‘2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ„πŸ»β€β™€οΈNeural Super-Resolution in MoviesπŸ„πŸ»β€β™€οΈ

πŸ‘‰Implicit neural representation to get arbitrary spatial resolution & FPS -> Super Resolution!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Video as continuous video representation
βœ…Clips in arbitrary space/time resolution
βœ…OOD generalization in space-time
βœ…Source code and models available

More: https://bit.ly/3xsqccf
πŸ”₯6πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🧠 Bias in #AI, explained simple 🧠

πŸ‘‰Asking DallE-Mini to help me to show what the BIAS in #AI is

π†πžπ§πžπ«πšπ­πžπ π’πšπ¦π©π₯𝐞𝐬:
βœ…Best eng.->men/Caucasians
βœ…Best doctors->men/Caucasians
βœ…Top CEOs->men/Caucasians
βœ…Chef, kitchen->men/Caucasians
βœ…Rich People->only Caucasians
βœ…Poor People->non-Caucasians
βœ…Italian engineers->back in 30's
βœ…Chinese eng.->infrastructures
βœ…Italian working->local market
βœ…Chinese working->vegetables
βœ…Men workers->constructions
βœ…Women workers->only office

More: https://bit.ly/3b0UFqd
πŸ‘13❀6😁4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦• SAVi++: Segmentation by #Google πŸ¦•

πŸ‘‰Novel unsupervised object-centric #AI to predict depth signals from slot-based video representation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Segmenting complex dynamic scenes
βœ…Static/Moving objects on naturalistic BG
βœ…LiDAR-SAVi: segmenting in the wild
βœ…Source code and model soon available!

More: https://bit.ly/3n3hywd
πŸ”₯7πŸ‘6πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
βœ‹HaGRID : Half Million HandsπŸ‘‹

πŸ‘‰Russian Sberbank opens HaGRID, enormous dataset for HGR. "Peace" label is present πŸ”΅πŸŸ‘

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…552,992 samples, 18 classes
βœ…HD resolution in RGB format
βœ…BBox, gesture, leading hands
βœ…Dataset/models available

More: https://bit.ly/3n2cd8r
❀11πŸ€”2
πŸ”₯ #AIwithPapers: we are 2,900+! πŸ”₯

πŸ’™πŸ’› Cheers from "Black Metal Lady Gaga" plotted by DallE-mini πŸ’™πŸ’›

😈 Invite your friends -> https://t.me/AI_DeepLearning
😁8πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ…Segmentation with INSANE OcclusionsπŸ…

πŸ‘‰CMU unveils WALT: segmenting in severe occlusion scenarios. Performance over human.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…WALT: Watch & Learn Time-lapse
βœ…4K/1080p cams on streets over a year
βœ…Performance over human-supervised
βœ…Object-occluder-occluded neural layers
βœ…Source code under MIT license

More: https://bit.ly/3n7pvjO
🀯14πŸ‘4πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
🐠Largest Dataset for #autonomousdriving🐠

πŸ‘‰SHIFT: largest synthetic dataset for #selfdrivingcars. Shifts in cloud, rain, fog, time of day, vehicle & pedestrian density🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…4,800+ clips, multi-view sensor suite
βœ…Semantic/instance, M/stereo depth
βœ…2D/3D object detection, MOT
βœ…Optical flow, point cloud registration
βœ…Visual-Odo, trajectory & human pose

More: https://bit.ly/3HJBUUT
🀯9πŸ‘5❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‘Big Egocentric Dataset by #Meta πŸ¦‘

πŸ‘‰Novel dataset to speed-up research on egocentric MR/AI

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…159 sequences, multiple sensors
βœ…Scenarios: cooking, exercising, etc.
βœ…β€˜Desktop Activities’ via multi-view mocap
βœ…Dataset available upon request

More: https://bit.ly/3QDccVW
πŸ”₯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‹Transf-Codebook HD-Face RestorationπŸ¦‹

πŸ‘‰S-Lab unveils CodeFormer: hyper-datailed face restoration from degraded clips

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Face restoration as a code prediction
βœ…Discrete CB prior in small proxy space
βœ…Controllable transformation for LQ->HQ
βœ…Robustness and global coherence
βœ…Code and models soon available

More: https://bit.ly/3QEa9B5
πŸ”₯13πŸ‘7❀1