AI with Papers - Artificial Intelligence & Deep Learning

🪔 AvatarCLIP: Text-Driven Avatar 🪔

👉Zero-shot text-driven for #3D avatar in #metaverse

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅First text-driven synthesis
✅Shape, texture, and motion
✅Animation-ready, HQ texture/geometry
✅Zero-shot text-guided ref-based motion
✅Code and model under MIT license

More: https://bit.ly/3LjTWgB

🔥4👍2🤯2❤1

2.77K viewsedited 14:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥#AIwithPapers: we are 2,500!🔥

💙💛Only 2 Billion papers remaining on arXiv. The more we are, the faster we read💙💛

😈 Invite your friends -> https://t.me/AI_DeepLearning

🔥9❤4👍2🤔2👏1

2.5K viewsedited 15:14

AI with Papers - Artificial Intelligence & Deep Learning

AI with Papers - Artificial Intelligence & Deep Learning pinned a GIF

15:14

AI with Papers - Artificial Intelligence & Deep Learning

💥Podcasting AI & CV💥

👉🏼For people fluent in Italian: 1 hour podcast in which I talk about AI, CV, Startup and more (included this wonderful project).

More: https://bit.ly/38DtBwB

👏6❤3👍1

2.51K viewsedited 14:02

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Inpainting: new SOTA! INSANE🔥

👉Novel two-stream approach: inpainting at the next level!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅High-freq locally, low-freq globally
✅Local to global -> error correction
✅44% / 26% improvements FID/scores
✅Source code, more clips available

More: https://bit.ly/3ltIX9R

👍8🤯3🔥1🥰1

2.55K viewsedited 07:21

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Super-Human Crossword Solver🔥

👉Solving crosswords outperforming best humans

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Crossword solving based on NNs
✅Q&A, structured decoding, local search
✅Wide domains with perfect accuracy
✅Large question-answer dataset

More: https://bit.ly/3a3zzqQ

🔥4🤯3👏2👍1

2.62K views12:07

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥸Imagen: far beyond DALL·E 2🥸

👉#Google: unprecedented photorealism and deep level of language understanding

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Dynamic thresh diffusion sampling
✅Efficient U-Net, efficient++ variant
✅DrawBench, new text-to-image
✅The new SOTA, COCO FID of 7.27

More: https://bit.ly/3lVtkbz

🔥9🤯6👍1

2.48K viewsedited 11:41

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

🪤Tracking over SOTA detectors🪤

👉Lightweight Python lib for real-time 2D object tracking 💥

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Layer of tracking over SOTA detectors
✅Suitable for complex video processing
✅Source code under BSD 3-Clause
✅Maintained by Tryolabs team

More: https://bit.ly/3wKtGqg

👍7🔥3🤩3

2.81K viewsedited 15:49

AI with Papers - Artificial Intelligence & Deep Learning

0:11

This media is not supported in your browser

VIEW IN TELEGRAM

🥷🏿 FCA: #3D Neural Camouflage 🥷🏿

👉#3D full-camouflage adversarial patch to fool neural detectors

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Attack by diff-neural render
✅E2E physical adversarial attack
✅Envs, vehicles & detectors
✅Source code available!

More: https://bit.ly/38kKyfa

👍5🔥3🤯2👏1

2.83K viewsedited 12:06

AI with Papers - Artificial Intelligence & Deep Learning

0:18

Media is too big

VIEW IN TELEGRAM

🍋 One-Shot Object Pose 🍋

👉A novel one-shot object pose estimator

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Visual localization pipeline for object pose
✅Handling novel objects without CAD model
✅Novel graph attention for 2D-3D matching
✅Large dataset for one-shot object pose

More: https://bit.ly/3MTogjJ

🔥11❤4👍2🤯2

2.87K viewsedited 15:42

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

☄️STEVE: Slot-TransformEr for VidEos☄️

👉STEVE: unsupervised model for object-centric learning in videos

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Adoption of a slot decoder (SLATE)
✅SLATE with slot-level recurrence model
✅Complex and naturalistic videos
✅Significantly outperforms previous SOTA

More: https://bit.ly/3PNxxM3

🔥7👍1🤯1

2.43K viewsedited 15:38

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦔 CogVideo: insane text-to-clip 🦔

👉CogVideo: 9B-parameters world's first large scale open-source text-to-video 😵

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Largest open-source T2C transformer
✅Finetuning of text-to-image model
✅Multi-frame-rate hierarchical training
✅From pretrained model CogView2

More: https://bit.ly/3Gzfl4n

🔥9👍6

2.48K views06:35

AI with Papers - Artificial Intelligence & Deep Learning

0:05

This media is not supported in your browser

VIEW IN TELEGRAM

🦄Time-Aware Neural Voxels🦄

👉TiNeuVox: "NeRF" with time-aware voxel features 😵

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Dynamic scene w/ optimizable structure
✅Temporal information in radiance net
✅Small/large motion w/ single-res of feats
✅192× faster than previous Hyper-NeRF

More: https://bit.ly/3wR4O08

👍11🔥2🤯1

2.46K viewsedited 15:20

AI with Papers - Artificial Intelligence & Deep Learning

🫐Neural Anomaly Detection by AWS🫐

👉Ultra-competitive inference and SOTA for both detection and localization

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Locally aggregated, mid-level feats patch
✅Maximizing nominal information at test time
✅Reducing biases towards ImageNet classes
✅Image-level anomaly AUROC of up to 99.6%

More: https://bit.ly/3t7Ndjg

🔥7🤯3👍2

2.53K views06:46

AI with Papers - Artificial Intelligence & Deep Learning

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

🛹 Project Skate from Google #AI 🛹

👉#AI tool to analyze the skateboarder's tricks in real-time

More: https://bit.ly/3zbQS3M

🔥15🤩3👏1

2.45K views15:07

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧬Neural Text2Human Generation🧬

👉Text-driven neural human generation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Full-body from a given human pose
✅Hierarchical texture-aware codebook
✅DeepFashion -> 44k Hi-Res images
✅Code and models available!

More: https://bit.ly/3Mdnpt0

🔥15👍1

2.75K views15:54

AI with Papers - Artificial Intelligence & Deep Learning

🧨EfficientFormers: 1.6ms inference 🧨

👉Transformers fast as MobileNet? Snap shows that on #iphone!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Low latency on mobile, high performance!
✅Revisiting the design of ViT through latency
✅New dimension-consistent design paradigm
✅EfficientFormers: a new ViT for mobile!

More: https://bit.ly/3MdgW15

🔥16👍1🤯1

2.51K views10:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐢 Transformer-Based Sens-Fusion 🐢

👉Updating TransFuser (CVPR21): image + LiDAR representations with self-attention

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Existing approach can't handle traffic 😢
✅Novel multi-modal fusion transformer
✅The new SOTA in driving performance
✅Reducing avg collisions per KM by 48%
✅Insights on current limitations of E2E

More: https://bit.ly/391dmd6

👍11🔥2

2.5K views07:07

AI with Papers - Artificial Intelligence & Deep Learning

🧘🏻‍♂️YogNet: neural yoga assistant🧘🏻‍♂️

👉Multi-person yoga neural expert for 20 asanas

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅CNNs & reg.LSTMs + 3D-CNNs
✅Multi-person asanas in real-time
✅YAR: dataset for yoga & posture
✅1206 videos, 2D RGB camera

More: https://bit.ly/3NncVbE

❤13👍1

2.97K viewsedited 08:41

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔴 Geogram: geometric algos in C++ 🔴

👉Novel open-source programming library with (research) geometric algorithms in C++

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Geometry Processing from #INRIA
✅30+ papers from SIGGRAPH, etc.
✅Grants: GOODSHAPE & VORPALINE
✅Code (mostly C++) under BSD 3

More: https://bit.ly/3mhS4L7

🔥6👍3❤1

2.58K views09:35

About

Blog

Apps

Platform