AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ’Ĩ🚗 CrashCar101: Generative Damaged CarsđŸ’Ĩ🚗

👉 CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations

👉 Review https://t.ly/pITHm
👉 Paper https://lnkd.in/dzp6q3T5
👉 Project https://lnkd.in/daRXg73N
❤7👍1đŸ”Ĩ1đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓 Emu: image edit / video gen. 🐓

👉#Meta the new SOTA in text-to-video generation and instruction-based image editing

👉 Review https://t.ly/PMTBc
👉 Paper (images): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eG8eWUJY
👉 Paper (video): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eu6Zu6gp
đŸ”Ĩ8đŸ¤¯2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŒĻī¸ 100+ GPU weather training đŸŒĻī¸

👉#NVIDIA just released Makani: massively parallel training of weather and climate prediction models on 100+ GPUs and to enable the development of the next generation of weather and climate models.

👉 Review https://t.ly/jageY
👉 Code https://lnkd.in/d4NFZ5xi
❤23đŸ¤¯7⚡1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŋ Segmenting anything in 3D đŸŋ

👉 OmniSeg3D: omniversal segmentation method aims for segmenting anything in 3D all at once.

👉Review https://t.ly/Q0jrK
👉Paper https://lnkd.in/d9qpxXY9
👉Project https://oceanying.github.io/OmniSeg3D
👉Code (soon)
❤17đŸ”Ĩ7👍4đŸ¤¯2😱2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”ŗ SOTA Semantic Boundary đŸ”ŗ

👉Mobile-Seed, a lightweight, dual-task framework tailored for simultaneous semantic segmentation and boundary detection.

👉Review https://t.ly/GsArZ
👉Project whu-usi3dv.github.io/Mobile-Seed/
👉Paper arxiv.org/pdf/2311.12651.pdf
👉Code github.com/WHU-USI3DV/Mobile-Seed
❤5👍1đŸ”Ĩ1đŸ¤¯1😱1
đŸ§ŋ SOTA Model-aware 3D Gaze đŸ§ŋ

👉 Novel hybrid approach that outputs 3D eye model, semantic segmentation, cam-intrinsic & pose. Only 2D eye semantic segmentation masks and fewer 3D gaze labels for supervision.

👉Review https://t.ly/AdKRf
👉Paper https://lnkd.in/dWb9GHPh
👉Code https://lnkd.in/dfAWFVky
đŸ”Ĩ11👍3đŸ¤¯3❤1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĻ–T-Rex: Counting by Visual PromptingđŸĻ–

👉T-Rex: a novel interactive object counting model to detect and count any objects. Impressive results!

👉Review https://t.ly/4SfFX
👉Project https://lnkd.in/dVtEndHv
👉Paper https://lnkd.in/dBGQsbdP
👉Code (not announced, but an empty repo exists): https://lnkd.in/dnZnGRUn
👍16đŸ”Ĩ15❤4đŸ¤¯2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”Ĩ Stable (Stability.AI) Video Diffusion đŸ”Ĩ

👉 #StabilityAI released Stable Video Diffusion: latent video diffusion model for high-resolution, SOTA text-to-video and image-to-video generation

👉 Review https://t.ly/XwHys
👉 Code https://lnkd.in/dQw_yNuV
👉 Paper https://lnkd.in/dHn6f787
đŸ”Ĩ17👍6đŸ¤¯3❤1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎡 Panoptic Video Scene Graph 🎡

👉Combining video scene graph generation w/ panoptic segmentation for holistic video understanding. Novel HQ dataset with fine, temporal scene graph annotations & panoptic segmentation. Code released!đŸ”Ĩ

👉Review https://t.ly/tckDT
👉Project jingkang50.github.io/PVSG/
👉Paper arxiv.org/pdf/2311.17058.pdf
👉Code github.com/LilyDaytoy/OpenPVSG
👉Tool github.com/lilyDaytoy/PVSGAnnotation
đŸ”Ĩ7👍4❤3đŸ¤¯1
NebulOS.pdf
5.3 MB
đŸŒŗ NebulOS: (more than) Green AI đŸŒŗ

👉A novel hardware-aware Training-Free NAS approach that considers both training-free metrics & HW constraints, aiming to find the optimal balance between validation accuracy & energy consumption. 🚀

👉Review https://t.ly/Ozso1
👉Project sites.google.com/view/nebulos
👉Code github.com/fracapuano/NebulOS
👉Video https://lnkd.in/exN4Q2Fu
👉Hugging Face https://lnkd.in/eyCcPEPc
❤5🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 Material Palette from Images 🧱

👉A novel problem in #AI: material extraction from a real-world image without any prior knowledge đŸ¤¯

👉Discussion https://t.ly/AIWs-
👉Paper https://lnkd.in/dBFAVWPF
👉Project https://lnkd.in/dV5jK8Sm
👉Code https://lnkd.in/dNhMnfFb
👉Dataset (coming) ...
❤9👍2đŸ”Ĩ1đŸĨ°1đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
👑 HD Generative #AI With No $$👑

👉DemoFusion: a novel approach for HD image generation w/ no money. Progressive Upscaling, Skip Residual, & Dilated Sampling to achieve higher-resolution ever đŸ”Ĩ

👉Review https://t.ly/sIqDV
👉Paper https://lnkd.in/deDt-zcK
👉Project https://lnkd.in/dFGj47Xw
👉Code https://lnkd.in/dY3UcXwp
👍4❤2đŸ¤¯2👏1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Animate Anyone: new SOTA! 🍡

👉Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains 🚀

👉Review https://t.ly/qCahZ
👉Paper https://lnkd.in/d-zi8EZ6
👉Project https://lnkd.in/djwjQRvq
👉Repo https://lnkd.in/dDMkjnKz
đŸ¤¯22👍8đŸ”Ĩ4⚡1❤1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔎 Generative Powers of Ten 🔍

👉A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell đŸ¤¯

👉Review https://t.ly/2DG44
👉Paper https://lnkd.in/eDcSpU59
👉Project https://lnkd.in/e6NKu8n9
đŸ¤¯21❤4đŸ”Ĩ3👏2😱1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!

👍 FREE TO FORWARD TO OTHER TELEGRAM CHANNELS

đŸ”Ĩ NO COPY OF THE POSTS
đŸ”Ĩ NO COMMERCIAL USAGE
đŸ”Ĩ NO UNRESPECTFUL USAGE

âš ī¸ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION âš ī¸
❤19👍10👏3đŸĨ°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩰 Magic Animating Human 🩰

👉MagicAnimate: the new SOTA in human animation. Code available: let's dance!

👉Review https://t.ly/Oq7Za
👉Paper https://lnkd.in/dSUbGgCs
👉Project https://lnkd.in/dkVFf-SV
👉Code https://lnkd.in/dj2dbzdg
👉Demo https://lnkd.in/dHEKPE9q
đŸ¤¯6❤2👍1đŸ”Ĩ1đŸĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”Ĩ EfficientSAM: 20x faster Segment Anything đŸ”Ĩ

👉Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!

👉Review https://t.ly/966QS
👉Paper https://lnkd.in/duijp_Rh
👉Project https://lnkd.in/dW-p2CuH
👉Code https://lnkd.in/dAbZaB2t
👉Demo https://lnkd.in/d-tjKiUd
đŸ”Ĩ15❤4👍4đŸ¤¯2
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĢļ3D Hands with TransformersđŸĢļ

👉 HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.

👉Review https://t.ly/YtAW8
👉Paper https://arxiv.org/pdf/2312.05251.pdf
👉Project https://geopavlakos.github.io/hamer
👉Demo huggingface.co/spaces/geopavlakos/HaMeR
👉Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
👍10❤1👏1đŸ¤¯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĒŠ DreaMoving: Human Dancer đŸĒŠ

👉Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.

👉Review https://t.ly/BD_Yf
👉Paper https://lnkd.in/gepP6Rjw
👉Project https://lnkd.in/gwm72cfS
👉Repo (empty) https://lnkd.in/gsc2Qt-F
👍7💩6❤2đŸĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
📲 EdgeSAM: Mobile 40x SAM 📲

👉A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available 😉

👉Review https://t.ly/m_vLH
👉Paper https://lnkd.in/gHZVZN2x
👉Project https://lnkd.in/gK8qEK8p
👉Repo https://lnkd.in/gj6YAGNv
👉Hugging Face https://lnkd.in/gUUHJvxz
đŸ”Ĩ20⚡2❤2🤩1