AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🫀 Segmentation of Human 🫀

👉TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.

😎Review https://t.ly/yHMm1
😎Code https://lnkd.in/dvgrbsCE
😎Paper https://lnkd.in/dkwHuuzU
🔥14👍7🤯6😱21🤩1
🪐 Spacecraft Pose Estimation 🪐

👉SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab

😎Review https://t.ly/m8JPB
😎Paper https://lnkd.in/d_edvc3n
😎Project https://lnkd.in/dPp375aY
7🤯2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Florence-2: unified Computer Vision🔥

👉#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!

👉Review https://t.ly/pOins
👉Paper arxiv.org/pdf/2311.06242.pdf
👉Project www.microsoft.com/en-us/research/project/projectflorence/
😱95🔥3👍1👏1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
💥🚗 CrashCar101: Generative Damaged Cars💥🚗

👉 CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations

👉 Review https://t.ly/pITHm
👉 Paper https://lnkd.in/dzp6q3T5
👉 Project https://lnkd.in/daRXg73N
7👍1🔥1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓 Emu: image edit / video gen. 🐓

👉#Meta the new SOTA in text-to-video generation and instruction-based image editing

👉 Review https://t.ly/PMTBc
👉 Paper (images): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eG8eWUJY
👉 Paper (video): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eu6Zu6gp
🔥8🤯2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🌦️ 100+ GPU weather training 🌦️

👉#NVIDIA just released Makani: massively parallel training of weather and climate prediction models on 100+ GPUs and to enable the development of the next generation of weather and climate models.

👉 Review https://t.ly/jageY
👉 Code https://lnkd.in/d4NFZ5xi
23🤯71😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 Segmenting anything in 3D 🍿

👉 OmniSeg3D: omniversal segmentation method aims for segmenting anything in 3D all at once.

👉Review https://t.ly/Q0jrK
👉Paper https://lnkd.in/d9qpxXY9
👉Project https://oceanying.github.io/OmniSeg3D
👉Code (soon)
17🔥7👍4🤯2😱2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🔳 SOTA Semantic Boundary 🔳

👉Mobile-Seed, a lightweight, dual-task framework tailored for simultaneous semantic segmentation and boundary detection.

👉Review https://t.ly/GsArZ
👉Project whu-usi3dv.github.io/Mobile-Seed/
👉Paper arxiv.org/pdf/2311.12651.pdf
👉Code github.com/WHU-USI3DV/Mobile-Seed
5👍1🔥1🤯1😱1
🧿 SOTA Model-aware 3D Gaze 🧿

👉 Novel hybrid approach that outputs 3D eye model, semantic segmentation, cam-intrinsic & pose. Only 2D eye semantic segmentation masks and fewer 3D gaze labels for supervision.

👉Review https://t.ly/AdKRf
👉Paper https://lnkd.in/dWb9GHPh
👉Code https://lnkd.in/dfAWFVky
🔥11👍3🤯31😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🦖T-Rex: Counting by Visual Prompting🦖

👉T-Rex: a novel interactive object counting model to detect and count any objects. Impressive results!

👉Review https://t.ly/4SfFX
👉Project https://lnkd.in/dVtEndHv
👉Paper https://lnkd.in/dBGQsbdP
👉Code (not announced, but an empty repo exists): https://lnkd.in/dnZnGRUn
👍16🔥154🤯2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Stable (Stability.AI) Video Diffusion 🔥

👉 #StabilityAI released Stable Video Diffusion: latent video diffusion model for high-resolution, SOTA text-to-video and image-to-video generation

👉 Review https://t.ly/XwHys
👉 Code https://lnkd.in/dQw_yNuV
👉 Paper https://lnkd.in/dHn6f787
🔥17👍6🤯31🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎡 Panoptic Video Scene Graph 🎡

👉Combining video scene graph generation w/ panoptic segmentation for holistic video understanding. Novel HQ dataset with fine, temporal scene graph annotations & panoptic segmentation. Code released!🔥

👉Review https://t.ly/tckDT
👉Project jingkang50.github.io/PVSG/
👉Paper arxiv.org/pdf/2311.17058.pdf
👉Code github.com/LilyDaytoy/OpenPVSG
👉Tool github.com/lilyDaytoy/PVSGAnnotation
🔥7👍43🤯1
NebulOS.pdf
5.3 MB
🌳 NebulOS: (more than) Green AI 🌳

👉A novel hardware-aware Training-Free NAS approach that considers both training-free metrics & HW constraints, aiming to find the optimal balance between validation accuracy & energy consumption. 🚀

👉Review https://t.ly/Ozso1
👉Project sites.google.com/view/nebulos
👉Code github.com/fracapuano/NebulOS
👉Video https://lnkd.in/exN4Q2Fu
👉Hugging Face https://lnkd.in/eyCcPEPc
5🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 Material Palette from Images 🧱

👉A novel problem in #AI: material extraction from a real-world image without any prior knowledge 🤯

👉Discussion https://t.ly/AIWs-
👉Paper https://lnkd.in/dBFAVWPF
👉Project https://lnkd.in/dV5jK8Sm
👉Code https://lnkd.in/dNhMnfFb
👉Dataset (coming) ...
9👍2🔥1🥰1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
👑 HD Generative #AI With No $$👑

👉DemoFusion: a novel approach for HD image generation w/ no money. Progressive Upscaling, Skip Residual, & Dilated Sampling to achieve higher-resolution ever 🔥

👉Review https://t.ly/sIqDV
👉Paper https://lnkd.in/deDt-zcK
👉Project https://lnkd.in/dFGj47Xw
👉Code https://lnkd.in/dY3UcXwp
👍42🤯2👏1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Animate Anyone: new SOTA! 🍡

👉Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains 🚀

👉Review https://t.ly/qCahZ
👉Paper https://lnkd.in/d-zi8EZ6
👉Project https://lnkd.in/djwjQRvq
👉Repo https://lnkd.in/dDMkjnKz
🤯22👍8🔥411😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔎 Generative Powers of Ten 🔍

👉A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell 🤯

👉Review https://t.ly/2DG44
👉Paper https://lnkd.in/eDcSpU59
👉Project https://lnkd.in/e6NKu8n9
🤯214🔥3👏2😱1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!

👍 FREE TO FORWARD TO OTHER TELEGRAM CHANNELS

🔥 NO COPY OF THE POSTS
🔥 NO COMMERCIAL USAGE
🔥 NO UNRESPECTFUL USAGE

⚠️ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION ⚠️
19👍10👏3🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩰 Magic Animating Human 🩰

👉MagicAnimate: the new SOTA in human animation. Code available: let's dance!

👉Review https://t.ly/Oq7Za
👉Paper https://lnkd.in/dSUbGgCs
👉Project https://lnkd.in/dkVFf-SV
👉Code https://lnkd.in/dj2dbzdg
👉Demo https://lnkd.in/dHEKPE9q
🤯62👍1🔥1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 EfficientSAM: 20x faster Segment Anything 🔥

👉Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!

👉Review https://t.ly/966QS
👉Paper https://lnkd.in/duijp_Rh
👉Project https://lnkd.in/dW-p2CuH
👉Code https://lnkd.in/dAbZaB2t
👉Demo https://lnkd.in/d-tjKiUd
🔥154👍4🤯2