AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
236 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฟ Segmenting anything in 3D ๐Ÿฟ

๐Ÿ‘‰ OmniSeg3D: omniversal segmentation method aims for segmenting anything in 3D all at once.

๐Ÿ‘‰Review https://t.ly/Q0jrK
๐Ÿ‘‰Paper https://lnkd.in/d9qpxXY9
๐Ÿ‘‰Project https://oceanying.github.io/OmniSeg3D
๐Ÿ‘‰Code (soon)
โค17๐Ÿ”ฅ7๐Ÿ‘4๐Ÿคฏ2๐Ÿ˜ฑ2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ณ SOTA Semantic Boundary ๐Ÿ”ณ

๐Ÿ‘‰Mobile-Seed, a lightweight, dual-task framework tailored for simultaneous semantic segmentation and boundary detection.

๐Ÿ‘‰Review https://t.ly/GsArZ
๐Ÿ‘‰Project whu-usi3dv.github.io/Mobile-Seed/
๐Ÿ‘‰Paper arxiv.org/pdf/2311.12651.pdf
๐Ÿ‘‰Code github.com/WHU-USI3DV/Mobile-Seed
โค5๐Ÿ‘1๐Ÿ”ฅ1๐Ÿคฏ1๐Ÿ˜ฑ1
๐Ÿงฟ SOTA Model-aware 3D Gaze ๐Ÿงฟ

๐Ÿ‘‰ Novel hybrid approach that outputs 3D eye model, semantic segmentation, cam-intrinsic & pose. Only 2D eye semantic segmentation masks and fewer 3D gaze labels for supervision.

๐Ÿ‘‰Review https://t.ly/AdKRf
๐Ÿ‘‰Paper https://lnkd.in/dWb9GHPh
๐Ÿ‘‰Code https://lnkd.in/dfAWFVky
๐Ÿ”ฅ11๐Ÿ‘3๐Ÿคฏ3โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ–T-Rex: Counting by Visual Prompting๐Ÿฆ–

๐Ÿ‘‰T-Rex: a novel interactive object counting model to detect and count any objects. Impressive results!

๐Ÿ‘‰Review https://t.ly/4SfFX
๐Ÿ‘‰Project https://lnkd.in/dVtEndHv
๐Ÿ‘‰Paper https://lnkd.in/dBGQsbdP
๐Ÿ‘‰Code (not announced, but an empty repo exists): https://lnkd.in/dnZnGRUn
๐Ÿ‘16๐Ÿ”ฅ15โค4๐Ÿคฏ2๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Stable (Stability.AI) Video Diffusion ๐Ÿ”ฅ

๐Ÿ‘‰ #StabilityAI released Stable Video Diffusion: latent video diffusion model for high-resolution, SOTA text-to-video and image-to-video generation

๐Ÿ‘‰ Review https://t.ly/XwHys
๐Ÿ‘‰ Code https://lnkd.in/dQw_yNuV
๐Ÿ‘‰ Paper https://lnkd.in/dHn6f787
๐Ÿ”ฅ17๐Ÿ‘6๐Ÿคฏ3โค1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽก Panoptic Video Scene Graph ๐ŸŽก

๐Ÿ‘‰Combining video scene graph generation w/ panoptic segmentation for holistic video understanding. Novel HQ dataset with fine, temporal scene graph annotations & panoptic segmentation. Code released!๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/tckDT
๐Ÿ‘‰Project jingkang50.github.io/PVSG/
๐Ÿ‘‰Paper arxiv.org/pdf/2311.17058.pdf
๐Ÿ‘‰Code github.com/LilyDaytoy/OpenPVSG
๐Ÿ‘‰Tool github.com/lilyDaytoy/PVSGAnnotation
๐Ÿ”ฅ7๐Ÿ‘4โค3๐Ÿคฏ1
NebulOS.pdf
5.3 MB
๐ŸŒณ NebulOS: (more than) Green AI ๐ŸŒณ

๐Ÿ‘‰A novel hardware-aware Training-Free NAS approach that considers both training-free metrics & HW constraints, aiming to find the optimal balance between validation accuracy & energy consumption. ๐Ÿš€

๐Ÿ‘‰Review https://t.ly/Ozso1
๐Ÿ‘‰Project sites.google.com/view/nebulos
๐Ÿ‘‰Code github.com/fracapuano/NebulOS
๐Ÿ‘‰Video https://lnkd.in/exN4Q2Fu
๐Ÿ‘‰Hugging Face https://lnkd.in/eyCcPEPc
โค5๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงฑ Material Palette from Images ๐Ÿงฑ

๐Ÿ‘‰A novel problem in #AI: material extraction from a real-world image without any prior knowledge ๐Ÿคฏ

๐Ÿ‘‰Discussion https://t.ly/AIWs-
๐Ÿ‘‰Paper https://lnkd.in/dBFAVWPF
๐Ÿ‘‰Project https://lnkd.in/dV5jK8Sm
๐Ÿ‘‰Code https://lnkd.in/dNhMnfFb
๐Ÿ‘‰Dataset (coming) ...
โค9๐Ÿ‘2๐Ÿ”ฅ1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘‘ HD Generative #AI With No $$๐Ÿ‘‘

๐Ÿ‘‰DemoFusion: a novel approach for HD image generation w/ no money. Progressive Upscaling, Skip Residual, & Dilated Sampling to achieve higher-resolution ever ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/sIqDV
๐Ÿ‘‰Paper https://lnkd.in/deDt-zcK
๐Ÿ‘‰Project https://lnkd.in/dFGj47Xw
๐Ÿ‘‰Code https://lnkd.in/dY3UcXwp
๐Ÿ‘4โค2๐Ÿคฏ2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿก Animate Anyone: new SOTA! ๐Ÿก

๐Ÿ‘‰Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains ๐Ÿš€

๐Ÿ‘‰Review https://t.ly/qCahZ
๐Ÿ‘‰Paper https://lnkd.in/d-zi8EZ6
๐Ÿ‘‰Project https://lnkd.in/djwjQRvq
๐Ÿ‘‰Repo https://lnkd.in/dDMkjnKz
๐Ÿคฏ22๐Ÿ‘8๐Ÿ”ฅ4โšก1โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”Ž Generative Powers of Ten ๐Ÿ”

๐Ÿ‘‰A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell ๐Ÿคฏ

๐Ÿ‘‰Review https://t.ly/2DG44
๐Ÿ‘‰Paper https://lnkd.in/eDcSpU59
๐Ÿ‘‰Project https://lnkd.in/e6NKu8n9
๐Ÿคฏ21โค4๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ˜ฑ1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!

๐Ÿ‘ FREE TO FORWARD TO OTHER TELEGRAM CHANNELS

๐Ÿ”ฅ NO COPY OF THE POSTS
๐Ÿ”ฅ NO COMMERCIAL USAGE
๐Ÿ”ฅ NO UNRESPECTFUL USAGE

โš ๏ธ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION โš ๏ธ
โค19๐Ÿ‘10๐Ÿ‘3๐Ÿฅฐ1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฉฐ Magic Animating Human ๐Ÿฉฐ

๐Ÿ‘‰MagicAnimate: the new SOTA in human animation. Code available: let's dance!

๐Ÿ‘‰Review https://t.ly/Oq7Za
๐Ÿ‘‰Paper https://lnkd.in/dSUbGgCs
๐Ÿ‘‰Project https://lnkd.in/dkVFf-SV
๐Ÿ‘‰Code https://lnkd.in/dj2dbzdg
๐Ÿ‘‰Demo https://lnkd.in/dHEKPE9q
๐Ÿคฏ6โค2๐Ÿ‘1๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ EfficientSAM: 20x faster Segment Anything ๐Ÿ”ฅ

๐Ÿ‘‰Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!

๐Ÿ‘‰Review https://t.ly/966QS
๐Ÿ‘‰Paper https://lnkd.in/duijp_Rh
๐Ÿ‘‰Project https://lnkd.in/dW-p2CuH
๐Ÿ‘‰Code https://lnkd.in/dAbZaB2t
๐Ÿ‘‰Demo https://lnkd.in/d-tjKiUd
๐Ÿ”ฅ15โค4๐Ÿ‘4๐Ÿคฏ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿซถ3D Hands with Transformers๐Ÿซถ

๐Ÿ‘‰ HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.

๐Ÿ‘‰Review https://t.ly/YtAW8
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.05251.pdf
๐Ÿ‘‰Project https://geopavlakos.github.io/hamer
๐Ÿ‘‰Demo huggingface.co/spaces/geopavlakos/HaMeR
๐Ÿ‘‰Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
๐Ÿ‘10โค1๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿชฉ DreaMoving: Human Dancer ๐Ÿชฉ

๐Ÿ‘‰Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.

๐Ÿ‘‰Review https://t.ly/BD_Yf
๐Ÿ‘‰Paper https://lnkd.in/gepP6Rjw
๐Ÿ‘‰Project https://lnkd.in/gwm72cfS
๐Ÿ‘‰Repo (empty) https://lnkd.in/gsc2Qt-F
๐Ÿ‘7๐Ÿ’ฉ6โค2๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ“ฒ EdgeSAM: Mobile 40x SAM ๐Ÿ“ฒ

๐Ÿ‘‰A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available ๐Ÿ˜‰

๐Ÿ‘‰Review https://t.ly/m_vLH
๐Ÿ‘‰Paper https://lnkd.in/gHZVZN2x
๐Ÿ‘‰Project https://lnkd.in/gK8qEK8p
๐Ÿ‘‰Repo https://lnkd.in/gj6YAGNv
๐Ÿ‘‰Hugging Face https://lnkd.in/gUUHJvxz
๐Ÿ”ฅ20โšก2โค2๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชผPatchFusion: SOTA Mono-Depth๐Ÿชผ

๐Ÿ‘‰PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/hv3yT
๐Ÿ‘‰Paper https://lnkd.in/d9dXP7iP
๐Ÿ‘‰Project https://lnkd.in/dQcvVJSx
๐Ÿ‘‰Repo https://lnkd.in/dW2GdVR5
๐Ÿ‘‰Demo https://lnkd.in/dFW-gAiY
๐Ÿ”ฅ10โค5๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒOutfit Anyone: Ultra-HQ VTO๐Ÿ’ƒ

๐Ÿ‘‰Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)

๐Ÿ‘‰Review https://t.ly/o6UR9
๐Ÿ‘‰Demo https://lnkd.in/dpQYdXhc
๐Ÿ‘‰Repo (empty) https://lnkd.in/dBsNST6r
๐Ÿคฏ10๐Ÿ‘4โค3๐Ÿ”ฅ2
๐Ÿ”ฅ #AIwithPapers: we are 8k+ ๐Ÿ”ฅ

๐Ÿ‘‰ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐Ÿงก

๐Ÿ˜ˆ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost

๐Ÿ˜ˆ Invite -> https://t.me/AI_DeepLearning
โค16๐Ÿคฃ7๐Ÿ”ฅ1๐Ÿฅฐ1