AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
237 videos
11 files
1.27K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงฑ Material Palette from Images ๐Ÿงฑ

๐Ÿ‘‰A novel problem in #AI: material extraction from a real-world image without any prior knowledge ๐Ÿคฏ

๐Ÿ‘‰Discussion https://t.ly/AIWs-
๐Ÿ‘‰Paper https://lnkd.in/dBFAVWPF
๐Ÿ‘‰Project https://lnkd.in/dV5jK8Sm
๐Ÿ‘‰Code https://lnkd.in/dNhMnfFb
๐Ÿ‘‰Dataset (coming) ...
โค9๐Ÿ‘2๐Ÿ”ฅ1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘‘ HD Generative #AI With No $$๐Ÿ‘‘

๐Ÿ‘‰DemoFusion: a novel approach for HD image generation w/ no money. Progressive Upscaling, Skip Residual, & Dilated Sampling to achieve higher-resolution ever ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/sIqDV
๐Ÿ‘‰Paper https://lnkd.in/deDt-zcK
๐Ÿ‘‰Project https://lnkd.in/dFGj47Xw
๐Ÿ‘‰Code https://lnkd.in/dY3UcXwp
๐Ÿ‘4โค2๐Ÿคฏ2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿก Animate Anyone: new SOTA! ๐Ÿก

๐Ÿ‘‰Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains ๐Ÿš€

๐Ÿ‘‰Review https://t.ly/qCahZ
๐Ÿ‘‰Paper https://lnkd.in/d-zi8EZ6
๐Ÿ‘‰Project https://lnkd.in/djwjQRvq
๐Ÿ‘‰Repo https://lnkd.in/dDMkjnKz
๐Ÿคฏ22๐Ÿ‘8๐Ÿ”ฅ4โšก1โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”Ž Generative Powers of Ten ๐Ÿ”

๐Ÿ‘‰A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell ๐Ÿคฏ

๐Ÿ‘‰Review https://t.ly/2DG44
๐Ÿ‘‰Paper https://lnkd.in/eDcSpU59
๐Ÿ‘‰Project https://lnkd.in/e6NKu8n9
๐Ÿคฏ21โค4๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ˜ฑ1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!

๐Ÿ‘ FREE TO FORWARD TO OTHER TELEGRAM CHANNELS

๐Ÿ”ฅ NO COPY OF THE POSTS
๐Ÿ”ฅ NO COMMERCIAL USAGE
๐Ÿ”ฅ NO UNRESPECTFUL USAGE

โš ๏ธ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION โš ๏ธ
โค19๐Ÿ‘10๐Ÿ‘3๐Ÿฅฐ1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฉฐ Magic Animating Human ๐Ÿฉฐ

๐Ÿ‘‰MagicAnimate: the new SOTA in human animation. Code available: let's dance!

๐Ÿ‘‰Review https://t.ly/Oq7Za
๐Ÿ‘‰Paper https://lnkd.in/dSUbGgCs
๐Ÿ‘‰Project https://lnkd.in/dkVFf-SV
๐Ÿ‘‰Code https://lnkd.in/dj2dbzdg
๐Ÿ‘‰Demo https://lnkd.in/dHEKPE9q
๐Ÿคฏ6โค2๐Ÿ‘1๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ EfficientSAM: 20x faster Segment Anything ๐Ÿ”ฅ

๐Ÿ‘‰Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!

๐Ÿ‘‰Review https://t.ly/966QS
๐Ÿ‘‰Paper https://lnkd.in/duijp_Rh
๐Ÿ‘‰Project https://lnkd.in/dW-p2CuH
๐Ÿ‘‰Code https://lnkd.in/dAbZaB2t
๐Ÿ‘‰Demo https://lnkd.in/d-tjKiUd
๐Ÿ”ฅ15โค4๐Ÿ‘4๐Ÿคฏ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿซถ3D Hands with Transformers๐Ÿซถ

๐Ÿ‘‰ HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.

๐Ÿ‘‰Review https://t.ly/YtAW8
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.05251.pdf
๐Ÿ‘‰Project https://geopavlakos.github.io/hamer
๐Ÿ‘‰Demo huggingface.co/spaces/geopavlakos/HaMeR
๐Ÿ‘‰Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
๐Ÿ‘10โค1๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿชฉ DreaMoving: Human Dancer ๐Ÿชฉ

๐Ÿ‘‰Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.

๐Ÿ‘‰Review https://t.ly/BD_Yf
๐Ÿ‘‰Paper https://lnkd.in/gepP6Rjw
๐Ÿ‘‰Project https://lnkd.in/gwm72cfS
๐Ÿ‘‰Repo (empty) https://lnkd.in/gsc2Qt-F
๐Ÿ‘7๐Ÿ’ฉ6โค2๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ“ฒ EdgeSAM: Mobile 40x SAM ๐Ÿ“ฒ

๐Ÿ‘‰A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available ๐Ÿ˜‰

๐Ÿ‘‰Review https://t.ly/m_vLH
๐Ÿ‘‰Paper https://lnkd.in/gHZVZN2x
๐Ÿ‘‰Project https://lnkd.in/gK8qEK8p
๐Ÿ‘‰Repo https://lnkd.in/gj6YAGNv
๐Ÿ‘‰Hugging Face https://lnkd.in/gUUHJvxz
๐Ÿ”ฅ20โšก2โค2๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชผPatchFusion: SOTA Mono-Depth๐Ÿชผ

๐Ÿ‘‰PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/hv3yT
๐Ÿ‘‰Paper https://lnkd.in/d9dXP7iP
๐Ÿ‘‰Project https://lnkd.in/dQcvVJSx
๐Ÿ‘‰Repo https://lnkd.in/dW2GdVR5
๐Ÿ‘‰Demo https://lnkd.in/dFW-gAiY
๐Ÿ”ฅ10โค5๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒOutfit Anyone: Ultra-HQ VTO๐Ÿ’ƒ

๐Ÿ‘‰Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)

๐Ÿ‘‰Review https://t.ly/o6UR9
๐Ÿ‘‰Demo https://lnkd.in/dpQYdXhc
๐Ÿ‘‰Repo (empty) https://lnkd.in/dBsNST6r
๐Ÿคฏ10๐Ÿ‘4โค3๐Ÿ”ฅ2
๐Ÿ”ฅ #AIwithPapers: we are 8k+ ๐Ÿ”ฅ

๐Ÿ‘‰ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐Ÿงก

๐Ÿ˜ˆ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost

๐Ÿ˜ˆ Invite -> https://t.me/AI_DeepLearning
โค16๐Ÿคฃ7๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸงŠ Depth Conditioning ๐ŸงŠ

๐Ÿ‘‰LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)

๐Ÿ‘‰Review https://t.ly/9y72m
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.03079.pdf
๐Ÿ‘‰Project https://shariqfarooq123.github.io/loose-control/
๐Ÿ‘‰Repo https://github.com/shariqfarooq123/LooseControl
๐Ÿ”ฅ14โค6๐Ÿคฏ4๐Ÿ‘1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ–ฒ๏ธ Amodal Tracking Any Object ๐Ÿ–ฒ๏ธ

๐Ÿ‘‰Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/Rc6Ku
๐Ÿ‘‰Paper https://lnkd.in/d39rFYT4
๐Ÿ‘‰Project https://lnkd.in/d7bkEcni
๐Ÿ‘‰(empty) Repo https://lnkd.in/dTsNKdfz
โค16๐Ÿคฏ8๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšฟ Event-Cam (1000 fps) Hands ๐Ÿšฟ

๐Ÿ‘‰Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.

๐Ÿ‘‰Review https://t.ly/YpQpX
๐Ÿ‘‰Paper arxiv.org/pdf/2312.14157.pdf
๐Ÿ‘‰Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Ÿ‘‰Repo github.com/Chris10M/Ev2Hands
๐Ÿ”ฅ3โค2๐Ÿ‘2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽ„UniSDF: Unifying Neural Representations๐ŸŽ„

๐Ÿ‘‰UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.

๐Ÿ‘‰Review https://t.ly/2QEul
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.13285.pdf
๐Ÿ‘‰Project https://fangjinhuawang.github.io/UniSDF/
๐Ÿ‘‰Repo: No code :(
๐Ÿ”ฅ7๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฎHAAR: Text-Driven Generative Hairstyles๐Ÿชฎ

๐Ÿ‘‰ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.

๐Ÿ‘‰Review https://t.ly/L38iD
๐Ÿ‘‰Project https://haar.is.tue.mpg.de/
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.11666.pdf
๐Ÿ‘‰Repo coming
๐Ÿคฏ4๐Ÿพ3๐Ÿ‘2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฒUniRef++: Segment Every Reference๐Ÿชฒ

๐Ÿ‘‰ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!

๐Ÿ‘‰Review https://t.ly/OxtOx
๐Ÿ‘‰Paper https://lnkd.in/eTrmDTK3
๐Ÿ‘‰Repo https://lnkd.in/etfTm4Wq
๐Ÿ‘11โค3๐Ÿคฏ3โšก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿˆš Seeing Through Occlusions ๐Ÿˆš

๐Ÿ‘‰Novel NSF to see through occlusions, reflection suppression & shadow removal.

๐Ÿ‘‰Review https://t.ly/5jcIG
๐Ÿ‘‰Project https://light.princeton.edu/publication/nsf
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.14235.pdf
๐Ÿ‘‰Repo https://github.com/princeton-computational-imaging/NSF
โค10๐Ÿคฏ7๐Ÿ”ฅ3๐Ÿพ1