AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🥻SF: Towards Virtual Cloth🥻

👉SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds

😎Review https://t.ly/MwpAV
😎Project https://sewformer.github.io/
😎Paper https://arxiv.org/pdf/2311.04218.pdf
😎Code https://github.com/sail-sg/sewformer
👍4🔥2🥰2👏2🤯1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🛋️ 3DiffTection: new SOTA 3D detection 🛋️

👉#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model

😎Review https://t.ly/PciXY
😎Paper https://arxiv.org/pdf/2311.04391.pdf
😎Code https://github.com/nv-tlabs/3DiffTection
😎Project research.nvidia.com/labs/toronto-ai/3difftection
🔥86👍3😱3👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🐪 30x Faster Neural Scenes 🐪

👉 NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30× faster rendering than previous SOTA w/ comparable or better realism

😎Review https://t.ly/ELJSE
😎Paper https://arxiv.org/pdf/2311.05607.pdf
😎Project https://waabi.ai/NeuRas/
🔥91👍1🤯1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Hu.ma.ne #AI Pin is out! 🔥

👉Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector

😎 More https://t.ly/IvoN7
6🔥4💩2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🫀 Segmentation of Human 🫀

👉TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.

😎Review https://t.ly/yHMm1
😎Code https://lnkd.in/dvgrbsCE
😎Paper https://lnkd.in/dkwHuuzU
🔥14👍7🤯6😱21🤩1
🪐 Spacecraft Pose Estimation 🪐

👉SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab

😎Review https://t.ly/m8JPB
😎Paper https://lnkd.in/d_edvc3n
😎Project https://lnkd.in/dPp375aY
7🤯2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Florence-2: unified Computer Vision🔥

👉#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!

👉Review https://t.ly/pOins
👉Paper arxiv.org/pdf/2311.06242.pdf
👉Project www.microsoft.com/en-us/research/project/projectflorence/
😱95🔥3👍1👏1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
💥🚗 CrashCar101: Generative Damaged Cars💥🚗

👉 CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations

👉 Review https://t.ly/pITHm
👉 Paper https://lnkd.in/dzp6q3T5
👉 Project https://lnkd.in/daRXg73N
7👍1🔥1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓 Emu: image edit / video gen. 🐓

👉#Meta the new SOTA in text-to-video generation and instruction-based image editing

👉 Review https://t.ly/PMTBc
👉 Paper (images): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eG8eWUJY
👉 Paper (video): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eu6Zu6gp
🔥8🤯2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🌦️ 100+ GPU weather training 🌦️

👉#NVIDIA just released Makani: massively parallel training of weather and climate prediction models on 100+ GPUs and to enable the development of the next generation of weather and climate models.

👉 Review https://t.ly/jageY
👉 Code https://lnkd.in/d4NFZ5xi
23🤯71😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 Segmenting anything in 3D 🍿

👉 OmniSeg3D: omniversal segmentation method aims for segmenting anything in 3D all at once.

👉Review https://t.ly/Q0jrK
👉Paper https://lnkd.in/d9qpxXY9
👉Project https://oceanying.github.io/OmniSeg3D
👉Code (soon)
17🔥7👍4🤯2😱2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🔳 SOTA Semantic Boundary 🔳

👉Mobile-Seed, a lightweight, dual-task framework tailored for simultaneous semantic segmentation and boundary detection.

👉Review https://t.ly/GsArZ
👉Project whu-usi3dv.github.io/Mobile-Seed/
👉Paper arxiv.org/pdf/2311.12651.pdf
👉Code github.com/WHU-USI3DV/Mobile-Seed
5👍1🔥1🤯1😱1
🧿 SOTA Model-aware 3D Gaze 🧿

👉 Novel hybrid approach that outputs 3D eye model, semantic segmentation, cam-intrinsic & pose. Only 2D eye semantic segmentation masks and fewer 3D gaze labels for supervision.

👉Review https://t.ly/AdKRf
👉Paper https://lnkd.in/dWb9GHPh
👉Code https://lnkd.in/dfAWFVky
🔥11👍3🤯31😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🦖T-Rex: Counting by Visual Prompting🦖

👉T-Rex: a novel interactive object counting model to detect and count any objects. Impressive results!

👉Review https://t.ly/4SfFX
👉Project https://lnkd.in/dVtEndHv
👉Paper https://lnkd.in/dBGQsbdP
👉Code (not announced, but an empty repo exists): https://lnkd.in/dnZnGRUn
👍16🔥154🤯2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Stable (Stability.AI) Video Diffusion 🔥

👉 #StabilityAI released Stable Video Diffusion: latent video diffusion model for high-resolution, SOTA text-to-video and image-to-video generation

👉 Review https://t.ly/XwHys
👉 Code https://lnkd.in/dQw_yNuV
👉 Paper https://lnkd.in/dHn6f787
🔥17👍6🤯31🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎡 Panoptic Video Scene Graph 🎡

👉Combining video scene graph generation w/ panoptic segmentation for holistic video understanding. Novel HQ dataset with fine, temporal scene graph annotations & panoptic segmentation. Code released!🔥

👉Review https://t.ly/tckDT
👉Project jingkang50.github.io/PVSG/
👉Paper arxiv.org/pdf/2311.17058.pdf
👉Code github.com/LilyDaytoy/OpenPVSG
👉Tool github.com/lilyDaytoy/PVSGAnnotation
🔥7👍43🤯1
NebulOS.pdf
5.3 MB
🌳 NebulOS: (more than) Green AI 🌳

👉A novel hardware-aware Training-Free NAS approach that considers both training-free metrics & HW constraints, aiming to find the optimal balance between validation accuracy & energy consumption. 🚀

👉Review https://t.ly/Ozso1
👉Project sites.google.com/view/nebulos
👉Code github.com/fracapuano/NebulOS
👉Video https://lnkd.in/exN4Q2Fu
👉Hugging Face https://lnkd.in/eyCcPEPc
5🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 Material Palette from Images 🧱

👉A novel problem in #AI: material extraction from a real-world image without any prior knowledge 🤯

👉Discussion https://t.ly/AIWs-
👉Paper https://lnkd.in/dBFAVWPF
👉Project https://lnkd.in/dV5jK8Sm
👉Code https://lnkd.in/dNhMnfFb
👉Dataset (coming) ...
9👍2🔥1🥰1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
👑 HD Generative #AI With No $$👑

👉DemoFusion: a novel approach for HD image generation w/ no money. Progressive Upscaling, Skip Residual, & Dilated Sampling to achieve higher-resolution ever 🔥

👉Review https://t.ly/sIqDV
👉Paper https://lnkd.in/deDt-zcK
👉Project https://lnkd.in/dFGj47Xw
👉Code https://lnkd.in/dY3UcXwp
👍42🤯2👏1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Animate Anyone: new SOTA! 🍡

👉Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains 🚀

👉Review https://t.ly/qCahZ
👉Paper https://lnkd.in/d-zi8EZ6
👉Project https://lnkd.in/djwjQRvq
👉Repo https://lnkd.in/dDMkjnKz
🤯22👍8🔥411😱1