AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
๐Ÿ˜ป CatFLW: Cat Neural Landmarks ๐Ÿ˜ป

๐Ÿ‘‰Landmark convolution neural network-based model for cat faces

๐Ÿ˜ŽReview https://t.ly/Y3mQ8
๐Ÿ˜ŽPaper arxiv.org/pdf/2305.04232.pdf
๐Ÿ˜ŽDataset www.tech4animals.org/catflw
๐Ÿฅฐ17โค4๐Ÿ‘3๐Ÿ˜ฑ1๐Ÿคฉ1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿก4K4D: Real-Time 4D at 4K๐Ÿก

๐Ÿ‘‰THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!

๐Ÿ˜ŽReview https://t.ly/6ddQh
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.11448.pdf
๐Ÿ˜ŽProject zju3dv.github.io/4k4d/
๐Ÿ˜ŽCode github.com/zju3dv/4K4D
๐Ÿ”ฅ8๐Ÿ‘5๐Ÿคฏ5โค1๐Ÿ˜ฑ1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ›ฃ๏ธ Holistic Parking Detection (YOLO) ๐Ÿ›ฃ๏ธ

๐Ÿ‘‰ One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection

๐Ÿ˜ŽReview https://t.ly/2l4ZG
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.11629.pdf
๐Ÿ”ฅ8๐Ÿคฏ6โค4๐Ÿคฉ3๐Ÿ‘1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿˆ Cutie: VOS with heavy occlusions๐Ÿˆ

๐Ÿ‘‰Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors

๐Ÿ˜ŽReview https://t.ly/W3FR-
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.12982.pdf
๐Ÿ˜ŽProject https://hkchengrex.com/Cutie
๐Ÿ˜ŽCode https://github.com/hkchengrex/Cutie
๐Ÿ‘13๐Ÿคฃ3โค1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงก Rotoscoping Prince Of Persia (1985) ๐Ÿงก

๐Ÿ‘‰ A rare footage for the animation of Prince of Persia (1989). Damn Romantic.

๐Ÿ˜Ž More https://t.ly/xJife
โค17๐Ÿ‘2๐Ÿ‘2๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿช›PACE: new SOTA Motion๐Ÿช›

๐Ÿ‘‰#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.

๐Ÿ˜ŽReview https://t.ly/20you
๐Ÿ˜ŽProject https://nvlabs.github.io/PACE
๐Ÿ˜ŽPaper https://arxiv.org/pdf/2310.13768.pdf
๐Ÿคฃ5โค4๐Ÿ”ฅ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฅคNanoSAM: SAM on low-cost boards๐Ÿฅค

๐Ÿ‘‰NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT

๐Ÿ˜ŽReview https://t.ly/UErq_
๐Ÿ˜ŽTutorial https://github.com/NVIDIA-AI-IOT/nanosam
๐Ÿ”ฅ11๐Ÿ‘1๐Ÿ‘1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿง‚ SOTA RGB-D Video Salient Object ๐Ÿง‚

๐Ÿ‘‰ DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection

๐Ÿ˜ŽReview https://t.ly/DapLV
๐Ÿ˜ŽCode github.com/kerenfu/RDVS
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.15482.pdf
๐Ÿ”ฅ4๐Ÿ‘1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
โœŒ๏ธ Relighted 3D Hands ๐Ÿคž

๐Ÿ‘‰#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands

๐Ÿ˜ŽReview https://t.ly/I1dQk
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.17768.pdf
๐Ÿ˜ŽProject mks0601.github.io/ReInterHand
๐Ÿ˜ŽData github.com/mks0601/ReInterHand
๐Ÿคฏ8โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ„ Video Understanding with GPT-4V(ision) ๐Ÿ„

๐Ÿ‘‰ #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension

๐Ÿ˜ŽReview https://t.ly/RISMm
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.19773.pdf
๐Ÿ˜ŽProject https://multimodal-vid.github.io
๐Ÿคฏ22๐Ÿ‘9๐Ÿ”ฅ2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘ฃ Foot via Synthetic Data ๐Ÿ‘ฃ

๐Ÿ‘‰ 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot

๐Ÿ˜ŽReview https://t.ly/TVanP
๐Ÿ˜ŽPaper https://arxiv.org/pdf/2310.18279.pdf
๐Ÿ˜ŽProject https://ollieboyne.github.io/FOUND
๐Ÿ˜ŽCode https://github.com/OllieBoyne/FOUND
๐Ÿคฃ8๐Ÿ‘4โค2๐Ÿฅฐ2๐Ÿคฉ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿš› OYSTER: unsupervised detection w/ LIDAR ๐Ÿš›

๐Ÿ‘‰Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.

๐Ÿ˜ŽReview https://t.ly/EMi58
๐Ÿ˜ŽProject https://waabi.ai/oyster/
๐Ÿ˜ŽPaper arxiv.org/pdf/2311.02007.pdf
โค15๐Ÿ‘3๐Ÿ”ฅ2๐Ÿ‘1
๐Ÿ”ฅGPT-4 Pass the Turing Test?๐Ÿ”ฅ

๐Ÿ‘‰No. I mean...not yet. Read this Paper from UC San Diego๐Ÿ‘‡

๐Ÿ˜ŽReview https://t.ly/o8HgM
๐Ÿ˜ŽPaper https://arxiv.org/pdf/2310.20216.pdf
โค4๐Ÿ”ฅ3๐Ÿ‘1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฅปSF: Towards Virtual Cloth๐Ÿฅป

๐Ÿ‘‰SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds

๐Ÿ˜ŽReview https://t.ly/MwpAV
๐Ÿ˜ŽProject https://sewformer.github.io/
๐Ÿ˜ŽPaper https://arxiv.org/pdf/2311.04218.pdf
๐Ÿ˜ŽCode https://github.com/sail-sg/sewformer
๐Ÿ‘4๐Ÿ”ฅ2๐Ÿฅฐ2๐Ÿ‘2๐Ÿคฏ1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ›‹๏ธ 3DiffTection: new SOTA 3D detection ๐Ÿ›‹๏ธ

๐Ÿ‘‰#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model

๐Ÿ˜ŽReview https://t.ly/PciXY
๐Ÿ˜ŽPaper https://arxiv.org/pdf/2311.04391.pdf
๐Ÿ˜ŽCode https://github.com/nv-tlabs/3DiffTection
๐Ÿ˜ŽProject research.nvidia.com/labs/toronto-ai/3difftection
๐Ÿ”ฅ8โค6๐Ÿ‘3๐Ÿ˜ฑ3๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿช 30x Faster Neural Scenes ๐Ÿช

๐Ÿ‘‰ NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30ร— faster rendering than previous SOTA w/ comparable or better realism

๐Ÿ˜ŽReview https://t.ly/ELJSE
๐Ÿ˜ŽPaper https://arxiv.org/pdf/2311.05607.pdf
๐Ÿ˜ŽProject https://waabi.ai/NeuRas/
๐Ÿ”ฅ9โค1๐Ÿ‘1๐Ÿคฏ1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Hu.ma.ne #AI Pin is out! ๐Ÿ”ฅ

๐Ÿ‘‰Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector

๐Ÿ˜Ž More https://t.ly/IvoN7
โค6๐Ÿ”ฅ4๐Ÿ’ฉ2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿซ€ Segmentation of Human ๐Ÿซ€

๐Ÿ‘‰TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.

๐Ÿ˜ŽReview https://t.ly/yHMm1
๐Ÿ˜ŽCode https://lnkd.in/dvgrbsCE
๐Ÿ˜ŽPaper https://lnkd.in/dkwHuuzU
๐Ÿ”ฅ14๐Ÿ‘7๐Ÿคฏ6๐Ÿ˜ฑ2โค1๐Ÿคฉ1
๐Ÿช Spacecraft Pose Estimation ๐Ÿช

๐Ÿ‘‰SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab

๐Ÿ˜ŽReview https://t.ly/m8JPB
๐Ÿ˜ŽPaper https://lnkd.in/d_edvc3n
๐Ÿ˜ŽProject https://lnkd.in/dPp375aY
โค7๐Ÿคฏ2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅFlorence-2: unified Computer Vision๐Ÿ”ฅ

๐Ÿ‘‰#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!

๐Ÿ‘‰Review https://t.ly/pOins
๐Ÿ‘‰Paper arxiv.org/pdf/2311.06242.pdf
๐Ÿ‘‰Project www.microsoft.com/en-us/research/project/projectflorence/
๐Ÿ˜ฑ9โค5๐Ÿ”ฅ3๐Ÿ‘1๐Ÿ‘1๐Ÿพ1