Artificial Intelligence && Deep Learning

🆔🆔 Magic-Me: Identity-Specific Video 🆔🆔

👉hashtag#ByteDance (+UC Berkeley) unveils VCD for video-gen: with just a few images of a specific identity it can generate temporal consistent videos aligned with the given prompt. Impressive results, source code under Apache 2.0 💙

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel Video Custom Diffusion (VCD) framework
✅High-Quality ID-specific videos generation
✅Improvement in aligning IDs-images and text
✅Robust 3D Gaussian Noise Prior for denoising
✅Better Inter-frame correlation / video consistency
✅New modules F-VCD/T-VCD for videos upscale
✅New train with masked loss by prompt-to-segmentation

hashtag#artificialintelligence hashtag#machinelearning hashtag#ml hashtag#AI hashtag#deeplearning hashtag#computervision hashtag#AIwithPapers hashtag#metaverse

👉Channel: @deeplearning_ai
👉Paper https://arxiv.org/pdf/2402.09368.pdf
👉Project https://magic-me-webpage.github.io/
👉Code https://github.com/Zhen-Dong/Magic-Me

👍23❤5

35.1K viewsShohruh, edited 05:07

This media is not supported in your browser

VIEW IN TELEGRAM

Introducing ECoDepth: The New Benchmark in Diffusive Mono-Depth

From the labs of IITD, we unveil ECoDepth - our groundbreaking SIDE model powered by a diffusion backbone and enriched with ViT embeddings. This innovation sets a new standard in single image depth estimation (SIDE), offering unprecedented accuracy and semantic understanding.

Key Features:

✅Revolutionary MDE approach tailored for SIDE tasks
✅Enhanced semantic context via ViT embeddings
✅Superior performance in zero-shot transfer tasks
✅Surpasses previous SOTA models by up to 14%

Dive into the future of depth estimation with ECoDepth. Access our source code and explore the full potential of our model.

📖 Read the Paper
💻 Get the Code

#ArtificialIntelligence #MachineLearning #DeepLearning #ComputerVision #AIwithPapers #Metaverse

join our community:
👉 @deeplearning_ai

👍16❤2

27K viewsSHOHRUH, edited 08:44

Artificial Intelligence && Deep Learning

0:52

Media is too big

VIEW IN TELEGRAM

Neural Bodies with Clothes: Overview

Introduction: Neural-ABC, a cutting-edge parametric model developed by the University of Science & Technology of China, innovatively represents clothed human bodies.

Key Features:

✅Novel approach for modeling clothed human figures.
✅Unified framework accommodating various clothing types.
✅Consistent representation of both body and clothing.
✅Enables seamless modification of identity, shape, clothing, and pose.
✅Extensive dataset with detailed clothing information.

Explore More:
💻Project Details: Discover More
📖Read the Paper: Access Here
💻Source Code: Explore on GitHub

Relevance: #artificialintelligence #machinelearning #AI #deeplearning #computervision

join our community:
👉 @deeplearning_ai

👍12🔥7❤6

28K viewsSHOHRUH, edited 07:45

Artificial Intelligence && Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔍 Discover the Power of Fine-Grained Gaze Estimation with L2CS-Net! 🌟

🚀 Key Features:
✅Advanced Architecture: Built using state-of-the-art neural network structures.
✅Versatile Utilities: Packed with utility functions and classes for seamless integration.
✅Robust Data Handling: Efficient data loading, preprocessing, and augmentation.
✅Comprehensive Training & Testing: Easy-to-follow scripts for training and testing your models.

👀 Live Demo:
Visualize the power of L2CS-Net with your own video:

🌟 Join Us:
Star our repo on GitHub and be part of the innovative community pushing the boundaries of gaze estimation. Your support drives us forward!

🔗 GitHub Repository

Let's advance gaze estimation together! 🚀🌐 #GazeEstimation #DeepLearning #AI #MachineLearning #ComputerVision

👍14❤5🤩1

25.1K viewsSHOHRUH, 03:04

About

Blog

Apps

Platform