demo.gif
15 MB
VSP-LLM (Visual Speech Processing incorporated with LLMs)
Code: https://github.com/Sally-SH/VSP-LLM?tab=readme-ov-file
Paper: https://arxiv.org/pdf/2402.15151
@deeplearning_ai
Code: https://github.com/Sally-SH/VSP-LLM?tab=readme-ov-file
Paper: https://arxiv.org/pdf/2402.15151
@deeplearning_ai
π11β€6π₯4π1
This media is not supported in your browser
VIEW IN TELEGRAM
π Introducing: Face Landmark Auto-Labeling Toolkit π§
Tired of manually annotating facial landmarks? Here's a deep learning-powered desktop app that will change your workflow forever!
π§ Key Features:
β 68-point facial landmark detection (PIPNet + ONNX)
β Modern GUI with theme support (Tkinter + TTKBootstrap)
β Smart drag-and-drop editing
β Batch processing of entire folders
β Export as .json and .pts
β Real-time dataset statistics
π½οΈ See it in action:
βΆοΈ Demo Video #1
βΆοΈ Demo Video #2
π Star the repo & share with your community!
π GitHub: https://github.com/Shohruh72/Landmark-Auto-Label
https://github.com/Shohruh72
@deeplearning_ai
Tired of manually annotating facial landmarks? Here's a deep learning-powered desktop app that will change your workflow forever!
π§ Key Features:
β 68-point facial landmark detection (PIPNet + ONNX)
β Modern GUI with theme support (Tkinter + TTKBootstrap)
β Smart drag-and-drop editing
β Batch processing of entire folders
β Export as .json and .pts
β Real-time dataset statistics
π½οΈ See it in action:
βΆοΈ Demo Video #1
βΆοΈ Demo Video #2
π Star the repo & share with your community!
π GitHub: https://github.com/Shohruh72/Landmark-Auto-Label
https://github.com/Shohruh72
@deeplearning_ai
β€9π8π₯4π€©2
π SmolVLM Realtime Inference UI
On-device visionβlanguage in your browserβwebcam, video & image. Zero-config HTML/JS powered by llama.cpp.
π Try it: https://github.com/Shohruh72/SmolVLM-UI
Join my channel:
ππππππ
@MachineLearning_Programming
On-device visionβlanguage in your browserβwebcam, video & image. Zero-config HTML/JS powered by llama.cpp.
π Try it: https://github.com/Shohruh72/SmolVLM-UI
Join my channel:
ππππππ
@MachineLearning_Programming
π11β€8π₯6
π Introducing: High-Resolution Facial Landmark Detection! π―
π Source: Github
@deeplearning_ai
π Source: Github
@deeplearning_ai
β€16π9π₯6
This media is not supported in your browser
VIEW IN TELEGRAM
π LLaVA-FastVLM: One-Click Visual Language API
One-click web interface for Apple's FastVLM vision models
π Source: Github
@deeplearning_ai
One-click web interface for Apple's FastVLM vision models
π Source: Github
@deeplearning_ai
β€12π₯5π€©2
fastvlm-handwriting.gif
4.1 MB
π LLaVA-FastVLM: One-Click Visual Language API
One-click web interface for Apple's FastVLM vision models
π Source: Github
@deeplearning_ai
One-click web interface for Apple's FastVLM vision models
π Source: Github
@deeplearning_ai
β€9π±5π4π₯2
π Refactored HRNet Now Live! π
π₯ Supercharge your computer vision projects with high-resolution HRNet models β fully refactored for easy training/testing!
β Multiple ImageNet-pretrained models
β Lightning-fast setup
β Top-tier accuracy
π Check it out & βοΈ Star the repo if you find it useful!
GitHub: Shohruh72/HRNet
#AI #DeepLearning #OpenSource
@deeplearning_ai
π₯ Supercharge your computer vision projects with high-resolution HRNet models β fully refactored for easy training/testing!
β Multiple ImageNet-pretrained models
β Lightning-fast setup
β Top-tier accuracy
π Check it out & βοΈ Star the repo if you find it useful!
GitHub: Shohruh72/HRNet
#AI #DeepLearning #OpenSource
@deeplearning_ai
β€6π4π4π₯4π€©2
π Generative AI is Taking Over β Will You Lead or Follow? π
β οΈ FREE TRAINING
The AI revolution is here, and top companies are hiring for roles like AI Engineers, Gen AI Experts, and Generative AI Developers.
Register Free Now:
https://live.psitrontech.com
π‘ What Youβll Learn:
β Master Gen AI on AWS, Azure and GCP
β Master GPT, Claude, LLaMA & Stable Diffusion
β Build AI-powered applications & AI Agents
β Stay ahead with LLMOps & AI strategies
π₯ High demand, low competition β Now is the time to upskill!
β³ Opportunities like this donβt last forever!
β οΈ FREE TRAINING
The AI revolution is here, and top companies are hiring for roles like AI Engineers, Gen AI Experts, and Generative AI Developers.
Register Free Now:
https://live.psitrontech.com
π‘ What Youβll Learn:
β Master Gen AI on AWS, Azure and GCP
β Master GPT, Claude, LLaMA & Stable Diffusion
β Build AI-powered applications & AI Agents
β Stay ahead with LLMOps & AI strategies
π₯ High demand, low competition β Now is the time to upskill!
β³ Opportunities like this donβt last forever!
β€10π5
Forwarded from Python | Machine Learning | Coding | R
This channels is for Programmers, Coders, Software Engineers.
0οΈβ£ Python
1οΈβ£ Data Science
2οΈβ£ Machine Learning
3οΈβ£ Data Visualization
4οΈβ£ Artificial Intelligence
5οΈβ£ Data Analysis
6οΈβ£ Statistics
7οΈβ£ Deep Learning
8οΈβ£ programming Languages
β
https://t.me/addlist/8_rRW2scgfRhOTc0
β
https://t.me/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
β€6
This media is not supported in your browser
VIEW IN TELEGRAM
π 3DGazeNet: Next-Gen Gaze Estimation! - get instant results with just one click.
Discover how to train powerful gaze estimation models using only synthetic data and weak supervisionβno huge real-world datasets needed.
Perfect for AR/VR, HCI, and beyond.
Cutting-edge, open-source, and ready for your next project!
π Try it now: https://github.com/Shohruh72/3DGazeNet
#DeepLearning #GazeEstimation #AI
@deeplearning_ai
Discover how to train powerful gaze estimation models using only synthetic data and weak supervisionβno huge real-world datasets needed.
Perfect for AR/VR, HCI, and beyond.
Cutting-edge, open-source, and ready for your next project!
π Try it now: https://github.com/Shohruh72/3DGazeNet
#DeepLearning #GazeEstimation #AI
@deeplearning_ai
β€9π₯3π2