StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
📝https://github.com/yl4579/StyleTTS2
📝https://github.com/yl4579/StyleTTS2
GitHub
GitHub - yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with…
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models - yl4579/StyleTTS2
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
📝https://github.com/thu-coai/bpo
📝https://github.com/thu-coai/bpo
GitHub
GitHub - thu-coai/BPO
Contribute to thu-coai/BPO development by creating an account on GitHub.
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
📝https://github.com/stability-ai/generative-models
📝https://github.com/stability-ai/generative-models
GitHub
GitHub - Stability-AI/generative-models: Generative Models by Stability AI
Generative Models by Stability AI. Contribute to Stability-AI/generative-models development by creating an account on GitHub.
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
📝https://github.com/lllyasviel/fooocus
📝https://github.com/lllyasviel/fooocus
GitHub
GitHub - lllyasviel/Fooocus: Focus on prompting and generating
Focus on prompting and generating. Contribute to lllyasviel/Fooocus development by creating an account on GitHub.
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
📝https://github.com/stability-ai/generative-models
📝https://github.com/stability-ai/generative-models
GitHub
GitHub - Stability-AI/generative-models: Generative Models by Stability AI
Generative Models by Stability AI. Contribute to Stability-AI/generative-models development by creating an account on GitHub.
Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling
📝https://github.com/lizhe00/animatablegaussians
📝https://github.com/lizhe00/animatablegaussians
GitHub
GitHub - lizhe00/AnimatableGaussians: Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High…
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling" - lizhe00/AnimatableGaussians
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks
📝https://github.com/hsouri/battle-of-the-backbones
📝https://github.com/hsouri/battle-of-the-backbones
GitHub
GitHub - hsouri/Battle-of-the-Backbones
Contribute to hsouri/Battle-of-the-Backbones development by creating an account on GitHub.
Evaluating Large Language Models: A Comprehensive Survey
📝https://github.com/tjunlp-lab/awesome-llms-evaluation-papers
📝https://github.com/tjunlp-lab/awesome-llms-evaluation-papers
GitHub
GitHub - tjunlp-lab/Awesome-LLMs-Evaluation-Papers: The papers are organized according to our survey: Evaluating Large Language…
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey. - tjunlp-lab/Awesome-LLMs-Evaluation-Papers
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
📝https://github.com/videocrafter/videocrafter
📝https://github.com/videocrafter/videocrafter
GitHub
GitHub - AILab-CVC/VideoCrafter: VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models - AILab-CVC/VideoCrafter
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
📝https://github.com/imoneoi/openchat
📝https://github.com/imoneoi/openchat
GitHub
GitHub - imoneoi/openchat: OpenChat: Advancing Open-source Language Models with Imperfect Data
OpenChat: Advancing Open-source Language Models with Imperfect Data - imoneoi/openchat
Sequential Modeling Enables Scalable Learning for Large Vision Models
📝https://github.com/ytongbai/LVM
📝https://github.com/ytongbai/LVM
GitHub
GitHub - ytongbai/LVM
Contribute to ytongbai/LVM development by creating an account on GitHub.
Aligning and Prompting Everything All at Once for Universal Visual Perception
📝https://github.com/shenyunhang/ape
📝https://github.com/shenyunhang/ape
GitHub
GitHub - shenyunhang/APE: Aligning and Prompting Everything All at Once for Universal Visual Perception
Aligning and Prompting Everything All at Once for Universal Visual Perception - shenyunhang/APE
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
📝https://github.com/zhyever/PatchFusion
📝https://github.com/zhyever/PatchFusion
GitHub
GitHub - zhyever/PatchFusion: [CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation - zhyever/PatchFusion
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
📝https://github.com/chongzhou96/edgesam
📝https://github.com/chongzhou96/edgesam
GitHub
GitHub - chongzhou96/EdgeSAM: Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment…
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM" - GitHub - chongzhou96/EdgeSAM: Official PyTorch implementation of &a...
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
📝https://github.com/vvictoryuki/animatezero
📝https://github.com/vvictoryuki/animatezero
GitHub
GitHub - vvictoryuki/AnimateZero: Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot…
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators" - GitHub - vvictoryuki/AnimateZero: Official PyTorch implementati...
FreeInit: Bridging Initialization Gap in Video Diffusion Models
📝https://github.com/tianxingwu/freeinit
📝https://github.com/tianxingwu/freeinit
GitHub
GitHub - TianxingWu/FreeInit: FreeInit: Bridging Initialization Gap in Video Diffusion Models
FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit