FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling
📝https://github.com/arthur-qiu/longercrafter
📝https://github.com/arthur-qiu/longercrafter
GitHub
GitHub - AILab-CVC/FreeNoise: [ICLR 2024] Code for FreeNoise based on VideoCrafter
[ICLR 2024] Code for FreeNoise based on VideoCrafter - AILab-CVC/FreeNoise
SALMONN: Towards Generic Hearing Abilities for Large Language Models
📝https://github.com/bytedance/salmonn
📝https://github.com/bytedance/salmonn
GitHub
GitHub - bytedance/SALMONN: SALMONN: Speech Audio Language Music Open Neural Network
SALMONN: Speech Audio Language Music Open Neural Network - GitHub - bytedance/SALMONN: SALMONN: Speech Audio Language Music Open Neural Network
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
📝https://github.com/deepseek-ai/dreamcraft3d
📝https://github.com/deepseek-ai/dreamcraft3d
GitHub
GitHub - deepseek-ai/DreamCraft3D: [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped…
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior - deepseek-ai/DreamCraft3D
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning
📝https://github.com/fudandisc/disc-finllm
📝https://github.com/fudandisc/disc-finllm
GitHub
GitHub - FudanDISC/DISC-FinLLM: DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language…
DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financ...
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
📝https://github.com/deepseek-ai/dreamcraft3d
📝https://github.com/deepseek-ai/dreamcraft3d
GitHub
GitHub - deepseek-ai/DreamCraft3D: [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped…
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior - deepseek-ai/DreamCraft3D
Evaluating Large Language Models: A Comprehensive Survey
📝https://github.com/tjunlp-lab/awesome-llms-evaluation-papers
📝https://github.com/tjunlp-lab/awesome-llms-evaluation-papers
GitHub
GitHub - tjunlp-lab/Awesome-LLMs-Evaluation-Papers: The papers are organized according to our survey: Evaluating Large Language…
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey. - tjunlp-lab/Awesome-LLMs-Evaluation-Papers
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
📝https://github.com/ailab-cvc/videocrafter
📝https://github.com/ailab-cvc/videocrafter
GitHub
GitHub - AILab-CVC/VideoCrafter: VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models - AILab-CVC/VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
📝https://github.com/videocrafter/videocrafter
📝https://github.com/videocrafter/videocrafter
GitHub
GitHub - AILab-CVC/VideoCrafter: VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models - AILab-CVC/VideoCrafter
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
📝https://github.com/huggingface/distil-whisper
📝https://github.com/huggingface/distil-whisper
GitHub
GitHub - huggingface/distil-whisper: Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word…
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. - huggingface/distil-whisper
A Survey of Large Language Models for Autonomous Driving
📝https://github.com/thinklab-sjtu/awesome-llm4ad
📝https://github.com/thinklab-sjtu/awesome-llm4ad
GitHub
GitHub - Thinklab-SJTU/Awesome-LLM4AD: A curated list of awesome LLM for Autonomous Driving resources (continually updated)
A curated list of awesome LLM for Autonomous Driving resources (continually updated) - Thinklab-SJTU/Awesome-LLM4AD
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
📝https://github.com/luosiallen/latent-consistency-model
📝https://github.com/luosiallen/latent-consistency-model
GitHub
GitHub - luosiallen/latent-consistency-model: Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference - luosiallen/latent-consistency-model
Contrastive Post-training Large Language Models on Data Curriculum
📝https://github.com/imoneoi/openchat
📝https://github.com/imoneoi/openchat
GitHub
GitHub - imoneoi/openchat: OpenChat: Advancing Open-source Language Models with Imperfect Data
OpenChat: Advancing Open-source Language Models with Imperfect Data - imoneoi/openchat
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
📝https://github.com/qwenlm/qwen-audio
📝https://github.com/qwenlm/qwen-audio
GitHub
GitHub - QwenLM/Qwen-Audio: The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed…
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud. - QwenLM/Qwen-Audio
GraphCast: Learning skillful medium-range global weather forecasting
📝https://github.com/deepmind/graphcast
📝https://github.com/deepmind/graphcast
GitHub
GitHub - google-deepmind/graphcast
Contribute to google-deepmind/graphcast development by creating an account on GitHub.
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
📝https://github.com/yuliang-liu/monkey
📝https://github.com/yuliang-liu/monkey
GitHub
GitHub - Yuliang-Liu/Monkey: 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large…
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models - Yuliang-Liu/Monkey
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
📝https://github.com/PKU-YuanGroup/Video-LLaVA
📝https://github.com/PKU-YuanGroup/Video-LLaVA
GitHub
GitHub - PKU-YuanGroup/Video-LLaVA: 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection - PKU-YuanGroup/Video-LLaVA
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
📝https://github.com/yl4579/StyleTTS2
📝https://github.com/yl4579/StyleTTS2
GitHub
GitHub - yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with…
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models - yl4579/StyleTTS2
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
📝https://github.com/thu-coai/bpo
📝https://github.com/thu-coai/bpo
GitHub
GitHub - thu-coai/BPO
Contribute to thu-coai/BPO development by creating an account on GitHub.