InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
📝https://github.com/internlm/internlm-xcomposer
📝https://github.com/internlm/internlm-xcomposer
GitHub
GitHub - InternLM/InternLM-XComposer
Contribute to InternLM/InternLM-XComposer development by creating an account on GitHub.
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
📝https://github.com/showlab/show-1
📝https://github.com/showlab/show-1
GitHub
GitHub - showlab/Show-1: Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation - GitHub - showlab/Show-1: Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
📝https://github.com/dreamgaussian/dreamgaussian
📝https://github.com/dreamgaussian/dreamgaussian
GitHub
GitHub - dreamgaussian/dreamgaussian: [ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation - dreamgaussian/dreamgaussian
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
📝https://github.com/interactivenlp-team/rolellm-public
📝https://github.com/interactivenlp-team/rolellm-public
GitHub
GitHub - InteractiveNLP-Team/RoleLLM-public: RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language…
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models - GitHub - InteractiveNLP-Team/RoleLLM-public: RoleLLM: Benchmarking, Eliciting, and Enhancing Role-P...
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
📝https://github.com/mathllm/mathcoder
📝https://github.com/mathllm/mathcoder
GitHub
GitHub - mathllm/MathCoder: Family of LLMs for mathematical reasoning.
Family of LLMs for mathematical reasoning. Contribute to mathllm/MathCoder development by creating an account on GitHub.
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction
📝https://github.com/hitz-zentroa/gollie
📝https://github.com/hitz-zentroa/gollie
GitHub
GitHub - hitz-zentroa/GoLLIE: Guideline following Large Language Model for Information Extraction
Guideline following Large Language Model for Information Extraction - hitz-zentroa/GoLLIE
Can large language models provide useful feedback on research papers? A large-scale empirical analysis
📝https://github.com/weixin-liang/llm-scientific-feedback
📝https://github.com/weixin-liang/llm-scientific-feedback
GitHub
GitHub - Weixin-Liang/LLM-scientific-feedback: Can large language models provide useful feedback on research papers? A large-scale…
Can large language models provide useful feedback on research papers? A large-scale empirical analysis. - GitHub - Weixin-Liang/LLM-scientific-feedback: Can large language models provide useful fee...
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
📝https://github.com/microsoft/autogen
📝https://github.com/microsoft/autogen
GitHub
GitHub - microsoft/autogen: A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord…
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour - microsoft/autogen
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
📝https://github.com/mihirp1998/alignprop
📝https://github.com/mihirp1998/alignprop
GitHub
GitHub - mihirp1998/AlignProp: AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion…
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods...
MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens
📝https://github.com/eric-ai-lab/minigpt-5
📝https://github.com/eric-ai-lab/minigpt-5
GitHub
GitHub - eric-ai-lab/MiniGPT-5: Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative…
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens" - eric-ai-lab/MiniGPT-5
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
📝https://github.com/yingqinghe/scalecrafter
📝https://github.com/yingqinghe/scalecrafter
GitHub
GitHub - YingqingHe/ScaleCrafter: [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation…
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time. - YingqingHe/ScaleCrafter
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
📝https://github.com/dongyh20/octopus
📝https://github.com/dongyh20/octopus
GitHub
GitHub - dongyh20/Octopus: 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual…
🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming. - GitHub - dongyh20/Octopus: 🐙Octopus, an embodied vision-language mode...
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
📝https://github.com/yuchenliu98/comm
📝https://github.com/yuchenliu98/comm
GitHub
GitHub - YuchenLiu98/COMM: Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models - GitHub - YuchenLiu98/COMM: Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in ...
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
📝https://github.com/luosiallen/latent-consistency-model
📝https://github.com/luosiallen/latent-consistency-model
GitHub
GitHub - luosiallen/latent-consistency-model: Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference - luosiallen/latent-consistency-model