OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
📝https://github.com/imoneoi/openchat
📝https://github.com/imoneoi/openchat
GitHub
GitHub - imoneoi/openchat: OpenChat: Advancing Open-source Language Models with Imperfect Data
OpenChat: Advancing Open-source Language Models with Imperfect Data - imoneoi/openchat
Sequential Modeling Enables Scalable Learning for Large Vision Models
📝https://github.com/ytongbai/LVM
📝https://github.com/ytongbai/LVM
GitHub
GitHub - ytongbai/LVM
Contribute to ytongbai/LVM development by creating an account on GitHub.
Aligning and Prompting Everything All at Once for Universal Visual Perception
📝https://github.com/shenyunhang/ape
📝https://github.com/shenyunhang/ape
GitHub
GitHub - shenyunhang/APE: Aligning and Prompting Everything All at Once for Universal Visual Perception
Aligning and Prompting Everything All at Once for Universal Visual Perception - shenyunhang/APE
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
📝https://github.com/zhyever/PatchFusion
📝https://github.com/zhyever/PatchFusion
GitHub
GitHub - zhyever/PatchFusion: [CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation - zhyever/PatchFusion
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
📝https://github.com/chongzhou96/edgesam
📝https://github.com/chongzhou96/edgesam
GitHub
GitHub - chongzhou96/EdgeSAM: Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment…
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM" - GitHub - chongzhou96/EdgeSAM: Official PyTorch implementation of &a...
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
📝https://github.com/vvictoryuki/animatezero
📝https://github.com/vvictoryuki/animatezero
GitHub
GitHub - vvictoryuki/AnimateZero: Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot…
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators" - GitHub - vvictoryuki/AnimateZero: Official PyTorch implementati...
FreeInit: Bridging Initialization Gap in Video Diffusion Models
📝https://github.com/tianxingwu/freeinit
📝https://github.com/tianxingwu/freeinit
GitHub
GitHub - TianxingWu/FreeInit: FreeInit: Bridging Initialization Gap in Video Diffusion Models
FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
📝https://github.com/threestudio-project/threestudio
📝https://github.com/threestudio-project/threestudio
GitHub
GitHub - threestudio-project/threestudio: A unified framework for 3D content generation.
A unified framework for 3D content generation. Contribute to threestudio-project/threestudio development by creating an account on GitHub.
OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields
📝https://github.com/linshan-bin/occnerf
📝https://github.com/linshan-bin/occnerf
GitHub
GitHub - LinShan-Bin/OccNeRF: Code of "OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields".
Code of "OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields". - GitHub - LinShan-Bin/OccNeRF: Code of "OccNeRF: Self-Supervised Multi-...
Using Sequences of Life-events to Predict Human Lives
📝https://github.com/SocialComplexityLab/life2vec
📝https://github.com/SocialComplexityLab/life2vec
GitHub
GitHub - SocialComplexityLab/life2vec
Contribute to SocialComplexityLab/life2vec development by creating an account on GitHub.
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
📝https://github.com/sjtu-ipads/powerinfer
📝https://github.com/sjtu-ipads/powerinfer
GitHub
GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs - SJTU-IPADS/PowerInfer
KwaiAgents: Generalized Information-seeking Agent System with Large Language Models
📝https://github.com/kwaikeg/kwaiagents
📝https://github.com/kwaikeg/kwaiagents
GitHub
GitHub - KwaiKEG/KwaiAgents: A generalized information-seeking agent system with Large Language Models (LLMs).
A generalized information-seeking agent system with Large Language Models (LLMs). - KwaiKEG/KwaiAgents
Video Understanding with Large Language Models: A Survey
📝https://github.com/yunlong10/awesome-llms-for-video-understanding
📝https://github.com/yunlong10/awesome-llms-for-video-understanding
GitHub
GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia
📝https://github.com/stanford-oval/wikichat
📝https://github.com/stanford-oval/wikichat
GitHub
GitHub - stanford-oval/WikiChat: WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving…
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus. - stanford-oval/WikiChat
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
📝https://github.com/facebookresearch/audio2photoreal
📝https://github.com/facebookresearch/audio2photoreal
GitHub
GitHub - facebookresearch/audio2photoreal: Code and dataset for photorealistic Codec Avatars driven from audio
Code and dataset for photorealistic Codec Avatars driven from audio - facebookresearch/audio2photoreal