Transparent Image Layer Diffusion using Latent Transparency
📝https://github.com/layerdiffusion/layerdiffusion
📝https://github.com/layerdiffusion/layerdiffusion
GitHub
GitHub - layerdiffusion/LayerDiffuse: Transparent Image Layer Diffusion using Latent Transparency
Transparent Image Layer Diffusion using Latent Transparency - layerdiffusion/LayerDiffuse
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
📝https://github.com/parthsarthi03/RAPTOR
📝https://github.com/parthsarthi03/RAPTOR
GitHub
GitHub - parthsarthi03/raptor: The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - parthsarthi03/raptor
TripoSR: Fast 3D Object Reconstruction from a Single Image
📝https://github.com/vast-ai-research/triposr
📝https://github.com/vast-ai-research/triposr
GitHub
GitHub - VAST-AI-Research/TripoSR
Contribute to VAST-AI-Research/TripoSR development by creating an account on GitHub.
V3D: Video Diffusion Models are Effective 3D Generators
📝https://github.com/heheyas/v3d
📝https://github.com/heheyas/v3d
GitHub
GitHub - heheyas/V3D: V3D: Video Diffusion Models are Effective 3D Generators
V3D: Video Diffusion Models are Effective 3D Generators - heheyas/V3D
Extreme Compression of Large Language Models via Additive Quantization
📝https://github.com/vahe1994/aqlm
📝https://github.com/vahe1994/aqlm
GitHub
GitHub - Vahe1994/AQLM: Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization…
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf - Vahe1994/AQLM
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
🖥
https://github.com/openbmb/minicpm-v
🖥
https://github.com/openbmb/minicpm-v
GitHub
GitHub - OpenBMB/MiniCPM-o: MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone - OpenBMB/MiniCPM-o
LightAutoML: AutoML Solution for a Large Financial Services Ecosystem
🖥
https://github.com/sb-ai-lab/lightautoml
🖥
https://github.com/sb-ai-lab/lightautoml
GitHub
GitHub - sb-ai-lab/LightAutoML: Fast and customizable framework for automatic ML model creation (AutoML)
Fast and customizable framework for automatic ML model creation (AutoML) - sb-ai-lab/LightAutoML
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
🖥
https://github.com/opengvlab/internvl
🖥
https://github.com/opengvlab/internvl
GitHub
GitHub - OpenGVLab/InternVL: [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型 - OpenGVLab/InternVL
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
🖥
https://github.com/tencent/hunyuandit
🖥
https://github.com/tencent/hunyuandit
GitHub
GitHub - Tencent/HunyuanDiT: Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding - Tencent/HunyuanDiT
Scaling Synthetic Data Creation with 1,000,000,000 Personas
🖥https://github.com/tencent-ailab/persona-hub
🖥https://github.com/tencent-ailab/persona-hub
GitHub
GitHub - tencent-ailab/persona-hub: Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas" - tencent-ailab/persona-hub
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
🖥https://github.com/osu-nlp-group/hipporag
🖥https://github.com/osu-nlp-group/hipporag
GitHub
GitHub - OSU-NLP-Group/HippoRAG: [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables…
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + P...
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
🖥https://github.com/g-u-n/be-your-outpainter
🖥https://github.com/g-u-n/be-your-outpainter
GitHub
GitHub - G-U-N/Be-Your-Outpainter: [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745 - G-U-N/Be-Your-Outpainter
Agentless: Demystifying LLM-based Software Engineering Agents
🖥https://github.com/OpenAutoCoder/Agentless
🖥https://github.com/OpenAutoCoder/Agentless
GitHub
GitHub - OpenAutoCoder/Agentless: Agentless🐱: an agentless approach to automatically solve software development problems
Agentless🐱: an agentless approach to automatically solve software development problems - OpenAutoCoder/Agentless
Scaling Synthetic Data Creation with 1,000,000,000 Personas
🖥https://github.com/tencent-ailab/persona-hub
🖥https://github.com/tencent-ailab/persona-hub
GitHub
GitHub - tencent-ailab/persona-hub: Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas" - tencent-ailab/persona-hub