Проекты машинного обучения

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation

📝https://github.com/threestudio-project/threestudio

GitHub

GitHub - threestudio-project/threestudio: A unified framework for 3D content generation.

A unified framework for 3D content generation. Contribute to threestudio-project/threestudio development by creating an account on GitHub.

28 views13:04

Проекты машинного обучения

OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields

📝https://github.com/linshan-bin/occnerf

GitHub

GitHub - LinShan-Bin/OccNeRF: Code of "OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields".

Code of "OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields". - GitHub - LinShan-Bin/OccNeRF: Code of "OccNeRF: Self-Supervised Multi-...

27 views13:04

Проекты машинного обучения

Using Sequences of Life-events to Predict Human Lives

📝https://github.com/SocialComplexityLab/life2vec

GitHub

GitHub - SocialComplexityLab/life2vec

Contribute to SocialComplexityLab/life2vec development by creating an account on GitHub.

25 views08:17

Проекты машинного обучения

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

📝https://github.com/sjtu-ipads/powerinfer

GitHub

GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs - SJTU-IPADS/PowerInfer

28 views09:17

Проекты машинного обучения

KwaiAgents: Generalized Information-seeking Agent System with Large Language Models

📝https://github.com/kwaikeg/kwaiagents

GitHub

GitHub - KwaiKEG/KwaiAgents: A generalized information-seeking agent system with Large Language Models (LLMs).

A generalized information-seeking agent system with Large Language Models (LLMs). - KwaiKEG/KwaiAgents

24 views07:49

Проекты машинного обучения

Generative Multimodal Models are In-Context Learners

📝https://github.com/baaivision/emu

GitHub

GitHub - baaivision/Emu: Emu Series: Generative Multimodal Models from BAAI

Emu Series: Generative Multimodal Models from BAAI - baaivision/Emu

28 views06:50

Проекты машинного обучения

AnyText: Multilingual Visual Text Generation And Editing

📝https://github.com/tyxsspa/anytext

GitHub

GitHub - tyxsspa/AnyText

Contribute to tyxsspa/AnyText development by creating an account on GitHub.

28 views07:25

Проекты машинного обучения

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

📝https://github.com/efeslab/atom

GitHub

GitHub - efeslab/Atom: Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving - GitHub - efeslab/Atom: Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

25 views07:06

Проекты машинного обучения

Video Understanding with Large Language Models: A Survey

📝https://github.com/yunlong10/awesome-llms-for-video-understanding

GitHub

GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.

35 views09:07

Проекты машинного обучения

GPT-4V(ision) is a Generalist Web Agent, if Grounded

📝https://github.com/osu-nlp-group/seeact

GitHub

GitHub - OSU-NLP-Group/SeeAct: SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website…

SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision). - GitHub - OSU-NLP-Group/S...

38 views14:07

Проекты машинного обучения

WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia

📝https://github.com/stanford-oval/wikichat

GitHub

GitHub - stanford-oval/WikiChat: WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving…

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus. - stanford-oval/WikiChat

37 views08:59

Проекты машинного обучения

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

📝https://github.com/facebookresearch/audio2photoreal

GitHub

GitHub - facebookresearch/audio2photoreal: Code and dataset for photorealistic Codec Avatars driven from audio

Code and dataset for photorealistic Codec Avatars driven from audio - facebookresearch/audio2photoreal

46 views12:07

About

Blog

Apps

Platform