Проекты машинного обучения

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation

📝https://github.com/threestudio-project/threestudio

GitHub

GitHub - threestudio-project/threestudio: A unified framework for 3D content generation.

A unified framework for 3D content generation. Contribute to threestudio-project/threestudio development by creating an account on GitHub.

28 views13:04

Проекты машинного обучения

OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields

📝https://github.com/linshan-bin/occnerf

GitHub

GitHub - LinShan-Bin/OccNeRF: Code of "OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields".

Code of "OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields". - GitHub - LinShan-Bin/OccNeRF: Code of "OccNeRF: Self-Supervised Multi-...

27 views13:04

Проекты машинного обучения

Using Sequences of Life-events to Predict Human Lives

📝https://github.com/SocialComplexityLab/life2vec

GitHub

GitHub - SocialComplexityLab/life2vec

Contribute to SocialComplexityLab/life2vec development by creating an account on GitHub.

25 views08:17

Проекты машинного обучения

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

📝https://github.com/sjtu-ipads/powerinfer

GitHub

GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs - SJTU-IPADS/PowerInfer

28 views09:17

Проекты машинного обучения

KwaiAgents: Generalized Information-seeking Agent System with Large Language Models

📝https://github.com/kwaikeg/kwaiagents

GitHub

GitHub - KwaiKEG/KwaiAgents: A generalized information-seeking agent system with Large Language Models (LLMs).

A generalized information-seeking agent system with Large Language Models (LLMs). - KwaiKEG/KwaiAgents

24 views07:49

Проекты машинного обучения

Generative Multimodal Models are In-Context Learners

📝https://github.com/baaivision/emu

GitHub

GitHub - baaivision/Emu: Emu Series: Generative Multimodal Models from BAAI

Emu Series: Generative Multimodal Models from BAAI - baaivision/Emu

28 views06:50

Проекты машинного обучения

AnyText: Multilingual Visual Text Generation And Editing

📝https://github.com/tyxsspa/anytext

GitHub

GitHub - tyxsspa/AnyText

Contribute to tyxsspa/AnyText development by creating an account on GitHub.

28 views07:25

Проекты машинного обучения

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

📝https://github.com/efeslab/atom

GitHub

GitHub - efeslab/Atom: Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving - GitHub - efeslab/Atom: Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

25 views07:06

Проекты машинного обучения

Video Understanding with Large Language Models: A Survey

📝https://github.com/yunlong10/awesome-llms-for-video-understanding

GitHub

GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.

35 views09:07

Проекты машинного обучения

GPT-4V(ision) is a Generalist Web Agent, if Grounded

📝https://github.com/osu-nlp-group/seeact

GitHub

GitHub - OSU-NLP-Group/SeeAct: SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website…

SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision). - GitHub - OSU-NLP-Group/S...

38 views14:07

Проекты машинного обучения

WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia

📝https://github.com/stanford-oval/wikichat

GitHub

GitHub - stanford-oval/WikiChat: WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving…

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus. - stanford-oval/WikiChat

37 views08:59

Проекты машинного обучения

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

📝https://github.com/facebookresearch/audio2photoreal

GitHub

GitHub - facebookresearch/audio2photoreal: Code and dataset for photorealistic Codec Avatars driven from audio

Code and dataset for photorealistic Codec Avatars driven from audio - facebookresearch/audio2photoreal

46 views12:07

Проекты машинного обучения

InstantID: Zero-shot Identity-Preserving Generation in Seconds

📝https://github.com/instantid/instantid

GitHub

GitHub - instantX-research/InstantID: InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥 - instantX-research/InstantID

37 views10:45

Проекты машинного обучения

AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception

📝https://github.com/yipoh/aesbench

GitHub

GitHub - yipoh/AesBench: An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs. - GitHub - yipoh/AesBench: An expert benchmark aiming to comprehensively evaluate the aesthetic ...

43 views11:46

Проекты машинного обучения

Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

📝https://github.com/zhanghm1995/forge_vfm4ad

GitHub

GitHub - zhanghm1995/Forge_VFM4AD: A comprehensive survey of forging vision foundation models for autonomous driving, including…

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities. - GitHub - zhanghm1995/Forge_VFM4AD: A comprehensive surv...

48 views14:46

Проекты машинного обучения

World Model on Million-Length Video And Language With RingAttention

📝https://github.com/LargeWorldModel/LWM

GitHub

GitHub - LargeWorldModel/LWM: Large World Model -- Modeling Text and Video with Millions Context

Large World Model -- Modeling Text and Video with Millions Context - LargeWorldModel/LWM

32 views10:48

Проекты машинного обучения

UFO: A UI-Focused Agent for Windows OS Interaction

📝https://github.com/microsoft/UFO

GitHub

GitHub - microsoft/UFO: The Desktop AgentOS.

The Desktop AgentOS. Contribute to microsoft/UFO development by creating an account on GitHub.

31 views13:48

Проекты машинного обучения

Revisiting Feature Prediction for Learning Visual Representations from Video

📝https://github.com/facebookresearch/jepa

GitHub

GitHub - facebookresearch/jepa: PyTorch code and models for V-JEPA self-supervised learning from video.

PyTorch code and models for V-JEPA self-supervised learning from video. - facebookresearch/jepa

29 views16:36

Проекты машинного обучения

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

📝https://github.com/GaussianObject/GaussianObject

GitHub

GitHub - chensjtu/GaussianObject: GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting…

GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting (SIGGRAPH Asia 2024, TOG) - chensjtu/GaussianObject

38 views12:51

Проекты машинного обучения

SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

📝https://github.com/willisma/sit

GitHub

GitHub - willisma/SiT: Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable…

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers" - willisma/SiT

36 views18:16

Проекты машинного обучения

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

📝https://github.com/wongkinyiu/yolov9

GitHub

GitHub - WongKinYiu/yolov9: Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information - WongKinYiu/yolov9

35 views18:16

About

Blog

Apps

Platform