Проекты машинного обучения

AnyText: Multilingual Visual Text Generation And Editing

📝https://github.com/tyxsspa/anytext

GitHub

GitHub - tyxsspa/AnyText

Contribute to tyxsspa/AnyText development by creating an account on GitHub.

28 views07:25

Проекты машинного обучения

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

📝https://github.com/efeslab/atom

GitHub

GitHub - efeslab/Atom: Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving - GitHub - efeslab/Atom: Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

25 views07:06

Проекты машинного обучения

Video Understanding with Large Language Models: A Survey

📝https://github.com/yunlong10/awesome-llms-for-video-understanding

GitHub

GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.

35 views09:07

Проекты машинного обучения

GPT-4V(ision) is a Generalist Web Agent, if Grounded

📝https://github.com/osu-nlp-group/seeact

GitHub

GitHub - OSU-NLP-Group/SeeAct: SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website…

SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision). - GitHub - OSU-NLP-Group/S...

38 views14:07

Проекты машинного обучения

WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia

📝https://github.com/stanford-oval/wikichat

GitHub

GitHub - stanford-oval/WikiChat: WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving…

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus. - stanford-oval/WikiChat

37 views08:59

Проекты машинного обучения

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

📝https://github.com/facebookresearch/audio2photoreal

GitHub

GitHub - facebookresearch/audio2photoreal: Code and dataset for photorealistic Codec Avatars driven from audio

Code and dataset for photorealistic Codec Avatars driven from audio - facebookresearch/audio2photoreal

46 views12:07

Проекты машинного обучения

InstantID: Zero-shot Identity-Preserving Generation in Seconds

📝https://github.com/instantid/instantid

GitHub

GitHub - instantX-research/InstantID: InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥 - instantX-research/InstantID

37 views10:45

Проекты машинного обучения

AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception

📝https://github.com/yipoh/aesbench

GitHub

GitHub - yipoh/AesBench: An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs. - GitHub - yipoh/AesBench: An expert benchmark aiming to comprehensively evaluate the aesthetic ...

43 views11:46

Проекты машинного обучения

Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

📝https://github.com/zhanghm1995/forge_vfm4ad

GitHub

GitHub - zhanghm1995/Forge_VFM4AD: A comprehensive survey of forging vision foundation models for autonomous driving, including…

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities. - GitHub - zhanghm1995/Forge_VFM4AD: A comprehensive surv...

48 views14:46

Проекты машинного обучения

World Model on Million-Length Video And Language With RingAttention

📝https://github.com/LargeWorldModel/LWM

GitHub

GitHub - LargeWorldModel/LWM: Large World Model -- Modeling Text and Video with Millions Context

Large World Model -- Modeling Text and Video with Millions Context - LargeWorldModel/LWM

32 views10:48

Проекты машинного обучения

UFO: A UI-Focused Agent for Windows OS Interaction

📝https://github.com/microsoft/UFO

GitHub

GitHub - microsoft/UFO: The Desktop AgentOS.

The Desktop AgentOS. Contribute to microsoft/UFO development by creating an account on GitHub.

31 views13:48

Проекты машинного обучения

Revisiting Feature Prediction for Learning Visual Representations from Video

📝https://github.com/facebookresearch/jepa

GitHub

GitHub - facebookresearch/jepa: PyTorch code and models for V-JEPA self-supervised learning from video.

PyTorch code and models for V-JEPA self-supervised learning from video. - facebookresearch/jepa

29 views16:36

Проекты машинного обучения

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

📝https://github.com/GaussianObject/GaussianObject

GitHub

GitHub - chensjtu/GaussianObject: GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting…

GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting (SIGGRAPH Asia 2024, TOG) - chensjtu/GaussianObject

38 views12:51

Проекты машинного обучения

SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

📝https://github.com/willisma/sit

GitHub

GitHub - willisma/SiT: Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable…

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers" - willisma/SiT

36 views18:16

Проекты машинного обучения

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

📝https://github.com/wongkinyiu/yolov9

GitHub

GitHub - WongKinYiu/yolov9: Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information - WongKinYiu/yolov9

35 views18:16

Проекты машинного обучения

Vectorized and performance-portable Quicksort

📝https://github.com/google/highway

GitHub

GitHub - google/highway: Performance-portable, length-agnostic SIMD with runtime dispatch

Performance-portable, length-agnostic SIMD with runtime dispatch - google/highway

37 views06:35

Проекты машинного обучения

Transparent Image Layer Diffusion using Latent Transparency

📝https://github.com/layerdiffusion/layerdiffusion

GitHub

GitHub - layerdiffusion/LayerDiffuse: Transparent Image Layer Diffusion using Latent Transparency

Transparent Image Layer Diffusion using Latent Transparency - layerdiffusion/LayerDiffuse

38 views06:33

Проекты машинного обучения

RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

📝https://github.com/parthsarthi03/RAPTOR

GitHub

GitHub - parthsarthi03/raptor: The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - parthsarthi03/raptor

28 views06:54

Проекты машинного обучения

TripoSR: Fast 3D Object Reconstruction from a Single Image

📝https://github.com/vast-ai-research/triposr

GitHub

GitHub - VAST-AI-Research/TripoSR

Contribute to VAST-AI-Research/TripoSR development by creating an account on GitHub.

32 views07:05

Проекты машинного обучения

V3D: Video Diffusion Models are Effective 3D Generators
📝https://github.com/heheyas/v3d

GitHub

GitHub - heheyas/V3D: V3D: Video Diffusion Models are Effective 3D Generators

V3D: Video Diffusion Models are Effective 3D Generators - heheyas/V3D

36 views06:25

Проекты машинного обучения

Extreme Compression of Large Language Models via Additive Quantization

📝https://github.com/vahe1994/aqlm

GitHub

GitHub - Vahe1994/AQLM: Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization…

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf - Vahe1994/AQLM

53 views07:26

About

Blog

Apps

Platform