GitHub repos – Telegram

GitHub repos

24.9K subscribers

18 photos

2 videos

10.2K links

Welcome to GitHub repos. Here you'll find valuable information on the latest trending projects. Subscribe to stay informed and gain insights from the thriving GitHub community.

Download Telegram

About

Blog

Apps

Platform

24.9K subscribers

FMInference/FlexGen
Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems.
Language: Python
#chatgpt #deep_learning #gpt_3 #high_throughput #large_language_models #machine_learning #offloading #opt
Stars: 1799 Issues: 11 Forks: 72
https://github.com/FMInference/FlexGen

GitHub - FMInference/FlexiGen: Running large language models on a single GPU for throughput-oriented scenarios.

Running large language models on a single GPU for throughput-oriented scenarios. - FMInference/FlexiGen

3.7K views11:06