liltom-eth/llama2-webui
Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.
Language: Python
#llama_2 #llama2 #llm #llm_inference
Stars: 481 Issues: 2 Forks: 42
https://github.com/liltom-eth/llama2-webui
Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.
Language: Python
#llama_2 #llama2 #llm #llm_inference
Stars: 481 Issues: 2 Forks: 42
https://github.com/liltom-eth/llama2-webui
GitHub
GitHub - liltom-eth/llama2-webui: Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2β¦
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps. - GitHub - liltom-eth/llama2-...
π4π₯1π₯°1π1
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language: C
#falcon #large_language_models #llama #llm #llm_inference #local_inference
Stars: 792 Issues: 8 Forks: 32
https://github.com/SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language: C
#falcon #large_language_models #llama #llm #llm_inference #local_inference
Stars: 792 Issues: 8 Forks: 32
https://github.com/SJTU-IPADS/PowerInfer
GitHub
GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving for Local Deployment
High-speed Large Language Model Serving for Local Deployment - SJTU-IPADS/PowerInfer
π2β€1
hpcaitech/SwiftInfer
Efficient AI Inference & Serving
Language: Python
#artificial_intelligence #deep_learning #gpt #inference #llama #llama2 #llm_inference #llm_serving
Stars: 299 Issues: 3 Forks: 14
https://github.com/hpcaitech/SwiftInfer
Efficient AI Inference & Serving
Language: Python
#artificial_intelligence #deep_learning #gpt #inference #llama #llama2 #llm_inference #llm_serving
Stars: 299 Issues: 3 Forks: 14
https://github.com/hpcaitech/SwiftInfer
GitHub
GitHub - hpcaitech/SwiftInfer: Efficient AI Inference & Serving
Efficient AI Inference & Serving. Contribute to hpcaitech/SwiftInfer development by creating an account on GitHub.
π₯1
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language: Python
#databricks #gen_ai #generative_ai #llm #llm_inference #llm_training #mosaic_ai
Stars: 1113 Issues: 7 Forks: 86
https://github.com/databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language: Python
#databricks #gen_ai #generative_ai #llm #llm_inference #llm_training #mosaic_ai
Stars: 1113 Issues: 7 Forks: 86
https://github.com/databricks/dbrx
GitHub
GitHub - databricks/dbrx: Code examples and resources for DBRX, a large language model developed by Databricks
Code examples and resources for DBRX, a large language model developed by Databricks - databricks/dbrx
arc53/llm-price-compass
LLM provider price comparison, gpu benchmarks to price per token calculation, gpu benchmark table
Language: TypeScript
#benchmark #gpu #inference_comparison #llm #llm_comparison #llm_inference #llm_price
Stars: 138 Issues: 1 Forks: 5
https://github.com/arc53/llm-price-compass
LLM provider price comparison, gpu benchmarks to price per token calculation, gpu benchmark table
Language: TypeScript
#benchmark #gpu #inference_comparison #llm #llm_comparison #llm_inference #llm_price
Stars: 138 Issues: 1 Forks: 5
https://github.com/arc53/llm-price-compass
GitHub
GitHub - arc53/llm-price-compass: This project collects GPU benchmarks from various cloud providers and compares them to fixedβ¦
This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient LLM GPU selections and cost-effective AI models. LLM provide...
MLSys-Learner-Resources/Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Language: HTML
#llm #llm_inference #llm_training #machine_learning #machine_learning_systems #mlsys
Stars: 120 Issues: 0 Forks: 0
https://github.com/MLSys-Learner-Resources/Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Language: HTML
#llm #llm_inference #llm_training #machine_learning #machine_learning_systems #mlsys
Stars: 120 Issues: 0 Forks: 0
https://github.com/MLSys-Learner-Resources/Awesome-MLSys-Blogger
GitHub
GitHub - MLSys-Learner-Resources/Awesome-MLSys-Blogger: The repository has collected a batch of noteworthy MLSys bloggers (Algβ¦
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems) - MLSys-Learner-Resources/Awesome-MLSys-Blogger
π1
codelion/openevolve
Open-source implementation of AlphaEvolve
Language: Python
#alpha_evolve #alphacode #alphaevolve #coding_agent #deepmind #deepmind_lab #discovery #distributed_evolutionary_algorithms #evolutionary_algorithms #evolutionary_computation #genetic_algorithm #genetic_algorithms #iterative_methods #iterative_refinement #llm_engineering #llm_ensemble #llm_inference #openevolve #optimize
Stars: 312 Issues: 1 Forks: 26
https://github.com/codelion/openevolve
Open-source implementation of AlphaEvolve
Language: Python
#alpha_evolve #alphacode #alphaevolve #coding_agent #deepmind #deepmind_lab #discovery #distributed_evolutionary_algorithms #evolutionary_algorithms #evolutionary_computation #genetic_algorithm #genetic_algorithms #iterative_methods #iterative_refinement #llm_engineering #llm_ensemble #llm_inference #openevolve #optimize
Stars: 312 Issues: 1 Forks: 26
https://github.com/codelion/openevolve
GitHub
GitHub - codelion/openevolve: Open-source implementation of AlphaEvolve
Open-source implementation of AlphaEvolve. Contribute to codelion/openevolve development by creating an account on GitHub.