https://www.ilsoftware.it/llm-large-language-models-sulle-gpu-consumer-con-exllamav2/
LLM: Large Language Models sulle GPU consumer con ExLlamaV2