Code Stars

dipampaul17/KVSplit
Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit keys & 4-bit values, reducing memory by 59% with <1% quality loss. Includes benchmarking, visualization, and one-command setup. Optimized for M1/M2/M3 Macs with Metal support.
Language:Python
Total stars: 144
Stars trend:

16 May 2025
 7pm ▏ +1
 8pm █████▌ +44
 9pm ████▊ +38
10pm ███▋ +29
11pm ██▎ +18

#python
#applesilicon, #generativeai, #kvcache, #llamacpp, #llm, #m1, #m2, #m3, #memoryoptimization, #metal, #optimization, #quantization

86 views00:17

Code Stars

MemTensor/MemOS
MemOS (Preview) | Intelligence Begins with Memory
Language:Python
Total stars: 162
Stars trend:

6 Jul 2025
11pm ▏ +1
7 Jul 2025
12am  +0
 1am  +0
 2am ▉ +7
 3am █▎ +10
 4am ▊ +6
 5am ████▍ +35
 6am ███▍ +27

#python
#agent, #kvcache, #languagemodel, #llm, #lora, #memcube, #memory, #memos, #neo4j, #tree

113 views07:17

About

Blog

Apps

Platform