https://kccncna2024.sched.com/event/1i7oh/load-aware-gpu-fractioning-for-llm-inference-on-kubernetes-olivier-tardieu-yue-zhu-ibm
I'm excited to attend KubeCon + CloudNativeCon North America 2024 https://kccncna2024.sched.com/event/1i7oh/load-aware-gpu-fractioning-for-llm-inference-on-kubernetes-olivier-tardieu-yue-zhu-ibm @sched