Notes about running a chat completion API endpoint with TensorRT-LLM and Meta-Llama-3–8B-Instruct Read More