diff --git a/docs/user_guide/request_cancellation.md b/docs/user_guide/request_cancellation.md index 2cc3e98bda..8db4e3b8c1 100644 --- a/docs/user_guide/request_cancellation.md +++ b/docs/user_guide/request_cancellation.md @@ -82,6 +82,7 @@ at various points. However, once a request has been given to the backend for execution, it is up to the individual backends to detect and handle request termination. Currently, the following backends support early termination: +- [TensorRT-LLM backend](https://github.com/triton-inference-server/tensorrtllm_backend) - [vLLM backend](https://github.com/triton-inference-server/vllm_backend) - [python backend](https://github.com/triton-inference-server/python_backend)