We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent a984bf4 commit 3399294Copy full SHA for 3399294
docs/user_guide/request_cancellation.md
@@ -82,6 +82,7 @@ at various points. However, once a request has been given to the backend
82
for execution, it is up to the individual backends to detect and handle
83
request termination.
84
Currently, the following backends support early termination:
85
+- [TensorRT-LLM backend](https://github.com/triton-inference-server/tensorrtllm_backend)
86
- [vLLM backend](https://github.com/triton-inference-server/vllm_backend)
87
- [python backend](https://github.com/triton-inference-server/python_backend)
88
0 commit comments