We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent ee68c57 commit 441d930Copy full SHA for 441d930
docs/user_guide/request_cancellation.md
@@ -82,6 +82,7 @@ at various points. However, once a request has been given to the backend
82
for execution, it is up to the individual backends to detect and handle
83
request termination.
84
Currently, the following backends support early termination:
85
+- [TensorRT-LLM backend](https://github.com/triton-inference-server/tensorrtllm_backend)
86
- [vLLM backend](https://github.com/triton-inference-server/vllm_backend)
87
- [python backend](https://github.com/triton-inference-server/python_backend)
88
0 commit comments