Skip to content

Commit eabc4d9

Browse files
committed
Remove mentioning increase instance count case
1 parent 4a8a85a commit eabc4d9

File tree

1 file changed

+0
-3
lines changed

1 file changed

+0
-3
lines changed

docs/user_guide/model_management.md

-3
Original file line numberDiff line numberDiff line change
@@ -229,9 +229,6 @@ the model file), Triton does not guarentee any remaining request(s) from the
229229
in-flight sequence(s) will be routed to the same model instance for processing.
230230
It is currently the responsibility of the user to ensure any in-flight
231231
sequence(s) are completed before reloading a sequence model.
232-
* If a sequence model is *updated* (i.e. increasing/decreasing the instance
233-
count), Triton will wait until the in-flight sequence is completed (or
234-
timed-out) before the instance behind the sequence is removed.
235232

236233
## Concurrently Loading Models
237234

0 commit comments

Comments
 (0)