Update start-trained-model-deployment.asciidoc (#118887)

Updating with changes in https://github.com/elastic/elasticsearch/pull/115041
This commit is contained in:
Pius Fung 2024-12-17 13:03:28 -08:00 committed by GitHub
parent 5d9c3a2663
commit 1054503ba8
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
1 changed files with 4 additions and 4 deletions

View File

@ -138,8 +138,8 @@ normal priority deployments.
Controls how many inference requests are allowed in the queue at a time.
Every machine learning node in the cluster where the model can be allocated
has a queue of this size; when the number of requests exceeds the total value,
new requests are rejected with a 429 error. Defaults to 1024. Max allowed value
is 1000000.
new requests are rejected with a 429 error. Defaults to 10000. Max allowed value
is 100000.
`threads_per_allocation`::
(Optional, integer)
@ -173,7 +173,7 @@ The API returns the following results:
"model_bytes": 265632637,
"threads_per_allocation" : 1,
"number_of_allocations" : 1,
"queue_capacity" : 1024,
"queue_capacity" : 10000,
"priority": "normal"
},
"routing_table": {
@ -229,4 +229,4 @@ POST _ml/trained_models/my_model/deployment/_start?deployment_id=my_model_for_se
}
}
--------------------------------------------------
// TEST[skip:TBD]
// TEST[skip:TBD]