You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"description": "Immutable. The maximum number of replicas this DeployedModel may be deployed on when the traffic against it increases. If the requested value is too large, the deployment will error, but if deployment succeeds then the ability to scale the model to that many replicas is guaranteed (barring service outages). If traffic against the DeployedModel increases beyond what its replicas at maximum may handle, a portion of the traffic will be dropped. If this value is not provided, a no upper bound for scaling under heavy traffic will be assume, though Vertex AI may be unable to scale beyond certain replica number.",
20292
+
"description": "Immutable. The maximum number of replicas that may be deployed on when the traffic against it increases. If the requested value is too large, the deployment will error, but if deployment succeeds then the ability to scale to that many replicas is guaranteed (barring service outages). If traffic increases beyond what its replicas at maximum may handle, a portion of the traffic will be dropped. If this value is not provided, a no upper bound for scaling under heavy traffic will be assume, though Vertex AI may be unable to scale beyond certain replica number.",
20265
20293
"format": "int32",
20266
20294
"type": "integer"
20267
20295
},
20268
20296
"minReplicaCount": {
20269
-
"description": "Immutable. The minimum number of replicas this DeployedModel will be always deployed on. If traffic against it increases, it may dynamically be deployed onto more replicas up to max_replica_count, and as traffic decreases, some of these extra replicas may be freed. If the requested value is too large, the deployment will error.",
20297
+
"description": "Immutable. The minimum number of replicas that will be always deployed on. If traffic against it increases, it may dynamically be deployed onto more replicas up to max_replica_count, and as traffic decreases, some of these extra replicas may be freed. If the requested value is too large, the deployment will error.",
20270
20298
"format": "int32",
20271
20299
"type": "integer"
20272
20300
}
@@ -21137,7 +21165,7 @@
21137
21165
"type": "string"
21138
21166
},
21139
21167
"model": {
21140
-
"description": "Immutable. The name of the publisher model to use for cached content. Format: projects/{project}/locations/{location}/publishers/{publisher}/models/{model}",
21168
+
"description": "Immutable. The name of the `Model` to use for cached content. Currently, only the published Gemini base models are supported, in form of projects/{PROJECT}/locations/{LOCATION}/publishers/google/models/{MODEL}",
21141
21169
"type": "string"
21142
21170
},
21143
21171
"name": {
@@ -22929,7 +22957,7 @@
22929
22957
"type": "object"
22930
22958
},
22931
22959
"GoogleCloudAiplatformV1DedicatedResources": {
22932
-
"description": "A description of resources that are dedicated to a DeployedModel, and that need a higher degree of manual configuration.",
22960
+
"description": "A description of resources that are dedicated to a DeployedModel or DeployedIndex, and that need a higher degree of manual configuration.",
"description": "Required. Immutable. The specification of a single machine used by the prediction."
22972
+
"description": "Required. Immutable. The specification of a single machine being used."
22945
22973
},
22946
22974
"maxReplicaCount": {
22947
-
"description": "Immutable. The maximum number of replicas this DeployedModel may be deployed on when the traffic against it increases. If the requested value is too large, the deployment will error, but if deployment succeeds then the ability to scale the model to that many replicas is guaranteed (barring service outages). If traffic against the DeployedModel increases beyond what its replicas at maximum may handle, a portion of the traffic will be dropped. If this value is not provided, will use min_replica_count as the default value. The value of this field impacts the charge against Vertex CPU and GPU quotas. Specifically, you will be charged for (max_replica_count * number of cores in the selected machine type) and (max_replica_count * number of GPUs per replica in the selected machine type).",
22975
+
"description": "Immutable. The maximum number of replicas that may be deployed on when the traffic against it increases. If the requested value is too large, the deployment will error, but if deployment succeeds then the ability to scale to that many replicas is guaranteed (barring service outages). If traffic increases beyond what its replicas at maximum may handle, a portion of the traffic will be dropped. If this value is not provided, will use min_replica_count as the default value. The value of this field impacts the charge against Vertex CPU and GPU quotas. Specifically, you will be charged for (max_replica_count * number of cores in the selected machine type) and (max_replica_count * number of GPUs per replica in the selected machine type).",
22948
22976
"format": "int32",
22949
22977
"type": "integer"
22950
22978
},
22951
22979
"minReplicaCount": {
22952
-
"description": "Required. Immutable. The minimum number of machine replicas this DeployedModel will be always deployed on. This value must be greater than or equal to 1. If traffic against the DeployedModel increases, it may dynamically be deployed onto more replicas, and as traffic decreases, some of these extra replicas may be freed.",
22980
+
"description": "Required. Immutable. The minimum number of machine replicas that will be always deployed on. This value must be greater than or equal to 1. If traffic increases, it may dynamically be deployed onto more replicas, and as traffic decreases, some of these extra replicas may be freed.",
22953
22981
"format": "int32",
22954
22982
"type": "integer"
22955
22983
},
22956
22984
"requiredReplicaCount": {
22957
-
"description": "Optional. Number of required available replicas for the deployment to succeed. This field is only needed when partial model deployment/mutation is desired. If set, the model deploy/mutate operation will succeed once available_replica_count reaches required_replica_count, and the rest of the replicas will be retried. If not set, the default required_replica_count will be min_replica_count.",
22985
+
"description": "Optional. Number of required available replicas for the deployment to succeed. This field is only needed when partial deployment/mutation is desired. If set, the deploy/mutate operation will succeed once available_replica_count reaches required_replica_count, and the rest of the replicas will be retried. If not set, the default required_replica_count will be min_replica_count.",
"description": "A proto representation of a Spanner-stored ModelVersionCheckpoint. The meaning of the fields is equivalent to their in-Spanner counterparts.",
31452
+
"description": "Describes the machine learning model version checkpoint.",
"description": "Identifier. The resource name of the ModelVersionCheckpoint. Format: `projects/{project}/locations/{location}/models/{model}/versions/{version}/checkpoints/{checkpoint}`",
31466
+
"type": "string"
31467
+
},
31335
31468
"step": {
31336
31469
"description": "The step of the checkpoint.",
31337
31470
"format": "int64",
@@ -32656,6 +32789,17 @@
32656
32789
},
32657
32790
"type": "object"
32658
32791
},
32792
+
"GoogleCloudAiplatformV1OutputConfig": {
32793
+
"description": "Config for evaluation output.",
32794
+
"id": "GoogleCloudAiplatformV1OutputConfig",
32795
+
"properties": {
32796
+
"gcsDestination": {
32797
+
"$ref": "GoogleCloudAiplatformV1GcsDestination",
32798
+
"description": "Cloud storage destination for evaluation output."
32799
+
}
32800
+
},
32801
+
"type": "object"
32802
+
},
32659
32803
"GoogleCloudAiplatformV1PSCAutomationConfig": {
32660
32804
"description": "PSC config that is used to automatically create forwarding rule via ServiceConnectionMap.",
"description": "Output only. Google Cloud Storage URI points to the original text in user's bucket. The text file is up to 10MB in size.",
39053
+
"description": "Output only. Google Cloud Storage URI points to a copy of the original text in the Vertex-managed bucket in the user's project. The text file is up to 10MB in size.",
0 commit comments