Interface AutomaticResourcesOrBuilder

  • All Superinterfaces:
    com.google.protobuf.MessageLiteOrBuilder, com.google.protobuf.MessageOrBuilder
    All Known Implementing Classes:
    AutomaticResources, AutomaticResources.Builder

    public interface AutomaticResourcesOrBuilder
    extends com.google.protobuf.MessageOrBuilder
    • Method Summary

      All Methods Instance Methods Abstract Methods 
      Modifier and Type Method Description
      int getMaxReplicaCount()
      Immutable.
      int getMinReplicaCount()
      Immutable.
      • Methods inherited from interface com.google.protobuf.MessageLiteOrBuilder

        isInitialized
      • Methods inherited from interface com.google.protobuf.MessageOrBuilder

        findInitializationErrors, getAllFields, getDefaultInstanceForType, getDescriptorForType, getField, getInitializationErrorString, getOneofFieldDescriptor, getRepeatedField, getRepeatedFieldCount, getUnknownFields, hasField, hasOneof
    • Method Detail

      • getMinReplicaCount

        int getMinReplicaCount()
         Immutable. The minimum number of replicas this DeployedModel will be always
         deployed on. If traffic against it increases, it may dynamically be
         deployed onto more replicas up to
         [max_replica_count][google.cloud.aiplatform.v1.AutomaticResources.max_replica_count],
         and as traffic decreases, some of these extra replicas may be freed. If the
         requested value is too large, the deployment will error.
         
        int32 min_replica_count = 1 [(.google.api.field_behavior) = IMMUTABLE];
        Returns:
        The minReplicaCount.
      • getMaxReplicaCount

        int getMaxReplicaCount()
         Immutable. The maximum number of replicas this DeployedModel may be
         deployed on when the traffic against it increases. If the requested value
         is too large, the deployment will error, but if deployment succeeds then
         the ability to scale the model to that many replicas is guaranteed (barring
         service outages). If traffic against the DeployedModel increases beyond
         what its replicas at maximum may handle, a portion of the traffic will be
         dropped. If this value is not provided, a no upper bound for scaling under
         heavy traffic will be assume, though Vertex AI may be unable to scale
         beyond certain replica number.
         
        int32 max_replica_count = 2 [(.google.api.field_behavior) = IMMUTABLE];
        Returns:
        The maxReplicaCount.