Skip to content

Create Ai Benchmark Job

sagemaker_create_ai_benchmark_job R Documentation

Creates a benchmark job that runs performance benchmarks against inference infrastructure using a predefined AI workload configuration

Description

Creates a benchmark job that runs performance benchmarks against inference infrastructure using a predefined AI workload configuration. The benchmark job measures metrics such as latency, throughput, and cost for your generative AI inference endpoints.

Usage

sagemaker_create_ai_benchmark_job(AIBenchmarkJobName, BenchmarkTarget,
  OutputConfig, AIWorkloadConfigIdentifier, RoleArn, NetworkConfig, Tags)

Arguments

AIBenchmarkJobName

[required] The name of the AI benchmark job. The name must be unique within your Amazon Web Services account in the current Amazon Web Services Region.

BenchmarkTarget

[required] The target endpoint to benchmark. Specify a SageMaker endpoint by providing its name or Amazon Resource Name (ARN).

OutputConfig

[required] The output configuration for the benchmark job, including the Amazon S3 location where benchmark results are stored.

AIWorkloadConfigIdentifier

[required] The name or Amazon Resource Name (ARN) of the AI workload configuration to use for this benchmark job.

RoleArn

[required] The Amazon Resource Name (ARN) of an IAM role that enables Amazon SageMaker AI to perform tasks on your behalf.

NetworkConfig

The network configuration for the benchmark job, including VPC settings.

Tags

The metadata that you apply to Amazon Web Services resources to help you categorize and organize them. Each tag consists of a key and a value, both of which you define.

Value

A list with the following syntax:

list(
  AIBenchmarkJobArn = "string"
)

Request syntax

svc$create_ai_benchmark_job(
  AIBenchmarkJobName = "string",
  BenchmarkTarget = list(
    Endpoint = list(
      Identifier = "string",
      TargetContainerHostname = "string",
      InferenceComponents = list(
        list(
          Identifier = "string"
        )
      )
    )
  ),
  OutputConfig = list(
    S3OutputLocation = "string"
  ),
  AIWorkloadConfigIdentifier = "string",
  RoleArn = "string",
  NetworkConfig = list(
    VpcConfig = list(
      SecurityGroupIds = list(
        "string"
      ),
      Subnets = list(
        "string"
      )
    )
  ),
  Tags = list(
    list(
      Key = "string",
      Value = "string"
    )
  )
)