Skip to content

Client

gluedatabrew R Documentation

AWS Glue DataBrew

Description

Glue DataBrew is a visual, cloud-scale data-preparation service. DataBrew simplifies data preparation tasks, targeting data issues that are hard to spot and time-consuming to fix. DataBrew empowers users of all technical levels to visualize the data and perform one-click data transformations, with no coding required.

Usage

gluedatabrew(
  config = list(),
  credentials = list(),
  endpoint = NULL,
  region = NULL
)

Arguments

config

Optional configuration of credentials, endpoint, and/or region.

  • credentials:

    • creds:

      • access_key_id: AWS access key ID

      • secret_access_key: AWS secret access key

      • session_token: AWS temporary session token

    • profile: The name of a profile to use. If not given, then the default profile is used.

    • anonymous: Set anonymous credentials.

  • endpoint: The complete URL to use for the constructed client.

  • region: The AWS Region used in instantiating the client.

  • close_connection: Immediately close all HTTP connections.

  • timeout: The time in seconds till a timeout exception is thrown when attempting to make a connection. The default is 60 seconds.

  • s3_force_path_style: Set this to true to force the request to use path-style addressing, i.e. ⁠http://s3.amazonaws.com/BUCKET/KEY⁠.

  • sts_regional_endpoint: Set sts regional endpoint resolver to regional or legacy https://docs.aws.amazon.com/sdkref/latest/guide/feature-sts-regionalized-endpoints.html

credentials

Optional credentials shorthand for the config parameter

  • creds:

    • access_key_id: AWS access key ID

    • secret_access_key: AWS secret access key

    • session_token: AWS temporary session token

  • profile: The name of a profile to use. If not given, then the default profile is used.

  • anonymous: Set anonymous credentials.

endpoint

Optional shorthand for complete URL to use for the constructed client.

region

Optional shorthand for AWS Region used in instantiating the client.

Value

A client for the service. You can call the service's operations using syntax like svc$operation(...), where svc is the name you've assigned to the client. The available operations are listed in the Operations section.

Service syntax

svc <- gluedatabrew(
  config = list(
    credentials = list(
      creds = list(
        access_key_id = "string",
        secret_access_key = "string",
        session_token = "string"
      ),
      profile = "string",
      anonymous = "logical"
    ),
    endpoint = "string",
    region = "string",
    close_connection = "logical",
    timeout = "numeric",
    s3_force_path_style = "logical",
    sts_regional_endpoint = "string"
  ),
  credentials = list(
    creds = list(
      access_key_id = "string",
      secret_access_key = "string",
      session_token = "string"
    ),
    profile = "string",
    anonymous = "logical"
  ),
  endpoint = "string",
  region = "string"
)

Operations

batch_delete_recipe_version
Deletes one or more versions of a recipe at a time
create_dataset
Creates a new DataBrew dataset
create_profile_job
Creates a new job to analyze a dataset and create its data profile
create_project
Creates a new DataBrew project
create_recipe
Creates a new DataBrew recipe
create_recipe_job
Creates a new job to transform input data, using steps defined in an existing Glue DataBrew recipe
create_ruleset
Creates a new ruleset that can be used in a profile job to validate the data quality of a dataset
create_schedule
Creates a new schedule for one or more DataBrew jobs
delete_dataset
Deletes a dataset from DataBrew
delete_job
Deletes the specified DataBrew job
delete_project
Deletes an existing DataBrew project
delete_recipe_version
Deletes a single version of a DataBrew recipe
delete_ruleset
Deletes a ruleset
delete_schedule
Deletes the specified DataBrew schedule
describe_dataset
Returns the definition of a specific DataBrew dataset
describe_job
Returns the definition of a specific DataBrew job
describe_job_run
Represents one run of a DataBrew job
describe_project
Returns the definition of a specific DataBrew project
describe_recipe
Returns the definition of a specific DataBrew recipe corresponding to a particular version
describe_ruleset
Retrieves detailed information about the ruleset
describe_schedule
Returns the definition of a specific DataBrew schedule
list_datasets
Lists all of the DataBrew datasets
list_job_runs
Lists all of the previous runs of a particular DataBrew job
list_jobs
Lists all of the DataBrew jobs that are defined
list_projects
Lists all of the DataBrew projects that are defined
list_recipes
Lists all of the DataBrew recipes that are defined
list_recipe_versions
Lists the versions of a particular DataBrew recipe, except for LATEST_WORKING
list_rulesets
List all rulesets available in the current account or rulesets associated with a specific resource (dataset)
list_schedules
Lists the DataBrew schedules that are defined
list_tags_for_resource
Lists all the tags for a DataBrew resource
publish_recipe
Publishes a new version of a DataBrew recipe
send_project_session_action
Performs a recipe step within an interactive DataBrew session that's currently open
start_job_run
Runs a DataBrew job
start_project_session
Creates an interactive session, enabling you to manipulate data in a DataBrew project
stop_job_run
Stops a particular run of a job
tag_resource
Adds metadata tags to a DataBrew resource, such as a dataset, project, recipe, job, or schedule
untag_resource
Removes metadata tags from a DataBrew resource
update_dataset
Modifies the definition of an existing DataBrew dataset
update_profile_job
Modifies the definition of an existing profile job
update_project
Modifies the definition of an existing DataBrew project
update_recipe
Modifies the definition of the LATEST_WORKING version of a DataBrew recipe
update_recipe_job
Modifies the definition of an existing DataBrew recipe job
update_ruleset
Updates specified ruleset
update_schedule
Modifies the definition of an existing DataBrew schedule

Examples

## Not run: 
svc <- gluedatabrew()
svc$batch_delete_recipe_version(
  Foo = 123
)

## End(Not run)