选择您的 Cookie 首选项

我们使用必要 Cookie 和类似工具提供我们的网站和服务。我们使用性能 Cookie 收集匿名统计数据,以便我们可以了解客户如何使用我们的网站并进行改进。必要 Cookie 无法停用,但您可以单击“自定义”或“拒绝”来拒绝性能 Cookie。

如果您同意,AWS 和经批准的第三方还将使用 Cookie 提供有用的网站功能、记住您的首选项并显示相关内容,包括相关广告。要接受或拒绝所有非必要 Cookie,请单击“接受”或“拒绝”。要做出更详细的选择,请单击“自定义”。

UpdateCrawler - AWS Glue
此页面尚未翻译为您的语言。 请求翻译

UpdateCrawler

Updates a crawler. If a crawler is running, you must stop it using StopCrawler before updating it.

Request Syntax

{ "Classifiers": [ "string" ], "Configuration": "string", "CrawlerSecurityConfiguration": "string", "DatabaseName": "string", "Description": "string", "LakeFormationConfiguration": { "AccountId": "string", "UseLakeFormationCredentials": boolean }, "LineageConfiguration": { "CrawlerLineageSettings": "string" }, "Name": "string", "RecrawlPolicy": { "RecrawlBehavior": "string" }, "Role": "string", "Schedule": "string", "SchemaChangePolicy": { "DeleteBehavior": "string", "UpdateBehavior": "string" }, "TablePrefix": "string", "Targets": { "CatalogTargets": [ { "ConnectionName": "string", "DatabaseName": "string", "DlqEventQueueArn": "string", "EventQueueArn": "string", "Tables": [ "string" ] } ], "DeltaTargets": [ { "ConnectionName": "string", "CreateNativeDeltaTable": boolean, "DeltaTables": [ "string" ], "WriteManifest": boolean } ], "DynamoDBTargets": [ { "Path": "string", "scanAll": boolean, "scanRate": number } ], "HudiTargets": [ { "ConnectionName": "string", "Exclusions": [ "string" ], "MaximumTraversalDepth": number, "Paths": [ "string" ] } ], "IcebergTargets": [ { "ConnectionName": "string", "Exclusions": [ "string" ], "MaximumTraversalDepth": number, "Paths": [ "string" ] } ], "JdbcTargets": [ { "ConnectionName": "string", "EnableAdditionalMetadata": [ "string" ], "Exclusions": [ "string" ], "Path": "string" } ], "MongoDBTargets": [ { "ConnectionName": "string", "Path": "string", "ScanAll": boolean } ], "S3Targets": [ { "ConnectionName": "string", "DlqEventQueueArn": "string", "EventQueueArn": "string", "Exclusions": [ "string" ], "Path": "string", "SampleSize": number } ] } }

Request Parameters

For information about the parameters that are common to all actions, see Common Parameters.

The request accepts the following data in JSON format.

Classifiers

A list of custom classifiers that the user has registered. By default, all built-in classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

Type: Array of strings

Length Constraints: Minimum length of 1. Maximum length of 255.

Pattern: [\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*

Required: No

Configuration

Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see Setting crawler configuration options.

Type: String

Required: No

CrawlerSecurityConfiguration

The name of the SecurityConfiguration structure to be used by this crawler.

Type: String

Length Constraints: Minimum length of 0. Maximum length of 128.

Required: No

DatabaseName

The AWS Glue database where results are stored, such as: arn:aws:daylight:us-east-1::database/sometable/*.

Type: String

Required: No

Description

A description of the new crawler.

Type: String

Length Constraints: Minimum length of 0. Maximum length of 2048.

Pattern: [\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\r\n\t]*

Required: No

LakeFormationConfiguration

Specifies AWS Lake Formation configuration settings for the crawler.

Type: LakeFormationConfiguration object

Required: No

LineageConfiguration

Specifies data lineage configuration settings for the crawler.

Type: LineageConfiguration object

Required: No

Name

Name of the new crawler.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 255.

Pattern: [\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*

Required: Yes

RecrawlPolicy

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.

Type: RecrawlPolicy object

Required: No

Role

The IAM role or Amazon Resource Name (ARN) of an IAM role that is used by the new crawler to access customer resources.

Type: String

Required: No

Schedule

A cron expression used to specify the schedule (see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify: cron(15 12 * * ? *).

Type: String

Required: No

SchemaChangePolicy

The policy for the crawler's update and deletion behavior.

Type: SchemaChangePolicy object

Required: No

TablePrefix

The table prefix used for catalog tables that are created.

Type: String

Length Constraints: Minimum length of 0. Maximum length of 128.

Required: No

Targets

A list of targets to crawl.

Type: CrawlerTargets object

Required: No

Response Elements

If the action is successful, the service sends back an HTTP 200 response with an empty HTTP body.

Errors

For information about the errors that are common to all actions, see Common Errors.

CrawlerRunningException

The operation cannot be performed because the crawler is already running.

HTTP Status Code: 400

EntityNotFoundException

A specified entity does not exist

HTTP Status Code: 400

InvalidInputException

The input provided was not valid.

HTTP Status Code: 400

OperationTimeoutException

The operation timed out.

HTTP Status Code: 400

VersionMismatchException

There was a version conflict.

HTTP Status Code: 400

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following:

隐私网站条款Cookie 首选项
© 2025, Amazon Web Services, Inc. 或其附属公司。保留所有权利。