Doc AWS SDK 예제 GitHub 리포지토리에서 더 많은 SDK 예제를 사용할 수 있습니다. [AWS](https://github.com/awsdocs/aws-doc-sdk-examples) 기계 번역으로 제공되는 번역입니다. 제공된 번역과 원본 영어의 내용이 상충하는 경우에는 영어 버전이 우선합니다. # AWS Glue SDK for Kotlin을 사용한 예제 다음 코드 예제에서는 Kotlin용 AWS SDK를와 함께 사용하여 작업을 수행하고 일반적인 시나리오를 구현하는 방법을 보여줍니다 AWS Glue. *기본 사항*은 서비스 내에서 필수 작업을 수행하는 방법을 보여주는 코드 예제입니다. *작업*은 대규모 프로그램에서 발췌한 코드이며 컨텍스트에 맞춰 실행해야 합니다. 작업은 개별 서비스 함수를 직접적으로 호출하는 방법을 보여주며 관련 시나리오의 컨텍스트에 맞는 작업을 볼 수 있습니다. 각 예시에는 전체 소스 코드에 대한 링크가 포함되어 있으며, 여기에서 컨텍스트에 맞춰 코드를 설정하고 실행하는 방법에 대한 지침을 찾을 수 있습니다. **Topics** + [기본 사항](#basics) + [작업](#actions) ## 기본 사항 ### 기본 사항 알아보기 다음 코드 예제에서는 다음과 같은 작업을 수행하는 방법을 보여줍니다. + 퍼블릭 Amazon S3 버킷을 크롤링하고 CSV 형식의 메타데이터 데이터베이스를 생성하는 크롤러를 생성합니다. + 의 데이터베이스 및 테이블에 대한 정보를 나열합니다 AWS Glue Data Catalog. + 작업을 생성하여 S3 버킷에서 CSV 데이터를 추출하고, 데이터를 변환하며, JSON 형식의 출력을 다른 S3 버킷으로 로드합니다. + 작업 실행에 대한 정보를 나열하고 변환된 데이터를 확인하며 리소스를 정리합니다. 자세한 내용은 [자습서: AWS Glue Studio 시작하기를 참조하세요](https://docs.aws.amazon.com/glue/latest/ug/tutorial-create-job.html). **SDK for Kotlin** GitHub에 더 많은 내용이 있습니다. [AWS 코드 예제 리포지토리](https://github.com/awsdocs/aws-doc-sdk-examples/tree/main/kotlin/services/glue#code-examples)에서 전체 예제를 찾고 설정 및 실행하는 방법을 배워보세요. ``` suspend fun main(args: Array) { val usage = """ Usage: Where: iam - The Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role that has AWS Glue and Amazon Simple Storage Service (Amazon S3) permissions. s3Path - The Amazon Simple Storage Service (Amazon S3) target that contains data (for example, CSV data). cron - A cron expression used to specify the schedule (for example, cron(15 12 * * ? *). dbName - The database name. crawlerName - The name of the crawler. jobName - The name you assign to this job definition. scriptLocation - Specifies the Amazon S3 path to a script that runs a job. locationUri - Specifies the location of the database """ if (args.size != 8) { println(usage) exitProcess(1) } val iam = args[0] val s3Path = args[1] val cron = args[2] val dbName = args[3] val crawlerName = args[4] val jobName = args[5] val scriptLocation = args[6] val locationUri = args[7] println("About to start the AWS Glue Scenario") createDatabase(dbName, locationUri) createCrawler(iam, s3Path, cron, dbName, crawlerName) getCrawler(crawlerName) startCrawler(crawlerName) getDatabase(dbName) getGlueTables(dbName) createJob(jobName, iam, scriptLocation) startJob(jobName) getJobs() getJobRuns(jobName) deleteJob(jobName) println("*** Wait for 5 MIN so the $crawlerName is ready to be deleted") TimeUnit.MINUTES.sleep(5) deleteMyDatabase(dbName) deleteCrawler(crawlerName) } suspend fun createDatabase( dbName: String?, locationUriVal: String?, ) { val input = DatabaseInput { description = "Built with the AWS SDK for Kotlin" name = dbName locationUri = locationUriVal } val request = CreateDatabaseRequest { databaseInput = input } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.createDatabase(request) println("The database was successfully created") } } suspend fun createCrawler( iam: String?, s3Path: String?, cron: String?, dbName: String?, crawlerName: String, ) { val s3Target = S3Target { path = s3Path } val targetList = ArrayList() targetList.add(s3Target) val targetOb = CrawlerTargets { s3Targets = targetList } val crawlerRequest = CreateCrawlerRequest { databaseName = dbName name = crawlerName description = "Created by the AWS Glue Java API" targets = targetOb role = iam schedule = cron } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.createCrawler(crawlerRequest) println("$crawlerName was successfully created") } } suspend fun getCrawler(crawlerName: String?) { val request = GetCrawlerRequest { name = crawlerName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getCrawler(request) val role = response.crawler?.role println("The role associated with this crawler is $role") } } suspend fun startCrawler(crawlerName: String) { val crawlerRequest = StartCrawlerRequest { name = crawlerName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.startCrawler(crawlerRequest) println("$crawlerName was successfully started.") } } suspend fun getDatabase(databaseName: String?) { val request = GetDatabaseRequest { name = databaseName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getDatabase(request) val dbDesc = response.database?.description println("The database description is $dbDesc") } } suspend fun getGlueTables(dbName: String?) { val tableRequest = GetTablesRequest { databaseName = dbName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getTables(tableRequest) response.tableList?.forEach { tableName -> println("Table name is ${tableName.name}") } } } suspend fun startJob(jobNameVal: String?) { val runRequest = StartJobRunRequest { workerType = WorkerType.G1X numberOfWorkers = 10 jobName = jobNameVal } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.startJobRun(runRequest) println("The job run Id is ${response.jobRunId}") } } suspend fun createJob( jobName: String, iam: String?, scriptLocationVal: String?, ) { val commandOb = JobCommand { pythonVersion = "3" name = "MyJob1" scriptLocation = scriptLocationVal } val jobRequest = CreateJobRequest { description = "A Job created by using the AWS SDK for Java V2" glueVersion = "2.0" workerType = WorkerType.G1X numberOfWorkers = 10 name = jobName role = iam command = commandOb } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.createJob(jobRequest) println("$jobName was successfully created.") } } suspend fun getJobs() { val request = GetJobsRequest { maxResults = 10 } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getJobs(request) response.jobs?.forEach { job -> println("Job name is ${job.name}") } } } suspend fun getJobRuns(jobNameVal: String?) { val request = GetJobRunsRequest { jobName = jobNameVal } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getJobRuns(request) response.jobRuns?.forEach { job -> println("Job name is ${job.jobName}") } } } suspend fun deleteJob(jobNameVal: String) { val jobRequest = DeleteJobRequest { jobName = jobNameVal } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.deleteJob(jobRequest) println("$jobNameVal was successfully deleted") } } suspend fun deleteMyDatabase(databaseName: String) { val request = DeleteDatabaseRequest { name = databaseName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.deleteDatabase(request) println("$databaseName was successfully deleted") } } suspend fun deleteCrawler(crawlerName: String) { val request = DeleteCrawlerRequest { name = crawlerName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> glueClient.deleteCrawler(request) println("$crawlerName was deleted") } } ``` + API 세부 정보는 *AWS SDK for Kotlin API 참조*의 다음 주제를 참조하세요. + [CreateCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [CreateJob](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [DeleteCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [DeleteDatabase](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [DeleteJob](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [DeleteTable](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetDatabase](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetDatabases](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetJob](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetJobRun](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetJobRuns](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [GetTables](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [ListJobs](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [StartCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html) + [StartJobRun](https://sdk.amazonaws.com/kotlin/api/latest/index.html) ## 작업 ### `CreateCrawler` 다음 코드 예시는 `CreateCrawler`의 사용 방법을 보여줍니다. **SDK for Kotlin** GitHub에 더 많은 내용이 있습니다. [AWS 코드 예 리포지토리](https://github.com/awsdocs/aws-doc-sdk-examples/tree/main/kotlin/services/glue#code-examples)에서 전체 예를 찾고 설정 및 실행하는 방법을 배워보세요. ``` suspend fun createGlueCrawler( iam: String?, s3Path: String?, cron: String?, dbName: String?, crawlerName: String, ) { val s3Target = S3Target { path = s3Path } // Add the S3Target to a list. val targetList = mutableListOf() targetList.add(s3Target) val targetOb = CrawlerTargets { s3Targets = targetList } val request = CreateCrawlerRequest { databaseName = dbName name = crawlerName description = "Created by the AWS Glue Kotlin API" targets = targetOb role = iam schedule = cron } GlueClient.fromEnvironment { region = "us-west-2" }.use { glueClient -> glueClient.createCrawler(request) println("$crawlerName was successfully created") } } ``` + API 세부 정보는 *AWS SDK for Kotlin API 참조*의 [CreateCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html)를 참조하세요. ### `GetCrawler` 다음 코드 예시는 `GetCrawler`의 사용 방법을 보여줍니다. **SDK for Kotlin** GitHub에 더 많은 내용이 있습니다. [AWS 코드 예 리포지토리](https://github.com/awsdocs/aws-doc-sdk-examples/tree/main/kotlin/services/glue#code-examples)에서 전체 예를 찾고 설정 및 실행하는 방법을 배워보세요. ``` suspend fun getSpecificCrawler(crawlerName: String?) { val request = GetCrawlerRequest { name = crawlerName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getCrawler(request) val role = response.crawler?.role println("The role associated with this crawler is $role") } } ``` + API 세부 정보는 *AWS SDK for Kotlin API 참조*의 [GetCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html)를 참조하세요. ### `GetDatabase` 다음 코드 예시는 `GetDatabase`의 사용 방법을 보여줍니다. **SDK for Kotlin** GitHub에 더 많은 내용이 있습니다. [AWS 코드 예 리포지토리](https://github.com/awsdocs/aws-doc-sdk-examples/tree/main/kotlin/services/glue#code-examples)에서 전체 예를 찾고 설정 및 실행하는 방법을 배워보세요. ``` suspend fun getSpecificDatabase(databaseName: String?) { val request = GetDatabaseRequest { name = databaseName } GlueClient.fromEnvironment { region = "us-east-1" }.use { glueClient -> val response = glueClient.getDatabase(request) val dbDesc = response.database?.description println("The database description is $dbDesc") } } ``` + API 세부 정보는 *AWS SDK for Kotlin API 참조*의 [GetDatabase](https://sdk.amazonaws.com/kotlin/api/latest/index.html)를 참조하세요. ### `StartCrawler` 다음 코드 예시는 `StartCrawler`의 사용 방법을 보여줍니다. **SDK for Kotlin** GitHub에 더 많은 내용이 있습니다. [AWS 코드 예 리포지토리](https://github.com/awsdocs/aws-doc-sdk-examples/tree/main/kotlin/services/glue#code-examples)에서 전체 예를 찾고 설정 및 실행하는 방법을 배워보세요. ``` suspend fun startSpecificCrawler(crawlerName: String?) { val request = StartCrawlerRequest { name = crawlerName } GlueClient.fromEnvironment { region = "us-west-2" }.use { glueClient -> glueClient.startCrawler(request) println("$crawlerName was successfully started.") } } ``` + API 세부 정보는 *AWS SDK for Kotlin API 참조*의 [StartCrawler](https://sdk.amazonaws.com/kotlin/api/latest/index.html)를 참조하세요.