Publishing a product in AWS Data Exchange containing AWS Lake Formation data permission data sets (Preview)
If you're interested in publishing products containing AWS Lake Formation data permission data
sets during this Preview, contact AWS Support
An AWS Lake Formation data permission data set contains a set of LF-tags and permissions for data managed by AWS Lake Formation. When customers subscribe to a product containing Lake Formation data permissions, they are granted read-only access to the databases, tables, and columns associated with the LF-tags added to the data set.
As a data provider, you start by creating LF-tags in AWS Lake Formation and associating those tags with the data you want to make available to subscribers. For more information about tagging your resources in Lake Formation, see Lake Formation Tag-based access control in the AWS Lake Formation Developer Guide. Then you import those LF-tags and a set of data permissions into AWS Data Exchange as an asset. Subscribers are granted access to the data associated with those LF-tags upon subscription.
The following topics describe the process of publishing a product containing AWS Lake Formation data permissions. The process has the following steps:
Steps
- Step 1: Create an AWS Lake Formation data set (Preview)
- Step 2: Create an AWS Lake Formation data permission (Preview)
- Step 3: Review and finalize
- Step 5: (Optional) Create a revision
- Step 6: Publish a new product containing AWS Lake Formation data sets (Preview)
- Considerations when publishing an AWS Lake Formation data permission data set (Preview)
Step 1: Create an AWS Lake Formation data set (Preview)
To create an AWS Lake Formation data set
-
Open your web browser and sign in to the AWS Data Exchange console
. -
From the left navigation pane, under Publish data, choose Products.
-
In Owned data sets, choose Create data set to open the Data set creation steps wizard.
-
In Select data set type, choose AWS Lake Formation data permission.
-
In Define data set, enter a Name and Description for your data set. For more information, see Data set best practices.
-
Under Add tags – optional, choose Add new tag.
-
Choose Create data set and continue.
Step 2: Create an AWS Lake Formation data permission (Preview)
AWS Data Exchange uses LF-Tags to grant data permissions. Choose the LF-Tags that are associated with the data you want to share to grant subscriber permissions to the data.
To create AWS Lake Formation data permission
-
On the Create Lake Formation data permission page, choose Add LF-Tag.
-
Enter the Key and choose your LF-Tag Values.
-
Choose Preview resource(s) to view how your LF-Tags are interpreted.
-
From Preview resource(s), select your Associated data catalog resource(s).
Note
Make sure to revoke
IAMAllowedPrincipals
group on the following resources. For more information, see Revoking IAM role temporary security credentials in the IAM User Guide.
-
-
Review the interpretation of the LF-Tag expression in the dialog box below and Permissions associated with the data set.
-
For Service access, select your existing service role that allows AWS Data Exchange to assume the role and access, grant, and revoke entitlements to Lake Formation data permissions on your behalf. Then choose Create Lake Formation data permission. For more information about creating a role for an AWS service, see Creating a role to delegate permissions to an AWS service.
-
In the Define product section, under Product overview, enter information about your product, including the Product name, Product logo, Support contact information, and Product categories.
For more information, see Product best practices in AWS Data Exchange.
-
(Optional) In the Define product section, under Data dictionaries and samples – optional, choose a data set by selecting the option button next to the data set name and then choose Edit.
For more information, see Data dictionaries in AWS Data Exchange and Sample data in AWS Data Exchange.
-
In the Edit dialog box, under Upload data dictionary, choose Add file to upload a new data dictionary.
You can choose one data dictionary, in .csv format, with a maximum size of 1 MB.
-
Choose a saved data dictionary from your computer and then choose Open.
The data dictionary .csv file appears on the Edit dialog box.
Note
Your data dictionary must conform to the AWS Data Exchange data dictionary template. If you don’t have a saved data dictionary to upload, you can choose either the blank data dictionary template link or the example data dictionary link in the AWS Data Exchange console.
-
Choose Data dictionary preview to preview the data dictionary.
-
Under Samples - optional, choose Upload samples, choose a sample from your computer, and then choose Open.
The samples appear on the Edit dialog box.
Note
You can upload up to 10 samples with a maximum size of 50 MB. Samples in .csv format can be previewed.
-
Enter a description for each sample that will be visible on the product detail page.
-
Choose Save.
-
-
Under Product definition, enter a Short description and a Long description of your product.
If you want to use a template for your long description, select Apply template, choose your template type, and then provide your specific product details in the template.
-
Choose Next.
-
Configure your offer.
-
If you're creating a public offer, in the Add public offer section, configure your offer. All AWS Data Exchange products with visibility set to Public require a public offer.
-
Choose your Pricing and access duration options for the subscription.
-
Choose your US sales tax settings, data subscription agreement (DSA), and refund policy.
-
(Optional) Set Subscription verification to control who can subscribe to this product. For more information, see Subscription verification for providers in AWS Data Exchange.
-
Choose your Offer auto-renewal option. For more information, see Creating an offer for AWS Data Exchange products.
-
Choose Next.
-
-
If you're creating a private offer, configure the offer details in the Add custom offer section.
-
In the Subscriber account information section, add at least one subscriber account to which you want to extend the offer.
-
Choose your Pricing and access duration options for the subscription.
-
Choose the Offer expiration date by which the subscriber must accept the offer.
-
Choose your US sales tax settings, data subscription agreement (DSA), and refund policy.
-
Choose your Offer auto-renewal option. For more information, see Creating an offer for AWS Data Exchange products.
-
Choose Next.
-
-
-
In the Review & publish section, review your product information and then expand the Product page preview to see how it will look after it’s published.
-
If you're sure that you want to make the product and public offer visible and available to everyone, choose Publish.
You've now completed the manual portion of publishing a data product with a public offer. AWS Data Exchange prepares and publishes your product. On the Product overview page, the status of your product is Awaiting approval. The status changes to Published after the product is published.
Step 3: Review and finalize
After creating your AWS Lake Formation data permission (Preview), you can Review and finalize your data set.
To review and finalize
-
Review your Data set details and Tags in Step 1 for accuracy.
-
Review your LF-Tag expression(s), Add another Lake Formation data permission (optional), Associated data catalog resource(s), and job details.
Note
Job are deleted 90 days after they’re created.
-
Choose Finalize.
Step 5: (Optional) Create a revision
To create a revision
-
From the Owned data sets section, choose the data set for which you want to add a revision.
-
Choose the Revisions tab.
-
In the Revisions section, choose Create revision.
-
On the Revise Lake Formation data permission page, choose Add LF-Tag.
-
Review the Permissions for Database and Table.
-
From Service access, select an existing service role and then choose Create Lake Formation data permission.
Step 6: Publish a new product containing AWS Lake Formation data sets (Preview)
After you've created at least one data set and finalized a revision with assets, you're ready to publish a product with AWS Lake Formation data sets. For more information, see Product best practices in AWS Data Exchange. Make sure that you have all required details about your product.
To publish a new product containing AWS Lake Formation data sets (Preview)
-
Open your web browser and sign in to the AWS Data Exchange console
. -
From the left navigation pane, under Publish data, choose Products.
-
From Products, choose Publish new product to open the Publish new product wizard.
-
In the Product visibility section, choose your product's Product visibility options and Sensitive information configuration, and then choose Next. For more information, see Product visibility in AWS Data Exchange and Sensitive categories of information in AWS Data Exchange.
-
In the Add data section, under Owned data sets, select the check boxes next to the data sets that you want to add, and then choose Add selected.
Note
The data sets you choose must have a finalized revision. Data sets without finalized revisions aren't added.
-
Go to Selected data sets to review your selections.
You can review the Name of the data set, the Type of data set, and the timestamp of when the data set was Last updated.
-
Go to Select revision access rules, choose the revision access rules that you want to set for data sets included in this product, and then choose Next.
For more details, see Revision access rules in AWS Data Exchange.
-
Considerations when publishing an AWS Lake Formation data permission data set (Preview)
To ensure an optimal subscriber experience, we strongly advise against making any of the following modifications to any permissions where your product contains AWS Data Exchange for Lake Formation data sets (Preview) with active subscribers to that product.
-
We recommend not deleting or modifying IAM roles passed to AWS Data Exchange in published products containing AWS Lake Formation data sets. If you delete or modify such IAM roles, the following issues occur:
-
AWS accounts that have access to the Lake Formation data permissions might retain access indefinitely.
-
AWS accounts that subscribe to your product but have not yet received access to the Lake Formation data permissions will fail to receive access.
AWS Data Exchange will not be liable for any IAM roles that you delete or modify.
-
-
We recommend that you don’t revoke granted AWS Lake Formation data permissions from IAM roles passed to AWS Data Exchange in published product containing AWS Lake Formation data sets. If you revoke granted data permissions from such IAM roles, the following issues occur:
-
AWS accounts that have access to the Lake Formation data permissions might retain access indefinitely.
-
AWS accounts that subscribe to your product but have not yet received access to the Lake Formation data permissions will fail to receive access.
-
-
We recommend not revoking granted AWS Lake Formation data permissions from AWS accounts with active subscriptions to published products containing AWS Lake Formation data sets. If you revoke granted data permissions from AWS accounts subscribed to your product, those accounts will lose access, creating a poor customer experience.
-
We recommend setting the cross account version in your AWS Glue Data Catalog to version 3 when publishing products containing AWS Lake Formation data sets. If you downgrade the cross account version of your Data Lake Catalog while having published products containing AWS Lake Formation data sets, the AWS accounts that subscribe to your product but have not yet received access to the Lake Formation data permissions may fail to get access to the data.