Creating a lookalike segment
Note
You can only supply a training data set for using in a Clean Rooms ML lookalike model that has data stored in Amazon S3. However, you can supply the seed data for a lookalike model using SQL that runs across data stored in any supported data source.
A lookalike segment is a subset of the training data that most closely resembles the seed data.
To create a lookalike segment in AWS Clean Rooms
-
Sign in to the AWS Management Console and open the AWS Clean Rooms console
with your AWS account (if you haven't yet done so). -
In the left navigation pane, choose Collaborations.
-
On the With active membership tab, choose a collaboration.
-
On the ML Models tab, choose Create lookalike segment.
-
On the Create lookalike segment page, for Associated configured lookalike model, choose the associated configured lookalike model to use for this lookalike segment.
-
For Lookalike segment details enter a Name and optional Description.
-
For Seed profiles, choose your Seed method by selecting an option and then taking the recommended action.
Option Recommended action Amazon S3 path -
Select an Amazon S3 location.
-
(Optional) Choose Include seed profiles in the output.
SQL query Write a SQL query and use its results as the seed data. Analysis template Choose an analysis template from the dropdown list and use the results created by an analysis template. -
-
Choose the Worker type and Number of workers to use when creating this data channel.
-
For Service access, choose the Existing service role name that will be used to access this table.
-
If you want to enable Tags for the training dataset, choose Add new tag and then enter the Key and Value pair.
-
Choose Create lookalike segment.
For the corresponding API action, see StartAudienceGenerationJob.