JOIN
Performs a join operation on two datasets.
Parameters
-
joinKeys
— A JSON-encoded string representing a list of columns from each dataset to act as join keys. -
joinType
— The type of join to perform. Must be one of:INNER_JOIN
|LEFT_JOIN
|RIGHT_JOIN
|OUTER_JOIN
|LEFT_EXCLUDING_JOIN
|RIGHT_EXCLUDING_JOIN
|OUTER_EXCLUDING_JOIN
-
leftColumns
— A JSON-encoded string representing a list of columns from the current active dataset. -
rightColumns
— A JSON-encoded string representing a list of columns from another (secondary) dataset to join to the current one. -
secondInputLocation
— An Amazon S3 URL that resolves to the data file for the secondary dataset. -
secondaryDatasetName
— The name of the secondary dataset.
Example
{ "Action": { "Operation": "JOIN", "Parameters": { "joinKeys": "[{\"key\":\"assembly_session\",\"value\":\"assembly_session\"},{\"key\":\"state_code\",\"value\":\"state_code\"}]", "joinType": "INNER_JOIN", "leftColumns": "[\"year\",\"assembly_session\",\"state_code\",\"state_name\",\"all_votes\",\"yes_votes\",\"no_votes\",\"abstain\",\"idealpoint_estimate\",\"affinityscore_usa\",\"affinityscore_russia\",\"affinityscore_china\",\"affinityscore_india\",\"affinityscore_brazil\",\"affinityscore_israel\"]", "rightColumns": "[\"assembly_session\",\"vote_id\",\"resolution\",\"state_code\",\"state_name\",\"member\",\"vote\"]", "secondInputLocation": "s3://databrew-public-datasets-us-east-1/votes.csv", "secondaryDatasetName": "votes" } } }