FindMatches class
Package: com.amazonaws.services.glue.ml
object FindMatches
Def apply
def apply(frame: DynamicFrame,
transformId: String,
transformationContext: String = "",
callSite: CallSite = CallSite("Not provided", ""),
stageThreshold: Long = 0,
totalThreshold: Long = 0,
enforcedMatches: DynamicFrame = null): DynamicFrame,
computeMatchConfidenceScores: Boolean
Find matches in an input frame and return a new frame with a new column containing a unique ID per match group.
frame
— The DynamicFrame in which to find matches. Required.transformId
— A unique ID associated with the FindMatches transform to apply on the input frame. Required.transformationContext
— Identifier for thisDynamicFrame
. ThetransformationContext
is used as a key for the job bookmark state that is persisted across runs. Optional.callSite
— Used to provide context information for error reporting. These values are automatically set when calling from Python. Optional.stageThreshold
— The maximum number of error records allowed from the computation of thisDynamicFrame
before throwing an exception, excluding records present in the previousDynamicFrame
. Optional. The default is zero.totalThreshold
— The maximum number of total errors records before an exception is thrown, including those from previous frames. Optional. The default is zero.enforcedMatches
— The frame for enforced matches. Optional. The default isnull
.computeMatchConfidenceScores
— A Boolean value indicating whether to compute a confidence score for each group of matching records. Optional. The default is false.
Returns a new dynamic frame with a unique identifier assigned to each group of matching records.