示例输出 – 方法 –__call__apply name describeArgs describeReturn describeTransform describeErrors describe

FormatCase 类

FormatCase 转换会将列中的每个字符串更改为指定的大小写类型。

示例


from pyspark.context import SparkContext
from pyspark.sql import SparkSession
from awsgluedi.transforms import *

sc = SparkContext()
spark = SparkSession(sc)

datasource1 = spark.read.json("s3://${BUCKET}/json/zips/raw/data")

try:
    df_output = data_cleaning.FormatCase.apply(
        data_frame=datasource1,
        spark_context=sc,
        source_column="city",
        case_type="LOWER"
    )    
except:
    print("Unexpected Error happened ")
    raise

输出

FormatCase 转换会根据“case_type="LOWER"”参数将“city”列中的值转换为小写字母。生成的“df_output”DataFrame 将包含原始“datasource1”DataFrame 中的所有列，但“city”列的值为小写。

方法

__call__
apply
name
describeArgs
describeReturn
describeTransform
describeErrors
describe

call(spark_context, data_frame, source_column, case_type)

FormatCase 转换会将列中的每个字符串更改为指定的大小写类型。

source_column – 现有列的名称。
case_type – 支持的大小写类型为 CAPITAL、LOWER、UPPER、SENTENCE。

apply(cls, *args, **kwargs)

继承自 GlueTransform apply。

name(cls)

继承自 GlueTransform name。

describeArgs(cls)

继承自 GlueTransform describeArgs。

describeReturn(cls)

继承自 GlueTransform describeReturn。

describeTransform(cls)

继承自 GlueTransform describeTransform。

describeErrors(cls)

继承自 GlueTransform describeErrors。

describe(cls)

继承自 GlueTransform describe。

Javascript 在您的浏览器中被禁用或不可用。

要使用 Amazon Web Services 文档，必须启用 Javascript。请参阅浏览器的帮助页面以了解相关说明。

文档惯例

FormatPhoneNumber

FillWithMode