featuretools.primitives.UpperCaseCount#

class featuretools.primitives.UpperCaseCount[source]#

计算文本中的大写字母数量。

描述

给定字符串列表，确定每个字符串中大写的字符数量。单独计算每个字母，而不仅仅是包含大写字母的每个单词。

如果字符串缺失，返回 NaN

示例

>>> x = ['This IS a string.', 'This is a string', 'aaa']
>>> upper_case_count = UpperCaseCount()
>>> upper_case_count(x).tolist()
[3.0, 1.0, 0.0]

方法

`__init__`()
`flatten_nested_input_types`(input_types)	将嵌套的列模式输入展平为单个列表。
`generate_name`(base_feature_names)
`generate_names`(base_feature_names)
`get_args_string`()
`get_arguments`()
`get_description`(input_column_descriptions[, ...])
`get_filepath`(filename)
`get_function`()
`process_text`(text)

属性

`base_of`
`base_of_exclude`
`commutative`
`default_value`	如果未找到数据，此特征返回的默认值。
`description_template`
`input_types`	输入的 woodwork.ColumnSchema 类型
`max_stack_depth`
`name`	primitive 的名称
`number_output_features`	与此特征关联的特征矩阵中的列数
`return_type`	返回的 ColumnSchema 类型
`stack_on`
`stack_on_exclude`
`stack_on_self`
`uses_calc_time`
`uses_full_dataframe`