great_expectations.rule_based_profiler.parameter_builder.regex_pattern_string_parameter_builder

Module Contents

Classes

RegexPatternStringParameterBuilder(name: str, metric_domain_kwargs: Optional[Union[str, dict]] = None, metric_value_kwargs: Optional[Union[str, dict]] = None, threshold: Union[str, float] = 1.0, candidate_regexes: Optional[Union[str, Iterable[str]]] = None, evaluation_parameter_builder_configs: Optional[List[ParameterBuilderConfig]] = None, data_context: Optional[AbstractDataContext] = None)

Detects the domain REGEX from a set of candidate REGEX strings by computing the

great_expectations.rule_based_profiler.parameter_builder.regex_pattern_string_parameter_builder.logger
class great_expectations.rule_based_profiler.parameter_builder.regex_pattern_string_parameter_builder.RegexPatternStringParameterBuilder(name: str, metric_domain_kwargs: Optional[Union[str, dict]] = None, metric_value_kwargs: Optional[Union[str, dict]] = None, threshold: Union[str, float] = 1.0, candidate_regexes: Optional[Union[str, Iterable[str]]] = None, evaluation_parameter_builder_configs: Optional[List[ParameterBuilderConfig]] = None, data_context: Optional[AbstractDataContext] = None)

Bases: great_expectations.rule_based_profiler.parameter_builder.ParameterBuilder

Detects the domain REGEX from a set of candidate REGEX strings by computing the column_values.match_regex_format.unexpected_count metric for each candidate format and returning the format that has the lowest unexpected_count ratio.

CANDIDATE_REGEX :Set[str]
property metric_domain_kwargs(self)
property metric_value_kwargs(self)
property threshold(self)
property candidate_regexes(self)
_build_parameters(self, domain: Domain, variables: Optional[ParameterContainer] = None, parameters: Optional[Dict[str, ParameterContainer]] = None, recompute_existing_parameter_values: bool = False)

Builds ParameterContainer object that holds ParameterNode objects with attribute name-value pairs and details.

Check the percentage of values matching the REGEX string, and return the best fit, or None if no string exceeds the configured threshold.

Builds ParameterContainer object that holds ParameterNode objects with attribute name-value pairs and details.