pyvalues.dictionary_classifier

class pyvalues.dictionary_classifier.DictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: object

Abstract base class for a classifier that assigns values based on a dictionary.

class pyvalues.dictionary_classifier.OriginalValuesDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: DictionaryClassifier, OriginalValuesClassifier

Classifier that assigns values based on a dictionary.

classify_segments_for_original_values(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[OriginalValues, None, None]

Classifies each segment.

Parameters:
  • segments (Iterable[str]) – The segments to classify

  • language (LanguageAlpha2 | str) – The language of the segments

Returns:

Classifications for each segment

Return type:

Generator[OriginalValues, None, None]

class pyvalues.dictionary_classifier.OriginalValuesWithAttainmentDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: DictionaryClassifier, OriginalValuesWithAttainmentClassifier

Classifier that assigns values based on a dictionary.

classify_segments_for_original_values_with_attainment(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[OriginalValuesWithAttainment, None, None]

Classifies each segment.

Parameters:
  • segments (Iterable[str]) – The segments to classify

  • language (LanguageAlpha2 | str) – The language of the segments

Returns:

Classification for each segment

Return type:

Generator[OriginalValuesWithAttainment, None, None]

class pyvalues.dictionary_classifier.RefinedCoarseValuesDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: DictionaryClassifier, RefinedCoarseValuesClassifier

Classifier that assigns values based on a dictionary.

classify_segments_for_refined_coarse_values(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedCoarseValues, None, None]

Classifies each segment.

Parameters:
  • segments (Iterable[str]) – The segments to classify

  • language (LanguageAlpha2 | str) – The language of the segments

Returns:

Classification for each segment

Return type:

Generator[RefinedCoarseValues, None, None]

class pyvalues.dictionary_classifier.RefinedCoarseValuesWithAttainmentDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: DictionaryClassifier, RefinedCoarseValuesWithAttainmentClassifier

Classifier that assigns values based on a dictionary.

classify_segments_for_refined_coarse_values_with_attainment(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedCoarseValuesWithAttainment, None, None]

Classifies each segment.

Parameters:
  • segments (Iterable[str]) – The segments to classify

  • language (LanguageAlpha2 | str) – The language of the segments

Returns:

Classifications for each segment

Return type:

Generator[RefinedCoarseValuesWithAttainment, None, None]

class pyvalues.dictionary_classifier.RefinedValuesDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: DictionaryClassifier, RefinedValuesClassifier

Classifier that assigns values based on a dictionary.

classify_segments_for_refined_values(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedValues, None, None]

Classifies each segment.

Parameters:
  • segments (Iterable[str]) – The segments to classify

  • language (LanguageAlpha2 | str) – The language of the segments

Returns:

Classifications for each segment

Return type:

Generator[RefinedValues, None, None]

class pyvalues.dictionary_classifier.RefinedValuesWithAttainmentDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)

Bases: DictionaryClassifier, RefinedValuesWithAttainmentClassifier

Classifier that assigns values based on a dictionary.

classify_segments_for_refined_values_with_attainment(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedValuesWithAttainment, None, None]

Classifies each segment.

Parameters:
  • segments (Iterable[str]) – The segments to classify

  • language (LanguageAlpha2 | str) – The language of the segments

Returns:

Classifications for each segment

Return type:

Generator[RefinedValuesWithAttainment, None, None]