pyvalues.dictionary_classifier¶
- class pyvalues.dictionary_classifier.DictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
objectAbstract base class for a classifier that assigns values based on a dictionary.
- class pyvalues.dictionary_classifier.OriginalValuesDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
DictionaryClassifier,OriginalValuesClassifierClassifier that assigns values based on a dictionary.
- classify_segments_for_original_values(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[OriginalValues, None, None]¶
Classifies each segment.
- Parameters:
segments (Iterable[str]) – The segments to classify
language (LanguageAlpha2 | str) – The language of the segments
- Returns:
Classifications for each segment
- Return type:
Generator[OriginalValues, None, None]
- class pyvalues.dictionary_classifier.OriginalValuesWithAttainmentDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
DictionaryClassifier,OriginalValuesWithAttainmentClassifierClassifier that assigns values based on a dictionary.
- classify_segments_for_original_values_with_attainment(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[OriginalValuesWithAttainment, None, None]¶
Classifies each segment.
- Parameters:
segments (Iterable[str]) – The segments to classify
language (LanguageAlpha2 | str) – The language of the segments
- Returns:
Classification for each segment
- Return type:
Generator[OriginalValuesWithAttainment, None, None]
- class pyvalues.dictionary_classifier.RefinedCoarseValuesDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
DictionaryClassifier,RefinedCoarseValuesClassifierClassifier that assigns values based on a dictionary.
- classify_segments_for_refined_coarse_values(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedCoarseValues, None, None]¶
Classifies each segment.
- Parameters:
segments (Iterable[str]) – The segments to classify
language (LanguageAlpha2 | str) – The language of the segments
- Returns:
Classification for each segment
- Return type:
Generator[RefinedCoarseValues, None, None]
- class pyvalues.dictionary_classifier.RefinedCoarseValuesWithAttainmentDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
DictionaryClassifier,RefinedCoarseValuesWithAttainmentClassifierClassifier that assigns values based on a dictionary.
- classify_segments_for_refined_coarse_values_with_attainment(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedCoarseValuesWithAttainment, None, None]¶
Classifies each segment.
- Parameters:
segments (Iterable[str]) – The segments to classify
language (LanguageAlpha2 | str) – The language of the segments
- Returns:
Classifications for each segment
- Return type:
Generator[RefinedCoarseValuesWithAttainment, None, None]
- class pyvalues.dictionary_classifier.RefinedValuesDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
DictionaryClassifier,RefinedValuesClassifierClassifier that assigns values based on a dictionary.
- classify_segments_for_refined_values(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedValues, None, None]¶
Classifies each segment.
- Parameters:
segments (Iterable[str]) – The segments to classify
language (LanguageAlpha2 | str) – The language of the segments
- Returns:
Classifications for each segment
- Return type:
Generator[RefinedValues, None, None]
- class pyvalues.dictionary_classifier.RefinedValuesWithAttainmentDictionaryClassifier(language: ~pydantic_extra_types.language_code.LanguageAlpha2 | str, dictionaries: dict[str, str | ~typing.Tuple[str, float, float]] | ~io.FileIO, tokenize: ~typing.Callable[[str], list[str]] = <function simple_tokenize>, normalize_token: ~typing.Callable[[str, ~pydantic_extra_types.language_code.LanguageAlpha2], str] = <function normalize_dictionary_token>, score_threshold: float = 5e-324, max_values: int = 0)¶
Bases:
DictionaryClassifier,RefinedValuesWithAttainmentClassifierClassifier that assigns values based on a dictionary.
- classify_segments_for_refined_values_with_attainment(segments: Iterable[str], language: LanguageAlpha2 | str = 'en') Generator[RefinedValuesWithAttainment, None, None]¶
Classifies each segment.
- Parameters:
segments (Iterable[str]) – The segments to classify
language (LanguageAlpha2 | str) – The language of the segments
- Returns:
Classifications for each segment
- Return type:
Generator[RefinedValuesWithAttainment, None, None]