Detectors
RangeValueDetector
RareCategoryDetector
RareCategoryDetector(min_occurrences: int = 1)
Detect rare categories.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
min_occurrences
|
int
|
The minimum number of occurrences for a category to avoid detection. |
1
|
SpecialValuesDetector
SpecialValuesDetector(values: Sequence)
Detect specific given values.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
values
|
Sequence
|
The sequence of values to be detected. |
required |
QuantileTailsDetector
ThresholdCategoryDetector
ThresholdCategoryDetector(threshold: float = 0.99)
Detect the lowest frequency categories in order to avoid detection of a given minimal fraction of the total number of categories.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
threshold
|
float
|
The minimal fraction of the total number of categories to avoid detection. Must be a float between 0 and 1. |
0.99
|
DateTimeQuantileTailsDetector
DateTimeQuantileTailsDetector(
quantile: float = 0.005,
side: Side = BOTH,
fmt: str | None = None,
)
Detect datetime values in the tails of the data distribution.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
quantile
|
float
|
The quantile defining the tails. Must be between 0 and 1. |
0.005
|
side
|
Side
|
The side of the distribution to consider. Should be a |
BOTH
|
fmt
|
str | None
|
The datetime format. If None, it will be automatically inferred. |
None
|