set default check n_failure_cases to None #784

cosmicBboy · 2022-03-05T15:09:43Z

Prior to this change, pandera would only report the first 10 unique failure cases for a particular check. Users who want to collect all failure cases would need to set n_failure_cases = None at the check level.

This PR makes it so that pandera collects all failure cases by default.

codecov · 2022-03-05T15:22:36Z

Codecov Report

Merging #784 (0de3882) into dev (ebfecc1) will decrease coverage by 0.09%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##              dev     #784      +/-   ##
==========================================
- Coverage   97.71%   97.61%   -0.10%     
==========================================
  Files          45       44       -1     
  Lines        4026     4028       +2     
==========================================
- Hits         3934     3932       -2     
- Misses         92       96       +4

Impacted Files	Coverage Δ
pandera/model_components.py	`95.57% <ø> (ø)`
pandera/__init__.py	`94.28% <100.00%> (ø)`
pandera/checks.py	`98.51% <100.00%> (ø)`
pandera/check_utils.py	`90.00% <0.00%> (-6.67%)`	⬇️
pandera/engines/pandas_engine.py	`97.27% <0.00%> (+0.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ebfecc1...0de3882. Read the comment docs.

* add imports to fastapi docs * Add option to disallow duplicate column names (#758) * ENH: add duplicate detection to dataframeschema * ENH: propagate duplicate colnames check to schemamodel * Add getter setter property * make schemamodel actually work, update __str__ * fix __repr__ as well * fix incorrect default value * black formatting has changed * invert parameter naming convention * address other PR comments * fix doctests, comma in __str__ * maybe fix sphinx errors * fix ci and mypy tests * Update test_schemas.py * fix lint Co-authored-by: cosmicBboy <[email protected]> * Make SchemaModel use class name, define own config (#761) * Make SchemaModel use class name, define own config * fix * fix * fix * fix tests * fix lint and docs * add test Co-authored-by: cosmicBboy <[email protected]> * implement coercion-on-initialization for DataFrame[SchemaModel] types (#772) * implement coercion-on-initialization * pylint * Update tests/core/test_model.py Co-authored-by: Matt Richards <[email protected]> Co-authored-by: Matt Richards <[email protected]> * update conda install instructions (#776) * add documentation for pandas_engine.DateTime (#780) * add documentation for pandas_engine.DateTime * fix removed numpy_engine.Object doc * set default n_failure_cases to None (#784) * Update filtering columns for performance reasons. (#777) * Update filtering columns for performance reasons. * Update pandera/schemas.py * Update schemas.py * Update schemas.py * Bugfix in schemas.py Co-authored-by: Niels Bantilan <[email protected]> * implement pydantic model data type (#779) * make finding coerce failure cases faster (#792) * make finding coerce failure cases faster * fix tests * remove unneeded import * fix tests, coverage * update docs for 0.10.0 (#795) * add pyspark support, deprecate koalas (#793) * add support for pyspark.pandas, deprecate koalas * update docs * add type check in pandas generics * update docs * clean up ci * fix mypy, generics * fix generic hack * improve coverage * Add overloads to `schema.to_yaml` (#790) * Add overloads to `to_yaml` * Update schemas.py Co-authored-by: Niels Bantilan <[email protected]> * add support for logical data types * add initial support for decimal * fix dtype check * Feature: Add support for Generic to SchemaModel (#810) * Adapt SchemaModel so that it can inherit from typing.Generic * Extend SchemaModel to enable generic types in fields * fix linter Co-authored-by: Thomas Willems <[email protected]> Co-authored-by: cosmicBboy <[email protected]> * fix pandas_engine.DateTime.coerce_value not consistent with coerce (#827) * pyspark docs fixes * fix koalas link to pyspark * bump version 0.10.1 * fix pandas_engine.DateTime.coerce_value not consistent with coerce Co-authored-by: cosmicBboy <[email protected]> * Refactor logical type check method * add logical types tests * add back conftest * fix test_invalid_annotations * fix ray initialization in setup_modin_engine * fix logical type validation when output is an iterable * add Decimal data type to pandera.__init__ * remove DataType.is_logical * add logical types documentation * Update dtypes.rst * Update dtypes.rst * increase coverage * fix SchemaErrors.failure_cases with logical types * fix modin compatibility for logical type validation * fix prepare_series_check_output compatibility with pyspark * fix mypy error * Update dtypes.rst Co-authored-by: cosmicBboy <[email protected]> Co-authored-by: Matt Richards <[email protected]> Co-authored-by: Sean Mackesey <[email protected]> Co-authored-by: Ferdinand Hahmann <[email protected]> Co-authored-by: Robert Craigie <[email protected]> Co-authored-by: tfwillems <[email protected]> Co-authored-by: Thomas Willems <[email protected]>

set default n_failure_cases to None

0de3882

cosmicBboy merged commit 7e7d19f into dev Mar 6, 2022

cosmicBboy deleted the n-failure-cases-none branch March 19, 2022 17:56

cosmicBboy added a commit that referenced this pull request Apr 1, 2022

set default n_failure_cases to None (#784)

4efed31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

set default check n_failure_cases to None #784

set default check n_failure_cases to None #784

Uh oh!

cosmicBboy commented Mar 5, 2022

Uh oh!

codecov bot commented Mar 5, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

set default check n_failure_cases to None #784

set default check n_failure_cases to None #784

Uh oh!

Conversation

cosmicBboy commented Mar 5, 2022

Uh oh!

codecov bot commented Mar 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

codecov bot commented Mar 5, 2022 •

edited

Loading