Enable javascript in your browser for better experience. Need to know to enable it? Go here.
Published : Apr 26, 2023
NOT ON THE CURRENT EDITION
This blip is not on the current edition of the Radar. If it was on one of the last few editions, it is likely that it is still relevant. If the blip is older, it might no longer be relevant and our assessment might be different today. Unfortunately, we simply don't have the bandwidth to continuously review blips from previous editions of the Radar. Understand more
Apr 2023
Assess ?

在之前的雷达中,我们介绍了例如 Great Expectations 的数据验证和测试平台,其可用于验证假设并测试用于培训或分类的输入数据的质量。但是有时候,只需要一个简单的代码库就可以直接在流水线中实现测试和质量检查。pandera 是一个 Python 库,用于测试和验证跨各种框架类型的数据,例如 pandas,Dask 或者 PySpark。 pandera 可以实现关于字段的简单断言或基于统计模型的假设验证。其广泛支持的框架库意味着只需编写一次测试就可以应用于各种底层数据格式。此外,pandera 还可以用于生成测试 ML 模型的合成数据 synthetic data to test ML models.

Download the PDF

 

 

 

English | Português 

Sign up for the Technology Radar newsletter

 

 

Subscribe now

Visit our archive to read previous volumes