Technology Radar

pandera

Published : Apr 26, 2023

NOT ON THE CURRENT EDITION

This blip is not on the current edition of the Radar. If it was on one of the last few editions, it is likely that it is still relevant. If the blip is older, it might no longer be relevant and our assessment might be different today. Unfortunately, we simply don't have the bandwidth to continuously review blips from previous editions of the Radar. Understand more

Apr 2023

Assess

在之前的雷达中，我们介绍了例如 Great Expectations 的数据验证和测试平台，其可用于验证假设并测试用于培训或分类的输入数据的质量。但是有时候，只需要一个简单的代码库就可以直接在流水线中实现测试和质量检查。pandera 是一个 Python 库，用于测试和验证跨各种框架类型的数据，例如 pandas，Dask 或者 PySpark。 pandera 可以实现关于字段的简单断言或基于统计模型的假设验证。其广泛支持的框架库意味着只需编写一次测试就可以应用于各种底层数据格式。此外，pandera 还可以用于生成测试 ML 模型的合成数据 synthetic data to test ML models.

Download the PDF

English | Português

Sign up for the Technology Radar newsletter

Subscribe now

行业

数字出版物和工具

所有洞见

pandera

Download the PDF

Sign up for the Technology Radar newsletter

Visit our archive to read previous volumes