Technology Radar

AWS Data Wrangler

Published : Apr 13, 2021

NOT ON THE CURRENT EDITION

This blip is not on the current edition of the Radar. If it was on one of the last few editions, it is likely that it is still relevant. If the blip is older, it might no longer be relevant and our assessment might be different today. Unfortunately, we simply don't have the bandwidth to continuously review blips from previous editions of the Radar. Understand more

Apr 2021

Trial

AWS Data Wrangler es una biblioteca de código abierto que amplía las capacidades de Pandas a AWS al conectar marcos de datos a los servicios de datos de AWS. Además de Pandas, esta biblioteca aprovecha las capacidades de Apache Arrow y Boto3 para exponer varias APIs para cargar, transformar y guardar datos provenientes de lagos y almacenes de datos. Una limitación importante de esta biblioteca es que no permite realizar pipelines distribuidos para grandes volúmenes de datos. Sin embargo, es capaz de aprovechar servicios de datos nativos, como Athena, Redshift y Timestream, para hacer el trabajo pesado y extraer datos y así expresar transformaciones complejas que se adapten bien a los marcos de datos. Hemos utilizado AWS Data Wrangler en producción y como tal, permite concentrarse en escribir transformaciones sin perder demasiado tiempo en la conectividad a los servicios de datos de AWS.

Download the PDF

English | Português

Sign up for the Technology Radar newsletter

Subscribe now

Industrias

Publicaciones Digitales y Herramientas

Todos los Insights

AWS Data Wrangler

Download the PDF

Sign up for the Technology Radar newsletter

Visit our archive to read the previous volumes