Menú

Data Engineer(Data & AI)

Data Engineer(Data & AI)2020-09-15T06:14:01-04:00<p><em><span style="font-weight: 400;">Are you at your most vibrant when you’ve successfully distilled data into its simplest, most meaningful form?</span></em></p> <p>&nbsp;</p> <p><span style="font-weight: 400;">ThoughtWorks is a global software consultancy with an aim to create a positive impact on the world through technology. Our community of technologists thinks disruptively to deliver pragmatic solutions for our clients' most complex challenges. We are curious minds who come together as collaborative and inclusive teams to push boundaries, free to be ourselves and make our mark in tech.</span></p> <p>&nbsp;</p> <p><span style="font-weight: 400;">Our developers have been contributing code to major organizations and open source projects for over 25 years. They’ve also been writing books, speaking at conferences and helping push software development forward, changing companies and even industries along the way.&nbsp;</span></p> <p>&nbsp;</p> <p><span style="font-weight: 400;">As consultants, we </span><a href="https://www.thoughtworks.com/careers/hub/consultant-life"><span style="font-weight: 400;">work onsite with our clients</span></a><span style="font-weight: 400;"> to ensure we’re evolving their technology and empowering adaptive mindsets to meet their business goals. You could influence the digital strategy of a retail giant, build a bold new mobile application for a bank or redesign platforms using event sourcing and intelligent data pipelines. You will learn to use the latest Lean and Agile thinking, create pragmatic solutions to solve mission-critical problems and challenge yourself every day.&nbsp;</span></p> <p>&nbsp;</p> <p><span style="font-weight: 400;">Data Engineers develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions. You might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems. On other projects, you might be acting as the architect, leading the design of technical solutions, or perhaps overseeing a program inception to build a new product. It could also be a software delivery project where you're equally happy coding and tech-leading the team to implement the solution.</span></p> <p><br><br></p> <p><strong>You’ll spend time on the following:</strong></p> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">You will partner with teammates to create complex data processing pipelines in order to solve our clients’ most ambitious challenges</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">You will collaborate with Data Scientists in order to design scalable implementations of their models</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">You will pair to write clean and iterative code based on TDD</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Leverage various continuous delivery practices to deploy data pipelines</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Advise and educate clients on how to use different distributed storage and computing technologies from the plethora of options available</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Create data models and speak to the tradeoffs of different modeling approaches</span></li> </ul> <p><strong>Here’s what we’re looking for:</strong></p> <p>&nbsp;</p> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">You have a good understanding of data modelling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">You have built large-scale data pipelines and data-centric applications using any of the distributed storage platforms such as HDFS, S3, NoSQL databases (Hbase, Cassandra, etc.) and any of the distributed processing platforms like Hadoop, Spark, Hive, Oozie, and Airflow in a production setting</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Hands on experience in MapR, Cloudera, Hortonworks and/or cloud (AWS EMR, Azure HDInsights, Qubole etc.) based Hadoop distributions</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">You are comfortable taking data-driven approaches and applying data security strategy to solve business problems&nbsp;</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Working with data excites you: you can build and operate data pipelines, and maintain data storage, all within distributed systems</span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Strong communication and client-facing skills with the ability to work in a consulting environment</span></li> </ul>ThoughtworksShenzhenChina