Job Description
DESCRIPTION
Are you eager to apply your data engineering skills in reshaping one of the world's largest data-driven logistics systems? Amazon's Modeling and Optimization team (MOP) is recruiting motivated individuals with strong data engineering skills to evolve the data infrastructure underlying one of the most sophisticated optimization decision-engines in the world.
Amazon’s extensive logistics system comprises thousands of fixed infrastructure nodes with millions of possible connections between them. Billions of packages flow through this network on a yearly basis, making the impact of decision improvements truly unparalleled. This magnificent challenge is a terrific opportunity for developing modern data infrastructures to support reshaping one of the world's most complex, automated, data-driven logistics systems.
In this role, you will drive the development of data engineering solutions from initial experimentation to production level deployment, including the following - Identify gaps and improvement opportunities in existing data infrastructure; Design, implement, and maintain a modern cloud-based data-infrastructure for large data-sets; Migrate existing data pipelines to your newly developed solutions; Create and manage large datasets by extracting, transforming, combining, and loading data from various heterogeneous data sources; Maintain data integrity, availability, and auditability; Manage AWS resources; Drive the adoption of new technologies and new best practices.
The successful candidate will combine strong technical abilities and leadership skills to deliver business value in a fast-paced, collaborative development environment. You will be self-driven, communicate and collaborate effectively with different technical and non-technical stake-holders, mentor junior colleagues, propagate best practices, drive innovation, and deliver successfully against high operational standards.
Key job responsibilities
You will build data infrastructure, metrics, reports and other data products on native AWS to monitor the overall health of Amazon's global supply chain and fulfillment network.
A day in the life
Build data infrastructure and data reports that will provide high-level as well as detailed views to leaders in Amazon Operations about key metrics and provide uninterrupted data access to research scientists in MOP team for key initiatives. You will also collaborate on building our next generation data lake for raw and processed weather data that will be shared with several consumers within Amazon Operations.
About the team
Our team is a mix of SDEs and Data Engineers and we build all our products from scratch on native AWS to solve business problems. Few of the systems that we invent and simplify include backend software and web apps to handle the impact of weather, simulation system to rethink Amazon inbound processes and order fulfillment and a brand new data infrastructure to generate map-based reports for key business metrics.
BASIC QUALIFICATIONS
- 1+ years of data engineering experience
- 1+ years of analyzing and interpreting data with Redshift, Oracle, NoSQL etc. experience
- Bachelor's degree in a quantitative/technical field such as computer science, engineering, statistics
- Knowledge of distributed systems as it pertains to data storage and computing
- Experience with data modeling, warehousing and building ETL pipelines
- Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
- Experience with one or more scripting language (e.g., Python, KornShell)
Job Tags
Full time,