About the job
Join us at the Amazon's sustainability initiatives to work on environmental and social advancements to support Amazon's long term worldwide sustainability strategy. At Amazon, we're working to be the most customer-centric company on earth. To get there, we need exceptionally talented, bright, and driven people.
Responsibilities
Implementing, and maintaining data and AI/ML infrastructure to support a wide variety of large and complex data sets, ensuring high performance, availability, and integrity.
Identifying and solving data needs for Gen AI and benchmarking for tasks across the sustainability domain.
Developing and optimizing robust data pipelines for internal data from sources such as Product Lifecycle Management tools, Product Details (images, text, and structured data), Inventory Management Platforms, and financial systems.
Implement web-scale data collection for images, text, structured data across locations and sustainability domains.
Develop comprehensive monitoring, alarming, and data quality controls for all of the above.
Partnering with Scientists and Software Engineers to create our data collection strategy and ML Ops best practices.
Qualifications
Minimum
3+ years of building models for business application experience
PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
Experience in patents or publications at top-tier peer-reviewed conferences or journals
Experience programming in Java, C++, Python or related language
Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing
Preferred
Experience using Unix/Linux
Experience in professional software development