Machine Learning - Compiler Engineer , AWS Neuron, Annapurna Labs

Company: Amazon
Location: Cupertino
Posted on: April 3, 2026

Job Description:

Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to AI hardware and software infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads. This role is for a software engineer in the Compiler team for AWS Neuron. As part of this role, you will be responsible for building next generation Neuron compiler which transforms ML models written in ML frameworks (e.g, PyTorch, TensorFlow, and JAX) to be deployed AWS Inferentia and Trainium based servers in the Amazon cloud. You will be responsible for solving hard compiler optimization problems to achieve optimum performance for variety of ML model families including massive scale large language models like Llama, Deepseek, and beyond as well as stable diffusion, vision transformers and multi-model models. You will be required to understand how these models work inside-out to make informed decisions on how to best coax the compiler to generate optimal implementation instruction. You will leverage your technical communications skill to partner with internal and external customers/stakeholders and will be involved in pre-silicon design, bringing new products/features to market, ultimately, making Neuron compiler highly performant and easy-to-use. Experience in object-oriented languages like C++/Java is a must, experience with compilers or building ML models using ML frameworks on accelerators (e.g., GPUs) is preferred but not required. Experience with technologies like OpenXLA, StableHLO, MLIR will be added bonus! Explore the product and our history! https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index htmlhttps://aws.amazon.com/machine-learning/neuron/ https://github.com/aws/aws-neuron-sdk https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-success AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Key job responsibilities You will design, implement, test, deploy and maintain innovative software solutions to transform Neuron compiler’s performance, stability and user-interface. You will work side by side with chip architects, runtime/OS engineers, scientists and ML Apps teams to seamlessly deploy state of the art ML models from our customers on AWS accelerators with optimal cost/performance benefits. You will have opportunity to work with open-source software (e.g., StableHLO, OpenXLA, MLIR) to pioneer optimizing advanced ML workloads on AWS software and hardware. You will also work on building innovative features that will deliver best possible experiences for our customers – developers across the globe. A day in the life As you design and code solutions to help our team drive efficiencies in compiler architecture, you’ll create compiler optimization and verification passes, build features surface features and peculiarities of AWS accelerators to developers, implement tools to analyze numerical errors, and resolve the root cause of compiler defects. You’ll also participate in design discussions, code review, and communicate with internal (other Neuron SDK and Amazon wide teams) and external stakeholders (open-source communities). Lastly, work in a startup-like development environment, where you’re always working on the most important stuff. About the team About the Team Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. About AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. - 3 years of non-internship professional software development experience - 2 years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience programming with at least one software programming language - Master's degree or PhD in Computer Science, or a related technical field. - 3 years of experience writing production grade code in object-oriented languages such as C++/Java. - Experience in compiler design for CPU/GPU/Vector engines/ML-accelerators. - Experience with OpenSource compiler toolset like LLVM/MLIR. - Experience with the following technologies: PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms. - Experience with modern build systems like Bazel/CMake. Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner. The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits . USA, CA, Cupertino - 165,200.00 - 223,600.00 USD annually

Keywords: Amazon, Hayward , Machine Learning - Compiler Engineer , AWS Neuron, Annapurna Labs, IT / Software / Systems , Cupertino, California

Didn't find what you're looking for? Search again!

Let Cupertino recruiters find you. Post your resume for free!

Get Cupertino IT / Software / Systems jobs via email.

View more Hayward IT / Software / Systems jobs

Other IT / Software / Systems Jobs

Technical Consultant
Description: I'm Hiring Qualifications: 2-5 years of relevant experience Excellent Linux skills
Company: Cardinal Search Group
Location: San Francisco
Posted on: 04/4/2026

Robotics Software Engineer (Perception & Localization)
Description: Orchard Robotics is a Series A startup backed by top VCs like Quiet Capital, Shine Capital, and General Catalyst. We're securing America s food supply by building the AI farmer that automates our nation (more...)
Company: Orchard Robotics
Location: San Francisco
Posted on: 04/4/2026

Senior Developer Productivity Engineer
Description: About the Role Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together (more...)
Company: Together AI
Location: San Francisco
Posted on: 04/4/2026

Salary in Hayward, California Area | More details for Hayward, California Jobs |Salary

Relationship Executive - Government Banking - Executive Director
Description: Description You are customer focused, enjoy building relationships and providing financial advice to your clients. A role as a Government Relationship Executive is for you. As a Relationship Executive (more...)
Company: JPMorganChase
Location: San Francisco
Posted on: 04/4/2026

HR Business Partner
Description: About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering (more...)
Company: Glean
Location: San Francisco
Posted on: 04/4/2026

AI Applications Engineer
Description: About Us: Notion helps you build beautiful tools for your life s work. In today's world of endless apps and tabs, Notion provides one place for teams to get everything done, seamlessly connecting docs, (more...)
Company: Notion
Location: San Francisco
Posted on: 04/4/2026

Product Design Lead, AI Products
Description: About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering (more...)
Company: Glean
Location: San Francisco
Posted on: 04/4/2026

Research Scientist
Description: About Snorkel At Snorkel, we believe meaningful AI doesn t start with the model, it starts with the data. We re on a mission to help enterprises transform expert knowledge into specialized AI at scale. (more...)
Company: Snorkel AI
Location: San Francisco
Posted on: 04/4/2026

Deployment Strategist
Description: About David AI David AI is the first audio data research company. We bring an R amp D approach to data developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into (more...)
Company: David AI
Location: San Francisco
Posted on: 04/4/2026

Quantitative UX Researcher, Search
Description: info outline X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers (more...)
Company: Google
Location: San Francisco
Posted on: 04/4/2026

Loading more jobs...

Machine Learning - Compiler Engineer , AWS Neuron, Annapurna Labs

Didn't find what you're looking for? Search again!

Other IT / Software / Systems Jobs

Log In or Create An Account