Search for More Jobs
Get alerts for jobs like this Get jobs like this tweeted to you
Company: AMD
Location: New York, NY
Career Level: Director
Industries: Technology, Software, IT, Electronics

Description



WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_



PMTS - GPU Performance Engineering 

 

THE ROLE: As Principal Member of Technical Staff in the GPU Libraries group, you will provide technical leadership and strategic support across the AMD Radeon Open Ecosystem (ROCm) ecosystem. This role focuses on the in-depth characterization, analysis, and strategic enablement of GPU kernel performance across various existing and upcoming hardware architectures. You will work to evaluate proof-of-concepts GPU kernels, create solution alternatives, and work to communicate performance across a diverse audience. This position requires a breadth of GPU kernel-level expertise, rich and varied high-performance computing experience, and applied experience working in enterprise technical computing environments. 
THE PERSON: To be successful in this role, you possess a diversity of experience with desktop, cluster, and enterprise systems performance engineering with a focus on GPU-accelerated high-performance computing. You have experience in working with very large sources of performance data, and use structured engineering approaches to discover, explain, and resolve key performance bottlenecks. You find excitement in working through the complexity of performance at the edge and appreciate all the gritty details that comes with it. You understand the right balance that teams must strike to hit performance and usability, and how to collaboratively sustain that across teams in the software lifecycle.

 

KEY RESPONSIBILITIES: 

  • Establish collaborative processes across libraries teams to design, implement, validate, and deliver key software library capabilities, ensuring successful integration with GPU kernels into broader systems. Establishes, validates, and updates the software roadmap in collaboration with key stakeholders.
  • Leads the collaborative engineering scoping of new architectures, design adaptations, and creates documentation efforts as they arise across teams. Establishes strategies and programs that provide technical governance for these efforts across engineering teams.
  • Review technical designs and proposed engineering approaches of the technical staff. Ensure that plans holistically consider constraints, performance, and roadmap commitments. Conduct research to explore new technologies and methodologies that can improve GPU performance and efficiency. Propose innovative solutions to meet evolving market demands. Use organizational sensing techniques to identify plans at risk and develop strategies to uplift teams when required. Provide support and guidance to technical staff in the execution of these strategies. Identify, capture, and transform tacit knowledge into self-service and digital forms.
  • Owns the coaching and mentoring framework, the library of engineering best-practices and is accountable for the adoption of these practices by technical teams. Participates in the hiring and identification of key engineering talent; a trusted partner of technical managers.

 

PREFERRED EXPERIENCE: 

  • Expert C++ and Python programming skills with relevant applied experience using CUDA, HIP, OpenMP, MPI, or OpenCL for accelerated computing on CPUs and GPUs. Deep familiarity with other languages or DSLs is also expected. Knowledge of AI/ML Frameworks, Operators, Patterns, and Architectures. Familiarity with applied use of TensorFlow, PyTorch, and JAX.  
  • Notable applied and demonstrated experience using performance engineering skills for both CPU and GPU optimization and technical delivery. Employing continuous improvement of the software lifecycle, including build robustness, regression and performance management, software deployment and orchestration management.
  • Demonstrated contributions to open-source software development projects and driving the maturity of teams using various methodologies such as Agile, Scrum, Kanban.  
  • Experience driving the maturity of organizations through best practices, technical governance, and coaching and mentoring frameworks.  
  • Demonstrated ability to independently conduct, direct, and mentor teams to perform structured research, produce relevant executive and/or peer-reviewed publications, and distilled and synthesized materials which accelerates learning for peers and colleagues.  
     

ACADEMIC CREDENTIALS: 

  • Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

LOCATION: 

New York, USA or remote 

 



Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.


 Apply on company website