
Description
WHAT YOU DO AT AMD CHANGES EVERYTHING
We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
AMD together we advance_
THE ROLE:
We're seeking a highly technical and hands-on AI Technical Application Engineering Lead to drive the development and optimization of AI applications and frameworks for AMD's datacenter GPU products and the ROCm software ecosystem. You will lead the creation of cutting-edge application examples, contribute to opensource AI/ML projects, and ensure seamless integration with AMD ROCm software to maximize developer productivity.
We'd love to see what you've built! Please share your resume along with a link to your GitHub so we can check out your contributions to opensource projects. We're all about seeing your passion and technical skills in action.
THE PERSON:
In this position, you'll be working with opensource projects like vLLM, SGLang, Unsloth, and other critical AI infrastructure projects, while developing comprehensive application examples that demonstrate AMD GPU capabilities. You'll work closely with framework maintainers and the broader AI community to implement new features, solve complex technical challenges and support the opensource communities for these implementations. We're looking for someone who combines deep technical expertise with strong engineering leadership skills and thrives on solving complex AI acceleration problems. The successful candidate should demonstrate strong enthusiasm for community-building, creating content, and helping others grow. Using your strong communication skills– you know how to break down tough tech into easy-to-understand concepts.
KEY RESPONSIBILITIES:
- Lead technical development of AI application examples and reference implementations showcasing AMD GPU performance across diverse AI/ML workloads
- Architect and implement ROCm integrations for key opensource projects including vLLM, SGLang, Unsloth, and emerging AI frameworks.
- Support CI/CD and on-going maintenance of ROCm integration to these projects.
- Engage their associated developer communities and provide required technical support
- Contribute high-quality code, and new features to upstream opensource projects and maintain AMD's technical presence in these communities
- Develop and maintain comprehensive benchmarking suites and performance analysis tools for AI workloads on AMD hardware
- Collaborate with ROCm engineering teams to identify and resolve performance bottlenecks, API gaps, and integration challenges
- Design and implement end-to-end AI application pipelines demonstrating best practices for AMD GPU utilization in production environments
- Lead cross-functional engineering efforts to optimize AI/ML frameworks, ensuring optimal performance on AMD datacenter GPUs
- Mentor and guide a team of application engineers in advanced GPU programming, AI framework optimization, and opensource contribution practices
- Drive technical roadmap alignment between AMD hardware capabilities, ROCm software features, and emerging AI application requirements
- Analyze performance metrics, identify optimization opportunities, and implement solutions that deliver measurable improvements in AI workload throughput and efficiency
PREFERRED EXPERIENCE:
- Recent and in-depth hands-on experience in AI/ML application development, GPU computing, and performance optimization—with significant contributions to opensource AI projects
- Proven track record of leading technical teams and delivering complex software projects in AI/ML domains, particularly involving GPU acceleration
- Expert-level proficiency in AI/ML frameworks (PyTorch, TensorFlow, JAX) with deep understanding of their internals, performance characteristics, and optimization techniques
- Strong systems programming skills in Python, C++, and CUDA/ROCm, with experience in low-level GPU programming and kernel optimization
- Demonstrated experience contributing to and maintaining opensource projects, with a portfolio of meaningful contributions to AI/ML infrastructure
- Deep technical knowledge of transformer architectures, large language models, and modern AI training/inference optimization techniques
- Experience with distributed computing, model parallelism, and scaling AI workloads across multi-GPU systems
- Strong analytical and debugging skills with ability to profile, optimize, and troubleshoot complex AI applications and system-level performance issues
- Bonus if you also have:
- Experience with AMD GPUs and software tools for AI/ML development.
- Familiarity with cloud platforms and containers in AI/ML workflows.
ACADEMIC CREDENTIALS:
- A master's degree in artificial intelligence, machine learning, or a related field.
LOCATION:
- San Jose, CA
#LI-MV1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Apply on company website