
Description
WHAT YOU DO AT AMD CHANGES EVERYTHING
We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
AMD together we advance_
About Us
At AMD, we are at the forefront of the AI revolution, pioneering the future of accelerated computing. Our groundbreaking technologies are transforming industries, from scientific research and healthcare to autonomous vehicles and generative AI. We build the platforms that empower the world's brightest minds to solve their most challenging problems.
Our Global Partner Support team is the technical backbone for our ecosystem of strategic partners, including leading Cloud Service Providers (CSPs), Original Equipment Manufacturers (OEMs), and System Integrators. We are a team of trusted technical advisors dedicated to ensuring our partners' success.
The Role
We are seeking a highly skilled and motivated Field Applications Engineer (FAE) to join our elite Global Partner Support team. In this role, you will be the go-to technical expert for our most critical partners, focusing on debugging and resolving complex, system-level issues related to our Data Center GPUs (DCGPUs) and the AI software stack.
You are a deep-level problem solver who thrives on technical challenges. You will act as a bridge between our partners and our internal engineering teams, ensuring that issues are triaged, debugged, and resolved efficiently to maintain partner momentum and customer satisfaction. This is a hands-on, deeply technical role for someone passionate about the intersection of hardware, software, and artificial intelligence.
What You'll Be Doing
- Advanced Technical Debugging: Lead advanced debugging sessions for complex, multi-node issues involving GPUs, drivers, networking, and AI frameworks. You will be the last line of defense for our partners' toughest technical problems.
- Partner Enablement: Serve as the primary technical point of contact for key global partners, providing post-sales support, escalation management, and expert-level guidance.
- Issue Replication and Resolution: Reproduce complex customer and partner issues in our labs. Analyze logs, core dumps, and performance data to isolate root causes in hardware, firmware, drivers, or software stacks.
- Engineering Collaboration: Work closely with our core engineering, product, and QA teams to report, track, and drive bugs to resolution. You will provide critical field-level feedback to improve future products.
- Knowledge Creation: Develop and communicate solutions, workarounds, and best practices. Author knowledge base articles, application notes, and whitepapers to enable broader success.
- Proactive Support: Identify potential issues and trends from the field and work proactively with partners to implement preventative measures and best practices.
- Training and Education: Occasionally create and deliver technical training sessions for partner engineering teams on new products, features, and debugging methodologies.
What We Need to See (Required Qualifications)
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience.
- Demonstrated experience in a technical support, solutions architecture, or systems engineering role.
- Deep expertise in Linux systems administration, performance tuning, and debugging (e.g., strace, gdb, kernel logs).
- Strong hands-on experience with Data Center GPUs (e.g., AMD MI300X, NVIDIA H100) and an understanding of GPU architecture.
- Proficiency with major AI/ML frameworks (e.g., TensorFlow, PyTorch) and an understanding of their operational lifecycle.
- Strong scripting and programming skills for automation and debugging (Python and Bash are essential).
- Excellent problem-solving, analytical, and communication skills, with a proven ability to interact effectively with highly technical customer/partner teams.
Ways to Stand Out (Preferred Qualifications)
- Hands-on experience with CUDA programming and debugging tools (e.g., cuda-gdb, Nsight Systems, Nsight Compute).
- Experience with C/C++ programming and debugging.
- Familiarity with high-performance networking technologies (InfiniBand, RoCE) and protocols.
- Experience with containerization and orchestration technologies (Docker, Kubernetes, Slurm).
- In-depth knowledge of server hardware architecture, including PCIe, NVLink/NVSwitch, and memory subsystems.
- Direct experience debugging large-scale HPC or AI training/inference clusters.
- Previous experience working directly with OEMs, CSPs, or large enterprise customers.
What We Offer
- A competitive salary, equity, and bonus package.
- Comprehensive health, dental, and vision insurance.
- Flexible work environment and a culture of trust and autonomy.
- A unique opportunity to work on the cutting-edge of AI and accelerated computing.
- Budget for professional development and continuous learning.
Location: Austin, TX
#LI-RF1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Apply on company website