GPU Kernel Developer - AI Models Advanced Micro Devices
Advanced Micro Devices
Office Location
Full Time
Experience: 3 - 3 years required
Pay: INR 50000 - INR 50000 /Month
Type: Full Time
Location: Karnataka
Skills: HIP, CUDA, OpenCL, Triton, Hardware architecture, software design, GPU kernel development, AIHPC applications, GPU Computing, CC, Python Programming
About Advanced Micro Devices
Job Description
GPU Kernel Developer - AI Models Bangalore, India Engineering 65519 Job Description WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_ GPU Kernel Developer - AI Models THE ROLE: AMD is looking for a GPU kernel development engineer who is talented in developing high performance kernels for state-of-the-art and upcoming GPU hardware. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology. THE PERSON: Experienced in GPU kernel development and optimization for AI/HPC applications. Strong technical and analytical skills in GPU computing, hardware architecture, and deep understanding of HIP/CUDA/OpenCL/Triton development. Ability to work as part of a team, deliver to project scope, and communicate to a technical/non-technical audience. KEY RESPONSIBILITIES: Develop high performance GPU kernels for key AI operators on AMD GPUs Optimize GPU code using structured and disciplined methodology - profiling to identify gaps, roofline-analysis on hardware, identify key set of optimizations, establish uplift and line-of-sight, prototype and develop optimizations Support mission-critical workloads in NLP/LLM, Recommendation, Vision and Audio Collaborate and interact with system level performance architects, GPU hardware specialists, power/clock tuning teams, performance validation teams, and performance marketing teams to analyze and optimize training and inference for AI Work with open-source framework maintainers to understand their requirements and have your code changes integrated upstream Debug, maintain and optimize GPU kernels, understand and drive AI operator performance (GEMM, Attention, Distributed scale-up/out communication, etc.) Apply your knowledge of software engineering best practices PREFERRED EXPERIENCE: Knowledge of GPU computing (HIP, CUDA, OpenCL, Triton) Knowledge and experience in optimizing GPU kernels Expertise in using profiling, debugging tools Core understanding of GPU hardware Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design. ACADEMIC CREDENTIALS: Masters or PhD or equivalent experience in Computer Science, Computer Engineering, or related field #LI-PK1 ,