AI Software Product Engineer

3 days ago


Shanghai, Shanghai, China AMD Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.
Together, we advance your career.
The Role
"AI Product Applications Engineer (Solution Architect) - China" position is in the AMD AI group, located in China.

Preferred Experience

  • Strong proficiency in at least one programming language — C++, Python, or Golang — with solid coding standards and software engineering practices.
  • Proven experience developing high-performance backend services and conducting performance optimization.
  • Deep understanding of GPU cloud-native technologies, with hands-on experience in Kubernetes, Argo, and related open-source orchestration frameworks (including custom extensions).
  • Familiarity with microservices architecture and distributed systems design. Experience with AIOps middleware and designing systems that are highly available and fault-tolerant.
  • Strong analytical and problem-solving skills, capable of diagnosing and resolving complex production issues.
  • Experience building end-to-end AI pipelines and working with interactive development tools such as Jupyter, Code Server, or Colab-style IDEs is a strong plus.

The Person
Success in this role requires hands-on experience with GPU cloud-native technologies and AI workloads such as LLMs, Generative AI, and Transformers. The ideal candidate is proficient in frameworks like PyTorch, Triton, vLLM, and SGLang, and has built scalable, end-to-end AI pipelines across cloud and edge environments.

Key Responsibilities

  • Architect and implement a unified GPU cloud platform to handle large-scale concurrent workloads and dynamic load balancing.
  • Contribute to the development of enterprise-grade GPU development and runtime environment, enabling efficient resource allocation and intelligent scheduling.
  • Design and develop GPU virtualization and instance management services to ensure system reliability, scalability, and operational excellence.

Academic Credentials

  • List any desired degrees, certifications, etc.
  • Use the words preferred or desired, instead of required

LOCATION:
Shanghai/Shenzhen/Beijing

Hands-on experiences with AI tools (e.g. Pytroch, vLLM, Megatron-LM, Tensorflow, Deepspeed, TensorRT-LLM, TensorRT).

  • Experience implementing LLMs, Generative AI, transformers, end to end pipeline.
  • Solid communication skills to position the architecture proposal and value proposition.
  • Familiar with AMD MI GPU architecture, ROCm AI SW, will be preferred …
  • BS required. MS preferred with 6+ years of relevant industry experience.

Benefits offered are described:
AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.



  • Shanghai, Shanghai, China Kong Inc. Full time

    Are you ready to power the World's connections?If you don't think you meet all of the criteria below but are still interested in the job, please apply. Nobody checks every box - we're looking for candidates that are particularly strong in a few areas, and have some interest and capabilities in others.About The RoleYou will report to the Kong AI Gateway...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job Details:Job Description:  Job Description Intel Neural Compressor team is looking for a highly motivated talent to join usResponsibilities includes:• Develop Intel Neural Compressor product and related tools to support Intel AI platform, including CPU, GPU and AI Accelerator• Research and implement quantization and compression techniques for large...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job Details:Job Description:Job DescriptionIntel Neural Compressor team is looking for a highly motivated talent to join usResponsibilities includes:Develop Intel Neural Compressor product and related tools to support Intel AI platform, including CPU, GPU and AI AcceleratorResearch and implement quantization and compression techniques for large language...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job DetailsJob Description:Intel NPU organization is dedicated to research and development for the future of AI - unprecedented scale for enabling machine intelligence on Edge, desktop, and mobile computers. While achieving a minimal power consumption and tremendous computing power, Intel AI accelerators are targeting daily use for millions of devices. Join...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job Details:Job Description: Conducts design and development to build and optimize AI software. Designs, develops, and optimizes for AI frameworks (e.g., OpenVINO) and to contribute to external frameworks (e.g., TensorFlow, PyTorch). Implements various distributed algorithms such as model/data parallel frameworks, parameter servers, dataflow based...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job DetailsJob Description:Conducts design and development to build and optimize AI software. Designs, develops, and optimizes for AI frameworks (e.g., OpenVINO) and to contribute to external frameworks (e.g., TensorFlow, PyTorch). Implements various distributed algorithms such as model/data parallel frameworks, parameter servers, dataflow based asynchronous...


  • Shanghai, Shanghai, China Qualcomm Full time

    Company: Qualcomm ChinaJob Area:Engineering Group, Engineering Group > Software Applications EngineeringGeneral Summary:GENERAL SUMMARY:Support Qualcomm Automotive platform customer projects, work for AI SDK Model Compiling, Performance Benchmark, AI application integration.Play consultant/expert role to help customer to deploy AI models, work with customer...


  • Shanghai, Shanghai, China Qualcomm Full time

    CompanyQualcomm ChinaJob AreaEngineering Group, Engineering Group > Software Applications EngineeringGeneral SummaryGENERAL SUMMARY:Support Qualcomm Automotive platform customer projects, work for AI SDK Model Compiling, Performance Benchmark, AI application integration.Play consultant/expert role to help customer to deploy AI models, work with customer on...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job DetailsJob Description:Responsible for the design, development, testing, and performance tuning of AI kernels based on Intel GPU products; work will cover AI applications, algorithm research, kernel development, middleware, frameworks, operating systems, drivers, etc.QualificationsUndergraduate or graduate students majoring in Computer Science or related...


  • Shanghai, Shanghai, China Intel Corporation Full time

    Job DetailsJob Description:Intel Neural Compressor team is looking for a highly motivated talend to join usResponsibilities IncludesDevelope Intel Neural Compressor product and related tools to support Intel AI platform, including CPU, GPU and AI AcceleratorResearch and implement quantization and compression techniques for large language models (LLMs) and...