Senior AI Training Performance Engineer

6 months ago


Shanghai, China NVIDIA Full time

We are now looking for a Senior AI Training Performance Engineer

NVIDIA is seeking senior engineers who are obsessed with performance analysis and optimization to help us squeeze every last clock cycle out of AI training, one of the most important workloads in the world. If you are unafraid to work across all layers of the hardware/software stack from GPU architecture to Deep Learning Framework to achieve peak performance, we want to hear from you This role offers the opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution while helping deep learning users around the globe enjoy ever-higher training speeds.

What you will be doing:

  • Understand, analyze, profile, and optimize AI and deep learning training workloads on state-of-the-art hardware and software platforms.

  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across many dozens of state-of-the-art neural networks.

  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.

  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

What we want to see:

  • PhD (or equivalent experience) in CS, EE or CSEE and 5+ years; or MS and 8+ years of relevant work experience.

  • Strong background in deep learning and neural networks, in particular training.

  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture.

  • Proven experience analyzing and tuning application performance.

  • Experience with processor and system-level performance modelling.

  • Programming skills in C++, Python, and CUDA.

Intelligent machines powered by AI computers that can learn, reason and interact with people are no longer science fiction. Today, a self-driving car powered by artificial intelligence can meander through a country road at night and find its way. An AI-powered robot can learn motor skills through trial and error. This is truly an extraordinary time. The era of AI has begun, and we are powering it. NVIDIA is increasingly known as the AI Computing company and is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Are you passionate about performance? Are you interested in working on industry-leading Deep Learning products? Come, join our Deep Learning Architecture team, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field.



  • Shanghai, Shanghai, China NVIDIA Full time

    Unlock the Power of AI TrainingNVIDIA is seeking a senior engineer to join our team and help us optimize AI training performance. As a key member of our Deep Learning Architecture team, you will work closely with our engineers to analyze, profile, and optimize AI and deep learning training workloads on state-of-the-art hardware and software platforms.Key...


  • Shanghai, Shanghai, China NVIDIA Full time

    Transform AI Training PerformanceNVIDIA is seeking senior engineers who excel at performance analysis and optimization to drive AI training efficiency. If you're passionate about squeezing every last clock cycle out of AI training, we want to hear from you. This role offers the opportunity to directly impact the hardware and software roadmap in a...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled Senior AI Training Performance Engineer to join our team at NVIDIA. This role offers the opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution.About the RoleThe successful candidate will have a strong background in deep learning and neural networks,...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled AI Performance Optimization Engineer to join our team at NVIDIA. As a key member of our research and development team, you will be responsible for developing and optimizing deep learning software for inference on NVIDIA GPUs.Key Responsibilities:Develop highly optimized deep learning kernels for inferencePerform performance...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are expanding our research and development for Inference at NVIDIA. We seek excellent Software Engineers and Senior Software Engineers to collaborate with the deep learning community.Key Responsibilities:Develop highly optimized deep learning kernels for inferencePerform performance optimization, analysis, and tuningWork with cross-collaborative teams...

  • AI Senior Manager

    1 month ago


    Shanghai, Shanghai, China Porsche Engineering Group Full time

    Job SummaryWe are seeking a highly skilled Senior Manager of AI to lead our AI initiatives and drive business growth through the development and execution of AI strategies.Key ResponsibilitiesLead the development and execution of AI strategies, including identifying opportunities for AI integration and leveraging AI technologies to drive business growth and...

  • Senior Data Scientist

    4 weeks ago


    Shanghai, China Bosch Group Full time

    Job DescriptionWe are seeking a highly skilled and experienced Senior Data Scientist / AI Engineer to join our team. As a Senior Data Scientist / AI Engineer, you will be responsible for developing and implementing cutting-edge solutions using Python, Machine Learning, LLMs, and classical NLP methodologies. You will work closely with our team of experts to...

  • Senior Data Scientist

    4 weeks ago


    Shanghai, China Bosch Full time

    Job Description We are seeking a highly skilled and experienced Senior Data Scientist / AI Engineer to join our team. As a Senior Data Scientist / AI Engineer, you will be responsible for developing and implementing cutting-edge solutions using Python, Machine Learning, LLMs, and classical NLP methodologies. You will work closely with our team of experts...


  • Shanghai, Shanghai, China TÜV Rheinland Full time

    Job DescriptionAs a Senior Performance Engineer, you will be responsible for handling high-performance projects, achieving targets, and maintaining work quality, efficiency, and project lead time. You will work closely with safety labs and subcontractors to ensure seamless project execution.Key Responsibilities:Handle high-performance projects and achieve...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a skilled Deep Learning Performance Software Engineer to expand our research and development in Inference. This role involves developing highly optimized deep learning kernels for inference, working with cross-collaborative teams, and occasionally traveling to conferences and customers.As a Deep Learning Performance Software Engineer at...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a senior distributed systems engineer to work on our AI infrastructure team. Our team is responsible for enabling NVIDIA and our customers to scale up machine learning workflows. We are building and optimizing human-in-the-loop flows that enable massive state-of-the-art systems in artificial intelligence and machine learning.Key...

  • AI Solutions Engineer

    6 months ago


    Shanghai, China Thermo Fisher Scientific Full time

    : Explore New Capabilities: Stay updated with the latest advancements in OpenAI and other LLMs, including China Local AI. Conduct research and experiments to identify new capabilities and potential applications for our organization. Evaluate the feasibility and impact of integrating these technologies into our existing systems. Collaboration: Work...


  • Shanghai, Shanghai, China Porsche Engineering Group Full time

    Main Responsibilities:Lead the development and execution of AI strategies, including identifying opportunities for AI integration and leveraging AI technologies to drive business growth and innovation.Oversee the design and implementation of AI-powered systems, such as predictive analytics, natural language processing, machine learning, and computer vision,...


  • Shanghai, Shanghai, China NVIDIA Full time

    Develop Innovative Speech Solutions with NVIDIAWe're seeking a highly skilled Master Speech AI Engineer to join our team at NVIDIA. As a leading technology company, we're committed to pushing the boundaries of what's possible with AI.About the RoleThis is an exciting opportunity to contribute to the development of cutting-edge speech AI solutions. You'll be...


  • Shanghai, Shanghai, China Bosch Full time

    Job SummaryAs a Senior Generative AI Developer at Bosch, you will be responsible for developing and implementing complex use cases and algorithms using Generative AI and other technologies. Your expertise in AI will enable you to design, test, and optimize models and use cases for performance and scalability.Key ResponsibilitiesDevelop and implement complex...


  • Shanghai, Shanghai, China RE Info Tech-Shanghai branch Company Full time

    About the RoleThe Python Engineering Lead role involves leading complex research, design, and software development tasks within a software functional area or product line. This role requires direct input to project plans, schedules, and methodology in the development of cross-functional software products. Key responsibilities include designing, architecting,...


  • Shanghai, Shanghai, China Bosch Full time

    Job SummaryBosch is seeking a highly skilled Senior Database Architect to join our team in developing cutting-edge AI solutions. As a key member of our database team, you will be responsible for designing and implementing robust database models that support our AI applications.About the RoleIn this role, you will collaborate with cross-functional teams to...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a senior software engineer to join our AI Infrastructure team. Our team is responsible for building and optimizing human-in-the-loop flows which enable massive state-of-the-art systems in Artificial Intelligence / Machine Learning at NVIDIA and for our customers in many application spaces including medical imagery and autonomous driving.Key...


  • Shanghai, China SAP Full time

     We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Shanghai, Shanghai, China Amazon Information Service (Beijing) Co., Ltd. (Shanghai Branch) Full time

    As a highly skilled Applied Scientist, you will be joining the vibrant team at the AWS Shanghai AI Lab. This innovation center is dedicated to long-term research projects across machine learning, computer vision, natural language processing, and open-source AI systems. Your contributions will power products across various AWS services.Key...