Deep Learning Inference Software Engineer

4 weeks ago


Shanghai, Shanghai, China NVIDIA Full time

We are seeking a talented Software Engineer to join our team at NVIDIA

We are rapidly growing our research and software development for Inference. Our team specializes in developing GPU-accelerated Deep Learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Collaborate with the deep learning community to implement the latest algorithms for public release in TensorRT.

Key Responsibilities:

  • Develop deeply optimized deep learning kernels for inference.
  • Perform performance analysis and modeling to understand the performance limiter of current software stack as well as underlying hardware architecture.
  • Collaborate with different teams to improve both the software and architectures to extend the state of the art in performance, efficiency, reliability, and programmability.
  • Work with cross-collaborative teams across automotive, image understanding, and speech understanding to develop creative solutions.

Requirements:

  • Strong C/C++ programming and software design skills.
  • Python experience a plus.
  • Performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU.
  • GPU programming experience (CUDA or OpenCL) desired.
  • Expertise in characterizing and modeling system-level performance, executing comparison studies, and documenting and publishing results.

NVIDIA is a leader in the technology industry, and we have some of the most forward-thinking and talented people in the world working for us. If you're creative and autonomous, we want to hear from you.



  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a talented Software Engineer to join our team in developing GPU-accelerated Deep Learning software.NVIDIA is a leader in the field of Deep Learning, and we are rapidly growing our research and software development for Inference. As a member of our team, you will be responsible for developing deeply optimized deep learning kernels for...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are currently seeking a skilled Deep Learning Software Engineer to join our team based in Shanghai. This is a highly competitive role that offers a salary range of $120,000 - $180,000 per year.NVIDIA's history dates back to 1999 when we invented the GPU, which revolutionized computer graphics and parallel computing. Today, our GPUs power modern AI...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled Deep Learning Optimization Engineer to join our team at NVIDIA.About the Role:This is an exciting opportunity for a talented software engineer to develop deeply optimized deep learning kernels for inference. You will be responsible for analyzing and modeling performance to identify areas of improvement in our software stack...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking an ambitious and innately curious individual to be our Sr. Technical Program Manager for Deep Learning Software. You will collaborate with engineering and product leaders on planning and execution of large-scale programs to develop and publish software for training and inference applications using various types of neural networks.Key...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled Technical Program Manager to lead the development and publication of software for training and inference applications using various types of neural networks. This role will involve working closely with engineering and product leaders to plan and execute large-scale programs, driving the development process and coordinating...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a skilled Expert Software Quality Assurance Engineer to join NVIDIA's Deep Learning SWQA team.This position is part of the NVIDIA Deep Learning Software Quality Assurance team that defines, develops and performs tests to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking an exceptional Software Engineer to join our team at NVIDIA, working on our GPU-accelerated library of primitives for deep neural networks. The ideal candidate will have a strong background in software development, particularly in C/C++ and CUDA development, and experience with linear algebra, machine learning, and computer...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA, a pioneer in the field of computer graphics and parallel computing, continues to shape the future of technology.What You'll Be Doing:Design and develop the next generation of NVDLA architectureWork on deep-learning algorithms, software development, and performance optimizationCreate function and power models for NVDLACollaborate with hardware teams...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is continuously driving innovation in the field of deep learning. As a Senior NVIDIA Deep Learning Architect, you will play a crucial role in shaping the future of AI computing.Key Responsibilities:Design and develop next-generation NVDLA architectureWork on deep-learning algorithms and software developmentDevelop function/performance/power models for...


  • Shanghai, Shanghai, China Optiver Full time

    About Us:Optiver is a global market maker with a presence in multiple continents. Our company was founded in 1986 and has since grown to become a leading liquidity provider with a diverse range of products, including listed derivatives, cash equities, ETFs, bonds, and foreign currencies.Our Vision:We aim to become the trusted partner in the development of...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled expert in deep learning system performance to join our AI performance projection and analysis efforts.Key Responsibilities:Analyze state-of-the-art AI models on various GPU hardware platforms.Identify performance bottlenecks and propose optimizations.Perform deep learning workload analysis.Requirements:BS, MS or PhD in...


  • Shanghai, Shanghai, China Optiver Full time

    About Optiver:We are a global market maker with offices worldwide, dedicated to improving the market through competitive pricing, execution, and risk management. Our commitment is to provide liquidity on multiple exchanges across the globe in various financial instruments.Our Shanghai office has been rapidly growing since its establishment in 2012, trading...


  • Shanghai, Shanghai, China Optiver Full time

    OverviewAt Optiver, we are seeking a highly skilled Senior C++ Software Engineer to join our team. As a key member of our development team, you will be responsible for designing, developing, and deploying high-performance trading systems that drive our business forward.Key ResponsibilitiesDesign and develop large-scale trading systems using C++Collaborate...


  • Shanghai, Shanghai, China Optiver Full time

    Optiver is a global market maker with offices worldwide, seeking an exceptional machine learning engineer with a PhD degree to join the China research platform team.The ideal candidate will have an advanced understanding of neural networks and related machine learning technologies, with experience in implementing and training complex deep learning models.As...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled AI Performance Optimization Engineer to join our team at NVIDIA. As a key member of our research and development team, you will be responsible for developing and optimizing deep learning software for inference on NVIDIA GPUs.Key Responsibilities:Develop highly optimized deep learning kernels for inferencePerform performance...


  • Shanghai, Shanghai, China Optiver Full time

    About the RoleAt Optiver, we are seeking a highly skilled Senior C++ Developer to join our team. As a key member of our trading platform development team, you will be responsible for designing, developing, and deploying high-performance trading systems that drive our business forward.Key ResponsibilitiesDesign and develop large-scale trading systems using...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is driving innovation in processor and system architectures that accelerate various deep learning applications. As a leader in AI technology, we are seeking an expert deep learning system performance architect to join our AI performance projection and analysis efforts. This role presents a unique opportunity to work on performance projection,...


  • Shanghai, Shanghai, China Optiver Full time

    WHO WE ARE:At Optiver, we're a global market maker with a presence in multiple continents. Founded in 1986, we've grown to become a leading liquidity provider with a flat organizational structure, empowering our employees to make a significant impact.We provide liquidity to financial markets using our own capital, taking calculated risks to ensure efficient...


  • Shanghai, Shanghai, China Optiver Full time

    Accelerate your career in the dynamic field of machine learning with Optiver.As a Machine Learning Engineer, you'll leverage AI and deep learning to thrive in a fast-paced, cutting-edge environment. Collaborate closely with researchers and traders on new experiments, capabilities, and data sources.Key Responsibilities:Design and implement improvements to...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is a leader in the field of Artificial Intelligence and High-Performance Computing. We are seeking a highly skilled Speech AI Research Scientist to join our team and contribute to the development of our cutting-edge Speech AI product, Riva. The ideal candidate will have a strong background in deep learning and speech processing, with experience in...