Machine Learning Software Platform Architect

3 weeks ago


Shanghai, China NVIDIA Full time

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI, you will be responsible for the design, development, and maintenance of the infrastructure around NVIDIA's internal large language model aimed at facilitating chip design.

What you'll be doing:

  • Develop and maintain the infrastructure for managing large language models (LLMs) based application specifically adapted for the chip design and hardware domain.

  • Develop and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot, code generator etc.

  • Collaborate with HW chip designers and LLM research teams to understand the specific needs and challenges of GPU design and ensure the LLM infrastructure is well-suited to these needs.

  • Collaborate with LLM research teams to collect & organize training / fine-tuning data to train hardware specific language model

  • Optimize the infrastructure for performance, scalability, and reliability, and ensure the secure and efficient management of data.

  • Stay updated with the latest industry trends in AI and machine learning, and continuously look for opportunities to apply these advancements to improve the LLM infrastructure.

What we need to see:

  • 5+ years work experience in developing and maintaining AI or machine learning infrastructure, preferably in the context of large language models.

  • BS in computer science or related or equivalent experience

  • Strong proficiency in Python and web development, and familiarity with LLM related techniques e.g., langchain, vector database, prompt engineering, etc.

  • Understanding of chip design and related computational and data challenges.

  • Experience with data management, including doc cleaning, transformation, and secure storage.

  • Excellent problem-solving skills and the ability to work effectively in a team.

  • In depth understanding of Machine Learning / Deep Learning / NLP concepts.

Ways to stand out from the crowd:

  • You crafted & developed production quality microservices

  • Strong technical background in cloud/distributed infrastructure

  • An excellent plus if you are familiar with front-end development using React or Vue.js

  • Strong understanding of SQL & NoSQL Data platforms.

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. Are you a creative and passionate about applying Machine Learning to solve remarkably interesting problems? Are you interested in being involved in state-of-the-art development in the field of AI & love a challenge? If so, we want to hear from you



  • Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI, you will be responsible for the design, development, and maintenance of the infrastructure around Nvidia's internal large language...


  • Shanghai, China NVIDIA Full time

    Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI computers that can learn, reason, and interact with people are no longer science...


  • Shanghai, China NVIDIA Full time

    We are looking for a first-class Deep Learning Performance architect to join in us to drive the performance analysis, modelling and optimization of top Datacenter, Automotive and Client AI networks. Help building and enhancing our performance analysis infrastructure. In this role, you will analyze top inference networks, identify, prototype or model perf...

  • Software Architect

    2 weeks ago


    Shanghai, China Bose Full time

    You know the moment. It’s the first notes of that song you love, the intro to your favorite movie, or simply the sound of someone you love saying “hello.” It’s in these moments that sound matters most.At Bose, we believe sound is the most powerful force on earth. We’ve dedicated ourselves to improving it for nearly 60 years. And we’re passionate...


  • Shanghai, China NVIDIA Full time

    The Autonomous Vehicles Platform team is searching for engineers to develop and bring NVIDIA's automotive platform out to the world. You will participate in a focused effort to develop and productize ground-breaking solutions that will redefine the world of transportation and the growing field of self-driving cars. Work with hardworking and dedicated...


  • Shanghai, China NVIDIA Full time

    NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI performance projection and analysis efforts. In this position, you will have a chance to work on performance projection, analysis, and...

  • Embedded SW

    3 weeks ago


    Shanghai, China Bose Full time

    Job DescriptionAs Principle Embedded Software Engineer in Bose Software Platform Team / ASD , you will work with the entire platform team to architect design, implement and verify the software solutions for Bose amplifiers products. You should have a deep understanding of the Software Development Life Cycle (SDLC), and be comfortable in working in a...

  • Software Architect

    3 weeks ago


    Shanghai, China Henkel Full time

    At Henkel, you can be a game changer and craft your career. Unleash your entrepreneurial spirit by bringing your ideas to life within a global team. Our leading brands and technologies, along with our high-performing businesses will provide you with countless opportunities to develop your skills and explore new paths. Your career at Henkel will contribute...


  • Shanghai, China DNV Full time

    The successful candidate will contribute to research and development projects within the Artificial Intelligence Research Centre. She/He will report to the head of the AIRC and be responsible for the following tasks: Identify, explore and develop new research opportunities for creating value to DNV’s business activities and future strategy. Develop...


  • Shanghai, China Mercedes-Benz Full time

    Tätigkeitsbereich:Forschung & Entwicklung incl. DesignFachabteilung:Research & Development SoftwareGesellschaft:Mercedes-Benz Group China Ltd.Standort:Shanghai, CNStartdatum:sofortVeröffentlichungsdatum:..4Stellennummer:MER2XUC Join usAufgaben Job Objective Leads the advanced research and development of SW architecture in China, leads the solution...


  • Shanghai, China NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...


  • Shanghai, China NVIDIA Full time

    NVIDIA is searching for senior web engineers to work in our AI Infrastructure. Our team is enabling NVIDIA and our customers to more easily scale up machine learning workflows - machine learning at scale requires a new vocabulary for organizing and managing data, jobs and users. We are building and optimizing human-in-the-loop flows which enable massive...


  • Shanghai, China NVIDIA Full time

    NVIDIA's GPUs and SOCs are the world leaders in performance and efficiency, and we are continually innovating in creative and unique ways to improve our ability to deliver outstanding solutions in a wide range of sectors. We are seeking Platform and Silicon Validation Tools Engineers who are passionate about what they do and are committed to making a...

  • Solution Architect

    3 weeks ago


    Shanghai, China Roche Full time

    The Position Solution Architect - China Data Management & Analytics Platform Product team Location: China The position We are looking for a highly motivated individual for the position of Solution Architect focused on Data to join the China Data Management & Analytics Platform Product team. Someone who influences their own development...


  • Shanghai, China Logitech Full time

    Description Lead Software Engineer | Platform | C++ The Role The Logitech Gaming Team is growing. We are seeking an experienced hands-on desktop application developer to join our Logitech China Software Tribe. This position, located in Shanghai, focuses on Logitech GHub programming, which encompasses developing and maintaining software...


  • Shanghai, China Mercedes-Benz Full time

    Tätigkeitsbereich:Forschung & Entwicklung incl. DesignFachabteilung:Advanced Design Center ChinaGesellschaft:Mercedes-Benz Group China Ltd.Standort:Shanghai, CNStartdatum:sofortVeröffentlichungsdatum:..4Stellennummer:MERSDArbeitszeit:Vollzeit Join usAufgaben Job Objective - Design and implement platform architecture based on SOC + Virtualization +...


  • Shanghai, China Mercedes-Benz Full time

    Tätigkeitsbereich:Forschung & Entwicklung incl. DesignFachabteilung:Middleware, BSW and IntegrationGesellschaft:Mercedes-Benz Group China Ltd.Standort:Shanghai, CNStartdatum:sofortVeröffentlichungsdatum:..4Stellennummer:MERSFArbeitszeit:Vollzeit Join usAufgaben Job Objective - Responsible for development of basic/platform software based on Adaptive...


  • Shanghai, China Amazon Information Service (Beijing) Co., Ltd. (Shanghai Branch) Full time

    Amazon Web Services, an Amazon.com Company, has been the world’s leading cloud provider for more than 17 years with the most mature, comprehensive, and broadly adopted cloud platform. We have over 200 fully featured cloud services, managed from 99 availability zones within 31 geographic regions across the globe. Millions of customers in over 240 countries...


  • Shanghai, China Amazon Connect Technology Services (Beijing) Co., Ltd. Full time

    Amazon Web Services, an Amazon.com Company, has been the world’s leading cloud provider for more than 17 years with the most mature, comprehensive, and broadly adopted cloud platform. We have over 200 fully featured cloud services, managed from 99 availability zones within 31 geographic regions across the globe. Millions of customers in over 240 countries...

  • Machine Operator

    3 weeks ago


    Shanghai, China Air Products Full time

    Machine Operator AS-CN-Shanghai Chemical Park Job Description and Qualifications Machine Operator Purpose Complete production task under Operating machine such as NC Lathe, wire-cutting machine. Etc. Nature and Scope Complete the task according to production requirement. PRINCIPAL ACCOUNTABILITIES Operate NC lathe machine and other...