Machine Learning Software Platform Architect

4 months ago


Shanghai, China NVIDIA Full time

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI, you will be responsible for the design, development, and maintenance of the infrastructure around NVIDIA's internal large language model aimed at facilitating chip design.

What you'll be doing:

  • Develop and maintain the infrastructure for managing large language models (LLMs) based application specifically adapted for the chip design and hardware domain.

  • Develop and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot, code generator etc.

  • Collaborate with HW chip designers and LLM research teams to understand the specific needs and challenges of GPU design and ensure the LLM infrastructure is well-suited to these needs.

  • Collaborate with LLM research teams to collect & organize training / fine-tuning data to train hardware specific language model

  • Optimize the infrastructure for performance, scalability, and reliability, and ensure the secure and efficient management of data.

  • Stay updated with the latest industry trends in AI and machine learning, and continuously look for opportunities to apply these advancements to improve the LLM infrastructure.

What we need to see:

  • 5+ years work experience in developing and maintaining AI or machine learning infrastructure, preferably in the context of large language models.

  • BS in computer science or related or equivalent experience

  • Strong proficiency in Python and web development, and familiarity with LLM related techniques e.g., langchain, vector database, prompt engineering, etc.

  • Understanding of chip design and related computational and data challenges.

  • Experience with data management, including doc cleaning, transformation, and secure storage.

  • Excellent problem-solving skills and the ability to work effectively in a team.

  • In depth understanding of Machine Learning / Deep Learning / NLP concepts.

Ways to stand out from the crowd:

  • You crafted & developed production quality microservices

  • Strong technical background in cloud/distributed infrastructure

  • An excellent plus if you are familiar with front-end development using React or Vue.js

  • Strong understanding of SQL & NoSQL Data platforms.

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. Are you a creative and passionate about applying Machine Learning to solve remarkably interesting problems? Are you interested in being involved in state-of-the-art development in the field of AI & love a challenge? If so, we want to hear from you



  • Shanghai, Shanghai, China NVIDIA Full time

    About NVIDIANVIDIA is a leading technology company that has made groundbreaking developments in High-Performance Computing, Artificial Intelligence, and Visualization. Our work enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.Job SummaryWe are seeking a highly...


  • Shanghai, Shanghai, China NVIDIA Full time

    About NVIDIANVIDIA is a leading technology company that has revolutionized the industry with groundbreaking developments in High-Performance Computing, Artificial Intelligence, and Visualization. Our GPU, the visual cortex of modern computers, serves as the heart of our products and services, enabling amazing creativity and discovery and powering what were...

  • Cloud Data Architect

    3 weeks ago


    Shanghai, Shanghai, China Amazon Information Service (Beijing) Co., Ltd. (Shanghai Branch) Full time

    About the RoleWe are seeking a highly skilled Cloud Computing Architect to join our team at Amazon Information Service (Beijing) Co., Ltd. (Shanghai Branch). As a Cloud Computing Architect, you will be responsible for designing and implementing cloud-based solutions for our customers, with a focus on data analytics and machine learning.Key...


  • Shanghai, Shanghai, China Mercedes-Benz Full time

    Job Title: Platform Software ArchitectWe are seeking a highly skilled Platform Software Architect to join our team at Mercedes-Benz Group China Ltd. in Shanghai, China. As a key member of our Advanced Design Center, you will be responsible for designing and implementing platform architecture for autonomous driving products.Key Responsibilities:Design and...


  • Shanghai, China Optiver Full time

    WHO WE ARE: Optiver is a global market maker with offices in Amsterdam, London, Chicago, Austin, Sydney, Shanghai, Hong Kong, Singapore and Taipei. Founded in 1986, today we are a leading liquidity provider, with close to 2,000 employees in offices around the world, united in our commitment to improve the market through competitive pricing, execution and...


  • Shanghai, Shanghai, China NVIDIA Full time

    About the RoleNVIDIA is seeking a highly skilled Solutions Architect to collaborate with our largest global alliance partners and enable them with our portfolio of GPU Accelerated Computing solutions, specifically Generative AI and Machine Learning.This individual will be a technical data science and accelerated computing platform advisor, responsible for...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled Solutions Architect to collaborate with our largest global alliance partners to enable them with our portfolio of GPU Accelerated Computing solutions, specifically Generative AI and Machine Learning. This individual will work in a fast-evolving technological environment, staying on the cutting edge of AI and Accelerated...


  • Shanghai, China NVIDIA Full time

    NVIDIA is looking for a Solutions Architect or Data Scientist to work with our largest global alliance partners to enable them with our portfolio of GPU Accelerated Computing solutions i.e. Machine Learning and Deep Learning, specifically Generative AI and to build comprehensive multi-cloud, hybrid and on-prem solutions.We need a passionate, hard-working,...


  • Shanghai, Shanghai, China Optiver Full time

    About UsOptiver is a leading global market maker with a presence in multiple continents. Founded in 1986, we have grown to become a prominent liquidity provider, with a team of over 2,000 employees worldwide. Our mission is to improve the market through competitive pricing, execution, and risk management.Our Shanghai OfficeSince its establishment in 2012,...


  • Shanghai, Shanghai, China Mercedes-Benz Full time

    About the RoleWe are seeking a highly skilled Android Software Architect to join our team at Mercedes-Benz Group China Ltd. in Shanghai, CN.Key ResponsibilitiesParticipate in the evaluation of products, planning, and discussion of functions, as well as implementation and release according to project needs;Review the development process, estimate personnel...


  • Shanghai, China NVIDIA Full time

    Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI computers that can learn, reason, and interact with people are no longer science...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled deep learning system performance architect to join our AI performance projection and analysis efforts. As a key member of our team, you will have the opportunity to work on performance projection, analysis, and optimization on state-of-the-art hardware architectures for various AI workloads.Key Responsibilities:Analyze and...


  • Shanghai, Shanghai, China Mercedes-Benz Full time

    Job DescriptionWe are seeking a highly skilled Android Software Architect to join our team at Mercedes-Benz Group China Ltd. in Shanghai, CN.Key Responsibilities:Participate in the evaluation of products, planning and discussion of functions, as well as implementation and release according to project needs;Participate in reviewing the development process,...


  • Shanghai, Shanghai, China Mercedes-Benz Full time

    Job Title: QNX SW ArchitectWe are seeking a highly skilled QNX SW Architect to join our team at Mercedes-Benz Group China Ltd.Job SummaryThe successful candidate will be responsible for designing and developing platform software for various vehicle products, including BSP, QNX/Linux, and middleware.Key ResponsibilitiesDesign and develop platform software for...

  • Software Architect

    4 months ago


    Shanghai, China Bose Full time

    You know the moment. It’s the first notes of that song you love, the intro to your favorite movie, or simply the sound of someone you love saying “hello.” It’s in these moments that sound matters most.At Bose, we believe sound is the most powerful force on earth. We’ve dedicated ourselves to improving it for nearly 60 years. And we’re passionate...


  • Shanghai, Shanghai, China NVIDIA Full time

    Role OverviewAs a Senior Solutions Architect for the Omniverse Platform, you will play a pivotal role in shaping the future of our technology solutions. Your expertise will be crucial in driving innovative approaches and ensuring the successful implementation of our platform.Key ResponsibilitiesTechnical Proficiency: Demonstrate extensive experience with...

  • Software Architect

    1 week ago


    Shanghai, Shanghai, China Bose Full time

    Job DescriptionWe are seeking a highly skilled Software Architect to join our Automotive Software Group. As a key member of our engineering team, you will be responsible for designing and developing high-performance, integrated software-hardware systems that deliver amazing experiences for our customers.Key ResponsibilitiesCollaborate with customers and...


  • Shanghai, China NVIDIA Full time

    The Autonomous Vehicles Platform team is searching for engineers to develop and bring NVIDIA's automotive platform out to the world. You will participate in a focused effort to develop and productize ground-breaking solutions that will redefine the world of transportation and the growing field of self-driving cars. Work with hardworking and dedicated...


  • Shanghai, Shanghai, China Microsoft Full time

    About the RoleWe are seeking a highly motivated and passionate Cloud Solution Architect to drive high-priority customer initiatives on the Microsoft Azure Platform in collaboration with customers and the Microsoft field in Enterprise accounts segment of our business.Key ResponsibilitiesUnderstand customers' overall data estate, IT and business priorities and...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is on the lookout for experienced web software engineers to join our AI Infrastructure team. Our mission is to empower NVIDIA and our clients to efficiently scale machine learning workflows. This requires a fresh approach to organizing and managing data, tasks, and users. We are developing and refining human-in-the-loop processes that facilitate the...