Deep Learning Solutions Architect

3 weeks ago


Shanghai, Shanghai, China NVIDIA Full time

NVIDIA is seeking a highly skilled Solutions Architect to collaborate with our largest global alliance partners in leveraging our portfolio of GPU Accelerated Computing solutions, including Machine Learning and Deep Learning, specifically Generative AI. This individual will be responsible for architecting and creating prototypes, building, pre-training, fine-tuning, and p-tuning LLMs, and finding the best deployment scenarios for customers.

Key Responsibilities:

  • Designing and implementing Proof of Concept and Demos that require presentation skills, explanation of complex topics, writing Python code to execute data pipelines and train ML/DL models, and deploying on container-based orchestrators.
  • Staying up-to-date on the state of the art in the production Deep Learning and Machine Learning ecosystem and helping to architect and scale high-performance, distributed AI deployments built on the latest NVIDIA GPU supercomputers.
  • Documenting knowledge and guiding others through building targeted training for partners and other Solutions Architects, writing whitepapers, blogs, and wiki articles, and working through challenging problems with a partner on a whiteboard.
  • Providing mentorship and answering questions, working with Partner Business Managers to assist partners and customers on their mission-critical projects, and helping them build their GPU-enabled Accelerated Compute datacenters and get the most out of their investment.
  • Utilizing conferencing tools and requiring some travel for this role, with the freedom to find the best way to get the job done and make our partners successful.

Requirements:

  • BS or MS in Engineering, Mathematics, Physics, or Computer Science (or equivalent experience).
  • 5+ years of work-related experience in Deep Learning and Machine Learning, including deep learning frameworks TensorFlow or PyTorch, GPU and CUDA experience extremely helpful.
  • Experience working with DevOps, including but not limited to Docker/Containers, Kubernetes, and Data Center deployments.
  • Deep understanding of dense datacenter design including compute, storage, and networking.
  • Ability to multitask effectively in a dynamic environment.
  • Strong analytical and problem-solving skills.
  • Clear written and oral communication skills with the ability to effectively collaborate with management and engineering.
  • Strong desire to share knowledge with clients, partners, and co-workers.

Preferred Qualifications:

  • Extensive knowledge and hands-on experience with recent advancements in LLMs and GenAI.
  • Experience developing with ML/DL frameworks and MLOps ecosystem of partners and solutions in the cloud and on-prem.
  • Background with cloud-based solution designing, APIs, and Microservices, orchestration platforms, storage solutions, and data migration techniques.
  • Experience or good knowledge of server architectures, PCIe topologies, Infiniband, and other networking technologies and Operating systems.
  • Willingness and ability to dig into unfamiliar territories to tackle complex problems and be a great listener.

NVIDIA is a leader in the technology world, and we have some of the most forward-thinking and hardworking individuals in the world working for us. If you're creative and autonomous, we want to hear from you.



  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is continuously driving innovation in the field of deep learning. As a Senior NVIDIA Deep Learning Architect, you will play a crucial role in shaping the future of AI computing.Key Responsibilities:Design and develop next-generation NVDLA architectureWork on deep-learning algorithms and software developmentDevelop function/performance/power models for...


  • Shanghai, Shanghai, China NVIDIA Full time

    Job DescriptionNVIDIA is seeking an experienced Solutions Architect or Data Scientist to collaborate with our largest global alliance partners. This role requires a passion for working with cutting-edge AI and Accelerated Computing technologies, specifically Generative AI.The successful candidate will be a technical advisor to our partners, architecting and...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled expert in deep learning system performance to join our AI performance projection and analysis efforts.Key Responsibilities:Analyze state-of-the-art AI models on various GPU hardware platforms.Identify performance bottlenecks and propose optimizations.Perform deep learning workload analysis.Requirements:BS, MS or PhD in...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is driving innovation in processor and system architectures that accelerate various deep learning applications. As a leader in AI technology, we are seeking an expert deep learning system performance architect to join our AI performance projection and analysis efforts. This role presents a unique opportunity to work on performance projection,...


  • Shanghai, Shanghai, China Goodyear Full time

    Job SummaryWe are seeking a highly skilled AI Solutions Architect to join our team at Goodyear. As an AI Solutions Architect, you will play a pivotal role in designing and implementing AI solutions that optimize operations, improve tire quality, and propel us ahead of the competition.Key ResponsibilitiesIdentify strategic opportunities for AI implementation...

  • Solutions Architect

    4 weeks ago


    Shanghai, Shanghai, China Amazon Information Service (Beijing) Co., Ltd. (Shanghai Branch) Full time

    About the RoleWe are seeking a highly skilled Solutions Architect to join our team at Amazon Web Services (AWS). As a Solutions Architect, you will play a critical role in helping our customers succeed in building applications and services on the AWS platform.Key ResponsibilitiesOwn the technical engagement and ultimate success around specific implementation...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are currently seeking a skilled Deep Learning Software Engineer to join our team based in Shanghai. This is a highly competitive role that offers a salary range of $120,000 - $180,000 per year.NVIDIA's history dates back to 1999 when we invented the GPU, which revolutionized computer graphics and parallel computing. Today, our GPUs power modern AI...


  • Shanghai, Shanghai, China Optiver Full time

    About Optiver:We are a global market maker with offices worldwide, dedicated to improving the market through competitive pricing, execution, and risk management. Our commitment is to provide liquidity on multiple exchanges across the globe in various financial instruments.Our Shanghai office has been rapidly growing since its establishment in 2012, trading...


  • Shanghai, Shanghai, China Optiver Full time

    WHO WE ARE:At Optiver, we're a global market maker with a presence in multiple continents. Founded in 1986, we've grown to become a leading liquidity provider with a flat organizational structure, empowering our employees to make a significant impact.We provide liquidity to financial markets using our own capital, taking calculated risks to ensure efficient...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled Deep Learning Optimization Engineer to join our team at NVIDIA.About the Role:This is an exciting opportunity for a talented software engineer to develop deeply optimized deep learning kernels for inference. You will be responsible for analyzing and modeling performance to identify areas of improvement in our software stack...


  • Shanghai, Shanghai, China Optiver Full time

    About Us:Optiver is a global market maker with a presence in multiple continents. Our company was founded in 1986 and has since grown to become a leading liquidity provider with a diverse range of products, including listed derivatives, cash equities, ETFs, bonds, and foreign currencies.Our Vision:We aim to become the trusted partner in the development of...


  • Shanghai, Shanghai, China Microsoft Full time

    About the RoleWe're seeking a highly motivated and passionate Cloud Solution Architect to drive high-priority customer initiatives on the Microsoft Azure Platform. As a key member of our team, you'll collaborate with customers and the Microsoft field to deliver innovative Data Platform and Advanced Analytics/Artificial Intelligence solutions.Key...


  • Shanghai, Shanghai, China Optiver Full time

    Optiver is a global market maker with offices worldwide, seeking an exceptional machine learning engineer with a PhD degree to join the China research platform team.The ideal candidate will have an advanced understanding of neural networks and related machine learning technologies, with experience in implementing and training complex deep learning models.As...


  • Shanghai, Shanghai, China Microsoft Full time

    Job SummaryWe are seeking a highly motivated and passionate Cloud Solution Architect to drive high-priority customer initiatives on the Microsoft Azure Platform. This is a customer-facing role, owning the technical relationship between the customer and Microsoft Data, Advanced Analytics, and Artificial Intelligence Platform.Key ResponsibilitiesUnderstand...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking an ambitious and innately curious individual to be our Sr. Technical Program Manager for Deep Learning Software. You will collaborate with engineering and product leaders on planning and execution of large-scale programs to develop and publish software for training and inference applications using various types of neural networks.Key...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking an experienced Robotics Solutions Architect to lead our technical engagements within the Industrial/Manufacturing community. As a key member of our team, you will be responsible for evangelizing NVIDIA technologies and accelerating their adoption.The ideal candidate will have strong technical competence and leadership skills to function...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a talented Software Engineer to join our team in developing GPU-accelerated Deep Learning software.NVIDIA is a leader in the field of Deep Learning, and we are rapidly growing our research and software development for Inference. As a member of our team, you will be responsible for developing deeply optimized deep learning kernels for...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a talented Software Engineer to join our team at NVIDIA We are rapidly growing our research and software development for Inference. Our team specializes in developing GPU-accelerated Deep Learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas....


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking an exceptional Software Engineer to join our team at NVIDIA, working on our GPU-accelerated library of primitives for deep neural networks. The ideal candidate will have a strong background in software development, particularly in C/C++ and CUDA development, and experience with linear algebra, machine learning, and computer...


  • Shanghai, Shanghai, China NVIDIA Full time

    We are seeking a highly skilled Technical Program Manager to lead the development and publication of software for training and inference applications using various types of neural networks. This role will involve working closely with engineering and product leaders to plan and execute large-scale programs, driving the development process and coordinating...