Deep Learning Solutions Architect

24 hours ago


Shanghai, Shanghai, China NVIDIA Full time
About the Role

NVIDIA is seeking a highly skilled Solutions Architect to join our team and work with our largest global alliance partners to enable them with our portfolio of GPU Accelerated Computing solutions, including Machine Learning and Deep Learning. As a Solutions Architect, you will be responsible for architecting and creating prototypes, building, pre-training, fine-tuning, and p-tuning LLMs, and finding the best deployment scenarios for customers.

Key Responsibilities
  • Creating or running Proof of Concept and Demos that require presentation skills, explanation of complex topics, writing Python code to execute data pipelines and train ML/DL models, and deploy on container-based orchestrators.
  • Keeping up to date on the state of the art in the production Deep Learning and Machine Learning ecosystem and helping to architect and scale high-performance, distributed AI deployments built on the latest NVIDIA GPU supercomputers.
  • Documenting knowledge and guiding others through building targeted training for partners and other Solutions Architects, writing whitepapers, blogs, and wiki articles, and working through challenging problems with a partner on a whiteboard.
  • Answering questions and providing mentorship, working with Partner Business Managers to assist partners and customers on their mission-critical projects, and helping them build their GPU-enabled Accelerated Compute datacenters and get the most out of their investment.
  • Using conferencing tools and some travel is required for this role, and you are empowered to find the best way to get your job done and do what it takes to make our partners successful.
Requirements
  • BS or MS in Engineering, Mathematics, Physics, or Computer Science (or equivalent experience).
  • 5+ years of work-related experience in Deep Learning and Machine Learning, including deep learning frameworks TensorFlow or PyTorch, GPU and CUDA experience extremely helpful.
  • Experience working with DevOps, including but not limited to Docker/Containers, Kubernetes, and Data Center deployments.
  • Deep understanding of dense datacenter design including compute, storage, and networking.
  • Ability to multitask effectively in a dynamic environment.
  • Strong analytical and problem-solving skills.
  • Clear written and oral communication skills with the ability to effectively collaborate with management and engineering.
  • Strong desire to share knowledge with clients, partners, and co-workers.
Preferred Qualifications
  • Extensive knowledge and hands-on experience with recent advancements in LLMs and GenAI.
  • Experience developing with ML/DL frameworks and MLOps ecosystem of partners and solutions in the cloud and on-prem.
  • Background with cloud-based solution designing, APIs, and Microservices, orchestration platforms, storage solutions, and data migration techniques.
  • Experience or good knowledge of server architectures, PCIe topologies, Infiniband, and other networking technologies and Operating systems.
  • Willingness and ability to dig into unfamiliar territories to tackle complex problems and be a great listener.
About NVIDIA

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking individuals in the world working for us. If you're creative and autonomous, we want to hear from you.



  • Shanghai, Shanghai, China NVIDIA Full time

    About the RoleNVIDIA is seeking a highly skilled Solutions Architect to collaborate with our largest global alliance partners and enable them with our portfolio of GPU Accelerated Computing solutions, specifically Generative AI and Machine Learning.This individual will be a technical data science and accelerated computing platform advisor, responsible for...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled Solutions Architect to collaborate with our largest global alliance partners to enable them with our portfolio of GPU Accelerated Computing solutions, specifically Generative AI and Machine Learning. This individual will work in a fast-evolving technological environment, staying on the cutting edge of AI and Accelerated...


  • Shanghai, Shanghai, China NVIDIA Full time

    Unlock the Power of Deep Learning with NVIDIAWe are seeking a talented Deep Learning Performance Architect to join our team at NVIDIA. As a leader in the field of deep learning, we are expanding our research and development for inference and seeking excellent software engineers and senior software engineers to collaborate with our team.About the Role:Develop...


  • Shanghai, Shanghai, China NVIDIA Full time

    Deep Learning Performance Architect InternNVIDIA is pushing the boundaries of deep learning performance by developing innovative processor and system architectures. We are seeking a talented deep learning system performance architect to contribute to our AI performance projection and analysis efforts.Key Responsibilities:Analyze state-of-the-art AI models on...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA is seeking a highly skilled deep learning system performance architect to join our AI performance projection and analysis efforts. As a key member of our team, you will have the opportunity to work on performance projection, analysis, and optimization on state-of-the-art hardware architectures for various AI workloads.Key Responsibilities:Analyze and...


  • Shanghai, Shanghai, China NVIDIA Full time

    Deep Learning Performance Software EngineerWe are expanding our research and development for Inference at NVIDIA, and we seek excellent Software Engineers and Senior Software Engineers to collaborate with our team.Key Responsibilities:Develop highly optimized deep learning kernels for inferencePerform performance optimization, analysis, and tuningWork with...


  • Shanghai, Shanghai, China NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades, driving innovation in the tech industry. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing.Key Responsibilities:Design...


  • Shanghai, Shanghai, China Optiver Full time

    About Us:Optiver is a global market maker with a presence in multiple continents. Founded in 1986, we have grown to become a leading liquidity provider with a team of over 2,000 employees worldwide. Our mission is to improve the market through competitive pricing, execution, and risk management.Our Shanghai Office:Established in 2012, our Shanghai office is...


  • Shanghai, Shanghai, China NVIDIA Full time

    Job Title: Senior Technical Program Manager, Deep Learning SoftwareWe are seeking a highly skilled and experienced Senior Technical Program Manager to lead our deep learning software development efforts. As a key member of our team, you will be responsible for planning, executing, and delivering large-scale programs that enable NVIDIA's customers to employ...


  • Shanghai, Shanghai, China NVIDIA Full time

    Deep Learning Inference Software Internship OpportunityWe are seeking a highly skilled and motivated individual to join our team as a Deep Learning Inference Software Intern. As a member of our research and software development team, you will be responsible for developing and optimizing deep learning inference software for NVIDIA GPUs.Key...


  • Shanghai, Shanghai, China NVIDIA Full time

    Job SummaryWe are seeking a highly skilled Technical Program Manager to lead our Deep Learning Software initiatives. As a key member of our team, you will be responsible for planning and executing large-scale programs to develop and publish software for training and inference applications using various types of neural networks.Key ResponsibilitiesDefine and...


  • Shanghai, Shanghai, China Amazon Connect Technology Services (Beijing) Co., Ltd. - C46 Full time

    About the RoleWe are seeking a highly skilled Cloud Solutions Architect to join our team at Amazon Connect Technology Services (Beijing) Co., Ltd. - C46. As a Cloud Solutions Architect, you will play a key role in helping our customers build and deploy cloud-based solutions on the Amazon Web Services (AWS) platform.Key ResponsibilitiesDesign and implement...


  • Shanghai, Shanghai, China Amazon Connect Technology Services (Beijing) Co., Ltd. - C46 Full time

    About the RoleWe are seeking a highly skilled Cloud Solutions Architect to join our team at Amazon Connect Technology Services (Beijing) Co., Ltd. - C46. As a Cloud Solutions Architect, you will play a key role in shaping and delivering our cloud strategy, working closely with customers to understand their needs and develop tailored solutions to meet their...


  • Shanghai, Shanghai, China Microsoft Full time

    About the RoleWe are seeking a highly motivated and passionate Cloud Solution Architect to drive high-priority customer initiatives on the Microsoft Azure Platform in collaboration with customers and the Microsoft field in Enterprise accounts segment of our business.Key ResponsibilitiesUnderstand customers' overall data estate, IT and business priorities and...


  • Shanghai, Shanghai, China Microsoft Full time

    Job Title: Cloud Solution Architect - Data & AIWe are seeking a highly motivated and passionate Cloud Solution Architect to drive high-priority customer initiatives on the Microsoft Azure Platform in collaboration with customers and the Microsoft field in Enterprise accounts segment of our business.Key Responsibilities:Design and implement data platform and...


  • Shanghai, Shanghai, China NVIDIA Full time

    Job Opportunity: Deep Learning Inference Software Engineer InternWe are seeking a highly skilled Deep Learning Inference Software Engineer Intern to join our team at NVIDIA. As a leading technology company, we are committed to advancing the field of deep learning and developing innovative software solutions.Key Responsibilities:Develop highly optimized deep...


  • Shanghai, Shanghai, China NVIDIA Full time

    About the RoleWe are seeking a highly skilled and experienced Technical Program Manager to lead our Deep Learning Software development efforts. As a key member of our team, you will be responsible for planning, executing, and delivering large-scale software programs that enable our customers to employ industry-leading AI and ML in their products.Key...


  • Shanghai, Shanghai, China Microsoft Full time

    Job Title: Cloud Solution Architect - Data and AI ExpertWe are seeking a highly motivated and passionate Cloud Solution Architect to drive high-priority customer initiatives on the Microsoft Azure Platform in collaboration with customers and the Microsoft field in Enterprise accounts segment of our business.Key Responsibilities:Design and implement data...


  • Shanghai, Shanghai, China NVIDIA Full time

    About NVIDIANVIDIA is a leading company in the field of artificial intelligence computing. Our employees are passionate about AI, high-performance computing, visualization, and gaming. Our Solution Architecture team focuses on bringing NVIDIA's new technology to various industries, helping to design the architecture of AI computing platforms, and analyzing...


  • Shanghai, Shanghai, China Tencent Full time

    About the RoleTencent is seeking a highly skilled Cloud Solutions Architect to join our team. As a Cloud Solutions Architect, you will be responsible for providing technical consulting services to our clients in the internet industry, identifying their cloud computing needs, and designing cloud architecture solutions that meet their business requirements.Key...