NVIDIA Logo

NVIDIA

Senior Software Engineer, AI Networking

Reposted 4 Days Ago
Be an Early Applicant
In-Office
Seattle, WA, USA
152K-288K Annually
Senior level
In-Office
Seattle, WA, USA
152K-288K Annually
Senior level
The Senior Software Engineer will optimize AI workloads using machine learning techniques for large scale GPU and CPU clusters, focusing on resource allocation, performance analysis, and collaboration on hardware/software teams.
The summary above was generated by AI

NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark R&D team. In this pivotal role, the candidate is responsible for building and productizing machine learning tools. These include tools that use ML-based combinatorial optimization and build space exploration (DSE) techniques. These tools will be employed to optimize AI workloads across large GPU and CPU clusters, thereby ensuring the most efficient and productive utilization of system resources at data center scale. The role involves working on distributed Deep Learning, particularly within LLM training and inference stacks. A strong passion for collective communication and networking is desirable.

The candidate will interact with diverse hardware and platforms, such as Host Channel Adapters (HCAs), Switches, CPUs, GPUs, and complete Systems. Furthermore, the role requires engagement across multiple software layers, including LLM applications, machine learning frameworks, and communication and computing libraries. The candidate will develop tools and methodologies using Machine Learning (ML) for comprehensive performance analysis and optimization, potentially incorporating learning-based agentic techniques. This work involves deep-diving across the software stack, from LLM applications and ML frameworks down to communication and computing libraries. This position offers a distinct opportunity to support the core infrastructure powering the next generation of large-scale AI systems.

What you'll be doing:

  • Design and implement resource allocation and combinatorial optimization techniques (e.g., reinforcement learning, LLM agents for DSE, Bayesian optimization and other multi-objective optimization techniques) to optimize LLM models at datacenter scale.

  • Research, develop, and deploy AI/ML techniques to optimize large-scale Deep Learning (LLM) training and inference on NVIDIA supercomputers and distributed systems. This includes a focus on high-performance networking and NVIDIA communication libraries.

  • Build and productionize ML-based tools for performance prediction and optimization, with a strong emphasis on networking aspects.

  • Develop and deploy a scalable, reliable data curation pipeline capable of handling complex data types, such as time series and PyTorch model graphs, to effectively support the training of high-performance Machine Learning models.

  • Collaborate across hardware and software teams to deliver valuable performance analysis insights.

  • Lead performance test planning, establish performance targets for new technologies and solutions, and drive efforts to achieve those performance goals.

What we need to see:

  • PhD or Master's degree in Computer Science, Software Engineering, or equivalent experience.

  • 4+ years of experience applying machine learning techniques to computer architecture and system optimization problems. Desired experience involves bringing to bear ML at the intersection of at least two of the following areas: HPC, networking, and AI applications.

  • Hands-on experience developing and deploying various learning algorithms (e.g., reinforcement learning, offline RL, supervised learning) to tackle optimization challenges within computer architecture, system design, or networking domains.

  • Proficiency in building and using ML models with leading frameworks such as PyTorch or TensorFlow, or JAX.

  • Proven ability to apply GNNs/transformers-based optimization to PyTorch model graph and Kineto execution traces.

  • Expertise combining knowledge of NVIDIA GPUs, the CUDA library, and deep learning frameworks (TensorFlow/PyTorch) with networking concepts, including collective communication libraries (like NCCL) and protocols (such as RoCE and RDMA).

  • Strong programming capabilities in Python, Bash, and C++.

  • A collaborative teammate with effective communication and interpersonal abilities.

Ways to stand out from the crowd:

  • In-depth knowledge and experience with machine learning/reinforcement learning and frameworks.

  • Comprehensive understanding of computer architecture, system architecture and networking.

  • Extensive experience in applying machine learning techniques such as GNNs or related graph-based models.

  • Knowledge in PyTorch, CUDA, and NCCL libraries.

  • Proven software engineering/development skills

With competitive salaries and a comprehensive benefits package, NVIDIA is widely regarded as one of the most desirable technology employers in the world. Our teams are composed of some of the most forward‑thinking and driven engineers in the industry, and we continue to grow rapidly. If you are a senior data engineer passionate about building large‑scale, high‑impact data platforms, we’d love to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 18, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Seattle, Washington, USA Office

4545 Roosevelt Way NE 6th Floor, Seattle, Washington, United States, 98105

Similar Jobs

20 Minutes Ago
Remote or Hybrid
Seattle, WA, USA
212K-244K Annually
Mid level
212K-244K Annually
Mid level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead selection, implementation, and administration of marketing and sales technologies to drive growth and customer engagement. Manage and coach a team, execute digital marketing and creative campaigns, optimize marketing automation and Salesforce analytics, ensure data quality and validation, and partner with stakeholders to improve processes and deliverables from planning through completion.
Top Skills: Adobe Data CollectionAdobe Experience Manager (Aem)Adobe Martech PlatformsAnalytics InstrumentationCdpCRMDom ManipulationHTMLJavaScriptMarketing AutomationSalesforce Crm AnalyticsSalesforce Marketing CloudTypescriptWeb Sdk
20 Minutes Ago
Remote or Hybrid
Seattle, WA, USA
99K-266K Annually
Mid level
99K-266K Annually
Mid level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As a Financial Services Tax Manager, you'll supervise and develop teams, manage client accounts, analyze complex problems, and drive digitization in real estate tax services.
Top Skills: Crm Systems
20 Minutes Ago
Hybrid
Seattle, WA, USA
124K-280K Annually
Senior level
124K-280K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead design and implementation of Kinaxis supply chain solutions, analyze and redesign supply chain processes, develop strategies to improve efficiency and reduce costs, manage budgets, guide strategic sourcing and procurement, coach and lead teams, validate outcomes with clients, and promote use of technology and data analytics to create an agile, connected supply chain.
Top Skills: Kinaxis

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account