NVIDIA Logo

NVIDIA

Senior Software Engineer, CUTLASS Performance

Posted 5 Days Ago
Be an Early Applicant
In-Office
Redmond, WA, USA
152K-288K Annually
Senior level
In-Office
Redmond, WA, USA
152K-288K Annually
Senior level
The role involves benchmarking deep learning models on NVIDIA GPUs, improving software performance, and developing automation tools for performance optimization.
The summary above was generated by AI

NVIDIA's high-performance computing platforms are powering the AI revolution across many applications and industries. Within our software stack, CUTLASS stands out as a popular open-source ecosystem dedicated to high-performance linear algebra and Tensor Core primitives. Since 2017, it has provided the community with C++ and Python abstractions to implement custom matrix multiply (GEMM) and related math and deep learning computations on NVIDIA GPUs.

If you are enthusiastic about performance and eager to help bridge the gap between current performance and what’s theoretically possible, apply to join the CUTLASS team today!

What you'll be doing:

  • Benchmark the performance of state-of-the-art deep learning models’ inference and training passes to identify key GPU kernel and fusion opportunities.

  • Identify gaps between theoretical and realized performance, and suggest software improvements or model adjustments to resolve them.

  • Develop tooling to automate the benchmarking, analysis, and performance optimization loop to push the limit of CUTLASS kernel performance within DL networks.

  • Be the authoritative resource on kernel performance in the team and engage with teams across NVIDIA including GPU architecture, DL frameworks, and QA as the performance representative for the CUTLASS team.

What we need to see:

  • Masters or PhD degree in Computer Science, Computer Engineering, or related field (or equivalent experience).

  • 3+ years of relevant industry experience.

  • Strong programming skills in Python and C++.

  • Experience in software performance analysis and optimization.

  • Deep understanding of computer architecture and familiarity with GPUs or similar parallel processing architectures.

Ways to stand out from the crowd:

  • Deep understanding of state-of-the art DL model architectures.  

  • Hands-on experience with performance benchmarking of DL frameworks like PyTorch, JAX, SGLang, vLLM, TRT-LLM, or others.

  • Experience in developing performance models and performance regression systems.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hard working people in the world working for us. If you're creative, autonomous, and love a challenge, consider joining our Deep Learning Library team and help us build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 5, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Seattle, Washington, USA Office

4545 Roosevelt Way NE 6th Floor, Seattle, Washington, United States, 98105

Similar Jobs

An Hour Ago
Remote or Hybrid
United States
67K-101K Annually
Junior
67K-101K Annually
Junior
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Provide tactical HR support for Manheim Shared Services including employee relations, program implementation, talent and workforce initiatives, data analysis and reporting, and continuous improvement. Advise managers on policies, coordinate HR program logistics, conduct exit interviews, and partner with HRBPs and COEs to improve employee experience and organizational effectiveness. Up to 25% travel; US remote.
Top Skills: Excel
An Hour Ago
Remote or Hybrid
United States
92K-154K Annually
Senior level
92K-154K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Customer Success Manager is responsible for driving customer outcomes across a portfolio, managing relationships and retention, and collaborating across various teams to ensure value realization and maximize customer satisfaction.
Top Skills: AICloudCustomer SuccessManaged ServicesSaaS
An Hour Ago
Hybrid
67K-101K Annually
Junior
67K-101K Annually
Junior
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Provide onsite HR support for the Manheim Dallas Market Center, advising managers on HR policies and employee relations, implementing people programs, supporting talent and workforce initiatives, conducting exit interviews, analyzing HR data to inform leaders, and partnering with COEs to improve HR service delivery and processes.
Top Skills: Excel

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account