NVIDIA Logo

NVIDIA

Senior Software Engineer, Deep Learning Inference, TensorRT

Sorry, this job was removed Sorry, this job was removed at 08:11 p.m. (PST) on Friday, May 30, 2025
Be an Early Applicant
Santa Clara, CA
Santa Clara, CA

Similar Jobs

An Hour Ago
Remote
Hybrid
8 Locations
185K-327K Annually
Senior level
185K-327K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Enterprise Account Executive is responsible for selling the Square platform to enterprise clients, leading complex deal negotiations, and representing the company at industry events. They must build strong relationships with C-level executives, oversee the sales cycle, and collaborate across various internal teams.
Top Skills: SaaS
An Hour Ago
Remote
Hybrid
8 Locations
161K-284K Annually
Senior level
161K-284K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Senior Machine Learning Engineer will develop end-to-end ML solutions for risk management, collaborating with various teams to enhance effectiveness and efficiency.
Top Skills: AirflowAWSGCPGcp Vertex AiMlflowPrefectPysparkPyTorchSagemaker
An Hour Ago
Remote
Hybrid
San Francisco, CA, USA
123K-223K Annually
Senior level
123K-223K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Manage BPO programs for Risk Operations, ensuring production quality and efficiency through vendor relationships and collaboration. Define performance metrics and drive continuous improvement initiatives.
Top Skills: Operational ManagementRisk ManagementVendor Management

We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering positions in the Deep Learning Inference TensorRT software team.

What you’ll be doing:

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.

  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.

  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.

What we need to see:

  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.

  • 3+ years of software development experience.

  • Strong experience with C++11/C++14/C++17.

  • Strong grasp of Machine Learning concepts.

  • Experience and knowledge in Computer Architecture, Data Structures, Algorithms.

  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Ways to stand out from the crowd:

  • Experience developing System Software.

  • Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL.

  • Experience in software performance benchmarking, profiling, and optimizations.

  • Background in compiler development

  • Experience in working with TensorRT, PyTorch, TensorFlow, ONNX Runtime or other ML frameworks.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous and love a challenge, we want to hear from you. Come, join our TensorRT Workflows team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Seattle, Washington, USA Office

4545 Roosevelt Way NE 6th Floor, Seattle, Washington, United States, 98105

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account