Baseten Logo

Baseten

Senior Software Engineer - Infrastructure

Reposted 4 Days Ago
In-Office or Remote
3 Locations
200K-270K Annually
Senior level
In-Office or Remote
3 Locations
200K-270K Annually
Senior level
Architect and lead the development of ML inference platforms, optimizing infrastructure and Kubernetes deployments, while mentoring junior engineers.
The summary above was generated by AI

ABOUT BASETEN

Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With our recent $150M Series D funding, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction, we’re scaling our team to meet accelerating customer demand.

THE ROLE

As a Senior Infrastructure Software Engineer at Baseten, you'll architect and lead development of our ML inference platform that powers production AI applications. You'll make key technical decisions for the infrastructure enabling developers to deploy, scale, and monitor ML models with high performance and reliability.

EXAMPLE INITIATIVES

You'll get to work on these types of projects as part of our Infrastructure team:

  • Multi-cloud capacity management

  • Inference on B200 GPUs

  • Multi-node inference

  • Fractional H100 GPUs for efficient model serving

RESPONSIBILITIES

  • Design and architect scalable infrastructure systems for our ML inference platform

  • Lead optimization of Kubernetes deployments for efficient, cost-effective model serving

  • Drive enhancements to our inference orchestration layer for complex model deployments

  • Define monitoring strategies for model performance, latency, and resource utilization

  • Develop advanced solutions for GPU capacity management and throughput optimization

  • Establish infrastructure automation standards to streamline ML deployment workflows

  • Partner with other engineers to translate complex inference requirements into technical solutions

  • Make critical architectural decisions balancing performance with system reliability

  • Lead technical discussions and mentor junior engineers on infrastructure best practices

  • Contribute to long-term technical strategy and infrastructure roadmap

REQUIREMENTS

  • Bachelor's degree or higher in Computer Science or related field

  • 5+ years experience building production infrastructure systems

  • Expert-level proficiency in Go, with Python experience a plus

  • Deep expertise with Kubernetes in production environments

  • Extensive experience with major cloud providers (AWS, GCP) and neo-cloud providers (Crusoe, DigitalOcean, Nebius) a plus.

  • Advanced understanding of distributed systems concepts and performance tuning

  • Proven experience designing observability systems

  • Track record of leading technical initiatives and mentoring engineers

  • Experience with ML/AI workloads and MLOps platforms highly valued

BENEFITS

  • Competitive compensation, including meaningful equity.

  • 100% coverage of medical, dental, and vision insurance for employee and dependents

  • Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

  • Paid parental leave

  • Company-facilitated 401(k)

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Top Skills

AWS
Distributed Systems
GCP
Go
Kubernetes
Python

Similar Jobs

13 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Design and operate distributed database technologies, focusing on scalability and reliability. Guide teams in technical choices and maintain high availability systems.
Top Skills: AuroraClickhouseDynamoDBGoJavaMemcachedMongoDBPythonRdsRedis
11 Days Ago
Remote
United States
196K-265K Annually
Senior level
196K-265K Annually
Senior level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
As a Senior Software Engineer on the Storage team, you will design and maintain large-scale distributed storage systems, collaborate on architectural improvements, and ensure system reliability and performance.
Top Skills: C++GoRust
3 Days Ago
Easy Apply
Remote
United States
Easy Apply
133K-197K Annually
Senior level
133K-197K Annually
Senior level
Database • Analytics
Design and implement backend systems and APIs, scale cloud-native systems, define CI/CD best practices, and engage with community contributors to enhance observability tools.
Top Skills: ClickhouseDockerHelmInfrastructure-As-CodeKubernetesNode.jsSQLTerraformTypescript

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account