VAST Data Logo

VAST Data

Senior Solutions Engineer, AI Infrastructure

Posted 15 Days Ago
Remote or Hybrid
Hiring Remotely in United States
Senior level
Remote or Hybrid
Hiring Remotely in United States
Senior level
The Senior Solutions Engineer will design and implement infrastructure for AI and HPC workloads, engage with customers, and lead technical discovery and architecture design.
The summary above was generated by AI
Description

We're looking for a deeply technical Solutions Architect to help customers design, evaluate, and deploy infrastructure for large-scale AI, HPC, analytics, and data-intensive workloads.

This is a customer-facing technical role for someone who has lived inside production infrastructure. You may have been a platform engineer, infrastructure engineer, SRE, MLOps engineer, AI infrastructure engineer, storage engineer, cloud engineer, or HPC systems engineer. What matters most is that you have built, operated, or architected real systems, and can bring that credibility into customer conversations.

Our customers are building infrastructure at serious scale: GPU clusters, high-performance storage systems, Kubernetes platforms, distributed training environments, inference platforms, data pipelines, lakehouses, and large enterprise systems. You'll help them reason about architectures involving 10,000+ GPUs, 100PB+ of storage, high-performance networking, distributed filesystems, orchestration layers, and demanding production workloads.

You'll own technical discovery, architecture design, PoC planning, competitive positioning, and customer technical strategy. You'll work from the first whiteboard session through evaluation, deployment planning, and production success. You'll also partner closely with product and engineering teams to bring field feedback into the roadmap.

We're looking for someone who can go deep technically, communicate clearly, operate without a rigid playbook, and translate complex infrastructure into customer outcomes.

Responsibilities

  • Lead technical discovery with customers across infrastructure, platform, ML, data, and executive stakeholders.
  • Design architectures for large-scale AI, HPC, analytics, and enterprise data workloads.
  • Help customers evaluate infrastructure involving GPUs, storage, networking, orchestration, and data movement.
  • Translate complex technical requirements into clear solution designs, reference architectures, and deployment guidance.
  • Debug customer issues across Linux, storage, networking, Kubernetes, schedulers, GPUs, and application workloads.
  • Build technical assets, runbooks, and field guidance for repeatable customer engagements.
  • Partner with product and engineering to communicate customer requirements, gaps, and roadmap opportunities.
  • Help customers move from architecture design to production deployment.
Requirements
  • 8 to 12+ years of technical experience, with significant hands-on infrastructure experience.
  • Experience building, operating, or architecting production platform infrastructure.
  • Strong understanding of Linux kernel implementation details, distributed systems including PAXOS and raft, storage implementations details like NAND or write amplification, networking store/forward, load balancing designs, and production operations.
  • Experience with one or more of: GPU infrastructure, large scale HPC systems, Kubernetes platforms from scratch, MLOps, storage systems, cloud infrastructure, data platforms, or large-scale enterprise infrastructure.
  • Ability to communicate credibly with engineers, architects, technical executives, and business stakeholders.
  • Strong discovery, problem-solving, and systems debugging skills.
  • Comfort operating in ambiguous, fast-moving environments.
  • Interest in customer-facing technical work, solution design, and business outcomes.

Preferred Experience

  • Experience with large-scale GPU clusters, distributed training, inference infrastructure, or AI platforms.
  • Experience with petabyte-scale storage or high-performance data systems.
  • Experience with Kubernetes, Slurm, Ray, Spark, or other orchestration / scheduling systems.
  • Domain Expertise with one or more of these - Lustre, Ceph, Weka, BeeGFS, GPFS, VAST, object storage, or distributed filesystems.
  • Experience with large-scale InfiniBand, RoCE, RDMA, high-performance Ethernet, or NVIDIA/Mellanox networking.
  • Direct Experience with CUDA, NCCL, DCGM, GPUDirect, checkpointing, dataset staging, or model-serving infrastructure.
  • Experience across multiple industries or customer environments.

Similar Jobs

An Hour Ago
Remote or Hybrid
192K-337K Annually
Expert/Leader
192K-337K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead design and scaling of role architecture, competency frameworks, collaboration models, and org design governance for the Customer Excellence Group. Partner with leaders, HR, and L&D to translate strategy into practical role mandates, enable role communities, drive stakeholder alignment, and support transformation initiatives to improve organizational effectiveness across a global, matrixed SaaS organization.
Top Skills: Servicenow
An Hour Ago
Remote or Hybrid
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead and transform the ServiceNow–Wipro partnership to drive sourced and influenced revenue, architect AI-led joint GTM and co-innovation, build multi-directional ecosystem strategies, and align executive stakeholders across both organizations to execute multi-year joint business plans and governance.
Top Skills: AIAutomationDataHyperscalersServicenowServicenow Ai
An Hour Ago
Remote or Hybrid
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead and transform the strategic global partnership with Wipro, driving revenue, pipeline, and co-innovation. Architect AI-led joint GTM, multi-directional ecosystem strategies, joint business plans, and governance. Influence senior executives, align cross-functional teams, and deliver scalable co-created solutions and new business models to accelerate ServiceNow growth.
Top Skills: AIAutomationDataServicenow

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account