Design, deploy, and support large-scale networks for AI infrastructure. Responsibilities include network optimization, automation via scripting, and incident response.
We are hiring a Staff Network Engineer to help build and operate the backbone of a carrier-grade, high-performance AI infrastructure. As a key technical contributor, you will design, deploy, and support large-scale network systems that connect our GPU clusters, high-throughput storage, and compute environments across geographically distributed data centers.
You will work alongside Principal Engineers and cross-functional teams to deliver automation-driven, low-latency networking designed for the scale and intensity of AI workloads and HPC environments.
Key Responsibilities
- Implement and maintain high-throughput, low-latency networks supporting AI Factory workloads and distributed training infrastructure.
- Work hands-on to deploy, configure, and troubleshoot routing, switching, optics, and interconnect systems across data centers.
- Operate and optimize layer 2/3 network services: BGP, EVPN/VXLAN, OSPF, MPLS, QoS, and ACLs.
- Work with Infiniband Networking Systems and Nvidia Fabric Manager (UFM)
- Develop and maintain network automation (e.g., Ansible, Python, Terraform) for provisioning, compliance, and operational workflows.
- Monitor network health and performance using telemetry tools and help scale observability platforms.
- Participate in the incident response rotation and perform root cause analysis on service-impacting events.
- Maintain configuration standards, documentation, and change management in line with infrastructure governance processes.
- Collaborate with the Principal Network Engineer on architectural decisions and vendor evaluations.
Qualifications
Required:
- 5-8+ years of hands-on experience in large-scale network engineering, data center networks, or service provider infrastructure
- Strong knowledge of IP networking, BGP, OSPF, EVPN/VXLAN, and L2/L3 design principles
- Experience configuring and operating Arista, Juniper, or Cisco platforms in production environments
- Proficiency in scripting or automation (e.g., Python, Bash, Ansible)
- Solid troubleshooting skills and experience with real-time diagnostics and packet analysis
- Familiarity with monitoring and telemetry tools (e.g., Prometheus, Grafana, sFlow, InfluxDB)
Preferred:
- Experience in AI, HPC, or GPU-based infrastructure
- Exposure to carrier-grade architectures, DCI, and optical transport systems
- Exposure to Nvidia Infiniband Networking systems and components.
- Understanding of network segmentation, security policies, and zero-trust principles
- Comfortable working in 24/7 operational environments and on-call rotations
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.
You will work alongside Principal Engineers and cross-functional teams to deliver automation-driven, low-latency networking designed for the scale and intensity of AI workloads and HPC environments.
Key Responsibilities
- Implement and maintain high-throughput, low-latency networks supporting AI Factory workloads and distributed training infrastructure.
- Work hands-on to deploy, configure, and troubleshoot routing, switching, optics, and interconnect systems across data centers.
- Operate and optimize layer 2/3 network services: BGP, EVPN/VXLAN, OSPF, MPLS, QoS, and ACLs.
- Work with Infiniband Networking Systems and Nvidia Fabric Manager (UFM)
- Develop and maintain network automation (e.g., Ansible, Python, Terraform) for provisioning, compliance, and operational workflows.
- Monitor network health and performance using telemetry tools and help scale observability platforms.
- Participate in the incident response rotation and perform root cause analysis on service-impacting events.
- Maintain configuration standards, documentation, and change management in line with infrastructure governance processes.
- Collaborate with the Principal Network Engineer on architectural decisions and vendor evaluations.
Qualifications
Required:
- 5-8+ years of hands-on experience in large-scale network engineering, data center networks, or service provider infrastructure
- Strong knowledge of IP networking, BGP, OSPF, EVPN/VXLAN, and L2/L3 design principles
- Experience configuring and operating Arista, Juniper, or Cisco platforms in production environments
- Proficiency in scripting or automation (e.g., Python, Bash, Ansible)
- Solid troubleshooting skills and experience with real-time diagnostics and packet analysis
- Familiarity with monitoring and telemetry tools (e.g., Prometheus, Grafana, sFlow, InfluxDB)
Preferred:
- Experience in AI, HPC, or GPU-based infrastructure
- Exposure to carrier-grade architectures, DCI, and optical transport systems
- Exposure to Nvidia Infiniband Networking systems and components.
- Understanding of network segmentation, security policies, and zero-trust principles
- Comfortable working in 24/7 operational environments and on-call rotations
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.
Top Skills
Acls
Ai Infrastructure
Ansible
Bgp
Evpn
Grafana
Infiniband
Influxdb
Mpls
Nvidia Fabric Manager
Ospf
Prometheus
Python
Qos
Sflow
Terraform
Vxlan
Voltage Park Redmond, Washington, USA Office
15809 Bear Creek Pkwy Suite 300, Redmond, WA, United States, 98052
Similar Jobs at Voltage Park
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
The Infrastructure Operations Engineer at Voltage Park will design and implement infrastructure solutions, ensure system stability, support AI workloads, and collaborate with various teams.
Top Skills:
AnsibleAWSBashCephElk StackGoKubernetesLinuxNfsPrometheusPythonTerraform
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
The Product and Privacy Counsel will advise on legal and compliance issues for AI software and cloud services, manage IP, oversee data privacy, and collaborate with engineering and leadership on liability standards.
Top Skills:
AICloud ServicesData Security
What you need to know about the Seattle Tech Scene
Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.
Key Facts About Seattle Tech
- Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Amazon, Microsoft, Meta, Google
- Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Madrona, Fuse, Tola, Maveron
- Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

