Infrastructure Engineer - Hardware Fleet

| Remote
Sorry, this job was removed at 11:00 a.m. (PST) on Wednesday, February 2, 2022
Find out who’s hiring remotely
See all Remote jobs
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

OctoML is a fast-growing startup developing the industry's leading machine learning deployment platform. We enable customers to take ML models to production faster and with greater performance. OctoML's mission is to make AI sustainable and accessible so it can be used to thoughtfully improve lives.We are founded by the creators of Apache TVM, the open-source stack for ML performance and portability. TVM automates the optimization of machine learning models on CPUs, GPUs, edge devices, and specialized accelerators. Building on the success of TVM, our cloud-based OctoML Platform provides choice, automation, and performance to organizations that are taking their trained models to production.Our team consists of experts in machine learning, hardware, cloud services, and compilers. We have secured over $130M in venture capital funding and our team will more than double in size over the next year. We're based largely in Seattle, but have a remote-first culture with people working all over the US and elsewhere in the world.We dream big but execute with focus and believe in creativity, productivity, and a balanced life. We value diversity in all dimensions and are always looking for talented people to join our team!Overview:OctoML is seeking an Infrastructure Engineer to help build, operate, and support physical and cloud infrastructure for the OctoML SaaS platform. A successful candidate will leverage their experience with hardware, Linux administration, infrastructure as code, networking, and scripting to build platforms and services that are secure, observable, and reliable. Our on-prem hardware includes ARM and x86 devices and is a mixture of enterprise and hobbyist SKUs (Raspberry Pi / Arduino / STM-32). We leverage the cloud for anything \"normal,\" so what goes into our on-prem fleet is the miscellaneous and nonstandard hardware. Future work includes specialized AI acceleration hardware e.g. TPU and mobile devices. Experience with small Arm devices, microcontrollers, bootloaders, the PXE/TFTP stack, serial/JTAG, and other technologies will be an asset.Keywords: Linux, Arm, Ansible, Ubuntu, PXE, networking, TCP/IP, Bash, bootstrap, Packer, on-premLanguages (in no particular order): Bash, Python, YAML, GoNote that few candidates will have all of these qualifications. We encourage you to apply and are always looking for the right candidate!As an Infrastructure Engineer, you will:

  • Design, build, operate, and maintain the hardware fleet behind the OctoML SaaS platform.
  • Build out networking, load and manage Linux systems, and build and operate automation for these tasks.
  • Build and manage systems across multiple environments with a focus on configuration as code and platform automation.
  • Build and improve internal developer tools and help drive Continuous Integration and Continuous Delivery to increase productivity across the engineering organization.
  • Assist with running cloud infrastructure for the OctoML SaaS platform.
  • Participate in an on-call rotation with other Infrastructure engineers.

Our ideal infrastructure engineer will have:

  • Expert understanding of Linux systems and networking
  • Experience with small Arm devices, bootloaders, the PXE/TFTE stack, serial/JTAG interfaces, and other embedded and SoC dev boards
  • On-prem/datacenter experience desirable but not required
  • Command of a scripting language such as Python or Bash, as well as Git
  • Proficiency and experience with infrastructure as code/configuration management tools, such as Ansible or Terraform desirable but not required.
  • Computer science, electrical engineering degree or related experience desirable but not required
  • Experience working with cloud providers such as AWS, GCP, or Azure desirable but not required
  • Excellent verbal and written communication skills
  • Passion for documenting work for future engineers
  • Ability to empathize with co-workers and customers
  • Collaborative working style; able to self manage your time effectively

Key Technologies We use: Linux, Ubuntu, ARM, Raspberry Pi, GKE, GCP, EKS, AWS, VPC, Kubernetes, Docker, Terraform, Atlantis, Packer, Python, GitLab, IPsec, RabbitMQ, CockroachDB, GolangOur Benefits:OctoML aims to provide the resources that employees need to be healthy and comfortable.

  • 100% employer paid premium (for employee and dependents) with a low-deductible plan
  • Remote and telework setups for employees (post-COVID)
  • Flexible work hours
  • 4 weeks paid personal time off + company paid holidays and company downtime 2x per year
  • Family & Medical Paid Time Off (includes Maternity, Paternity, Adoption, among others)

OctoML is committed to creating a diverse environment and is proud to be an equal opportunity employer. We hire based on an evaluation of abilities and effectiveness. We don't discriminate against employees on the basis of any other personal characteristic or any classification protected by federal, state or local law. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

OctoML is a remote-first company based in Seattle, WA. Our office is located in Freemont, a place where you can find a troll, a drawbridge, a rocket, dinosaurs, statues, and art for you to dress up - plus numerous restaurants, bars, places to shop, live, and stay!

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about OctoAIFind similar jobs