ElastixAI
Teams at ElastixAI
Recently posted jobs
Artificial Intelligence • Hardware • Machine Learning • Generative AI
Design, develop, and maintain core ML inference platform components including model deployment, optimization pipelines, and benchmarking/simulation workflows. Collaborate with systems and cloud engineers, and build APIs/tools to ensure scalable, reliable, and hardware-efficient inference solutions.
Artificial Intelligence • Hardware • Machine Learning • Generative AI
Design and implement IR transformations, graph optimizations, kernel lowering, and code generation for novel hardware. Decompose LLM/transformer workloads into primitives, build performance models and profiling tools, collaborate with ML and hardware teams, prototype end-to-end improvements from framework passes to custom kernels, and shape system architecture for an inference engine.
Artificial Intelligence • Hardware • Machine Learning • Generative AI
Design, operate, and evolve ElastixAI's Kubernetes and multi-cloud inference infrastructure. Run accelerated ML workloads at scale, build deployment and automation tooling, harden AWS/GCP/on-prem systems, partner with ML/runtime teams to productionize models, optimize costs and reliability, and participate in on-call rotation.
