Software Engineer 3 - Linux/Python/Kubernetes/PyTorch/Helm

Job expired!
Build Something to Be Proud Of. Captivation Software has built a reputation on providing customers exactly what is needed in a timely manner. Our team of engineers take pride in what they develop and constantly innovate to provide the best solution. Captivation Software is looking for a senior level software engineer who will be responsible for designing, developing, and sustaining pipelines that enable machine-learning model training as well as large-scale inference in a Kubernetes-based environment. Responsibilities: - Development of data-aware model training pipelines to facilitate unique customer requirements surrounding model provenance - Development of scalable, Kubernetes-based inference pipelines that compliantly handle in-flight data - Configuring and maintaining custom metrics to enable tuning of running pipelines Requirements: Security Clearance: - Must currently hold a Top Secret / SCI U.S. Government security clearance with a favorable Polygraph, therefore all candidates must be a U.S. citizen Minimum Qualifications: - Master's degree in Computer Science or related discipline from an accredited college or university, plus five (5) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity. - Bachelor's degree in Computer Science or related discipline from an accredited college or university, plus seven (7) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity - Nine (9) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity. Required Skills: - Experience using the Linux CLI - Experience developing with Python - Experience developing and deploying containerized applications - Experience writing and deploying Kubernetes resources - Experience writing and deploying Helm charts Desired Skills: - Experience developing with Go - Experience with CI/CD concepts & implementations (Gitlab, Flux CD, etc) - Experience working with, and debugging, GPU-enabled applications - Experience with Policy Management Tools such as Kyverno - Experience using Imagine Builder Tools for Go such as Ko - Experience using a machine-learning framework (PyTorch, TensorFlow, etc) - Experience with other ML pipelines/frameworks like KubeFlow, NeMo, PyTorch Lightning - Experience with metrics and monitoring tools such as Prometheus and Grafana - Experience with the Atlassian suite of tools This position is open for direct hires only. We will not consider candidates from third party staffing/recruiting firms. Benefits: - Annual Salary: $125,000 - $250,000 (Depends on the years of experience) - Up to 20% 401k contribution (no matching required) - Above market hourly rates - $3,000 HSA Contribution - 5 Weeks Paid Time Off - Company Paid Employee Medical / Dental / Vision Insurance / Life Insurance / Short-Term & Long-Term Disability / AD&D