Software Engineer 2 - Linux/Python/Kubernetes/Helm/Git/PyTorch

Job expired!
Build Something to Be Proud Of. Captivation Software has built a reputation on providing customers exactly what is needed in a timely manner. Our team of engineers take pride in what they develop and constantly innovate to provide the best solution. Captivation Software is looking for a mid-level software engineer who will be responsible for designing, developing, and sustaining pipelines that enable machine-learning model training as well as large scale inference in a Kubernetes-based environment. Responsibilities: - Development of data-aware model training pipelines to facilitate unique customer requirements surrounding model provenance - Development of scalable, Kubernetes-based inference pipelines that compliantly handle in-flight data - Configuring and maintaining custom metrics to enable tuning of running pipelines Requirements Security Clearance: - Must currently hold a Top Secret / SCI U.S. Government security clearance with a favorable Polygraph, therefore all candidates must be a U.S. citizen Minimum Qualifications: - Master's degree in Computer Science or related discipline from an accredited college or university, plus three (3) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity. - Bachelor's degree in Computer Science or related discipline from an accredited college or university, plus five (5) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity - Seven (7) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity.. Required Skills: - Experience using the Linux CLI - Experience developing with Python - Experience developing and deploying containerized applications - Experience writing and deploying Kubernetes resources - Experience writing and deploying Helm charts Desired Skills: - Experience developing with Go - Experience with CI/CD concepts & implementations (Gitlab, Flux CD, etc) - Experience working with, and debugging, GPU-enabled applications - Experience using a machine-learning framework (PyTorch, TensorFlow, etc) - Experience with other ML pipelines/frameworks like KubeFlow, NeMo, PyTorch Lightning, etc. - Experience with metrics and monitoring tools such as Prometheus and Grafana - Experience with the Atlassian suite of tools This position is open for direct hires only. We will not consider candidates from third party staffing/recruiting firms. Benefits - Annual Salary: $125,000 - $250,000 (Depends on the years of experience) - Up to 20% 401k contribution (no matching required) - Above market hourly rates - $3,000 HSA Contribution - 5 Weeks Paid Time Off - Company Paid Employee Medical / Dental / Vision Insurance / Life Insurance / Short-Term & Long-Term Disability / AD&D