Software Engineer - Cloud Engineering
Mountain View, CA
Kumo
Kumo is a platform that uses artificial intelligence to help businesses make predictions from their data, like customer churn or product purchases.As a key team member, you will architect scalable systems for the Kumo platform, making it the top choice for Big Data and AI workloads. Joining early, you'll design the platform to handle large datasets, enhancing productivity for engineers and users. Collaborating with ML scientists, product engineers, and leaders, you'll influence scaling ML tech, develop tools for speed, and craft full-stack experiences. Engineers at Kumo wear many hats, leading the design of core systems from scratch and shaping product direction. You'll dive into foundational work, managing model lifecycles, ML Ops, CI/CD, and deployment strategies.
The Value You'll Add:
- Build and extend components of the core Kumo Cloud Infrastructure and Kumo infrastructure
- Define a culture of engineering excellence and operational efficiency, especially as it relates to development and productization
- Build and automate CI-CD pipelines, release tooling to support continuous delivery, and true zero-downtime deployments across different cloud providers using the latest cloud-native technologies
- Work on advanced tools developed for the world’s leading cloud-native machine learning engine that uses graph deep learning technology
- Develop the infrastructure microservices for features such as usage tracking, diagnostics, monitoring, and alerting at the cloud scale
- Lead automation efforts to streamline global deployment effort
- Build the Kumo ML Ops platform, which will be able to data drift, track model versions, report on production model performance, alert the team of any anomalous model behavior, and run programmatic A/B tests on production models.
Your Foundation:
- BS (preferred MS, PhD.) in Computer Science or a related field
- 3+ years of experience writing production code in C++, Python, Go, or similar languages.
- Experience with Infrastructure-as-Code development (e.g., Terraform, CloudFormation, Ansible, Chef, Bash scripting, etc.)
- Experience with B2B SaaS and architecting experience in building a large-scale distributed system at scale
- Experience with productionizing cloud applications, including Docker and Kubernetes
- Experience with CI/CD and advanced packaging, versioning, and deployment strategies
- Hands-on experience with Kubernetes (e.g., EKS, GKS, AKS, or OpenSource) on public clouds (AWS, GCP) at scale
Your Extra Special Sauce:
- Experience with popular MLOps tooling from cloud vendors like GCP (Vertex AI), AWS (SageMaker), or Azure Machine Learning, MLFlow, Kubeflow, etc.
- Experience with managing popular Data platforms such as AWS EMR, Snowflake, Databricks, etc.
- Experience with industry standard security practices, such as security testing, vulnerability assessments, ISO27001, GRC, and risk under compliance
- Extensive experience with Docker/Containers, Jenkins/Flux/Argo, and Terraform in a Linux environment
- Experience with monitoring tools such as Prometheus, Grafana, etc.
- Proficiency in developing customer-facing Web Front Ends or public APIs/SDKs for the application
Benefits:
- Stock
- Competitive Salaries
- Medical Insurance
- Dental Insurance
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: A/B testing Ansible APIs AWS Azure Big Data CI/CD CloudFormation Computer Science Databricks Deep Learning Docker Engineering GCP Grafana ISO 27001 Jenkins Kubeflow Kubernetes Linux Machine Learning Microservices MLFlow MLOps PhD Pipelines Python SageMaker Security Snowflake Terraform Testing Vertex AI
Perks/benefits: Career development Insurance
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.