DevOps engineer
Menlo Park
Lamini
Lamini is the enterprise LLM platform for existing software teams to quickly develop and control their own LLMs. Lamini has built-in best practices for specializing LLMs on billions of proprietary documents to improve performance, reduce...About the Role: We are seeking a highly skilled and motivated DevOps Engineer to join our team at a Senior or Staff level. The ideal candidate will be instrumental in managing cloud infrastructure, improving internal development workflows, and ensuring seamless release and delivery of the Lamini Platform to enterprise customers. This role involves collaborating with cross-functional teams and leveraging cutting-edge technologies to enhance our platform's reliability, scalability, and performance.
Key Responsibilities:
- Software Deployment and Delivery on Kubernetes platform:
- Design and implement robust software deployment processes for delivering high-quality platforms to enterprise customers.
- Work with on-prem and managed Kubernetes environments on Cloud to drive product architecture design.
- Internal Infrastructure Support:
- Maintain and enhance internal ML infrastructure on GCP VertexAI, AWS Bedrock, and private data center GPU servers.
- Support the engineering team by improving the development environment (GitHub, Cloud, local setups).
- Customer Support and Troubleshooting:
- Diagnose and resolve issues related to deploying Lamini Platform in customer on-prem environments.
- Ensure the reliability and performance of the platform and contribute to its continuous improvement.
- Data Center Server Management:
- Collaborate with data center vendors to manage GPU servers.
- Utilize Infrastructure as Code (IaC) principles to automate provisioning and configuration management.
- Team Collaboration:
- Partner with cross-functional teams to ensure reliability and scalability are embedded in the design of new features and services.
- Document systems, processes, and findings to maintain transparency and knowledge sharing.
Desired Fit:
- Continuous Improvement: Proactively identify and address issues in Lamini Platform, to ensure a delightful experience of deploying Lamini Platform for customers.
- Principled Approach: Advocate and implement best practices like Infrastructure as Code (IaC) IaC to ensure system reliability and consistency.
- Collaborative Mindset: Work seamlessly across teams, supporting colleagues and contributing to team success.
- Ownership: Take initiative to own problems end-to-end, learning new skills as needed to deliver solutions.
- Technical Savvy: Experience using AI-assisted programming tools such as Copilot and Cursor is a plus.
Qualifications:
- Bachelor’s degree in Computer Science, or a related field (or equivalent work experience).
- Proven expertise in DevOps tools and platforms, with hands-on experience building workflows and pipelines.
- Deep knowledge of Docker, Kubernetes, Observability, CI/CD, cloud platforms (AWS/GCP), and related tools (docker, helm, prometheus, git, terraform etc).
- Proficiency in programming languages such as Python, Go, and shell scripting (e.g., Bash, Awk).
- Strong problem-solving skills with the ability to thrive in a fast-paced environment.
- Excellent communication skills for engaging with stakeholders and documenting technical processes.
At Lamini AI, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants without regard to race, color, religion, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, and any other characteristic protected by applicable law. Lamini AI believes that diversity and inclusion among our employees is critical to our success as a company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. Selection for employment is decided on the basis of qualifications, merit, and business need.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS CI/CD Computer Science Copilot DevOps Docker Engineering GCP Git GitHub GPU Helm Kubernetes Machine Learning ML infrastructure Pipelines Python Security Shell scripting Terraform Testing Vertex AI
Perks/benefits: Career development Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.