Backend Team Lead - Runtime Team - MLOps Group
Tel Aviv/ Netanya, Israel
JFrog
The JFrog Platform gives you an end-to-end pipeline to control the flow of your binaries from build to production. Power your software updates to the edgeCompany Overview:
At JFrog, we’re reinventing DevOps to help the world’s greatest companies innovate – and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit and just all-around great people. Here, if you’re willing to do more, your career can take off. And since software plays a central role in everyone’s lives, you’ll be part of an important mission. Thousands of customers, including the majority of the Fortune 100, trust JFrog to manage, accelerate, and secure their software delivery from code to production – a concept we call “liquid software”. Wouldn't it be amazing if you could join us on our journey?
We are looking for an experienced, hands-on team lead to lead an innovative and talented R&D team, responsible for delivering fast and high-quality code using the latest frameworks and technologies.
Our team plays a critical role in enabling ML innovation by developing and maintaining robust ML capabilities, one of the top-selling products within our group.
Position Overview:
We are looking for a strong team lead to head our Runtime team and help drive forward our MLOps domain in a fast-paced startup environment. The Runtime team focuses on developing and optimizing our AI/ML infrastructure, ensuring models are efficiently deployed, monitored, and scaled to handle millions of requests across multiple cloud platforms, including GPU-powered machines.
As a key member of our growing team, you will tackle interesting and meaningful challenges while contributing to our evolving MLOps ecosystem.
Responsibilities:
- Take a significant part in shaping our cross-company engineering culture.
- Lead and mentor a growing team of skilled engineers.
- Maintain and evolve our Runtime team products, ensuring reliability and scalability.
- Own the execution and delivery of the team’s roadmap, ensuring high-quality and timely releases.
- Work closely with product, data, infrastructure, and other engineering teams to align priorities and technical requirements.
- Build and develop a full hosting system, with various types of inference, analytics, monitoring, distributions and more, that enables customers to run large-scale realtime/batch/streaming pipelines for ML workloads.
- Define and enforce coding standards, methodologies, and technical direction for the team.
- Proactively identify opportunities for improvement in workflows, architecture, and team processes.
- Improve the efficiency, scalability, and stability of various system components.
Qualifications:
- 3+ years of experience as a Software Engineering Team Lead, with proven hands-on experience.
- Excellent communication skills, with the ability to articulate complex ideas clearly and concisely.
- A broad mindset, capable of understanding technical, product, and business perspectives.
- A proactive, initiative-driven approach, able to identify and solve problems independently.
- Proven hands-on experience in containerized environments, microservices and Kubernetes.
- Experienced in designing, developing, and debugging complex, distributed systems (microservices, event-driven).
- Working knowledge of message brokers (e.g., Kafka, RabbitMQ).
- Hands-on experience with Java, GO, and Python.
- Experienced with at least one of the main cloud provider platforms (e.g. AWS, GCP).
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS DevOps Distributed Systems Engineering GCP GPU Java Kafka Kubernetes Machine Learning Microservices ML infrastructure MLOps Pipelines Python R RabbitMQ R&D Streaming
Perks/benefits: Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.