Machine Learning Infrastructure Engineer
REMOTE
Applications have closed
Flip
Download Flip App for free today to discover, learn & shop your favorite brands all in one place. Share and monetize your video reviews.About Flip.shop:Welcome to Flip.shop, where innovation meets the social commerce revolution! Fresh off our Series C funding round, we've raised $144 million, propelling our valuation to an impressive $1.05 billion. We’re redefining the shopping experience by giving consumers a voice in a space dominated by tech giants. Join us on this exhilarating journey where your technical skills will play a pivotal role in shaping the future of social commerce!
Why Join Us?At Flip.shop, you’ll be at the forefront of innovation in social commerce. This isn’t just a job—it’s a chance to build infrastructure that empowers our AI-driven platform to scale and deliver personalized shopping experiences. You will have the opportunity to directly partner, work with and learn from the very best engineers and scientists who joined us from some of the leading big-tech companies! If you thrive in a fast-paced, collaborative environment where you can develop high-performance systems, we want to hear from you!
Role Overview:We are seeking a Machine Learning Infrastructure Engineer to design, build, and optimize the infrastructure that powers our machine learning systems. You’ll ensure the efficient deployment, scaling, and monitoring of machine learning models, and will help streamline the development lifecycle. This role offers the opportunity to create scalable, production-level systems that support real-time recommendations and drive business growth.
Responsibilities:
- Infrastructure Development: Design and implement scalable infrastructure for deploying, monitoring, and maintaining machine learning models in production environments.
- Tooling & Automation: Build tools to automate workflows for model training, testing, and deployment, ensuring that machine learning models can move quickly from development to production.
- Cloud Infrastructure: Leverage cloud platforms to create efficient, scalable systems for large-scale machine learning workloads.
- Performance Optimization: Ensure the infrastructure supports high-performance model inference at scale, with a focus on minimizing latency and maximizing throughput.
- Collaboration: Work closely with data scientists, machine learning engineers, and DevOps teams to create seamless integration between development and production environments.
- Monitoring & Maintenance: Build robust monitoring systems to track model performance and infrastructure health, ensuring reliability and uptime of machine learning services.
- Security & Compliance: Implement best practices in infrastructure security, data privacy, and compliance, particularly when handling sensitive user data.
Requirements:
- Experience: 3+ years in infrastructure engineering, DevOps, or similar roles, with a focus on supporting machine learning workflows in production.
- Technical Skills: Strong proficiency in cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), CI/CD pipelines, and infrastructure-as-code tools (Terraform, Ansible). Experience with SageMaker is a bonus.
- ML Workflow Knowledge: Experience working with machine learning frameworks (TensorFlow, PyTorch, or similar) and familiarity with MLOps practices.
- Performance & Scalability: Proven track record of optimizing infrastructure for performance, scalability, and reliability in production environments.
- Collaboration: Strong teamwork skills, with the ability to partner with ML engineers and data scientists to streamline workflows.
- Communication: Ability to communicate complex infrastructure solutions to technical and non-technical stakeholders.
- Problem-Solving: Passion for solving infrastructure challenges that support real-time machine learning at scale.
Ready to Build the Future?If you're passionate about building scalable infrastructure and driving innovation in machine learning at scale, join us at Flip.shop! Let’s redefine the future of online shopping together.
Compensation & Benefits:Base salary and total compensation will vary based on factors including but not limited to location, experience, and performance. Please note the base salary is just one component of the company’s total rewards package for exempt employees. Other rewards may include equity, bonuses, long term incentives, a PTO policy, and other progressive benefits.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Ansible AWS Azure CI/CD DevOps Docker Engineering GCP Kubernetes Machine Learning ML infrastructure ML models MLOps Model inference Model training Pipelines Privacy PyTorch SageMaker Security TensorFlow Terraform Testing
Perks/benefits: Career development Equity / stock options Health care Salary bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.