SRE/DevOps
Kraków, Poland
Infotree Global Solutions
Award winning global supplier of Contract Staffing, Independent Contractor Solutions, Permanent Placement, Global Payroll & Employer of Record Solutions.We are enabling the transition to software-defined vehicles supported by electrified and intelligently connected architectures – which will combine to power the future of mobility.
We’re seeking a highly motivated technical lead to establish a Cloud Site Reliability Engineering practice within the Active Safety and User Experience division which is responsible for building the next generation of Autonomous Driving solutions for some of the biggest Car Manufacturers in the world.
In this role, you will provide guidance on the best practices, tools, and processes for a world-leading Cloud Site Reliability Engineering (SRE) function and lead applying SRE solutions to critical workloads across CICD, Simulation, and AI.
ROLES AND RESPONSIBILITIES
• Design, build, and operate cloud-based software engineering tools that are elastic, resilience and secure
• Design integrations with industry-leading observability platforms and implement AI-based alerting systems.
• Implement cybersecurity best practices for threat detection and multi-platform Identity Management.
• Identify and resolve performance bottlenecks and other issues that affect the reliability and scalability of our systems
• Develop and maintain automation scripts and tools to streamline operations
• Collaborate with development teams to ensure that our systems and services meet their needs and are easy to use
• Work with other SREs to design and implement solutions for monitoring, logging, and alerting of our cloud infrastructure and services
• Continuously evaluate and improve our cloud infrastructure and processes to ensure that they are efficient, scalable, and secure
• Actively participate in project team meetings to ensure standard practices are followed and any concerns are quickly addressed.
• Keep up to date on the latest industry trends in technologies.
EXPERIENCE / SKILL REQUISITES:
• Bachelor's degree in Computer Science or related field, or equivalent experience
• 5+ years of experience in a DevOps, MLOps, or Cloud Site Reliability Engineering or similar role e.g. Platform Engineering, Cloud Operations, etc
• Proficiency in one or more programming/scripting languages (Python, Go, Perl, etc.)
• Strong Experience with cloud-based infrastructure platforms such as AWS, GCP, or Azure
• Experience with containerization technologies such as Docker and Kubernetes
• Good Experience with one or more Infrastructure as Code technologies i.e. Ansible, Terraform, Cloud Formation
• Working knowledge of monitoring, logging, and alerting tools such as Datadog, Prometheus, Grafana, and Splunk
• Strong problem-solving and troubleshooting skills
• Excellent communication and collaboration skills
• Excellent problem-solving and debugging skills and Agile development practices
• Good team player and should follow agile development methodologies
• Good interpersonal and communication skills, English language proficiency is a must.
• Ability to learn and adapt new technologies, passion for continuous improvement.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Ansible Architecture Autonomous Driving AWS Azure Computer Science DevOps Docker Engineering GCP Grafana Kubernetes MLOps Perl Python Splunk Terraform
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.