Systems Reliability Engineer, Enterprise AI

Tokyo

Woven by Toyota

Woven by Toyota will help Toyota to develop next-generation cars and to realize a mobility society in which everyone can move freely, happily and safely.

View all jobs at Woven by Toyota

Apply now Apply later

About Woven by ToyotaWoven by Toyota, a part of the Toyota Group, is challenging the current state of mobility through human-centric innovation and empowering mobility transformation. Through our AD/ADAS technology, our automotive software development platform Arene OS, our mobility test course Toyota Woven City, and Toyota’s growth fund, Woven Capital, we are pioneering the movement of people, goods, information, and energy, weaving a future of enhanced safety, connectivity and well-being for all.
=========================================================================
TEAMThe Enterprise AI Team’s mission is to give Toyota a platform for AI growth. The team works to simplify training, inference, MLops and other aspects of AI development. This is used internally to build safety, convenience and autonomy for the Toyota vehicles.
As part of Enterprise AI, the SRE team will provide incident and release processes, monitoring, debugging and alerting tools, and on-call support for Enterprise AI. The SRE team is critical to ensure that we provide a reliable experience to our customers without Woven by Toyota and the larger Toyota group.
WHO ARE WE LOOKING FOR?As a member of the Enterprise AI SRE team, you will be responsible for maintaining the reliability of our services and end user experience. You will have a solid technical background and hands-on experience in on-call rotations, debugging production service issues, and monitoring/observability. You have knowledge of SRE technologies, processes, and best practices, to work closely with other engineering teams.

RESPONSIBILITIES

  • Develop and maintain site reliability tools and processes within SRE and Enterprise AI’s engineering and support teams
  • Implement new technologies and infrastructure upgrades and configuration
  • Collaborate with other site reliability and support  teams at Woven to build and maintain an integrated system
  • Take part in on-call rotations to ensure platform availability

MINIMUM QUALIFICATIONS

  • 3+ years of experience in software engineering, with at least 2 years in a site reliability engineering or related role
  • Kubernetes cloud infrastructure and services in AWS and/or GCP, Python/Go, and Terraform experience
  • On-call support and monitoring/alerting tools (such as Pagerduty, Statuspage, Grafana, etc.),  processes (such as on-call, incident management, post-mortems, release/change management, etc.), structure, and best practice experience
  • Business level English or higher

NICE TO HAVES

  • Experience working with customers and clients in Japan
  • Experience working with machine learning and/or AI
  • Japanese language skills
=========================================================================Important Points・All interviews will be arranged via Google Meet, unless otherwise stated.・The same job descriptions are available in both English and Japanese; therefore, we kindly ask that you apply to only one version.・We kindly request that you submit your resume in English, if possible. However, Japanese resumes are also acceptable. Please note that, depending on the English proficiency requirements of the role, we may request an English version of your resume later in the process.
WHAT WE OFFER・Competitive Salary - Based on experience・Work Hours - Flexible working time・Paid Holiday - 20 days per year (prorated)・Sick Leave - 6 days per year (prorated)・Holiday - Sat & Sun, Japanese National Holidays, and other days defined by our company・Japanese Social Insurance - Health Insurance, Pension, Workers’ Comp, and Unemployment Insurance, Long-term care insurance・Housing Allowance・Retirement Benefits・Rental Cars Support・In-house Training Program (software study/language study)
Our Commitment・We are an equal opportunity employer and value diversity.・Any information we receive from you will be used only in the hiring and onboarding process. Please see our privacy notice for more details.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: AWS Engineering GCP Grafana Kubernetes Machine Learning MLOps Privacy Python Terraform

Perks/benefits: Career development Competitive pay Flex hours Health care

Region: Asia/Pacific
Country: Japan

More jobs like this