Stability and Resilience Engineer with Python Dev Experience

Alpharetta, GA

Synechron

Synechron is an innovative global consulting firm delivering industry-leading digital solutions to transform and empower businesses.

View all jobs at Synechron

Apply now Apply later

We are

At Synechron, we believe in the power of digital to transform businesses for the better. Our global consulting firm combines creativity and innovative technology to deliver industry-leading digital solutions. Synechron’s progressive technologies and optimization strategies span end-to-end Artificial Intelligence, Consulting, Digital, Cloud & DevOps, Data, and Software Engineering, servicing an array of noteworthy financial services and technology firms. Through research and development initiatives in our FinLabs we develop solutions for modernization, from Artificial Intelligence and Blockchain to Data Science models, Digital Underwriting, mobile-first applications and more. Over the last 20+ years, our company has been honored with multiple employer awards, recognizing our commitment to our talented teams. With top clients to boast about, Synechron has a global workforce of 14,000+, and has 55 offices in 20 countries within key global markets.

Our challenge

We are seeking a highly skilled Stability and Resilience Engineer to join our dynamic team. This role is crucial for ensuring the stability, reliability, and resilience of our systems and applications, with a focus on proactive measures to prevent downtime and mitigate risks. The ideal candidate will have a strong technical background, a passion for problem-solving, and a commitment to excellence in system performance.

Additional Information* 

The base salary for this position will vary based on geography and other factors. In accordance with law, the base salary for this role if filled within Alpharetta, GA is $100K - $110K/year & benefits (see below).

The Role

Responsibilities:

1.       System Monitoring and Analysis:

  • Implement and manage monitoring solutions to track system performance and health.

  • Analyze system metrics to identify patterns and irregularities that may indicate potential failures.

2.       Incident Management:

  • Develop and execute incident response plans to address system outages and failures.

  • Conduct root cause analysis to understand incidents and implement corrective actions.

3.       Collaboration:

  • Work closely with development, operations, and other engineering teams to ensure systems are designed with stability and resilience in mind.

  • Participate in architectural reviews and design discussions to provide insights on resilience best practices.

4.       Documentation and Reporting:

  • Maintain thorough documentation of system configurations, incident reports, and resilience testing results.

  • Prepare reports and presentations to communicate system performance, risks, and improvement strategies to stakeholders.

5.       Continuous Improvement:

  • Stay current with industry trends, tools, and techniques in system stability and resilience.

  • Propose and implement innovations to enhance system performance and reliability.

Requirements:

You are:

  • Bachelor’s degree in Computer Science, Engineering, or related field; Master’s degree preferred.

  • Proven experience in systems engineering, reliability engineering, or a similar role.

  • Strong understanding of system architecture, cloud infrastructure, and networking concepts.

  • Proficiency in monitoring tools and techniques (e.g., Prometheus, Grafana, Dynatrace, etc.).

  • Experience with incident management and response frameworks (ITIL, SRE principles).

  • Knowledge of scripting and programming languages (e.g., Python, Bash, Go).

  • Familiarity with DevOps practices and CI/CD pipelines.

  • Excellent problem-solving skills and the ability to work under pressure.

  • Strong communication and collaboration skills, with an ability to work effectively in a team environment.

It would be great if you also had:

  • Relevant certifications (e.g., AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer, Certified Kubernetes Administrator).

  • Experience with container orchestration and microservices architecture.

  • Previous involvement in disaster recovery planning and execution.

We can offer you:

  • A highly competitive compensation and benefits package

  • A multinational organization with 55 offices in 20 countries and the possibility to work abroad

  • Laptop and a mobile phone

  • 10 days of paid annual leave (plus sick leave and national holidays)

  • Maternity & Paternity leave plans

  • A comprehensive insurance plan including: medical, dental, vision, life insurance, and long-/short-term disability (plans vary by region)

  • Retirement savings plans

  • A higher education certification policy

  • Commuter benefits (varies by region)

  • Extensive training opportunities, focused on skills, substantive knowledge, and personal development.

  • On-demand Udemy for Business for all Synechron employees with free access to more than 5000 curated courses 

  • Coaching opportunities with experienced colleagues from our Financial Innovation Labs (FinLabs) and Center of Excellences (CoE) groups

  • Cutting edge projects at the world’s leading tier-one banks, financial institutions and insurance firms

  • A flat and approachable organization

  • A truly diverse, fun-loving and global work culture

S​YNECHRON’S DIVERSITY & INCLUSION STATEMENT
 

Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.


All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.

Candidate Application Notice

Apply now Apply later
  • Share this job via
  • 𝕏
  • or
Job stats:  0  0  0

Tags: Architecture AWS Blockchain CI/CD Computer Science Consulting Consulting firm DevOps Engineering GCP Google Cloud Grafana ITIL Kubernetes Microservices Pipelines Python Research Testing

Perks/benefits: Career development Competitive pay Equity / stock options Gear Health care Insurance Medical leave Parental leave

Region: North America
Country: United States

More jobs like this