Site Reliability Engineer - Data Analytics
Issaquah, WA, US
Full Time Senior-level / Expert USD 85K - 160K
Costco Wholesale
Costco IT is responsible for the technical future of Costco Wholesale, the third largest retailer in the world with wholesale operations in fourteen countries. Despite our size and explosive international expansion, we continue to provide a family, employee centric atmosphere in which our employees thrive and succeed.
This is an environment unlike anything in the high-tech world and the secret of Costco’s success is its culture. The value Costco puts on its employees is well documented in articles from a variety of publishers including Bloomberg and Forbes. Our employees and our members come FIRST. Costco is well known for its generosity and community service and has won many awards for its philanthropy. The company joins with its employees to take an active role in volunteering by sponsoring many opportunities to help others.
Come join the Costco Wholesale IT family. Costco IT is a dynamic, fast-paced environment, working through exciting transformation efforts. We are building the next generation retail environment where you will be surrounded by dedicated and highly professional employees.
Data Engineers are responsible for developing and operationalizing data pipelines/integrations to make data available for consumption (i.e. Reporting, Data Science/Machine Learning, Data APIs, etc.). This includes data ingestion, data transformation, data validation/quality, data pipeline optimization, orchestration; and deploying code to production via CI/CD. The Data Engineer role requires knowledge of software development/programming methodologies, various data sources (Relational Databases, flat files (csv, delimited), APIs, XML, JSON, etc.), data access (SQL, Python, etc.), followed by expertise in data modeling, cloud architectures/platforms, data warehousing, and data lakes. This role also will partner closely with Product Owners, Data Architects, Platform/DevOps Engineers, etc. to design, build, test, implement and maintain data pipelines.
This position serves as a Data Engineer and subject matter expert. The role works directly with managers, architects, engineers and developers -- providing hands-on technical cloud solutions. The position requires hands-on technical design and implementation of data applications like PowerBI, cognos, Alteryx, BW, BODS, where scape and IICS environments. . This position will bring real world Microsoft PowerBI engineering and solution experience and will function as part of a highly skilled, agile development and data team supporting major data and analytics programs.
Engineers have deep knowledge and hands-on experience in enterprise-wide platforms, and solve technical problems while working on technology initiatives. Engineers will be required to be available for 24x7 support as needed and be on call on a rotational basis. Engineers have strong architectural, leadership, and technical skills. Engineers should have high-level skills in PowerBI, Databricks, Azure, Alteryx, UC4, and Webi. System administration, monitoring experience and observability is a plus. Engineers interact in a highly effective manner with other team members and management, drive innovation, and influence delivery and performance.
The Site Reliability Engineer (SRE) will be responsible for maintaining and improving the availability, performance, and capacity of Data Analytics Engineering Operations Problem and Incident Resolution Support. The SRE will translate Costco’s goals and strategies for system availability, performance, and capacity into designs and plans for technical solutions. The SRE will work with other Costco teams to resolve issues affecting the availability, performance, or capacity of Data Analytics Engineering Operations Problem and Incident Resolution Support.
The SRE will also work with other Costco teams to identify upcoming events that could affect demand on system performance and prepare mitigation plans. The SRE will work with teams and System Architects to implement, maintain, and validate disaster recovery plans and other solutions to avoid or mitigate service interruptions.
The SRE will monitor the availability, performance, and capacity of Data Analytics Engineering Operations Problem and Incident Resolution Support to identify trends and concerns. Create and disseminate system reliability reports to Costco management in support of planning and decision making. The SRE will also assist in the development of policies, standards, and guidelines for the maintenance and operation of Costco’s overall Data Analytics Engineering Operations Problem and Incident Resolution Support Solutions.
Additionally, this role will work closely with other members of DAEO, Operations teams, the Quality Assurance team, Software Development teams, Support teams, and management to achieve team goals.
If you want to be a part of one of the worldwide BEST companies “to work for”, simply apply and let your career be reimagined.
ROLE
● Builds prototypes of potential features.
● Enhances automation of applications, systems, and platforms and identifies opportunities for streamlining, and continuous process improvement.
● Applies knowledge to practical and sustainable applications and capabilities.
● Contributes, interprets, and communicates enterprise, technical, project, and operational strategies to the team.
● Formulates and directs activities that align short term goals and long term initiatives while providing accurate and timely estimates of work breakdown schedules.
● Influences and drives adoption of best practices and high quality standards throughout the division.
● Integrates diverse solution components across multiple platforms using industry standard interfaces.
● Tests and resolves problems, performs root cause analysis, identifies gaps, recommends solutions and preventative measures, and leads team members to solution delivery plans.
● Runs proof of concepts and uses diagnostic/debugging skills to solve current challenges in multi-platform systems
● Orchestrates reviews for system additions and/or enhancements.
● Promotes and supports a culture of compliance, risk avoidance/mitigation, and corporate accountability throughout the organization through technical leadership, knowledge of business need, development and communication of policies, procedures, and plans, and assurance of solution designs that are in compliance with architecture standards, technology guardrails, security, and operational guidelines.
● Optimizes team efficiency and performance through high level technical direction.
● Provides technical leadership in implementation of applications, strategic planning sessions, documentation requirements, tool implementation, database query languages, and programming languages.
● Uses subject matter expertise to support industry standard source control and source change management techniques.
● Presents technical designs and solutions to executives, management, and other audiences to gain consensus and/or project approval.
● Provides technical leadership in implementation of cloud applications using cloud technologies (PaaS, IaaS), database query languages and web programming languages.
● Leads technical expertise in solving complex problems with cloud solutions.
● Works closely with cloud infrastructure teams, architects, Dev / QA and engineers to design, implement and manage secure, scalable and reliable cloud infrastructure environments.
● Proposes and implements cloud infrastructure and automation to drive efficiency.
● Develops monitoring rules and manages resources including Azure Load Balancers, VNet, Subnets, Resource Groups, and Network Security Groups.
● Solves highly technical and complex problems on multiple projects, and provides consultative support to internal staff.
● Monitors and manages Azure Load Balancers, VNet, Subnets, Resource Groups, and Network Security Groups.
● Manages and fixes platform related issues and supports the Azure DevOps environment Operating System Patching.
● Implements, setup and document the DevOps CI/CD pipeline.
● Costs optimizations with cloud resources (snoozing / shutting down) to drive financial efficacy.
● Monitors, Alerting, Logging and Analytics leveraging appropriate Azure native tools.
● Monitors third-party services and tools provided by Costco for Network, Compute, and Storage.
● Creates, manages and maintains Operations Runbook consisting of standard operating procedures, configurations, lessons learned, root cause analysis, diagnostic steps and solutions to resolve future incidents.
● Assists, manages and monitors Azure backup and restore processes.
● Documents Azure operational services baseline.
● Interacts with business customers to formulate and define system scope and objectives.
● Conducts data modeling, configuration, specification development, coding, testing, and documentation.
● Conducts cloud capacity planning and applies best practices in security for cloud applications.
● Consults on run proof of concepts and advise the best use of Azure services to solve current challenges.
● Reviews completion and implementation of cloud system additions and/or enhancements and makes recommendations to management and/or clients.
● Develops documentation on new or existing cloud systems.
● Maintains current knowledge of relevant technology as assigned (i.e. Microsoft Azure).
● Participates in special projects as required.
● Works closely with PMO to scope the next iteration and assist in implementation.
● Identifies automation tools to automate build and release pipelines to improve the speed and reliability of the release process.
● Ensures version control for added traceability and reliability of code to track code evolutions.
● Facilitates post-release retrospectives.
REQUIRED
● 3 to 5 years’ scripting to automate operational tasks.
● Ability to mentor and guide multiple members of the team across several simultaneous projects.
● Familiarity with cloud methodologies.
● Passion for cloud technologies.
Recommended
● Microsoft Azure certification preferred.
● Expertise with the Azure console and portal.
● Familiarity with XML, SOAP, REST, PowerShell and Puppet.
● 3+ years’ experience deploying enterprise data technologies.
● 5+ years’ experience with enterprise software systems.
● Proficient in Google Workspace applications, including Sheets, Docs, Slides, and Gmail.
Required Documents
● Cover Letter
● Resume
California applicants, please click here to review the Costco Applicant Privacy Notice.
Pay Ranges:
Level 1 - $85,000 - $110,000
Level 2 - $105,000 - $135,000
Level 3 - $130,000 - $160,000
We offer a comprehensive package of benefits including paid time off, health benefits - medical/dental/vision/hearing aid/pharmacy/behavioral health/employee assistance, health care reimbursement account, dependent care assistance plan, short-term disability and long-term disability insurance, AD&D insurance, life insurance, 401(k), stock purchase plan to eligible employees.
Costco is committed to a diverse and inclusive workplace. Costco is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or any other legally protected status. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to IT-Recruiting@costco.com
If hired, you will be required to provide proof of authorization to work in the United States. In some cases, applicants and employees for selected positions will not be sponsored for work authorization, including, but not limited to H1-B visas.
Tags: Agile APIs Architecture Azure CI/CD CSV Data Analytics Databricks Data pipelines Data Warehousing DevOps Engineering JSON Machine Learning Pipelines Power BI Privacy Puppet Python RDBMS Security SQL Testing XML
Perks/benefits: Career development Equity / stock options Gear Health care Insurance Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.