Site Reliability Engineer
Remote
Beacon Biosignals
AI + EEG to change the way patients are treated for disorders of the brain.
Beacon Biosignals is on a mission to revolutionize precision medicine for the brain. We are the leading at-home EEG platform supporting clinical development of novel therapeutics for neurological, psychiatric, and sleep disorders. Our FDA 510(k)-cleared Dreem EEG headband and AI algorithms enable quantitative biomarker discovery and implementation. Beacon’s Clinico-EEG database contains EEG data from nearly 100,000 patients, and our cloud-native analytics platform powers large-scale RWD/RWE retrospective and predictive studies. Beacon Biosignals is changing the way that patients are treated for any disorder that affects brain physiology.
Beacon Biosignals is seeking a skilled Site Reliability Engineer to join our Platform team, to help ensure the reliability, availability, and security of Beacon's cloud infrastructure that supports large-scale machine learning on terabytes of biosignal data. In this role, you'll be responsible for building and maintaining critical systems, such as the Kubernetes clusters that power our data scientists' hefty distributed numerical workloads, and observability infrastructure that makes it easy for users to monitor, trace, and identify bugs and resource utilization issues.
At Beacon, we've found that cultural and scientific impact is driven most by those that lead by example. As such, we're always seeking new contributors whose work demonstrates an avid curiosity, a bias towards simplicity, an eye for composability, a self-service mindset, and - most of all - a deep empathy towards colleagues, stakeholders, users, and patients. We believe a diverse team builds more robust systems and achieves higher impact.
Beacon's robust asynchronous work practices ensure a first-class remote work experience, but we also have in-person office hubs available located in Boston, New York and Paris.
At Beacon, we've found that cultural and scientific impact is driven most by those that lead by example. As such, we're always seeking new contributors whose work demonstrates an avid curiosity, a bias towards simplicity, an eye for composability, a self-service mindset, and - most of all - a deep empathy towards colleagues, stakeholders, users, and patients. We believe a diverse team builds more robust systems and achieves higher impact.
Beacon Biosignals is seeking a skilled Site Reliability Engineer to join our Platform team, to help ensure the reliability, availability, and security of Beacon's cloud infrastructure that supports large-scale machine learning on terabytes of biosignal data. In this role, you'll be responsible for building and maintaining critical systems, such as the Kubernetes clusters that power our data scientists' hefty distributed numerical workloads, and observability infrastructure that makes it easy for users to monitor, trace, and identify bugs and resource utilization issues.
At Beacon, we've found that cultural and scientific impact is driven most by those that lead by example. As such, we're always seeking new contributors whose work demonstrates an avid curiosity, a bias towards simplicity, an eye for composability, a self-service mindset, and - most of all - a deep empathy towards colleagues, stakeholders, users, and patients. We believe a diverse team builds more robust systems and achieves higher impact.
Beacon's robust asynchronous work practices ensure a first-class remote work experience, but we also have in-person office hubs available located in Boston, New York and Paris.
What success looks like:
- Design and implement infrastructure as code solutions that improve reliability, security, and maintainability of our cloud infrastructure
- Lead and execute major infrastructure initiatives including cluster upgrades, security improvements, and architectural changes
- Develop and maintain CI/CD pipelines that enable teams to deploy safely and efficiently
- Improve observability across our systems through enhanced monitoring, logging, and alerting
- Participate in an on-call rotation and lead incident response efforts when issues arise
- Collaborate with development teams to improve application reliability and performance
- Maintain and enhance our security posture through infrastructure hardening and automation
- Create and maintain documentation for infrastructure, deployment processes, and incident response procedures
What you will bring:
- Strong experience with Kubernetes administration, including cluster management, security, and troubleshooting
- Proven track record implementing infrastructure as code using Terraform or similar tools
- Experience building and maintaining CI/CD pipelines, particularly with GitHub Actions and ArgoCD
- Solid understanding of container technologies and build processes, especially Docker
- Strong cloud provider (e.g. AWS) knowledge including networking, security, and infrastructure services
- Experience with incident response and on-call responsibilities in a production environment
- Deep experience with Linux systems administration and debugging
- Proficiency in at least one programming language (Python, Go, Typescript etc.)
- Understanding of security and networking concepts including OAuth2/OIDC, DNS, TLS, TCP/UDP, etc
- Approximate experience: Bachelor's degree + 5-8 years of experience in SRE, DevOps, or other similar professional experience.
At Beacon, we've found that cultural and scientific impact is driven most by those that lead by example. As such, we're always seeking new contributors whose work demonstrates an avid curiosity, a bias towards simplicity, an eye for composability, a self-service mindset, and - most of all - a deep empathy towards colleagues, stakeholders, users, and patients. We believe a diverse team builds more robust systems and achieves higher impact.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
3
0
0
Categories:
Big Data Jobs
Engineering Jobs
Tags: AWS CI/CD DevOps Docker GitHub Kubernetes Linux Machine Learning Pipelines Python Security Terraform TypeScript
Perks/benefits: Career development Equity / stock options
Region:
Remote/Anywhere
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Staff Machine Learning Engineer jobsData Engineer II jobsStaff Data Scientist jobsBI Developer jobsData Scientist II jobsPrincipal Data Engineer jobsData Manager jobsJunior Data Analyst jobsData Science Manager jobsSenior AI Engineer jobsResearch Scientist jobsBusiness Data Analyst jobsData Specialist jobsPrincipal Software Engineer jobsLead Data Analyst jobsData Science Intern jobsData Analyst Intern jobsSr. Data Scientist jobsSoftware Engineer II jobsData Engineer III jobsData Analyst II jobsSoftware Engineer, Machine Learning jobsAzure Data Engineer jobsBI Analyst jobsData Engineering Manager jobs
Consulting jobsLinux jobsEconomics jobsOpen Source jobsData Warehousing jobsComputer Vision jobsRDBMS jobsHadoop jobsKafka jobsGoogle Cloud jobsAirflow jobsMLOps jobsBanking jobsJavaScript jobsNoSQL jobsKPIs jobsData warehouse jobsClassification jobsScala jobsScikit-learn jobsStreaming jobsPhysics jobsLooker jobsOracle jobsPostgreSQL jobs
R&D jobsTerraform jobsPySpark jobsBigQuery jobsSAS jobsPandas jobsGitHub jobsData Mining jobsScrum jobsCX jobsRobotics jobsDistributed Systems jobsIndustrial jobsJira jobsRedshift jobsUnstructured data jobsdbt jobsMicroservices jobsPharma jobsJenkins jobsGPT jobsMySQL jobsReact jobsE-commerce jobsData strategy jobs