Datadog Lead/Architect
Edison, NJ · Remote, US
Orion Innovation
Orion delivers digital transformative business solutions rooted in digital strategy, experience design, and engineering, enabling our clients with digital transformation to operate with agility at scale.Orion Innovation is a premier, award-winning, global business and technology services firm. Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity. We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.
About the Role:
We are seeking a highly skilled Datadog Lead/Architect to design, implement, and optimize monitoring, observability, and AIOps solutions using Datadog. This role requires deep expertise in cloud infrastructure, application performance monitoring (APM), log management, and automation to ensure end-to-end observability across complex enterprise environments.
Key Responsibilities:
Design & Architecture:
- Architect and implement Datadog-based monitoring solutions for cloud, hybrid, and on-prem environments.
- Define best practices for instrumentation, telemetry collection, and visualization.
- Design scalable and cost-efficient monitoring strategies to optimize performance and reliability.
Implementation & Integration:
- Lead Datadog integrations with cloud platforms (AWS, Azure, GCP), Kubernetes, databases, and third-party services.
- Oversee APM, log management, network monitoring, security monitoring, and synthetic monitoring implementations.
- Develop custom dashboards, alerts, and reports to improve visibility and troubleshooting efficiency.
Automation & Optimization:
- Automate observability using Terraform, Ansible, or scripting languages (Python, Bash, etc.).
- Optimize log ingestion, metric collection, and high-cardinality data management to reduce costs.
- Establish usage governance, anomaly detection, and auto-scaling recommendations.
Collaboration & Leadership:
- Work closely with DevOps, SRE, Security, and Engineering teams to align monitoring with business needs.
- Mentor and guide teams on Datadog best practices and performance tuning.
- Lead troubleshooting efforts for complex incidents using observability insights.
Required Skills & Experience:
- 5+ years of experience in observability, monitoring, or cloud operations.
- Deep expertise in Datadog setup, architecture, and optimization.
- Strong understanding of cloud-native technologies (Kubernetes, Docker, Serverless, CI/CD pipelines, etc.).
- Experience with APM, log management, network & infrastructure monitoring.
- Proficiency in infrastructure-as-code (IaC) tools like Terraform, Ansible, or CloudFormation.
- Strong hands-on knowledge of scripting languages (Python, Bash, PowerShell, etc.).
- Ability to analyze complex system behavior and troubleshoot performance issues.
- Experience integrating security and compliance monitoring into observability frameworks.
Preferred Qualifications:
- Datadog certifications (Datadog Certified Deploy or Advanced Observability).
- Experience in AIOps, anomaly detection, and ML-based monitoring.
- Background in SRE, DevOps, or Performance Engineering.
Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.
Candidate Privacy Policy
Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:
- What information we collect during our application and recruitment process and why we collect it;
- How we handle that information; and
- How to access and update that information.
Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AIOps Ansible Architecture AWS Azure CI/CD CloudFormation Data management DevOps Docker E-commerce Engineering GCP Industrial Kubernetes Machine Learning Pipelines Privacy Python Security Terraform
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.