Staff Software Engineer, Data Ingestion
Hyderabad
BrightEdge
BrightEdge is the leading SEO solution and content performance marketing platform, helping more than 1,700 customers generate more organic traffic. BrightEdge is based in San Mateo, CA with offices in New York, Seattle, Chicago, Cleveland,...
The Staff Software Engineer, Data Ingestion will be a critical individual contributor responsible for designing collection strategies, developing, and maintaining robust and scalable data pipelines. This role is at the heart of our data ecosystem, deliver new analytical software solution to access timely, accurate, and complete data for insights, products, and operational efficiency.
Key Responsibility
- Design, develop, and maintain high-performance, fault-tolerant data ingestion pipelines using Python.
- Integrate with diverse data sources (databases, APIs, streaming platforms, cloud storage, etc.).
- Implement data transformation and cleansing logic during ingestion to ensure data quality.
- Monitor and troubleshoot data ingestion pipelines, identifying and resolving issues promptly.
- Collaborate with database engineers to optimize data models for fast consumption.
- Evaluate and propose new technologies or frameworks to improve ingestion efficiency and reliability.
- Develop and implement self-healing mechanisms for data pipelines to ensure continuity.
- Define and uphold SLAs and SLOs for data freshness, completeness, and availability.
- Participate in on-call rotation as needed for critical data pipeline issues.
Required Skills
- 6+ years experience in software development industry from computer science background
- Extensive Python Expertise: Extensive experience in developing robust, production-grade applications with Python.
- Data Collection & Integration: Proven experience collecting data from various sources (REST APIs, OAuth, GraphQL, Kafka, S3, SFTP, etc.).
- Distributed Systems & Scalability: Strong understanding of distributed systems concepts, designing for scale, performance optimization, and fault tolerance.
- Cloud Platforms: Experience with major cloud providers (AWS or GCP) and their data-related services (e.g., S3, EC2, Lambda, SQS, Kafka, Cloud Storage, GKE).
- Database Fundamentals: Solid understanding of relational databases (SQL, schema design, indexing, query optimization). OLAP database experience is a plus (Hadoop)
- Monitoring & Alerting: Experience with monitoring tools (e.g., Prometheus, Grafana) and setting up effective alerts.
- Version Control: Proficiency with Git.
- Containerization (Plus): Experience with Docker and Kubernetes.
- Streaming Technologies (Plus): Experience with real-time data processing using Kafka, Flink, Spark Streaming
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Engineering Jobs
Leadership Jobs
Tags: APIs AWS Computer Science Data pipelines Data quality Distributed Systems Docker EC2 Flink GCP Git Grafana GraphQL Hadoop Kafka Kubernetes Lambda OLAP Pipelines Python RDBMS Spark SQL Streaming
Region:
Asia/Pacific
Country:
India
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Power BI Developer jobsBI Developer jobsPrincipal Data Engineer jobsSr. Data Engineer jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsData Science Intern jobsData Science Manager jobsDevOps Engineer jobsJunior Data Analyst jobsData Manager jobsSoftware Engineer II jobsData Analyst Intern jobsLead Data Analyst jobsAccount Executive jobsBusiness Data Analyst jobsStaff Software Engineer jobsSr. Data Scientist jobsData Specialist jobsAI/ML Engineer jobsSenior Backend Engineer jobsData Governance Analyst jobsBusiness Intelligence Analyst jobsData Engineer III jobs
Consulting jobsMLOps jobsAirflow jobsEconomics jobsOpen Source jobsLinux jobsKPIs jobsTerraform jobsJavaScript jobsKafka jobsGitHub jobsData Warehousing jobsPostgreSQL jobsRDBMS jobsComputer Vision jobsNoSQL jobsGoogle Cloud jobsPrompt engineering jobsClassification jobsScikit-learn jobsStreaming jobsBanking jobsPhysics jobsRAG jobsHadoop jobs
Oracle jobsData warehouse jobsBigQuery jobsPandas jobsR&D jobsdbt jobsLooker jobsGPT jobsReact jobsScala jobsDistributed Systems jobsPySpark jobsScrum jobsCX jobsIndustrial jobsELT jobsMicroservices jobsLangChain jobsJira jobsRedshift jobsSAS jobsOpenAI jobsJenkins jobsTypeScript jobsModel training jobs