SDE - Infrastructure Engineer
Tamil Nadu, Chennai, India
Bungee Tech
Simplify your pricing and category operations with a retail price optimization platform fueled by competitive intelligence. Book a demo.Company Description:
At Bungee Tech, we help retailers and brands meet customers everywhere and, on every occasion, they are in. We believe that accurate, high-quality data matched with compelling market insights empowers retailers and brands to keep their customers in the centre of all innovation and value they are delivering.
We provide a clear and complete omnichannel picture of their competitive landscape to retailers and brands. We collect billions of data points every day and multiple times in a day from publicly available sources. Using high-quality extraction, we uncover detailed information on products or services, which we automatically match, and then proactively track for price, promotion, and availability. Plus, anything we do not match helps to identify a new assortment opportunity.
Empowered with this unrivalled intelligence, we unlock compelling analytics and insights that once blended with verified partner data from trusted sources such as Nielsen, paints a complete, consolidated picture of the competitive landscape.
Why This Role Matters:
Data is the foundation of our business, and your work will ensure that we continue to deliver high-quality competitive intelligence at scale. Web platforms are constantly evolving, deploying sophisticated anti-bot measures—your job is to stay ahead of them. If you thrive on solving complex technical challenges and enjoy working with real-world data at an immense scale, this role is for you.
We seek a Software Development Engineer with expertise in cloud infrastructure, Big Data and web crawling technologies. This role bridges system reliability engineering with scalable data extraction solutions, ensuring our infrastructure remains robust and capable of handling high-volume data collection. You will design resilient systems, optimize automation pipelines, and tackle challenges posed by advanced bot-detection mechanisms.
- Architect, deploy, and manage scalable cloud environments (AWS/GCP/DO) to support distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently
- Automate infrastructure provisioning, monitoring, and disaster recovery using tools like Terraform, Kubernetes, and Prometheus.
- Optimize CI/CD pipelines to ensure seamless deployment of web scraping workflows and infrastructure updates.
- Develop and maintain stealthy web scrapers using Puppeteer, Playwright, and headless chromium browsers.
- Reverse-engineer bot-detection mechanisms (e.g., TLS fingerprinting, CAPTCHA solving) and implement evasion strategies.
- Monitor system health, troubleshoot bottlenecks, and ensure 99.99% uptime for data collection and processing pipelines.
- Implement security best practices for cloud infrastructure, including intrusion detection, data encryption, and compliance audits.
- Partner with data collection, ML and SaaS teams to align infrastructure scalability with evolving data needs
- Research emerging technologies to stay ahead of anti-bot trends including technologies like Kasada, PerimeterX, Akamai, Cloudflare, and more.
- 4–6 years of experience in site reliability engineering and cloud infrastructure management .
- Proficiency in Python, JavaScript for scripting and automation .
- Hands-on experience with Puppeteer/Playwright, headless browsers, and anti-bot evasion techniques .
- Knowledge of networking protocols, TLS fingerprinting, and CAPTCHA-solving frameworks .
- Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems.
- Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC.
- Strong proficiency in cloud platforms (AWS, GCP, or Azure) and containerization/orchestration (Docker, Kubernetes).
- Deep understanding of infrastructure-as-code tools (Terraform, Ansible) .
- Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments.
- Experience implementing observability frameworks, distributed tracing, and real-time monitoring tools.
- Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.
At Bungee Tech, you’ll be at the forefront of innovation in the data engineering space, working with cutting-edge technologies and a talented team. If you're passionate about building scalable systems, handling large-scale distributed data, and solving complex data challenges, we’d love to have you on board.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Ansible Architecture Avro AWS Azure Big Data CI/CD Distributed Systems Docker Elasticsearch Engineering GCP Grafana JavaScript Kubernetes Machine Learning Parquet Pipelines Playwright Python Research Security Terraform
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.