Principal Data Engineer
BENGALURU, KARNATAKA, India
Oracle
Oracle offers a comprehensive and fully integrated stack of cloud applications and cloud platform services.In the role of Data Engineer, you will be responsible for defining, designing and delivery of cloud-native, high-throughput, scalable and distributed data centric products and services. The role focuses on creating value chain to help address the challenges of acquiring the large complex data, evaluating its value, distilling, and analyzing. This individual will be responsible to develop robust, scalable, and maintainable data systems with a combination of in-house tools and standard technologies.
Career Level - IC4
Oracle Customer Success Services
Oracle Customer Success Services assists customers with their requirements for some of the most cutting-edge applications and solutions by utilizing the strengths of more than two decades of expertise in developing mission-critical solutions for enterprise customers and combining it with cutting-edge technology to provide our customers' speed, flexibility, resiliency, and security to enable customers to optimize their investment, minimize risk, and achieve more. The business was established with an entrepreneurial mindset and supports a vibrant, imaginative, and highly varied workplace. We are free of obligations, so we'll need your help to turn it into a premier engineering hub that prioritizes quality.
Role Description
In the role of Data Engineer, you will be responsible for defining, designing and delivery of cloud-native, high-throughput, scalable and distributed data centric products and services. The role focuses on creating value chain to help address the challenges of acquiring the large complex data, evaluating its value, distilling, and analyzing. This individual will be responsible to develop robust, scalable, and maintainable data systems with a combination of in-house tools and standard technologies.
Responsibilities
- Lead the development and implementation of data centric products, apply data engineering concepts, data architecture design and systems performance tuning.
- Design and implement high throughput, distributed data pipelines, real-time data analytics processing, interactive dashboards and ML/AI based services.
- Partner with product, and engineering stakeholders to identify software requirements, translate to technical design and implementation plan.
- Develop and maintain technical documentation, including architecture diagrams, design specifications, and system diagrams.
- Work with development teams to ensure software projects are delivered on time, within budget, and to the required quality standards.
- Provide guidance and mentorship to junior developers.
- Stay up to date with industry trends and developments in software architecture and development practices.
Qualifications
- 8+ Years of expertise in designing and implementing data architectures, including data lakes, data warehouses, distributed and real-time data processing systems.
- Experience with modern data stack: data ingress/egress, ETL/ELT, DataOps, Apache Spark, Kafka, Flink, NiFi
- Experience with SQL, NoSQL databases and data warehouse solutions: Oracle ADW, MySQL, MongoDB, Cassandra, etc.
- Knowledge of data-at-scale processing tools: Oracle Datalake, Databricks, Cloudera, etc.
- Demonstrated ability in building and deploying software applications on one or more public cloud providers such as OCI, AWS, Azure, GCP, or equivalent.
- Hands-on programming skills using Python, Java, SQL, PL/SQL
- Experience with data modeling concepts and tools: Data Modeler, dbt, Apache Avro, Parquet.
- Experience in DevOps practices involving containers in Kubernetes, CI/CD and Canary Deployments.
- Experience with Microservice architecture patterns such as but not limited to API Gateways, Event Driven & Reactive Architecture, etc.
- Proficiency in using data analytics tools, experience creating data visualizations, dashboards, and reports: Oracle Business Intelligence, (BI), Tableau, Power BI, Looker, Grafana, D3.js, Plotly.
- Conceptual knowledge and practical experience with statistical analysis and machine learning frameworks: Scikit-learn, TensorFlow, PyTorch.
- Proficiency in using Jupyter Notebooks or similar environments for model development.
- Experience with MLOps practices for managing the lifecycle of ML models including versioning, deployment, monitoring, and governance: Oracle AI Service, Oracle Data Science Services, MLflow, Kubeflow, Apache Airflow, SageMaker or equivalent.
- Conceptual knowledge and experience with generative AI models, techniques, and tools: Large Language Models (LLM), Vector DBs, LangChain, LlamaIndex, Hugging Face Transformers.
- Excellent communication skills to convey technical concepts to non-technical stakeholders.
- Strong collaboration skills to work with data scientists, analysts, and business stakeholders.
- Proven leadership skills in managing and mentoring a team of data engineers.
- Ability to influence and drive cross-functional teams towards a unified data strategy.
- Strong analytical and problem-solving skills to address complex data challenges.
- Ability to translate business requirements into data solutions.
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s problems. True innovation starts with diverse perspectives and various abilities and backgrounds.
When everyone’s voice is heard, we’re inspired to go beyond what’s been done before. It’s why we’re committed to expanding our inclusive workforce that promotes diverse insights and perspectives.
We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer a highly competitive suite of employee benefits designed on the principles of parity and consistency. We put our people first with flexible medical, life insurance and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by calling +1 888 404 2494, option one.
Disclaimer:
Oracle is an Equal Employment Opportunity Employer*. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
* Which includes being a United States Affirmative Action Employer
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs Architecture Avro AWS Azure Business Intelligence Cassandra CI/CD D3 Data Analytics Databricks DataOps Data pipelines Data strategy Data warehouse dbt DevOps ELT Engineering ETL Flink GCP Generative AI Grafana Java Jupyter Kafka Kubeflow Kubernetes LangChain LLMs Looker Machine Learning MLFlow ML models MLOps MongoDB MySQL NiFi NoSQL Oracle Parquet Pipelines Plotly Power BI Python PyTorch SageMaker Scikit-learn Security Spark SQL Statistics Tableau TensorFlow Transformers
Perks/benefits: Career development Insurance
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.