Gen Ai - Data Engineer - Senior Associate
Bangalore (SDC) - Bagmane Tech Park, India
PwC
We are a community of solvers combining human ingenuity, experience and technology innovation to help organisations build trust and deliver sustained outcomes.Line of Service
AdvisoryIndustry/Sector
Not ApplicableSpecialism
Advisory - OtherManagement Level
Senior AssociateJob Description & Summary
At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth.Those in intelligent automation at PwC will focus on conducting process mining, designing next generation small- and large-scale automation solutions, and implementing intelligent process automation, robotic process automation and digital workflow solutions to help clients achieve operational efficiencies and reduce costs.
Focused on relationships, you are building meaningful client connections, and learning how to manage and inspire others. Navigating increasingly complex situations, you are growing your personal brand, deepening technical expertise and awareness of your strengths. You are expected to anticipate the needs of your teams and clients, and to deliver quality. Embracing increased ambiguity, you are comfortable when the path forward isn’t clear, you ask questions, and you use these moments as opportunities to grow.
Examples of the skills, knowledge, and experiences you need to lead and deliver value at this level include but are not limited to:
- Respond effectively to the diverse perspectives, needs, and feelings of others.
- Use a broad range of tools, methodologies and techniques to generate new ideas and solve problems.
- Use critical thinking to break down complex concepts.
- Understand the broader objectives of your project or role and how your work fits into the overall strategy.
- Develop a deeper understanding of the business context and how it is changing.
- Use reflection to develop self awareness, enhance strengths and address development areas.
- Interpret data to inform insights and recommendations.
- Uphold and reinforce professional and technical standards (e.g. refer to specific PwC tax and audit guidance), the Firm's code of conduct, and independence requirements.
Design, develop, and maintain data pipelines and ETL processes for GenAI projects.
- Collaborate with data scientists and software engineers to implement machine learning models and algorithms.
- Optimize data infrastructure and storage solutions to ensure efficient and scalable data processing.
- Implement event-driven architectures to enable real-time data processing and analysis.
- Utilize containerization technologies like Kubernetes and Docker for efficient deployment and scalability.
- Develop and maintain data lakes for storing and managing large volumes of structured and unstructured data.
- Implement and integrate LLM frameworks (Langchain, Semantic Kernel) for advanced language processing and analysis.
- Collaborate with cross-functional teams to design and implement solution architectures for GenAI projects.
- Utilize cloud computing platforms such as Azure or AWS for data processing, storage, and deployment.
- Monitor and troubleshoot data pipelines and systems to ensure smooth and uninterrupted data flow.
- Stay up-to-date with the latest advancements in GenAI technologies and recommend innovative solutions to enhance data engineering processes.
- Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.
- Document data engineering processes, methodologies, and best practices.
- Maintain solution architecture certificates and stay current with industry best practices.
Requirements:
Python Proficiency: Minimum 3 years of hands-on experience building applications with Python.
Scalable System Design: Solid understanding of designing and architecting scalable Python applications, particularly for Gen AI use cases, with a strong understanding of various components and systems architecture patterns to make cohesive and decoupled, scalable applications.
Web Frameworks: Familiarity with Python web frameworks (Flask, FastAPI) for building web applications around AI models.
Modular Design & Security: Demonstrated ability to design applications with modularity, reusability, and security best practices in mind (session management, vulnerability prevention, etc.,).
Cloud-Native Development: Familiarity with cloud-native development patterns and tools (e.g., REST APIs, microservices, serverless functions).
Cloud Deployments: Experience deploying and managing containerized applications on Azure/AWS (Azure Kubernetes Service, Azure Container Instances, or similar).
Version Control (Git): Strong proficiency in Git for effective code collaboration and management.
CI/CD: Knowledge of continuous integration and deployment (CI/CD) practices on cloud platforms.
3-5 years of relevant technical/technology experience, with a focus on GenAI projects.
Strong programming skills in Python.
Experience with data processing frameworks like Apache Spark or similar.
Proficiency in SQL and database management systems.
Preferred Skills:
Gen AI Frameworks: Experience with LLM frameworks or tools for interacting with LLMs such as LangChain, Semantic Kernel, LlamaIndex
Data Pipelines: Experience in setting up data pipelines for model training and real-time inference.
If you are passionate about GenAI technologies and have a proven track record in data engineering, join PwC US-Acceleration Center and be part of a dynamic team that is shaping the future of GenAI solutions. We offer a collaborative and innovative work environment where you can make a significant impact.
Education (if blank, degree and/or field of study not specified)
Degrees/Field of Study required:Degrees/Field of Study preferred:Certifications (if blank, certifications not specified)
Required Skills
Optional Skills
Accepting Feedback, Accepting Feedback, Active Listening, Agile Methodology, Analytical Thinking, Automation Algorithms, Automation Engineering, Automation Framework Design and Development, Automation Programming, Automation Solutions, Automation Studio, Automation System Efficiency, Blue Prism, Business Analysis, Business Performance Management, Business Process Analysis, Business Process Automation (BPA), Business Transformation, Business Value Optimization, C++ Programming Language, Cognitive Automation, Communication, Conducting Discovery, Configuration Management (CM), Continuous Process Improvement {+ 36 more}Desired Languages (If blank, desired languages not specified)
Travel Requirements
Available for Work Visa Sponsorship?
Government Clearance Required?
Job Posting End Date
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture AWS Azure CI/CD Data pipelines Docker Engineering ETL FastAPI Flask Generative AI Git Kubernetes LangChain LLMs Machine Learning Microservices ML models Model training Pipelines Python Robotics RPA Security Spark SQL Unstructured data
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.