Gen Ai Data Engineer - Manager
Bangalore (SDC) - Bagmane Tech Park, India
PwC
We are a community of solvers combining human ingenuity, experience and technology innovation to help organisations build trust and deliver sustained outcomes.Line of Service
AdvisoryIndustry/Sector
Not ApplicableSpecialism
Advisory - OtherManagement Level
ManagerJob Description & Summary
At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth.Those in intelligent automation at PwC will focus on conducting process mining, designing next generation small- and large-scale automation solutions, and implementing intelligent process automation, robotic process automation and digital workflow solutions to help clients achieve operational efficiencies and reduce costs.
Enhancing your leadership style, you motivate, develop and inspire others to deliver quality. You are responsible for coaching, leveraging team member’s unique strengths, and managing performance to deliver on client expectations. With your growing knowledge of how business works, you play an important role in identifying opportunities that contribute to the success of our Firm. You are expected to lead with integrity and authenticity, articulating our purpose and values in a meaningful way. You embrace technology and innovation to enhance your delivery and encourage others to do the same.
Examples of the skills, knowledge, and experiences you need to lead and deliver value at this level include but are not limited to:
- Analyse and identify the linkages and interactions between the component parts of an entire system.
- Take ownership of projects, ensuring their successful planning, budgeting, execution, and completion.
- Partner with team leadership to ensure collective ownership of quality, timelines, and deliverables.
- Develop skills outside your comfort zone, and encourage others to do the same.
- Effectively mentor others.
- Use the review of work as an opportunity to deepen the expertise of team members.
- Address conflicts or issues, engaging in difficult conversations with clients, team members and other stakeholders, escalating where appropriate.
- Uphold and reinforce professional and technical standards (e.g. refer to specific PwC tax and audit guidance), the Firm's code of conduct, and independence requirements.
Job Description: GenAI Data Engineer - Senior Associate (PwC US AC)
PwC US - Acceleration Center is seeking a highly skilled and experienced GenAI Data Engineer to join our team at Senior Associate level. As a GenAI Data Engineer, you will be responsible for developing and maintaining data pipelines, implementing machine learning models, and optimizing data infrastructure for our GenAI projects. The ideal candidate should have a strong background in data engineering, with a focus on GenAI technologies, and possess a solid understanding of data processing, event-driven architectures, containerization, and cloud computing.
Responsibilities:
- Design, develop, and maintain data pipelines and ETL processes for GenAI projects.
- Collaborate with data scientists and software engineers to implement machine learning models and algorithms.
- Optimize data infrastructure and storage solutions to ensure efficient and scalable data processing.
- Implement event-driven architectures to enable real-time data processing and analysis.
- Utilize containerization technologies like Kubernetes and Docker for efficient deployment and scalability.
- Develop and maintain data lakes for storing and managing large volumes of structured and unstructured data.
- Implement and integrate LLM frameworks (Langchain, Semantic Kernel) for advanced language processing and analysis.
- Collaborate with cross-functional teams to design and implement solution architectures for GenAI projects.
- Utilize cloud computing platforms such as Azure or AWS for data processing, storage, and deployment.
- Monitor and troubleshoot data pipelines and systems to ensure smooth and uninterrupted data flow.
- Stay up-to-date with the latest advancements in GenAI technologies and recommend innovative solutions to enhance data engineering processes.
- Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.
- Document data engineering processes, methodologies, and best practices.
- Maintain solution architecture certificates and stay current with industry best practices.
Requirements:
Python Proficiency: Minimum 3 years of hands-on experience building applications with Python.
Scalable System Design: Solid understanding of designing and architecting scalable Python applications, particularly for Gen AI use cases, with a strong understanding of various components and systems architecture patterns to make cohesive and decoupled, scalable applications.
Web Frameworks: Familiarity with Python web frameworks (Flask, FastAPI) for building web applications around AI models.
Modular Design & Security: Demonstrated ability to design applications with modularity, reusability, and security best practices in mind (session management, vulnerability prevention, etc.,).
Cloud-Native Development: Familiarity with cloud-native development patterns and tools (e.g., REST APIs, microservices, serverless functions).
Cloud Deployments: Experience deploying and managing containerized applications on Azure/AWS (Azure Kubernetes Service, Azure Container Instances, or similar).
Version Control (Git): Strong proficiency in Git for effective code collaboration and management.
CI/CD: Knowledge of continuous integration and deployment (CI/CD) practices on cloud platforms.
3-5 years of relevant technical/technology experience, with a focus on GenAI projects.
Strong programming skills in Python.
Experience with data processing frameworks like Apache Spark or similar.
Proficiency in SQL and database management systems.
Preferred Skills:
Gen AI Frameworks: Experience with LLM frameworks or tools for interacting with LLMs such as LangChain, Semantic Kernel, LlamaIndex
Data Pipelines: Experience in setting up data pipelines for model training and real-time inference.
If you are passionate about GenAI technologies and have a proven track record in data engineering, join PwC US-Acceleration Center and be part of a dynamic team that is shaping the future of GenAI solutions. We offer a collaborative and innovative work environment where you can make a significant impact.
Education (if blank, degree and/or field of study not specified)
Degrees/Field of Study required:Degrees/Field of Study preferred:Certifications (if blank, certifications not specified)
Required Skills
Optional Skills
Accepting Feedback, Accepting Feedback, Active Listening, Agile Methodology, Analytical Thinking, Automation Algorithms, Automation Engineering, Automation Framework Design and Development, Automation Programming, Automation Solutions, Automation Studio, Automation System Efficiency, Blue Prism, Business Analysis, Business Performance Management, Business Process Analysis, Business Process Automation (BPA), Business Transformation, Business Value Optimization, C++ Programming Language, Coaching and Feedback, Cognitive Automation, Communication, Conducting Discovery, Configuration Management (CM) {+ 41 more}Desired Languages (If blank, desired languages not specified)
Travel Requirements
Available for Work Visa Sponsorship?
Government Clearance Required?
Job Posting End Date
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture AWS Azure CI/CD Data pipelines Docker Engineering ETL FastAPI Flask Generative AI Git Kubernetes LangChain LLMs Machine Learning Microservices ML models Model training Pipelines Python Robotics RPA Security Spark SQL Unstructured data
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.