Senior Associate
Pune, India
TIAA
At TIAA, we believe everyone deserves the chance for a secure retirement. Explore our annuity, financial planning advice and investing solutions.The role leverage big data tools and programming frameworks to ensure that the raw data gathered from data pipelines are redefined as data science models that are ready to scale as needed.
Key Responsibilities and Duties
- Machine learning engineers feed data into models defined by data scientists.
- They’re also responsible for taking theoretical data science models and helping scale them out to production-level models that can handle terabytes of real-time data.
- University (Degree) Preferred
- 5+ Years Required; 7+ Years Preferred
- Physical Requirements: Sedentary Work
Career Level
8IC
Position Summary: Describe below the primary purpose and function of this job
We're seeking a Senior Platform Reliability Engineer who reports to AI COE lead to maintain our enterprise-scale Generative AI platform on AWS. This role serves as a critical bridge between AI development teams and platform operations, ensuring reliable deployment and scaling of LLM services, vector databases, and associated infrastructure. The position focuses on maintaining high availability of AI services while optimizing cost and performance
Key Duties & Responsibilities: List up to 5 key duties and responsibilities, management responsibilities and time spent (if applicable)
AI Platform Infrastructure (35%)
Primary Focus Areas:
- Maintain scalable Kubernetes clusters for LLM deployments
- Manage vector store infrastructure (Pinecone/Weaviate/Faiss)
- Optimize DynamoDB performance for high-throughput AI operations
- Configure and maintain S3 data lakes for model artifacts
- Implement efficient model serving architectures
Development Support & Integration (25%)
- Collaborate with ML engineers on model deployment pipelines
- Maintain APIs for model inference services
- Implement A/B testing infrastructure for model variants
- Create developer tooling for model deployment and monitoring
- Support integration of new LLM models into production2
Observability & Performance (20%)
Monitoring & Metrics
Implement custom metrics for LLM performance
Monitor vector store query latencies
Track model inference costs
Set up distributed tracing for API calls
Optimization
Tune Kubernetes resources for model serving
Optimize vector store query performance
Implement caching strategies for frequent queries
Manage auto-scaling policies3
Security & Compliance (20%)
- Implement IAM roles and security policies
- Manage API authentication and rate limiting
- Ensure data privacy compliance for AI operations
- Monitor and prevent token/cost abuse
- Implement model access controls
Management/Leadership Responsibility: Is management of people a primary focus of the role? If so, how many direct and indirect employees are managed? Do any of them manage a function or process?
NA
Budget Responsibility: Does the position have responsibility for Revenue, Operating (expense) Budget, etc.? If so, what is the scope?
N/A
Impact:
NA
NA
Business or Industry Expertise: Describe the degree of knowledge and understanding required of TIAA’s business and industry, commercial environment and of competitors products and services.
Interactions / Interpersonal Skills: Describe the nature and level of interactions this job has with others, both internally and externally. Explain any specific interpersonal skills necessary to successfully perform this role (i.e., negotiation skills, represents business at external events or to governmental bodies, etc. ).
Job Requirements And Qualifications: Indicate the minimum and preferred education and experience for the job and any licenses and certifications required
Required Education:
Masters
Preferred Education:
Masters
Skills and Abilities:
- Must have 9-13 Yrs of relevant experience.
- Team Player – ability to work in global team environment.
- Collaboration skills with business-driven team, business development and Stakeholder
Technical Requirements
Core Skills
- 5+ years experience with AWS services
- Deep expertise in Kubernetes administration
- Strong Python programming skills
- Experience with Infrastructure as Code (Terraform)
- Understanding of ML/LLM deployment patterns
Required AWS Experience
- Primary Services:
- EKS (Kubernetes)
- S3 & DynamoDB
- VPC & Networking
- IAM & Security
- CloudWatch & Monitoring
- API Gateway
AI Infrastructure Experience
- Vector database deployment
- LLM serving frameworks
- API development and gateway management
Required Licenses/Certifications:
Licenses/Certifications
- AWS Certified DevOps / Solutions Architect
- CKA Certified.
_____________________________________________________________________________________________________
Company Overview
TIAA Global Capabilities was established in 2016 with a mission to tap into a vast pool of talent, reduce risk by insourcing key platforms and processes, as well as contribute to innovation with a focus on enhancing our technology stack. TIAA Global Capabilities is focused on building a scalable and sustainable organization , with a focus on technology , operations and expanding into the shared services business space.
Working closely with our U.S. colleagues and other partners, our goal is to reduce risk, improve the efficiency of our technology and processes and develop innovative ideas to increase throughput and productivity.
We are an Equal Opportunity/Affirmative Action Employer. We consider all qualified applicants for employment regardless of age, race, color, national origin, sex, religion, veteran status, disability, sexual orientation, gender identity, or any other protected status.
Accessibility Support
TIAA offers support for those who need assistance with our online application process to provide an equal employment opportunity to all job seekers, including individuals with disabilities.
If you are a U.S. applicant and desire a reasonable accommodation to complete a job application please use one of the below options to contact our accessibility support team:
Phone: (800) 842-2755
Email: accessibility.support@tiaa.org
Privacy Notices
For Applicants of TIAA, Nuveen and Affiliates residing in US (other than California), click here.
For Applicants of TIAA, Nuveen and Affiliates residing in California, please click here.
For Applicants of TIAA Global Capabilities, click here.
For Applicants of Nuveen residing in Europe and APAC, please click here.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: A/B testing API Development APIs Architecture AWS Big Data Business Intelligence Data pipelines Data visualization DevOps DynamoDB Engineering FAISS Generative AI Kubernetes LLMs Machine Learning ML infrastructure Model deployment Model inference Pinecone Pipelines Predictive modeling Privacy Python Security Statistics Terraform Testing Weaviate
Perks/benefits: Career development Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.