AI Platform Engineering Specialist
Montreal 700, Canada
ā ļø We'll shut down after Aug 1st - try fooš¦ for all jobs in tech ā ļø
Morgan Stanley
Discover how we help individuals, families, institutions and governments raise, manage and distribute the capital they need to achieve their goals.- Develop tooling and self-service capabilities for deploying AI solutions for the firm leveraging Kubernetes/OpenShift, Python, authentication solutions, APIs, REST framework, etc.
- Develop Terraform modules and Cloud architecture to enable secure AI cloud service deployment and consumption at scale.
- Have a platform mindset and build common, reusable solutions to scale Generative AI use cases using pre-trained models as well as fine-tuned models.Ā
- Leverage Kubernetes/OpenShift to develop modern containerized workloads.
- Integrate with capabilities such as large-scale vector stores for embeddings.Ā
- Author best practices on the Generative AI ecosystem, when to use which tools, available models such as GPT, Llama, Hugging Face etc. and libraries such as Langchain.Ā
- Analyze, investigate, and implement GenAI solutions focusing on Agentic Orchestration and Agent Builder frameworks.
- Author and publish architecture decision records to capture major design decisions and product selection for building Generative AI solutions.Ā Inclusive of app authentication, service communication, state externalization, container layering strategy and immutability.
- Ensure AI platform are reliable, scalable, and operational; (e.g. blueprints for upgrade/release strategies (E.g. Blue/Green); logging/monitoring/metrics; automation of system management tasks)
- Participate in all teamās Agile/ Scrum ceremonies.
- Participate in teamās oncall rotation in build/run team model
- Bachelorās or Masterās degree in Computer Science or related field, or equivalent job experienceĀ
- 5 years of experience in software engineering, design and development
- Strong hands-on Application Development background in at least one prominent programming language, preferably PythonĀ Flask or FAST Api.
- Broad understanding of data engineering (SQL, NoSQL, Big Data, Kafka, Redis), data governance, data privacy and security.
- Experience in development, management, and deployment of Kubernetes workloads, preferably on OpenShift.
- Experience with designing, developing, and managing RESTful services for large-scale enterprise solutions.
- Experience deploying applications on Azure, AWS, and/or GCP using IaC (Terraform)
- Hands-on experience with multiprocessing, multithreading, asynchronous I/O, performance profiling in at least one prominent programming language, preferably python.
- Ability to articulate technical concepts effectively to diverse audiences.
- Excellent communication skills.
- Demonstrated ability to work effectively and collaboratively in a global organization, across time zones, and across organizations
- Demonstrated experience in DevOps, understanding of CI/CD (Jenkins) and GitOps.
- Knowledge of DevOps and Agile practices.
Nice to have
- Practitioner of unit testing, performance testing and BDD/acceptance testing.
- Understanding of OAuth 2.0 protocol for secure authorization.
- Proficiency with Open Telemetry tools including Grafana, Loki, Prometheus, and Cortex.
- Good knowledge of Microservice based architecture, industry standards, for both public and private cloud.
- Good understanding of modern Application configuration techniques.
- Hands on experience with Cloud Application Deployment patterns like Blue/Green.
- Good understanding of State sharing between scalable cloud components (Kafka, dynamic distributed caching).
- Good knowledge of various DB engines (SQL, Redis, Kafka, etc) for cloud app storage.
- Experience building AI applications, preferably Generative AI and LLM based apps.Ā
- Deep understanding of AI agents, Agentic Orchestration, Multi-Agent Workflow Automation, along with hands-on experience in Agent Builder frameworks such Lang Chain and Lang Graph.
- Experience working with Generative AI development, embeddings, fine tuning of Generative AI models.Ā
- Understanding of ModelOps/ ML Ops/ LLM Op.
- Understanding of SRE techniques.
WHAT YOU CAN EXPECT FROM MORGAN STANLEY:
We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - arenāt just beliefs, they guide the decisions we make every day to do what's best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. At Morgan Stanley, youāll find an opportunity to work alongside the best and the brightest, in an environment where you are supported and empowered. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. Thereās also ample opportunity to move about the business for those who show passion and grit in their work.
To learn more about our offices across the globe, please copy and paste https://www.morganstanley.com/about-us/global-officesā into your browser.
Morgan Stanley is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index š°
Tags: Agile APIs Architecture AWS Azure Big Data CI/CD Computer Science Data governance DevOps Engineering Flask GCP Generative AI GPT Grafana Jenkins Kafka Kubernetes LangChain LLaMA LLMs Machine Learning NoSQL Privacy Python Scrum Security SQL Terraform Testing
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.