Senior AI Infrastructure Engineer (Databricks, AWS, Python)
Brazil
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Solvd
An AI-first advisory and digital engineering firm that guides brands through digital transformation and delivers measurable impact | Solvd
Solvd is an AI-first advisory and digital engineering firm delivering measurable business impact through strategic digital transformation. Taking an AI-first approach, we bridge the critical gap between experimentation and real ROI, weaving artificial intelligence into everything we do and helping clients at all stages accelerate AI integration into each process layer. Our mission is to empower passionate people to thrive in the era of AI while maintaining rigorous ethical AI standards. We’re supported by a global team with offices in the USA, Poland, Ukraine, Georgia and LATAM.
We’re looking for a Senior AI Infrastructure Engineer to help design, build, and scale our AI and data infrastructure. In this role, you’ll focus on architecting and maintaining cloud-based MLOps pipelines to enable scalable, reliable, and production-grade AI/ML workflows, working closely with AI engineers, data engineers, and platform teams. Your expertise in building and operating modern cloud-native infrastructure will help enable world-class AI capabilities across the organization.
If you are passionate about building robust AI infrastructure, enabling rapid experimentation, and supporting production-scale AI workloads, we’d love to talk to you.
We’re looking for a Senior AI Infrastructure Engineer to help design, build, and scale our AI and data infrastructure. In this role, you’ll focus on architecting and maintaining cloud-based MLOps pipelines to enable scalable, reliable, and production-grade AI/ML workflows, working closely with AI engineers, data engineers, and platform teams. Your expertise in building and operating modern cloud-native infrastructure will help enable world-class AI capabilities across the organization.
If you are passionate about building robust AI infrastructure, enabling rapid experimentation, and supporting production-scale AI workloads, we’d love to talk to you.
Responsibilities
- Design, implement, and maintain cloud-native infrastructure to support AI and data workloads, with a focus on AI and data platforms such as Databricks and AWS Bedrock.
- Build and manage scalable data pipelines to ingest, transform, and serve data for ML and analytics.
- Develop infrastructure-as-code using tools like Cloudformation, AWS CDK to ensure repeatable and secure deployments.
- Collaborate with AI engineers, data engineers, and platform teams to improve the performance, reliability, and cost-efficiency of AI models in production.
- Drive best practices for observability, including monitoring, alerting, and logging for AI platforms.
- Contribute to the design and evolution of our AI platform to support new ML frameworks, workflows, and data types.
- Stay current with new tools and technologies to recommend improvements to architecture and operations.
- Integrate AI models and large language models (LLMs) into production systems to enable use cases using architectures like retrieval-augmented generation (RAG).
Mandatory Requirements
- 7+ years of professional experience in software engineering and infrastructure engineering.
- Extensive experience building and maintaining AI/ML infrastructure in production, including model, deployment, and lifecycle management.
- Strong knowledge of AWS and infrastructure-as-code frameworks, ideally with CDK.
- Expert-level coding skills in TypeScript and Python building robust APIs and backend services.
- Production-level experience with Databricks MLFlow, including model registration, versioning, asset bundles, and model serving workflows.
- Expert level understanding of containerization (Docker), and hands on experience with CI/CD pipelines, orchestration tools (e.g., ECS) is a plus.
- Proven ability to design reliable, secure, and scalable infrastructure for both real-time and batch ML workloads.
- Ability to articulate ideas clearly, present findings persuasively, and build rapport with clients and team members.
- Strong collaboration skills and the ability to partner effectively with cross-functional teams.
- Familiarity with emerging LLM frameworks such as DSPy for advanced prompt orchestration and programmatic LLM pipelines.
- Understanding of LLM cost monitoring, latency optimization, and usage analytics in production environments.
- Knowledge of vector databases / embeddings stores (e.g., OpenSearch) to support semantic search and RAG.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Deep Learning Jobs
Engineering Jobs
Tags: APIs Architecture AWS CI/CD CloudFormation Databricks Data pipelines Docker ECS Engineering LLMs Machine Learning MLFlow ML infrastructure MLOps OpenSearch Pipelines Python RAG TypeScript
Region:
South America
Country:
Brazil
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Data Scientist II jobsSr. Data Engineer jobsBusiness Intelligence Developer jobsPrincipal Data Engineer jobsBI Developer jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsDevOps Engineer jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsData Manager jobsStaff Software Engineer jobsAI/ML Engineer jobsData Science Manager jobsLead Data Analyst jobsData Analyst Intern jobsBusiness Data Analyst jobsSr. Data Scientist jobsData Specialist jobsBusiness Intelligence Analyst jobsData Governance Analyst jobsData Engineer III jobsSenior Backend Engineer jobs
Consulting jobsMLOps jobsAirflow jobsOpen Source jobsEconomics jobsKafka jobsLinux jobsGitHub jobsKPIs jobsTerraform jobsJavaScript jobsPrompt engineering jobsPostgreSQL jobsRAG jobsBanking jobsStreaming jobsScikit-learn jobsClassification jobsNoSQL jobsData Warehousing jobsRDBMS jobsPhysics jobsComputer Vision jobsdbt jobsPandas jobs
Google Cloud jobsHadoop jobsScala jobsLangChain jobsGPT jobsR&D jobsBigQuery jobsData warehouse jobsMicroservices jobsCX jobsELT jobsDistributed Systems jobsReact jobsScrum jobsOracle jobsLooker jobsIndustrial jobsPySpark jobsOpenAI jobsJira jobsRedshift jobsRobotics jobsSAS jobsTypeScript jobsUnstructured data jobs