Lead AI Platform Engineer
Tasks
- Build internal services and CLI tools for AI developers
- Create workflows and guidance for no code low code agent platforms
- Define tooling and policies for safe local agent usage
- Deploy services to Kubernetes
- Design CI CD and training pipelines
- Design scalable cloud infrastructure using infrastructure as code
- Develop reusable model serving patterns
- Ensure training data quality using feature store
- Implement LLM and agent tracing
- Implement model drift monitoring
- Manage GPU and TPU resource allocation
- Manage vector databases and embedding pipelines
- Mitigate cold start scaling bottlenecks
- Optimize GPU utilization and cloud spend
- Optimize inference for lower latency and higher throughput
- Support AI agent deployment with service templates and tooling
Perks/Benefits
Skills/Tech-stack
AI Pipelines | AWS | CI/CD | Cloud infrastructure | Cost Optimization | Data Pipelines | Drift monitoring | Embeddings | Feature Store | GCP | GPU Utilization | GitHub Actions | Inference Optimization | Infrastructure as Code | Knowledge Distillation | Kubernetes | Language Models | Large Language Models | Latency optimization | MLflow | Machine Learning | Model Drift | Model Serving | Model drift monitoring | Observability | Quantization | RAG | Resource Utilization Monitoring | Resource utilization | Retrieval-Augmented Generation | Serverless computing | Terraform | Throughput Optimization | Vector Database | Vector embeddings | Vector indexing | Vertex AI | Vertex AI pipelines | “as-code”
Education
N/A
Related jobs
-
Featured Feat. AI Engineer (MTS) USD 160K-300KAPI Development | AWS | Amazon Web Services | Deep learning | FastAPIMentoring | Open source contributions | Remote workMid-levelRemote R12d ago
-
Mid-level Full Time北京 R7h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | API Integration | Classifier Training | Claude 3 | Claude 3 5 APIHybrid workSenior-level Full Time北京 R7h ago
-
Senior Data Engineer USD 165K-180KAPIs | Anomaly Detection | Azure | Azure Data | Azure Data FactorySenior-level Full TimeWork from home, VA, United States R10h ago
-
AWS Glue | AWS Lambda | Airflow | Amazon S3 | AzureRemote workSenior-level Full TimeRemote R12h ago
-
Data Engineer (MS) (Remote) INR 2040K-3380KCI/CD | Data Transformation | Data Validation | Date normalization | ETLMentorship opportunities | Professional growth | Remote workSenior-level Full TimeMaharashtra, Pune, India R12h ago
-
ML / LLM Engineer (Remote) INR 2500K-3000KAmazon Web Services | Azure | Classification | Feature Engineering | Language ModelsRemote workMid-level Full TimeMaharashtra, Pune, India R12h ago
-
Evergreen - Mathematics for Machine Learning USD 80K-300KAutodiff | JAX | Linear Algebra | Machine Learning | Matrix OperationsPart-time workMid-level Full TimeRemote, Remote, BR R13h ago
-
C++ | Cloud platform | Data Pipelines | ETL | Google CloudCDI | Career growth opportunities | Flexible work environment | Telework 1 day per weekSenior-level Full TimeCastelnaudary, France R13h ago
-
Data Engineer Databricks (H/F) EUR 47K-55KAmazon Web Services | Apache Spark | Azure | Azure DevOps | CI/CDCareer development | Flexible remote work | Meal tickets | Paid time off | RTT daysSenior-level Full TimeSAINT OUEN, France R17h ago
-
Senior Databricks EUR 46K-55KAWS | Apache Spark | Azure | Azure DevOps | Batch ProcessingCareer coaching | Conference speaking opportunities | Flexible telework | Meal tickets | Paid time offSenior-level Full TimeSAINT OUEN, France R18h ago
-
Anthropic Claude | Async Programming | ChatGPT | Claude Code | CodexDirect user impact | Flexible work schedule | Occasional international travel | Work from home optionMid-level Full TimeSlovenia / Remote R19h ago
-
Data Operations Engineer INR 2040K-3380KApache Airflow | Data Pipelines | Data Refresh | Data Warehousing | Data pipelineEmergency incident response support | On-call rotationSenior-level Full TimeBengaluru, INDIA, India R20h ago
-
APIs | Anomaly Detection | Data Modeling | Data Pipelines | Docker100% remote work | Career growth opportunities | Flexible work environmentMid-level Full TimeEstonia R21h ago
-
APIs | Anomaly Detection | Data Modeling | Data Pipelines | Docker100 percent remote work | Autonomous work environment | Career growth | Flexible work environment | International team cultureMid-level Full TimeHungary R21h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | ForecastingCareer growth | Flexible work environment | Remote workMid-level Full TimeFinland R21h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | JavaScript100% remote work | Career growth opportunities | Flexible work environmentMid-level Full TimeCzechia R21h ago
-
APIs | Anomaly Detection | Data Modeling | Data Pipelines | Docker100% remote work | Career growth opportunities | Flexible work environment | International team cultureMid-level Full TimeNorway R21h ago
-
API Integration | Anomaly Detection | Data Modeling | Docker | Machine Learning100 percent remote work | Autonomy | Career growth | Flexible work environment | International team cultureMid-level Full TimeLuxembourg R21h ago
-
API Integration | Anomaly Detection | Data Modeling | Data Pipelines | DockerCareer growth | Flexible schedule | International team culture | Remote workMid-level Full TimeCroatia R21h ago
-
APIs | Anomaly Detection | Data Modeling | Data Pipelines | Docker100% remote work | Autonomy | Career growth | Flexible work environment | International team cultureMid-level Full TimeBulgaria R21h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | Explainable AI100 percent remote work | Career growth opportunities | Flexible work environmentMid-level Full TimeDenmark R21h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | JavaScript100% remote work | Career growth opportunities | Flexible work environment | International team collaboration opportunitiesMid-level Full TimeGreece R21h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | JavaScript100% remote work | Career growth opportunities | Flexible work hoursMid-level Full TimeChile R21h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | JavaScript100% remote work | Career growth | Flexible work environmentMid-level Full TimePoland R21h ago