AI Platform & Inference Suite Engineer (Staff/Senior Staff level) - Riyadh, KSA
Tasks
- Advise customers on model selection and deployment feasibility
- Apply model quantization and mixed precision optimization
- Build test and deploy scalable inference pipelines
- Convert deep learning models
- Design end to end AI pipelines with preprocessing and post processing
- Design multi model workflows for detection tracking and recognition
- Develop and maintain ML application pipelines with customer frameworks
- Integrate models with runtime and optimization
- Integrate video pipelines into inference pipelines
- Optimize LLM and GenAI workloads for multi SoC and multi card architectures
- Optimize inference for throughput latency and accuracy
- Perform hardware sizing and architecture alignment
- Plan capacity and infrastructure optimization
- Port deep learning models to accelerator based data center platforms
- Produce technical documentation and runbooks
- Profile and debug performance bottlenecks in compute memory and runtime
- Select video processing pipeline components
- Serve models with NVIDIA Triton Inference Server
- Serve models with TensorFlow Serving
- Serve models with vLLM
- Validate model capability in deployment environments
Perks/Benefits
- Customer facing technical advisory role
- Technical documentation and training opportunities
- Travel opportunities
Skills/Tech-stack
C# | C++ | Capacity Planning | Concurrency | Container Orchestration | Data Preprocessing | Data postprocessing | Docker | FFmpeg | GStreamer | Git | INT8 | Inference Optimization | Inference Server | Kubernetes | Latency optimization | Linux | Mixed Precision | Model Conversion | Model Quantization | NVIDIA Triton | NVIDIA Triton Inference | NVIDIA Triton Inference Server | ONNX | Performance Profiling | PyTorch | Python | Runtime integration | TensorFlow | TensorFlow Serving | Throughput Optimization | Triton Inference Server | VLLM
Education
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer
Regions
Countries
States
Cities
Related jobs
-
ML Engineer USD 137K-211KA/B | A/B Testing | Amazon SageMaker | Amazon Web Services | Apache AirflowMid-level Full TimeRiyadh, Riyadh Province, Saudi Arabia2d ago
-
C# | MATLAB | NumPy | Pandas | PythonPart-time project-based workSenior-level Full TimeSaudi Arabia - Remote R2d ago
-
ML Application Engineer – AI Inference & Model Optimization (Staff/Senior Staff level) - Riyadh, KSA USD 153K-230KC# | C++ | Containers | Debugging | GitCustomer facing technical collaboration | Technical documentation and training opportunities | Travel for customer engagementsSenior-level Full TimeRiyadh, Saudi Arabia2d ago
-
API Integration | Data Preprocessing | Deep learning | Feature Engineering | Fine TuningSenior-level Full TimeRiyadh, Saudi Arabia2d ago
-
NumPy | Numerical Simulation | Pandas | Python | SciPyPart-time | Project-basedSenior-level FreelanceSaudi Arabia - Remote R3d ago
-
Data & AI Architect - KSA USD 145K-236KAmazon Web Services | Analytics | Apex | Artificial Intelligence | Business IntelligenceSenior-level Full TimeSaudi Arabia - Riyadh3d ago
-
AI/ML Support Automation Analyst USD 145K-226KAPI Development | Airflow | Ansible | Apache Spark | Argo RolloutsEntry-level Full TimeSaudi Arabia3d ago
-
NumPy | Numerical Simulation | Pandas | Physics modeling | PythonPart-time hours | Project based workEntry-level FreelanceSaudi Arabia - Remote R3d ago
-
MATLAB | NumPy | Pandas | Python | RPart-time project workEntry-level FreelanceSaudi Arabia - Remote R3d ago
-
Combinatorics | Graph theory | NumPy | Number theory | Numerical analysisSenior-level FreelanceSaudi Arabia - Remote R4d ago
-
AWS | Azure | Big Data | Cloud Platforms | Data ModelingFull-time role | On-site workSenior-level Full TimeRiyadh, Riyadh Province, Saudi Arabia5d ago
-
Data & AI Intern USD 56K-70KAPI | Artificial Intelligence | DBT | Databases | Machine LearningEquity stake | Flexible office hours | Private healthcareEntry-level InternshipRiyadh, Saudi Arabia7d ago
-
API Integration | Cost Optimization | Fine Tuning | Inference | Language ModelsOn call support scheduleSenior-level Full TimeRiyadh, Saudi Arabia7d ago
-
Data Analytics Engineer (Saudi National) INR 800K-1200KData Lakes | Data Modeling | Data Warehousing | ETL | ExcelMid-level Full TimeRiyadh, Saudi Arabia8d ago
-
Artificial Intelligence Engineer (Saudi National) USD 126K-190KAWS SageMaker | Azure AI | Cloud AI | Computer Vision | Data ManipulationMid-level Full TimeRiyadh, Saudi Arabia8d ago
-
Senior-level Full TimeRiyadh, Saudi Arabia8d ago
-
Artificial Intelligence Engineer USD 136K-215KData Preprocessing | Data Structures | Data cleaning | Feature Engineering | Machine LearningMid-level Full TimeRiyadh, Saudi Arabia8d ago
-
Mid-level Full TimeRiyadh, Saudi Arabia8d ago
-
AWS | Access Control | Access Management | Active Directory | Apache AirflowSenior-level Full TimeJeddah, Makkah Province, Saudi Arabia R8d ago
-
AI Solution Engineer Manager USD 155K-177KAWS | Azure | Cloud Platforms | Deep learning | Google CloudSenior-level Full TimeRiyadh, Riyadh Province, Saudi Arabia8d ago
-
API Integration | Automation | Claude | Data Modeling | Distributed SystemsCollaborative engineering culture | Flexible work from anywhere | Fully remote | High-ownership environmentSenior-level Full TimeSaudi Arabia R9d ago
-
Internship - AI Engineer USD 54K-65KComputer Vision | Data Drift | Data Preprocessing | LLM | Linear AlgebraClose-knit community | Creative freedom | Direct impact | Diverse international team | Fast-paced learningEntry-level InternshipRiyadh, Saudi Arabia10d ago
-
Senior Data Engineer USD 116K-175KAWS | Apache Flink | Apache Kafka | Bash | ClickHouseInclusive workplace | Reasonable accommodations during recruitmentMid-level Full TimeMecca, Makkah Province, Saudi Arabia11d ago
-
Agent Orchestration | Agentic AI | Amazon Web Services | Azure | CI/CDSenior-level Full TimeSaudi Arabia, Saudi Arabia14d ago
-
A/B | A/B Testing | API Development | Agentic AI | B testingCollaborationSenior-level Full TimeSaudi Arabia, Saudi Arabia14d ago