AI Engineer - Model Performance
Tasks
- Benchmark quantization strategies
- Build fine tuning pipelines
- Debug production inference quality regressions
- Evaluate model serving frameworks
- Optimize GPU selection and batching strategies
- Optimize model inference performance
- Perform GPU profiling and performance analysis
- Reduce model serving cost and latency
Perks/Benefits
Skills/Tech-stack
Attention Backend | Audio Processing | Batching | CUDA | CUDA graph | Cost modeling | DPO | Data Preparation | FP8 | GPU Profiling | JSONL | KV cache | LLM Inference | Learning Rate | Learning Rate Scheduling | LoRA | Modal | Model Serving | Multimodal Models | Performance Engineering | Python | QLoRA | Quantization | Ray Serve | SFT | SGLang | Speculative decoding | TensorRT-LLM | Torch compile | Training Data Preparation | Training data | VLLM
Education
N/A
Roles
AI | AI Engineer | Engineer | Model Performance Engineer | Performance Engineer
Regions
Countries
States
Related jobs
-
Principal Data Engineer USD 200K-240KAWS | Agentic Workflows | Anomaly Detection | Batch pipelines | CCPA401k plan | Commuter benefits | Flexible vacation | Life insurance | Long-term disabilitySenior-level Full TimeBoulder, Colorado or New York City, … R5h ago
-
Senior Machine Learning Engineer USD 200K-230KBatching | Cloud Inference | Computer Vision | Deep learning | Edge ComputingDental insurance | Flexible PTO | Health insurance | Remote work | Vision insuranceSenior-level Full TimeRemote, US or Canada - NYC … R12h ago
-
Senior Data Engineer USD 150K-165KAPIs | AWS | Automation | CI/CD | Data Pipelines401k matching | Birthday day off | Fitness stipend | Floating holidays | Health benefitsSenior-level Full TimeUnited States R12h ago
-
Senior Embedded Systems Engineer USD 170K-226KACAP | Ansible | Bash | Cellular networking | DNS401k match | Dental insurance | Medical insurance | PTO | Sick days without limitSenior-level Full TimeChicago / Remote R13h ago
-
Senior Data Engineer, Data Foundations & AI Platform USD 153K-207KAPIs | Alerting | Apache Spark | CI/CD | Data Lineage401k with company match | Disability insurance | Flexible time off | Health, dental, and vision insurance | Leave of absenceSenior-level Full TimeUnited States (Remote) R13h ago
-
AI/ML Engineering Manager USD 140K-215KAWS | AWS CDK | AWS CloudFormation | AWS Glue | Agent systems401k plan | Company laptop | Dental insurance | Equipment and office stipend | Flexible spending accountMid-level Full TimeUSA R13h ago
-
Senior Data Engineer USD 95K-135KAWS | Airflow | C++ | Cassandra | Cloud platform401k matching | Community service days | Dental insurance | Disability benefits | Fertility and adoption benefitsSenior-level Full TimeChicago, IL R15h ago
-
Senior Data Engineer USD 95K-135KAWS | Airflow | C++ | Cassandra | Cloud platform401k matching | Community service days | Dental insurance | Disability benefits | Fertility and adoption benefitsSenior-level Full TimeDenver, CO R15h ago
-
Senior Data Engineer USD 137K-170KAWS | Airflow | Apache Spark | Azure | C++401k matching | Community service days | Dental insurance | Disability benefits | Fertility and adoption benefitsSenior-level Full TimeHouston, TX R15h ago
-
Senior Data Engineer USD 137K-170KAWS | Airflow | Apache Spark | C plus plus | Cassandra401k matching | Community service days | Dental insurance | Disability benefits | Fertility and adoption benefitsSenior-level Full TimeDallas, TX R15h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCombinatorics | Graph theory | MATLAB | NumPy | Number theoryFlexible hours | Freelance opportunities | Project based workSenior-level FreelanceNew York, New York, United States … R1d ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCombinatorics | Graph theory | Mathematics | NumPy | Number theoryFreelance opportunity | Part-time project-based workSenior-level FreelanceFlorida, United States - Remote R1d ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCMME | MATLAB | NumPy | Pandas | PythonFlexible schedule | Part-time project-based work | Project-based compensationSenior-level FreelanceUnited States - Remote R1d ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | Combinatorics | Graph theory | MATLAB | NumPyFlexible hours | Part-time opportunities | Project based workSenior-level FreelanceTexas, United States - Remote R1d ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | MATLAB | NumPy | Pandas | PythonPart-time availability | Project based workSenior-level FreelanceMichigan, United States - Remote R1d ago
-
Technical Architect - Machine Learning USD 165K-200KAKS | AWS | Async Programming | Autogen | AzureSenior-level Full TimeUSA - Remote, United States R1d ago
-
Senior Machine Learning Engineer USD 150K-185KAWS | Artifact management | Auditability | Azure | Batch inferenceDental insurance | Disability insurance | EAP | Life insurance | Medical insuranceSenior-level Full TimeCHICAGO, IL, USA R1d ago
-
AI/ML Developer II (Remote) USD 90K-130KAlerting | Authentication | Authorization | Automl | AzureRemote workMid-level Full TimeGEORGIA - VIRTUAL - GA01, United … R1d ago
-
AI Agents | API | Agent systems | Debugging | Git401k | Dental insurance | Employee stock purchase plan | FSA/HSA | Life insuranceEntry-level Full Time InternshipRemote (US), United States R1d ago
-
ML Ops Engineer USD 174K-226KAWS | Cloud infrastructure | Cost Optimization | Data Ingestion | GCPHybrid work schedule | In-office at least 3 days per weekMid-level Full TimeSan Francisco HQ Office R1d ago
-
Machine Learning Engineer - 1 USD 130K-228KCNN | Cross-validation | Data Pipelines | Deep learning | Document processingEquity options | Flexible-hybrid work | Medical, dental & vision coverage | Professional development budget | Team offsitesNone Full TimeHybrid - San Mateo, California R1d ago
-
Lead AI Engineer - AI & Credit Analytics USD 156K-234KAWS | CI/CD | Data Governance | Generative AI | LLMOpsFlexible time off | Flexible work environment | Hybrid work option | Matching 401k | Medical/Dental/Vision insuranceSenior-level Full TimeCosta Mesa, CA, United States R1d ago
-
A/B | A/B Testing | AWS | AWS SageMaker | Apache Spark401k matching | Commuter benefit | Dental insurance | FSA | Flexible time-off policiesSenior-level Full TimeLos Angeles, California, United States; San … R1d ago
-
Staff AI Engineer United States |Remote USD 174K-209KAPI | Access Control | Agent systems | Audit trails | BigQueryCareer growth pathways | In-person onboarding | RSUs | Remote workSenior-level Full TimeUnited States (Remote) R1d ago
-
Senior Software Engineer, Data Ingestion Platform USD 185K-326KAWS | Apache Airflow | Apache Iceberg | Apache Kafka | Apache SparkFlexible time off | Medical insurance | Modern family planning | Remote work | Retirement savings plansSenior-level Full TimeBay Area, CA, United States of … R1d ago