AI Optimization Engineer - ONSITE
USD 150K-234K (estimate) Senior-level Contract
Tasks
- Collaborate with infrastructure and ML teams to improve scalability
- Conduct exploratory data analysis and model performance evaluation
- Deploy ML models using containerized microservices
- Deploy and manage LLMs in production environments
- Develop automated job scheduling with SLURM and Flask APIs
- Implement model optimization pruning quantization and knowledge distillation
- Monitor system performance with Prometheus and Grafana
- Optimize AI ML workloads on GPU HPC clusters
- Optimize inference pipelines with Triton Inference Server and TRTLLM
Perks/Benefits
- N/A
Skills/Tech-stack
CNN | CentOS | Deep learning | Docker | Enroot | Flask | GPU Computing | GitHub | Grafana | HPC | Hugging Face | Inference Server | Jenkins | Keras | Knowledge Distillation | Kubernetes | Linux | MLflow | Machine Learning | Matplotlib | Model Inference | Model Training | NLP | NumPy | Plotly | Podman | Podman Container | Prometheus | Pruning | PyTorch | Python | Pyxis | Quantization | REST API | RHEL | Scikit-learn | Seaborn | Slurm | TRTLLM | TensorFlow | Terraform | Transformers | Triton Inference | Triton Inference Server | Vector embeddings
Education
N/A
Roles
AI | AI Optimization Engineer | Engineer | Optimization Engineer
Regions
Countries
States
Cities
Related jobs
-
Director - Microsoft Cloud & AI Solution Architecture (Communications, Media & Technology) USD 225K-240KAKS | Ansible | Application development | Azure DevOps | Azure Firewall401k match | Bereavement leave | Dental insurance | Disability insurance | Employee assistance programSenior-level Full TimeLos Angeles, CA, United States R2h ago
-
Sr. Full Stack AI Developer (St. Louis) USD 106K-133KAVA | Agile | Cloud Foundry | Codefresh | DRY401k plan | Bereavement | Dental insurance | Disability insurance | Employee assistance programSenior-level Full TimeSt. Louis, MO, United States2h ago
-
Sr. Full Stack AI Developer (St. Louis) USD 106K-133KAVA | Agents | Agile | CI/CD | Cloud Foundry401(k) plan match | Bereavement leave | Employee assistance program | Employee discount program | Health, dental, and vision careSenior-level Full TimeSt. Louis, MO, United States2h ago
-
Infrastructure Engineer - Storage USD 100K-120KAnsible | Azure | Azure Blob | Azure Blob Storage | Azure Files401k plan | Bereavement | Disability insurance | Employee assistance program | Employee discount programMid-level Full TimeSt. Louis, MO, United States2h ago
-
Senior Infrastructure Kafka Engineer USD 125K-186KAWS | Alerting | Apache Kafka | Bash | Confluent KafkaContract-to-hire | Hybrid work model | Remote work optionSenior-level Full TimePhoenix, AZ3h ago
-
Senior-level Full TimeHerndon, VA3h ago
-
Senior Platform AI Engineer USD 119K-180KAPI Design | Asynchronous programming | Authentication | Concurrency | Distributed SystemsSenior-level Full TimeCenter, Center District, IL4h ago
-
Senior-level Full TimeCenter, Center District, IL4h ago
-
AWS Lambda | Amazon DynamoDB | Amazon Kinesis | Amazon SNS | Amazon SQSHybrid workSenior-level ContractSeattle, United States5h ago
-
Database Engineer USD 107K-158KAgile | Amazon EMR | Amazon Web Services | Data Modeling | Database performance401k retirement plan with employer matching | Company paid memberships | Flexible spending account | Flexible work schedule | Health, dental, vision, and life insuranceMid-level Full TimeHerndon, Virginia6h ago
-
Lead Software Engineer - Java/Python - Learn AI / LLM USD 175K-215KAgile | Amazon Web Services | Application Resiliency | Artificial Intelligence | CI/CDBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States6h ago
-
Quant Analytics [Multiple Positions Available] USD 150K-185KAWS Redshift | CTE | Data Aggregation | Data Enrichment | Data TransformationBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site wellness centersSenior-level Full TimePlano, TX, United States6h ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States7h ago
-
Machine Learning Engineer USD 130K-194KAI machine learning | AWS AI | AWS AI Machine Learning | Amazon DynamoDB | Amazon EC2Professional development | Work from homeMid-level Full TimeRemote, NY, US R7h ago
-
Software Engineer III - Data, AWS, ETL, Java/Python, USD 173K-185KAPIs | AWS | Agile methodologies | Apache Airflow | Apache FlinkBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimePlano, TX, United States7h ago
-
Algorithms Engineer USD 72K-120KARIMA | Anomaly Detection | Causal Inference | Causal forests | Change point detectionEntry-level Full TimeCenter, Center District, IL7h ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States7h ago
-
A/B | A/B Testing | B testing | Computer Vision | Deep learningEntry-level Full TimeSeattle, Washington, United States7h ago
-
Computer Vision | Deep learning | Information Retrieval | Language Processing | Machine LearningEntry-level Full TimeSan Jose, California, United States7h ago
-
Partner Engineer, Generative AI USD 173K-247KAWS | Agent Orchestration | Azure | Bias Mitigation | C plus plusSenior-level Full TimeMenlo Park, CA8h ago
-
AI Research Scientist, SysML - FAIR USD 143K-208KArtificial Intelligence | C# | C++ | Co-design | Compiler designMid-level Full TimeMenlo Park, CA | Boston, MA …8h ago
-
Data Engineer, Analytics (Technical Leadership) USD 175K-242KDashboards | Data Architecture | Data Governance | Data Marts | Data ModelingSenior-level Full TimeMenlo Park, CA | New York, …8h ago
-
AI Research Engineer, FAIR Chemistry USD 141K-208KApplied Mathematics | Artificial Intelligence | Computational statistics | Data Science | Density Functional TheorySenior-level Full TimeSan Francisco, CA8h ago
-
IP Validation Engineer - Machine Learning Accelerators USD 142K-203KAHB | APB | AXI | Android | C#Cross-functional collaboration | On device AI work | Prototype and silicon developmentMid-level Full TimeSunnyvale, CA | Burlingame, CA8h ago
-
AI Research Scientist, Media Data Research - MSL FAIR USD 117K-173KApache Spark | Computer Vision | Data Curation | Data Generation | Data Scaling LawsEntry-level Full TimeMenlo Park, CA8h ago