SR Principal Software Engineer - LLM Engineering
USD 175K-215K (estimate) Senior-level Full Time
Tasks
- Advise on model serving strategy and architecture
- Build reusable ML engineering platform frameworks
- Collaborate with stakeholders to prioritize AI and ML capabilities
- Define MLOps and LLMOps lifecycle management
- Deploy and optimize model inference servers
- Implement automation, CI/CD, and infrastructure-as-code
- Optimize model inference for high throughput and low latency
- Oversee AI workload operations with monitoring incident response security and compliance
- Productionize models on AWS with observability reliability and cost efficiency
- Translate technical concepts into executive strategies
Perks/Benefits
- Backup childcare
- Financial coaching
- Health care coverage
- Mental health support
- Retirement savings plan
- Tuition reimbursement
Skills/Tech-stack
AWS | Amazon Bedrock | Amazon EKS | Amazon SageMaker | Amazon SageMaker Pipelines | Autoscaling | Azure Machine Learning | CI/CD | Caching | CloudFormation | Compliance | Distributed Systems | Docker | Google Cloud | Google Cloud Vertex | Google Cloud Vertex AI | Governance | Hugging Face | Hugging Face Transformers | Incident Response | Inference Server | Infrastructure as Code | Java | Kubeflow | Kubernetes | MLflow | Machine Learning | Model Parallelism | Monitoring | Observability | PyTorch | Python | Quantization | SLI | SLO | Sagemaker Pipelines | Security | TensorFlow | Terraform | Throughput Optimization | Triton Inference | Triton Inference Server | VLLM | Vertex AI | “as-code”
Education
Regions
Countries
States
Cities
Related jobs
-
Infrastructure Engineer - Storage USD 100K-120KAnsible | Azure | Azure Blob | Azure Blob Storage | Azure Files401k plan | Bereavement | Disability insurance | Employee assistance program | Employee discount programMid-level Full TimeSt. Louis, MO, United States5h ago
-
Senior Infrastructure Kafka Engineer USD 125K-186KAWS | Alerting | Apache Kafka | Bash | Confluent KafkaContract-to-hire | Hybrid work model | Remote work optionSenior-level Full TimePhoenix, AZ6h ago
-
Senior-level Full TimeHerndon, VA7h ago
-
Senior Platform AI Engineer USD 119K-180KAPI Design | Asynchronous programming | Authentication | Concurrency | Distributed SystemsSenior-level Full TimeCenter, Center District, IL7h ago
-
Senior-level Full TimeCenter, Center District, IL7h ago
-
Senior-level ContractJersey City, United States8h ago
-
AWS Lambda | Amazon DynamoDB | Amazon Kinesis | Amazon SNS | Amazon SQSHybrid workSenior-level ContractSeattle, United States8h ago
-
Lead Software Engineer - Java/Python - Learn AI / LLM USD 175K-215KAgile | Amazon Web Services | Application Resiliency | Artificial Intelligence | CI/CDBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States9h ago
-
Quant Analytics [Multiple Positions Available] USD 150K-185KAWS Redshift | CTE | Data Aggregation | Data Enrichment | Data TransformationBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site wellness centersSenior-level Full TimePlano, TX, United States9h ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States10h ago
-
Machine Learning Engineer USD 130K-194KAI machine learning | AWS AI | AWS AI Machine Learning | Amazon DynamoDB | Amazon EC2Professional development | Work from homeMid-level Full TimeRemote, NY, US R10h ago
-
Software Engineer III - Data, AWS, ETL, Java/Python, USD 173K-185KAPIs | AWS | Agile methodologies | Apache Airflow | Apache FlinkBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimePlano, TX, United States10h ago
-
Algorithms Engineer USD 72K-120KARIMA | Anomaly Detection | Causal Inference | Causal forests | Change point detectionEntry-level Full TimeCenter, Center District, IL10h ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States10h ago
-
A/B | A/B Testing | B testing | Computer Vision | Deep learningEntry-level Full TimeSeattle, Washington, United States10h ago
-
Computer Vision | Deep learning | Information Retrieval | Language Processing | Machine LearningEntry-level Full TimeSan Jose, California, United States10h ago
-
Partner Engineer, Generative AI USD 173K-247KAWS | Agent Orchestration | Azure | Bias Mitigation | C plus plusSenior-level Full TimeMenlo Park, CA11h ago
-
AI Research Scientist, SysML - FAIR USD 143K-208KArtificial Intelligence | C# | C++ | Co-design | Compiler designMid-level Full TimeMenlo Park, CA | Boston, MA …11h ago
-
Data Engineer, Analytics (Technical Leadership) USD 175K-242KDashboards | Data Architecture | Data Governance | Data Marts | Data ModelingSenior-level Full TimeMenlo Park, CA | New York, …11h ago
-
AI Research Engineer, FAIR Chemistry USD 141K-208KApplied Mathematics | Artificial Intelligence | Computational statistics | Data Science | Density Functional TheorySenior-level Full TimeSan Francisco, CA11h ago
-
IP Validation Engineer - Machine Learning Accelerators USD 142K-203KAHB | APB | AXI | Android | C#Cross-functional collaboration | On device AI work | Prototype and silicon developmentMid-level Full TimeSunnyvale, CA | Burlingame, CA11h ago
-
Mid-level Full TimeMenlo Park, CA11h ago
-
Research Engineer - Perception and Machine Learning USD 177K-251KC++ | Computer Vision | Data Pipelines | Knowledge Distillation | Language ModelsSenior-level Full TimeRedmond, WA | Menlo Park, CA …11h ago
-
Research Engineer - Computer Vision and Robotics USD 141K-208K3D Reconstruction | C plus plus | Computational imaging | Computer Vision | Data AnalysisMid-level Full TimeRedmond, WA11h ago
-
Data Engineer, PAR USD 173K-242KAgent Orchestration | C# | C++ | Data Architecture | Data GovernanceCareer growth | Mentorship | Skill developmentSenior-level Full TimeMenlo Park, CA11h ago