Forward Deployment Engineer (Inference & RL POC)
USD 150K-230K (estimate) Mid-level Full Time
Tasks
- Deploy and optimize LLM inference
- Deploy and optimize reinforcement learning training
- Deploy post training workflows
- Diagnose GPU networking bottlenecks
- Feed customer learnings back into platform and APIs
- Integrate SDKs training APIs and cluster resources
- Optimize latency throughput GPU utilization
- Own customer Proof of Concepts end-to-end
- Run benchmarks profiling and stress tests
- Stand up and tune inference stacks
- Translate customer requirements into system designs
Perks/Benefits
Skills/Tech-stack
DeepSpeed | Distributed Systems | Fine Tuning | GPU Performance | GPU Utilization | GPU Utilization Optimization | GPU performance profiling | Go | Human Feedback | Kubernetes | LLM Inference | Latency optimization | Learning from Human Feedback | Machine Learning | MegatronLM | Multi-GPU | Multi-node | NVIDIA Triton | Networking | Performance Profiling | PyTorch | Python | Ray Serve | Reinforcement Learning | Reinforcement Learning Training | Reinforcement Learning from Human Feedback | Rust | SGLang | Supervised Fine Tuning | Throughput Optimization | Utilization Optimization | VLLM
Education
N/A
Regions
Countries
States
Related jobs
-
Silicon Engineer, Digital Research, Quantum AI USD 163K-237KASIC development | Analog design | Cadence Genus | Cadence Innovus | Cell ModelingMid-level Full TimeGoleta, CA, USA; Mountain View, CA, …1h ago
-
Software Engineer, YouTube Ads, Machine Learning USD 147K-211KData Processing | Debugging | Distributed Computing | Language Processing | Machine LearningBonus | Career development | Equity | Health insurance | Paid time offMid-level Full TimeMountain View, CA, USA1h ago
-
Software Engineer, BigQuery Metadata USD 147K-211KBigQuery | C++ | Cloud platform | Data Storage | Database systemsMid-level Full TimeSunnyvale, CA, USA1h ago
-
Senior Software Engineer, Machine Learning, Vertex AI USD 174K-252KCloud Computing | Data Privacy | Data Processing | Debugging | Fine TuningSenior-level Full TimeSunnyvale, CA, USA1h ago
-
Software Engineer, AI/ML, Search USD 174K-252KC++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingMid-level Full TimeMountain View, CA, USA1h ago
-
Autotuning | Benchmarking | C++ | CUDA | Code generationSenior-level Full TimeSunnyvale, CA, USA1h ago
-
Staff Data Engineer USD 114K-171KCloud Platforms | Data Modeling | Data Pipelines | Data Warehousing | Data integrationDental insurance | Health care | Paid time off | Retirement plan | Sick leaveSenior-level Full TimeResidence Based, Residence Based, US4h ago
-
Software Engineer, Infrastructure USD 150K-252KArtificial Intelligence | Language Processing | Machine Learning | Natural Language | Natural Language ProcessingEntry-level Full TimeSan Francisco Bay Area7h ago
-
Infrastructure Software Engineer, Energy Storage USD 180K-237KDeployment Automation | Distributed Systems | Kubernetes | Network Security | Operating SystemsMid-level Full TimeSan Francisco, California, United States8h ago
-
Staff Partner Engineer, Azure USD 150K-206KAI Platform | AI Platform Integration | API Integration | Apache Spark | Architecture DiagramsAnnual performance bonus | EquitySenior-level Full TimeNew York City, New York; San …9h ago
-
Senior-level Full TimeNew York City, New York9h ago
-
Senior-level Full TimeNew York City, New York9h ago
-
Senior-level Full TimeNew York9h ago
-
Senior-level Full TimeNew York9h ago
-
Fabric - Sr Data Engineer USD 112K-185KAPI-based integration | Access Control | Active Directory | Azure | Azure Active DirectorySenior-level Full TimeUnited States10h ago
-
Agile | Algorithms | CI/CD | Data Engineering | Data Structures401k plan | Accident insurance | Dependent care FSA plan | HSA and FSA | Hospital indemnity insuranceSenior-level Full TimeAustin, Texas or Remote R10h ago
-
Pre-Sales Data Scientist USD 70K-90KAPIs | AUC | Credibility Testing | Cross-validation | Data AnalysisCompany sponsored volunteering days | Discounted private health insurance | Extra paid time off | Fully remote within continental United States | Generous parental leaveMid-level Full TimeCarlsbad, CA, United States R10h ago
-
Applied Scientist, Amazon Prime, Prime AI/ML Science USD 136K-184KAmazon DynamoDB | Amazon EMR | Amazon Redshift | Amazon S3 | Amazon SageMakerSenior-level Full TimeSeattle, Washington, USA13h ago
-
GCP Data Engineer USD 53K-122KAirflow | Apache Beam | BigQuery | CI/CD | Cloud StorageCompany-Paid Holidays | Employee assistance program | Life and disability insurance | Medical, dental, and vision coverage | Paid time offMid-level Full TimeNashville, TN, US13h ago
-
GCP Associate Data Engineer USD 43K-105KAI machine learning | Airflow | Apache Beam | BigQuery | CI/CD401k retirement savings | Company holidays | Disability insurance | Employee assistance program | Life insuranceMid-level Full TimeBridgewater, NJ, US13h ago
-
Data Engineer, Integrations USD 140K-177KAWS | Airflow | Azure | Boomi | CCPA401k match | Dental insurance | Life insurance | Medical insurance | Tuition reimbursementMid-level Full TimeWaltham Office, United States13h ago
-
Freelance Creative Technologist, Applied AI USD 150K-190KAPI | Agentic Workflows | ComfyUI | ControlNet | Embeddings401k match | Dental | Healthcare | Paid Holidays | Paid time offMid-level FreelanceUnited States R13h ago
-
Data Engineer USD 90K-126KAgile | Azure Data | Azure Data Factory | Change Data Capture | Crystal Reports401k matching | Loan forgiveness | Tuition reimbursementMid-level Full TimeChicago, IL, United States13h ago
-
Staff Software Engineer AI/ML USD 141K-219KAgentic AI | Computer Vision | End to End | End-to-end machine learning | Experimentation401k | Adoption support | Charitable giving match | Fertility care stipend | Gym accessSenior-level Full TimeSan Jose, California, United States14h ago
-
AWS | Ansible | Azure | Ceph | DockerEquity | Health insurance | Remote work flexibilityMid-level Full TimeSan Francisco14h ago