Software Engineer, Inference - Multi Modal
Tasks
- Collaborate with researchers and product engineers to deploy capabilities
- Design inference infrastructure for multimodal models
- Enable experimental research workflows for production
- Improve GPU utilization tensor parallelism and hardware abstraction layers
- Optimize systems for high throughput low latency delivery
Perks/Benefits
- N/A
Skills/Tech-stack
Distributed Systems | GPU | High Throughput | Inference | Language Models | Large Language Models | Low Latency | Machine Learning | Model Parallelism | Multimodal | Networking | Tensor Parallelism | TensorRT-LLM | VLLM
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Software Engineer III, AI/ML Computer Vision, AR USD 147K-211KC++ | Computer Vision | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSan José, CA, USA1h ago
-
Agentic AI | C plus plus | C# | Cloud services | Data ProcessingMid-level Full TimeSan Francisco, CA, USA1h ago
-
Supply Chain Data Engineer Ii USD 94K-118KDBT | Data Governance | Data Modeling | Data Pipelines | Data Quality401k | Disability insurance | Employee stock purchase plan | Health insurance | Life insuranceMid-level Full TimeWayne, PA, US, 190876h ago
-
AI/ML Engineer 2 USD 101K-165KAI Agents | API Development | AWS | Azure | CI/CDDisability insurance | Family leave | Flexible spending accounts | Life and AD D Insurance | Medical/Dental/Vision insuranceSenior-level Full TimePhiladelphia, PA, US, 191037h ago
-
Data Scientist (Generative AI) USD 125K-160KAWS | AWS Bedrock | AWS SageMaker | Adversarial Networks | Attention MechanismsEntry-level Full TimeMcLean, VA, United States1d ago
-
AWS | Amazon S3 | Cloud Storage | Cloud platform | Dataset PipelinesOn-site work environment | Visa sponsorship availableMid-level Full TimeGreenwich, Connecticut, United States1d ago
-
Agents | Amazon Web Services | Artificial Intelligence | Cloud platform | Dataset PipelinesMid-level Full TimeManhattan, Nevada, United States1d ago
-
Mid-level Full TimeNew Jersey, New Jersey, United States1d ago
-
AWS | Cloud platform | Deep learning | Django | DockerBonus | Equity | Onsite work | Visa sponsorship availableMid-level Full TimeCalifornia, California, United States1d ago
-
AWS | Agents | Amazon S3 | Cloud Storage | DjangoBonus | Equity | On-site work | Visa sponsorshipMid-level Full TimeGoldens Bridge, New York, United States1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Health, dental, vision coverage | Learning stipend | Relocation assistanceSenior-level Full TimeGeorgia, Georgia, United States1d ago
-
AWS | Adapters | ArangoDB | Asynchronous programming | Context engineering401k | Dental insurance | Health insurance | Learning stipend | Relocation assistanceSenior-level Full TimeJacksonville, Florida, United States1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Learning stipend | On site work 5 days per weekSenior-level Full TimeMenlo Park, California, United States1d ago
-
AWS | Adapters | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Learning stipend | Relocation assistanceSenior-level Full TimeWashington D.C., District of Columbia, United …1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Relocation assistance | Unlimited learning stipendSenior-level Full TimeCharlotte, North Carolina, United States1d ago
-
AWS | Adapters | ArangoDB | Asynchronous programming | Context engineering401k | Health, dental, vision coverage | Learning stipend | Relocation assistance | Visa sponsorshipSenior-level Full TimeMountain View, California, United States1d ago
-
Senior Machine Learning Engineer USD 152K-250KAutomation | Distributed Training | Distributed inference | GPU | Go401k | Employee assistance program | Flexible PTO | Flexible spending account | Health savings account contributionsSenior-level Full TimeLas Vegas, Nevada1d ago
-
APIs | Agent workflows | Authentication | Debugging | Distributed SystemsSenior-level Full TimeNew York City1d ago
-
A/B | A/B Testing | Agentic coding | B testing | Context SummarizationSenior-level Full TimeSan Francisco1d ago
-
Applied AI Engineer - Bay Area USD 211K-263KArtificial Intelligence | C plus plus | C# | Embeddings | Feature Engineering401k | Comprehensive health and wellness benefits | Learning and development opportunities | Unlimited time offMid-level Full TimeHQ (San Francisco)1d ago
-
Sr. Platform Software Engineer – AI and Innovation USD 142K-200KAgile | Apollo GraphQL | Cloud Architecture | Continuous Delivery | Distributed SystemsSenior-level Full TimeOak Brook, IL, United States1d ago
-
SYSTEM ENGINEER - Computer Network Support - AI/ML - 15+ yrs of Experience - TS/SCI w/Poly clearance is required - ES A USD 205K-211KAgile Development | Artificial Intelligence | Confluence | Jira | LLM401k retirement plan | Dental insurance | Federal Holidays | Health insurance | Life insuranceMid-level Full TimeFort George G Meade, United States1d ago
-
Software Engineer, Machine Learning USD 213K-293KAI ethics | API Design | Agent Orchestration | Artificial Intelligence | Bias MitigationSenior-level Full TimeSunnyvale, CA | Remote, US | … R2d ago
-
Cloud Computing | Computer Vision | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Seattle, WA, USA2d ago
-
Senior Software Engineer, Data Cloud Frontier AI USD 174K-252KComputer Vision | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA2d ago