AI Infra Engineer - Large Model Inference Systems (Multimodal/LLM/VLM)
San Jose, California, United States
USD 198K-387K (estimate) Mid-level Full Time
Tasks
- Build inference systems
- Develop high performance inference kernels
- Implement intelligent operations for inference platforms
- Improve throughput and latency
- Optimize distributed inference
Perks/Benefits
- N/A
Skills/Tech-stack
Attention Mechanisms | Batching | CUDA | DP | Distributed Systems | EP | Latency optimization | Load Balancing | Mixture of Experts | Multimodal fusion | TP | Throughput Optimization | Triton
Education
N/A
Roles
AI | AI Infrastructure Engineer | Engineer | Infrastructure Engineer
Related jobs
-
Agent Orchestration | Airflow | Argo Workflows | Artifact versioning | Autonomous workflowsRemote work flexibilitySenior-level Full TimeRemote - United States R8h ago
-
Practice Manager - AI & Data USD 160K-190KAWS | Agent systems | Amazon SageMaker | Apache Spark | AzureCareer growth opportunities | Comprehensive benefits | MentorshipSenior-level Full TimeBroomfield, CO. Greensboro, NC. Troy, Michigan14h ago
-
Data Engineer-Secret Clearance Required USD 100K-127KAWS | AWS Glue | AWS Redshift | Azure | Azure Data401k match | Bereavement leave | Disability insurance | Employee assistance program | Employee discount programSenior-level Full TimeRemote - Nationwide, United States R15h ago
-
AI Solutions Architect - Central Region USD 185K-235KAI Enterprise | AI architecture | AI/ML | AWS | As-a-Service401k plan with company matching | Dental insurance | Employee assistance program | Health insurance | Life and disability insuranceSenior-level Full TimeChicago, IL, United States15h ago
-
AI Solutions Architect - East Region USD 185K-235KAI Enterprise | AWS | Artificial Intelligence | Azure | BOM401k matching | Bereavement leave | Dental insurance | Disability insurance | Employee assistance programSenior-level Full TimeHartford, CT, United States15h ago
-
AI Solutions Architect - Global USD 185K-235KAI Enterprise | AWS | As-a-Service | Azure | CUDA401k plan | Bereavement leave | Dental insurance | Disability insurance | Employee assistance programSenior-level Full TimeHartford, CT, United States15h ago
-
AI Solutions Architect - West Region USD 185K-235KAI Enterprise | AI architecture | AWS | Artificial Intelligence | As-a-Service401k with company matching | Employee assistance program | Employee discount program | Health and wellbeing benefits | Life and disability insuranceSenior-level Full TimePhoenix, AZ, United States15h ago
-
AI Development Specialist - East Region USD 150K-180KAI Governance | AI Workbench | AI-assisted coding | API Integration | Amazon Q401k plan with company matching | Bereavement | Employee assistance program | Employee discount program | Health dental vision careSenior-level Full TimeHartford, CT, United States15h ago
-
AI Development Specialist - Central & West Regions USD 150K-180KAI Workbench | AI code review | AI-assisted coding | AI-native | AI-native engineering401k plan with company matching | Bereavement | Employee assistance program | Employee discount program | Health dental vision careSenior-level Full TimePhoenix, AZ, United States15h ago
-
Senior Software Engineer - San Francisco (Onsite) USD 130K-220KAWS | Amazon EMR | Amazon S3 | Apache Flink | Apache SparkFast-paced startup environment | Onsite work environment | Rapid hiring process feedback | Relocation supportSenior-level Full TimeSan Francisco, CA, US18h ago
-
Bash | Data Pipelines | Distributed Systems | Docker | GCPAccess to cutting-edge technologies | Autonomy | Bonus | Collaborative culture | Distributed-first environmentMid-level Full TimeCanada R19h ago
-
APIs | CI/CD | Cloud platform | Compliance | ContainersAnnual leave | Dental coverage | Health coverage | High autonomy | Home office setup supportSenior-level Full TimeCanada R21h ago
-
A/B | A/B Testing | Asynchronous programming | B testing | Canary ReleasesEngineering autonomy | Inclusive work environment | Mentorship | Remote-first flexibility | Technical leadership developmentSenior-level Full TimeCanada21h ago
-
Networking AI Technical Lead USD 207K-301KAlgorithms | Artificial Intelligence | C++ | Compute Technologies | Data StructuresSenior-level Full TimeSunnyvale, CA, USA; Cambridge, MA, USA21h ago
-
Edge AI Engineer USD 100K-150KBenchmarking | Bias Evaluation | C++ | Core ML | Data PrivacyCareer growth | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Sales Data & Analytics Product Manager USD 131K-190KAmazon Web Services | Analytics reporting | Apache Airflow | Apache Kafka | Apache Spark401k | Bonus | Dental coverage | Holidays | Medical coverageMid-level Full TimeUS, MA, Wilmington, United States1d ago
-
Founding AI Engineer / Member of Technical Staff USD 125K-190KAPIs | Data Modeling | Data Pipelines | Deep learning | Distributed SystemsDental insurance | Health insurance | Paid Long Term Disability | Paid Short Term Disability | Paid life insuranceSenior-level Full TimeNew York, NY1d ago
-
Robotics Systems Field Engineer , Amazon RIVR USD 129K-174KC# | C++ | Cause analysis | Connectivity | Control SystemsHealth insurance | Paid time off | Parental leaveEntry-level Full TimeAustin, Texas, USA1d ago
-
Robotics Systems Field Engineer , Amazon RIVR USD 129K-174KC# | C++ | Cause analysis | Connectivity | Controls401k matching | Dental insurance | EAP mental health support | Health insurance | Paid time offEntry-level Full TimeAustin, Texas, USA1d ago
-
Backend Software Engineer (Evals) USD 230K-385KAPIs | Agent systems | Data Processing | Data Processing Pipelines | Distributed SystemsMid-level Full TimeSan Francisco2d ago
-
Staff AI engineer USD 167K-250KAI Evaluation | AWS | Agent Orchestration | Caching | Data PipelinesEquity participation | Flexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco2d ago
-
API Gateway | Apache Spark | Azure Data | Azure Data Factory | Azure Data LakeSenior-level Full TimeHouston, TX, United States2d ago
-
ML Engineer, Generative Video USD 175K-275KAutoregressive models | CUDA | Debugging | Deep learning | Diffusion Models401k match | Catered lunch | Commuter benefits | Dinner stipend | Grubhub subscriptionMid-level Full TimeUnion Square, New York City2d ago
-
Software Engineer, Inference Platform USD 220K-260KC++ | CI/CD | Debugging | Distributed Systems | GoMid-level Full TimeSunnyvale, CA3d ago
-
Staff Software Engineer, Inference Platform USD 180K-260KActive/Active | Alerting | C++ | CI/CD | DebuggingSenior-level Full TimeSunnyvale, CA3d ago