AI Software Engineer
Tasks
- Customize inference frameworks
- Design high performance inference serving systems
- Drive technical design for inference engineering practices
- Implement and tune inference optimizations
- Own end to end model deployment
- Translate model architecture changes into inference implementations
- Write and profile CUDA kernels and custom ops
Perks/Benefits
Skills/Tech-stack
C++ | CUDA | Computer Vision | Continuous batching | FP8 | INT4) | INT8 | KV cache | Language Processing | Multimodal Models | NVIDIA Nsight | Natural Language | Natural Language Processing | ONNX Runtime | Python | Quantization | Real Time | Real-Time Communication | SGLang | Speculative decoding | TensorRT-LLM | Transformer Models | VLLM
Education
Associate Degree | Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
Featured Feat. Applied AI Engineer - Bay Area USD 211K-263KArtificial Intelligence | C plus plus | C# | Embeddings | Feature Engineering401k | Comprehensive health and wellness benefits | Learning and development opportunities | Unlimited time offMid-level Full TimeHQ (San Francisco)25d ago
-
AI Engineer USD 141K-236KAzure OpenAI | Azure Security | Azure Security Center | Azure Synapse | DatabricksDisability insurance | Health insurance | Holiday pay | Learning and development | Life insuranceSenior-level Full TimeUSA-DC-Washington3h ago
-
Senior Software Engineer – Data Strategy USD 160K-240KAPIs | Algorithms | Apache Flink | Apache Iceberg | Automation401k match | Dental insurance | Life insurance | Medical insurance | Paid HolidaysSenior-level Full TimeNew York3h ago
-
Robotics Engineer II USD 88K-167KArtificial Intelligence | Automation engineering | Computer Vision | Control Systems | Control Theory401K Company Funding | Career development and training | Education assistance | Fitness reimbursement | Flexible work schedulesMid-level Full TimeUS-Louisiana-New Orleans4h ago
-
Distributed Systems | Machine Learning | Model Serving | Monitoring | Online InferenceSenior-level Full TimeSan Jose, California, United States4h ago
-
Lead Machine Learning Engineer (Multiple Positions) USD 194K-355KAlgorithm Optimization | Code review | Data Pipelines | Infrastructure Optimization | MLOpsSenior-level Full TimeSan Jose, California, United States4h ago
-
Senior Machine Learning Engineer (Multiple Positions) USD 194K-355KA/B | A/B Testing | B testing | Data Analysis | Data PipelinesSenior-level Full TimeSan Jose, California, United States4h ago
-
Senior Machine Learning Engineer (Multiple Positions) USD 194K-355KApache Spark | Big Data | Data Preparation | Deep learning | Experiment designSenior-level Full TimeSeattle, Washington, United States4h ago
-
A/B | A/B Testing | B testing | Big Data | ClassifiersSenior-level Full TimeSan Jose, California, United States4h ago
-
Big Data | Classifiers | Computer Vision | Data Mining | ExperimentationSenior-level Full TimeSan Jose, California, United States4h ago
-
AI Risk | AI Risk Assessment | Bias Mitigation | C# | C++Senior-level Full TimeBellevue, WA | Menlo Park, CA …5h ago
-
AI workflows | Bias Mitigation | C++ | Capacity Planning | Data ModelingSenior-level Full TimeMenlo Park, CA | Seattle, WA …5h ago
-
Production Engineer (University Grad) USD 177K-200KAI tool integration | APIs | Agent Orchestration | C plus plus | CDNSenior-level Full TimeMenlo Park, CA | Burlingame, CA5h ago
-
C++ | Data Storage | Data transfer | Device Drivers | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Staff Software Engineer, AI Interactions for iOS, XR USD 207K-301KArtificial Intelligence | Dart | Data Storage | Distributed Computing | FlutterSenior-level Full TimeSan Jose, CA, USA; New York, …5h ago
-
C++ | Data Processing | Debugging | Fine Tuning | JAXSenior-level Full TimeMountain View, CA, USA5h ago
-
Software Engineer III, AI/ML, Display Ads USD 147K-211KAlgorithms | C++ | Data Analysis | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA5h ago
-
Senior Software Engineer, AI/ML, Google Workspace USD 174K-253KCode review | Data Analysis | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Staff Software Engineer, ML Fleet Systems USD 207K-301KC++ | Cluster management | Data Structures | Data Structures and Algorithms | DebuggingBonus | Equity | Health benefits | Paid time off | Professional developmentSenior-level Full TimeSunnyvale, CA, USA5h ago
-
ACE | APB | ARM | AXI | Constrained randomSenior-level Full TimeMountain View, CA, USA5h ago
-
Senior Software Engineer, Embedded, Pixel Graphics USD 174K-253KC# | C++ | Device Drivers | Embedded Systems | Embedded operating systemsSenior-level Full TimeMountain View, CA, USA; San Diego, …5h ago
-
Staff Software Engineer, Embedded Systems/Firmware, XR USD 207K-301KC++ | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMiami, FL, USA5h ago
-
Software Engineer, AI/ML, Google Workspace USD 147K-211KData Processing | Debugging | Distributed Computing | Fine Tuning | Generative AIEmployee assistance | Health insurance | Paid time off | Retirement planMid-level Full TimeSunnyvale, CA, USA5h ago
-
Senior Software Engineer, AI Interactions, XR USD 174K-253KAgentic development | Algorithms | App Development | Artificial Intelligence | Data StorageSenior-level Full TimeSan Jose, CA, USA; New York, …5h ago
-
Security Engineer, Data Center Network Device Security USD 147K-211KARM Assembly | Assembly | C# | C++ | CodingBonus | Employee stock options | Health insurance | Paid time off | Retirement planMid-level Full TimeSunnyvale, CA, USA5h ago