AI Software Engineer
Tasks
- Customize inference frameworks for production requirements
- Design inference serving systems for large scale transformer and multimodal models
- Drive technical design and inference engineering best practices
- Implement and tune inference optimizations
- Monitor latency SLOs and respond to incidents
- Own end to end model deployment and serving API design
- Translate model architecture changes into inference efficient implementations
- Write and profile CUDA kernels and custom operations
Perks/Benefits
- Community involvement
- Health benefits
- Hybrid work options
- In-person work options
- Remote work options
- Wellbeing support
- Work-life balance
Skills/Tech-stack
C++ | CUDA | CUDA kernels | CUDA profiling | Cache Management | Continuous batching | FP8 | GPU Memory Optimization | GPU memory | INT4) | INT8 | KV cache | KV-cache management | Memory Optimization | Nsight | ONNX Runtime | Prefill Decode | Prefill decode disaggregation | Python | Quantization | SGLang | Speculative decoding | TensorRT-LLM | VLLM
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
Computer Scientist II USD 120K-144KAPI Design | Agile | Angular | Azure DevOps | Azure DevOps Pipelines401k employer match | Dental insurance | Disability insurance | Health insurance | Health savings accountMid-level Full TimeLas Vegas, NV, United States7h ago
-
Computer Scientist III USD 97K-130KC++ | Configuration Management | Integration Testing | Linux | Python401k plan | Dental insurance | Disability insurance | Health savings account | Life insuranceSenior-level Full TimeNorth Las Vegas, NV, United States7h ago
-
Computer Scientist I USD 123K-145KC++ | Configuration Management | Development Lifecycle | Integration Testing | Linux401k match | Disability insurance | Health savings accounts | Life insurance | Paid time offMid-level Full TimeEdwards AFB, CA, United States7h ago
-
Computer Scientist I USD 122K-150KASP Classic | C# | CSS | Django | HTML401k match | Dental insurance | Disability insurance | Flexible spending account | Health savings accountMid-level Full TimeEdwards AFB, CA, United States7h ago
-
Computer Scientist III USD 91K-130KC# | C++ | Data Structures | Documentation | Hardware Design401k match | Dental insurance | Disability insurance | Health savings account | Immediate vestingSenior-level Full TimeLas Vegas, NV, United States7h ago
-
Computer Scientist I USD 120K-144KC# | C++ | Development Lifecycle | Hardware documentation | Integration Testing401k match | Dental insurance | Disability insurance | Flexible spending account | Health savings accountMid-level Full TimeLas Vegas, NV, United States7h ago
-
ABB | C++ | Docker | Doosan | Electrical wiringCareer growth | Technical leadership path | Travel opportunitiesSenior-level Full TimeChicago, IL, United States8h ago
-
Early-Career Network Engineer (RAN Optimization) USD 82K-128K4G | 5G | Automation | C Band | CBRSEducational assistance | Matching gifts | Paid sick time | Paid vacation | Parental leaveMid-level Full TimePlano,Texas,United States R8h ago
-
Data Engineer / BI Developer USD 91K-130KAmazon Web Services | Apache Airflow | Cloud platform | DBT | Data ModelingMid-level Full TimeCenter, Center District, IL8h ago
-
Embedded Software Engineer - Body Control Modules USD 79K-178KA/D | ASIL A D | AUTOSAR | C# | CI/CDAdoption and surrogacy expense reimbursement | Employee resource groups | Fertility treatments | Flexible family care days | Medical, dental & vision coverageSenior-level Full TimeDearborn, MI, United States10h ago
-
API Integration | Agent Orchestration | Database Design | Fine Tuning | JavaScriptMid-level Full TimeNew York, New York, United States11h ago
-
Software Engineer, Video USD 141K-251KAV1 | AV2 | Artificial Intelligence | Audio CODEC | Automated testingMid-level Full TimeBellevue, WA | Menlo Park, CA …12h ago
-
Software Engineer, AI System Hacker, GenAI, DeepMind USD 174K-253K2D Games | 3D Games | AI Agents | C++ | CSSMid-level Full TimeMountain View, CA, USA12h ago
-
Staff Software Engineer, Data Center Orchestration USD 207K-301KC++ | Data Storage | Data integration | Distributed Systems | Generative AISenior-level Full TimePittsburgh, PA, USA12h ago
-
Software Engineer, AI/ML, Ads Data USD 147K-211KC++ | Data Structures | Data Structures and Algorithms | Debugging | Distributed ComputingMid-level Full TimeLos Angeles, CA, USA12h ago
-
Customer Engineer, Data Analytics, Financial Services USD 152K-222KApache Spark | Batch Processing | Big Data | Cloud Computing | Data GovernanceSenior-level Full TimeNew York, NY, USA; Atlanta, GA, …12h ago
-
Software Engineer, Data Acquisition USD 147K-211KAlgorithms | Apache Flume | C++ | Data analytics | Distributed ComputingMid-level Full TimeSan Jose, CA, USA12h ago
-
Senior Software Engineer, Embedded Systems and Firmware USD 174K-253KAndroid | Android build | C++ | CHRE | Embedded LinuxSenior-level Full TimeMountain View, CA, USA12h ago
-
Lead GenAI Forward Deployed Engineer, YouTube USD 186K-270KAI Safety | Agent systems | Agentic Frameworks | Applied Artificial Intelligence | Artificial IntelligenceSenior-level Full TimeSan Bruno, CA, USA; Mountain View, …12h ago
-
Software Engineer III, AI Analytics, Core USD 147K-211KArtificial Intelligence | Data Structures | Data Structures and Algorithms | Language Processing | Natural LanguageSenior-level Full TimeSan Jose, CA, USA12h ago
-
Senior Software Engineer, Gen AI GCP Data Analytics USD 174K-253KAgents | BigQuery | Cloud platform | Data Warehousing | DebuggingSenior-level Full TimeKirkland, WA, USA12h ago
-
Applied AI Engineer - AI Solutions USD 172K-300KAgentic Workflows | Airflow | Apache Spark | Chroma | CrewAIAnnual travel up to 25% | Employee stock options | Hybrid work | Professional developmentMid-level Full TimeNew York City, NY (Hybrid); Redwood … R19h ago
-
AI Solutions Engineer, Talent Acquisition USD 129K-171KAPIs | Access Control | Agentic Workflows | Audit trails | AuthenticationMid-level Full TimeSeattle, Washington, United States21h ago
-
Network Engineer, Supercomputing USD 350K-475KCUDA | Congestion Control | Container Orchestration | Debugging | Deep learningDental benefits | Health benefits | Paid parental leave | Relocation support | Unlimited PTOSenior-level Full TimeSan Francisco22h ago
-
Product Analytics Engineer USD 130K-140KA/B | A/B Testing | Airflow | B testing | DBT401k retirement savings plan | Employer-sponsored healthcare | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeRemote, USA R23h ago