Inference Intern
Tasks
- Add robust error handling
- Build programming abstractions for model porting
- Build runtime for transformer inference
- Co-design hardware instructions and model operations
- Debug performance and correctness issues
- Develop testing capabilities for model porting
- Enhance intra node execution
- Enhance multi node inference
- Implement high performance software components for model toolkit
- Implement state management
- Optimize routing and communication layers
- Perform performance profiling
- Support model porting to accelerator architecture
- Use collective communication for inference performance
Perks/Benefits
Skills/Tech-stack
C++ | Collective communication | Compilers | Consensus Protocols | Consistency models | Debugging | Distributed Systems | Error Handling | High speed | High-speed interconnect | Infiniband | Intra Node Execution | JAX | Kernel Networking | Linux | MOE | Mixture of Experts | Multi node inference | Multi-node | NVLink | Performance Profiling | PyTorch | Python | Rust | SGLang | SIMD | State management | Transformer Architecture | User Space Networking | User space | VLLM
Education
Regions
Countries
States
Cities
Related jobs
-
Quantum Software Engineer USD 120K-150KAWS | Access Control | C++ | CI/CD | Cirq401k match | Employee assistance program | Employer paid medical/dental/vision | Flexible savings account | Health savings accountMid-level Full TimeChicago, Illinois, United States1d ago
-
Agile | Azure | Azure DevOps | C plus plus | DAQ401k matching | Dental insurance | Employee assistance program | HSA option | Health insuranceSenior-level Full TimeAustin, TX, United States1d ago
-
Supercomputing Engineer (Test) USD 150K-275KBash | Benchmarking | CI/CD | Containerization | Data AnalysisDaily meals | Housing subsidy | Medical, dental & vision coverage | Relocation support | Unlimited compute budgetMid-level Full TimeSan Jose1d ago
-
Supercomputing Engineer (Network) USD 150K-275KArista EOS | Bash | Benchmarking | C# | C++Daily lunch dinner | Housing subsidy | Medical, dental & vision coverage | Relocation support | Unlimited compute budgetMid-level Full TimeSan Jose1d ago
-
Space Operations Engineer (Embedded Software) USD 100K-160KAPIs | ARM | Algorithm Optimization | C# | C++Mid-level Full TimeSan Francisco, CA1d ago
-
Senior Embedded Software Engineer USD 166K-200KAbstraction layer | Automated testing | Bootloader | C# | C++401k | Dental insurance | Free lunch | Health insurance | Paid time offSenior-level Full TimeAlameda HQ1d ago
-
Mid-level Full TimeKing George, VA, United States1d ago
-
AI & Data Solutions Architect USD 150K-200KAPI Integration | AWS | Apache Spark | Azure | CI/CDRemote work | Travel requiredSenior-level Full TimeSeattle, United States1d ago
-
Senior Software Engineer, AI Platform Engineering USD 160K-240KAWS | Amazon SageMaker | Containerization | Docker | EC2401k matching | Dental insurance | Life insurance | Medical insurance | Paid HolidaysSenior-level Full TimeNew York1d ago
-
Software Engineer, Databases (Technical Leadership) USD 160K-293KAI | Automation | Consensus Protocols | Data Integrity | Database InternalsSenior-level Full TimeBellevue, WA | Menlo Park, CA1d ago
-
C++ | Data Preparation | Data Processing | Debugging | GenAISenior-level Full TimeMountain View, CA, USA2d ago
-
Data Processing | Data Structures | Debugging | Distributed Systems | EmbeddingSenior-level Full TimeMountain View, CA, USA; San Bruno, …2d ago
-
C++ | Data Processing | Debugging | Embedding | Information RetrievalSenior-level Full TimeMountain View, CA, USA; San Bruno, …2d ago
-
Algorithms | C++ | Data Processing | Data Structures | DebuggingSenior-level Full TimeMountain View, CA, USA; San Bruno, …2d ago
-
AI Builder Intern USD 74K-111KAPI Integration | Anthropic API | Autogen | CrewAI | JavaScriptCommuter stipend | Comprehensive health dental and vision | Generous PTO | Learning and development stipend | Retirement benefitsEntry-level InternshipSan Francisco, CA; New York, NY2d ago
-
Audio Processing | Automatic gain control | Backend Infrastructure | C++ | Echo cancellationSenior-level Full TimeSan Francisco2d ago
-
Member of Technical Staff (AI Software Engineer, Agents) USD 220K-405KAI Evaluation | Browser technologies | CDP | Code Quality | Context engineeringSenior-level Full TimeSan Francisco2d ago
-
Embedded Software Engineer Intern (Fall 2026) USD 108K-108KADC | C# | C++ | CAN | Clock synchronizationHousing stipend | Overtime pay | Paid sick time | Relocation supportEntry-level InternshipSouth San Francisco, California, USA2d ago
-
ADAS | Autonomous Vehicles | C++ | Camera | Data ProcessingCompany benefits program | Company bonus | Equity incentive plan | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA; San Francisco, …2d ago
-
Senior Software Engineer, Data Platform USD 166K-220KAWS | Apache Iceberg | Athena | Containerization | DBTSenior-level Full TimeCosta Mesa, California, United States2d ago
-
Staff Software Engineer, Robotics USD 220K-292KAlgorithms | Anomaly Detection | C++ | Code optimization | Computer VisionCompetitive benefits | Health insurance | Paid time offSenior-level Full TimeIrvine, California, United States2d ago
-
Senior Software Engineer, Robotics USD 191K-253KAlgorithms | Anomaly Detection | C++ | Computer Vision | ConcurrencyHealth benefits | Recovery supportSenior-level Full TimeIrvine, California, United States2d ago
-
Forward Deployed Engineer USD 120K-158KAngular | Code Reviews | Customer enablement | Documentation | GitMid-level Full TimeAtlanta, Georgia, United States; Chicago, Illinois, …2d ago
-
Software Engineer, Propulsion Simulation & Data Analysis USD 125K-175K.NET | Angular | C# | C++ | Combustion Engineering401k retirement plan | Dental insurance | Employee stock purchase plan | Health insurance | Paid HolidaysSenior-level Full TimeHawthorne, CA2d ago
-
.NET | Angular | C# | C++ | CI/CD401k retirement plan | Company stock options | Dental insurance | Employee stock purchase plan | Life insuranceSenior-level Full TimeHawthorne, CA2d ago