AI Infra Engineer - Large Model Training Infrastructure (LLM/VLM /Agent RL)
San Jose, California, United States
USD 208K-300K (estimate) Senior-level Full Time
Tasks
- Build training infrastructure for large models
- Design and optimize distributed training strategies
- Develop reinforcement learning training and evaluation systems
- Enable multimodal training for image text audio video
Perks/Benefits
- N/A
Skills/Tech-stack
Attention Mechanisms | Deep learning | Distributed Training | Language Models | Large Language Models | Learning algorithms | Mixture of Experts | PyTorch | Python | Reinforcement Learning | Reinforcement learning algorithms | Vision Language Models | Vision-language
Education
N/A
Roles
Related jobs
-
Fullstack Engineer, AI Integrations USD 50K-70KAWS | Agile | Alerting | C++ | CSSAgile team environment | Hybrid work | MentorshipEntry-level Full TimeMountain View, CA / San Francisco, … R3h ago
-
Entry-level Full TimeMountain View, CA / San Francisco, … R3h ago
-
API Integration | ARM | Angular | Appian | Azure DevOpsFlexible extensions contract | Hybrid work schedule | Knowledge transfer coaching | Onsite work with mission teamsSenior-level ContractAustin, United States4h ago
-
Senior-level Full TimeUSA-VA-Herndon4h ago
-
Data Science Team Leader USD 165K-165KCI/CD | Cloud platform | Docker | Google BigQuery | Google CloudSenior-level Full TimeDenver, Colorado, United States5h ago
-
Software Engineer, Systems ML USD 141K-208KC plus plus | CUDA | Co-design | Compiler optimization | Deep learningSenior-level Full TimeBellevue, WA | Menlo Park, CA …6h ago
-
Network Engineer, Foundation & Support USD 120K-184KAI Assisted Development | Automation | C# | C++ | Distributed SystemsGlobal team collaboration | Mentorship | On-the-job trainingEntry-level Full TimeDenver, CO | Reston, VA | …6h ago
-
Agentic Workflows | Automated testing | Computer Vision | Data Processing | Function CallingSenior-level Full TimeMountain View, CA, USA6h ago
-
Technical Lead, AI/ML Infrastructure USD 207K-301KC# | C++ | Compute architecture | Cryptography | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Research Software Engineer USD 207K-301KData Structures | Data structures algorithms | Distributed Computing | Information Retrieval | Language ModelsBonus | Career development | Equity | Health insurance | Paid time offSenior-level Full TimeMountain View, CA, USA6h ago
-
Actuator control | Computer Vision | Embedded Systems | FPGA | Imitation LearningSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Data Engineer USD 96K-137KApache Airflow | Cloud platform | DBT | Git | Google Cloud401k matching | Basic life insurance | Dental insurance | Disability coverage | Medical insuranceMid-level Full TimePiscataway, NJ, US9h ago
-
Principal AI Platform Engineer USD 104K-166KAPIs | Access Control | Audit trails | Data Engineering | Data GovernanceSenior-level Full TimeSan Francisco, CA15h ago
-
Artificial Intelligence Developer (AI) USD 114K-218KAmazon Web Services | C++ | Conda | Data Modeling | ETL401k matching | Employer Covered Dental Insurance | Employer Covered Disability Insurance | Employer Covered Vision Insurance | Employer-covered health insuranceMid-level Full TimeChantilly, VA16h ago
-
Sr. Embedded Software Engineer - Radar & DSP USD 165K-220KAgile | Anomaly Detection | C# | C++ | ClassificationHealth insurance | Onsite work | Professional development | Retirement plansSenior-level Full TimeHuntington Beach, CA16h ago
-
Distinguished Machine Learning Engineer - Safety USD 399K-457KComputer Vision | Data Architecture | Data Processing | Distributed Systems | Language ModelsEquity compensation | Onsite work schedule | Workplace inclusion cultureSenior-level Full TimeSan Mateo, CA, United States R16h ago
-
Data Engineer USD 125K-160KAWS | AWS AppFlow | AWS CloudFormation | AWS Glue | AWS LambdaIn-office workSenior-level Full TimeMeridian, ID, US17h ago
-
Data Engineer Senior Principal (Hybrid) USD 144K-195KAmazon S3 | Amazon Web Services | Amazon Web Services (AWS) | Apache Airflow | Apache Flink401k match | Health insurance | Hybrid work | Paid time offSenior-level Full TimeUSA NC Fort Bragg - 2929 … R17h ago
-
Gen AI Engineer USD 112K-168KAKS | AWS | Agile | Agile frameworks | Apache Spark401k match | Dental insurance | Financial education resources | Health insurance | Life insuranceMid-level Full TimeGA-ATLANTA, 740 W PEACHTREE ST NW, …17h ago
-
Lead Cloud Data and AI/ML Engineer, AVP USD 90K-157KAPI | AWS | AWS Lambda | Agentic AI | AirflowDental insurance | Employee assistance program | Family care benefits | Health insurance | Incentive compensationSenior-level Full TimeQuincy, Massachusetts, United States17h ago
-
Machine Learning Engineer USD 137K-275KAWS | C++ | Docker | Java | KubernetesHybrid work | Remote work options | Work-life balanceMid-level Full TimeSeattle (WA), United States17h ago
-
Innovation Senior AI Engineer USD 210K-260KAWS | Agentic Systems | Agno | Apache Airflow | AzureHome office setup budget | Multisport card | Private healthcare | Professional development budgetSenior-level Full TimeBoston, United States17h ago
-
Data Engineer II USD 93K-100KAmazon Web Services | CI/CD | Cloud platform | Deep learning | Distributed ComputingPaid Holidays | Paid time off | Remote workMid-level Full TimeColumbia, MD, US17h ago
-
AI Engineer USD 165K-240KAPI Design | AWS | Agentic Workflows | Asynchronous processing | BM25401k enrollment | Gym membership stipend | Health coverage | Hybrid work environment | Paid HolidaysSenior-level Full TimeNew York17h ago
-
Senior Embedded Software Engineer (C++) USD 141K-224KAutomated testing | C++ | CI/CD | CMake | Computer VisionCompany holidays | Health insurance | Life insurance | Long-term disability | Paid parental leaveSenior-level Full TimeTorrance, California, United States18h ago