Staff Software Engineer - AI Research Infrastructure
New York City, New York; San Francisco, California
USD 199K-270K Senior-level Full Time
Tasks
- Build CI testing infrastructure for research code
- Build services for scheduling and orchestration
- Convert experimental workloads into robust repeatable pipelines
- Create abstractions for job submission and management
- Design infrastructure for large scale experiments
- Develop monitoring and observability for workloads
- Develop workflows that reduce iteration time
- Improve research developer productivity tooling
- Mentor engineers on compute infra and AI systems
Perks/Benefits
- N/A
Skills/Tech-stack
Backend Services | CI | Cluster management | Data Pipelines | Distributed Systems | Distributed Training | Fine Tuning | GPU Computing | High Performance | High-Performance Computing | Job Scheduling | Kubernetes | Model Evaluation | Model Parallelism | Monitoring | Observability | Performance Computing | Ray | Resource Management | Slurm | Testing
Education
Roles
Regions
Countries
States
Related jobs
-
Senior Machine Learning Engineer, Search Assistant USD 361K-510KA/B | A/B Testing | Airflow | B testing | Bandit AlgorithmsDisability benefits | Equity awards | Health insurance | Life insurance | Paid time offSenior-level Full TimeSan Jose, California7h ago
-
Senior Robot Infrastructure Engineer USD 100K-300KAWS | Alerting | C plus plus | Cloud pipelines | EC2Senior-level Full TimeSan Francisco9h ago
-
Senior Forward Deployed AI Engineer, Enterprise USD 216K-270KA/B | A/B Testing | AWS | Agent systems | Artificial IntelligenceCommuter stipend | Health, dental, vision insurance | Learning and development stipend | Paid time off | Retirement benefitsSenior-level Full TimeSan Francisco, CA; New York, NY9h ago
-
Forward Deployed AI Engineer, Enterprise USD 180K-225KA/B | A/B Testing | AWS | Agent systems | AzureCommuter stipend | Health, dental, vision coverage | Learning and development stipend | Paid time off | Retirement benefitsMid-level Full TimeSan Francisco, CA; New York, NY9h ago
-
Forward Deployed AI Engineer USD 110K-160KAI Agents | APIs | Artificial Intelligence | Cloud infrastructure | LLM401k employer match | Family building benefits | Flexible time off | Free OneMedical memberships | Healthcare plansEntry-level Full TimeNew York, NY, United States9h ago
-
Forward Deployed AI Engineer II USD 180K-230KAI Agents | APIs | Cloud infrastructure | Data Privacy | Evaluation401k match | Family building benefits fertility adoption surrogacy support | Flexible time off | Free OneMedical memberships | Healthcare plansMid-level Full TimeNew York, NY, United States9h ago
-
Sr. Delivery Acceleration AI Architect USD 168K-295KA/B | A/B Testing | AI Agent | AI Agent Development | AI agent orchestrationSenior-level Full TimeAustin , Texas, United States10h ago
-
Sr. Delivery Acceleration AI Architect USD 168K-295KA/B | A/B Testing | AI Agent | AI Agent Development | API DesignSenior-level Full TimeBoston , Massachusetts, United States10h ago
-
Sr. Delivery Acceleration AI Architect USD 168K-295KA/B | A/B Testing | AI Agent | AI Agent Development | AI agent orchestrationSenior-level Full TimeAtlanta, Georgia , United States10h ago
-
Sr. Delivery Acceleration AI Architect USD 168K-295KA/B | A/B Testing | AI Agent | AI Agent Development | AI agent orchestrationSenior-level Full TimeDallas, Texas, United States10h ago
-
Full Stack Software Engineer, Data USD 160K-225KAngular | Build Automation | C# | CI/CD | Continuous integrationExtended hours | Travel | Weekend availabilitySenior-level Full TimeStarbase, TX10h ago
-
Embedded Software Engineer USD 150K-186K*nix | Authentication | Bash | Bazel | Build AutomationOn site work 4 day schedule | Proof of vaccination requiredSenior-level Full TimeSunnyvale, CA, United States10h ago
-
Senior Staff Forward Deployed AI Engineer, Enterprise USD 288K-360KA/B | A/B Testing | AWS | Agent systems | AlgorithmsCommuter stipend | Comprehensive health, dental and vision coverage | Generous PTO | Learning and development stipend | Retirement benefitsSenior-level Full TimeSan Francisco, CA; New York, NY11h ago
-
Staff Forward Deployed AI Engineer, Enterprise USD 252K-315KA/B | A/B Testing | AWS | Agent systems | AzureCommuter stipend | Comprehensive health coverage | Dental and vision coverage | Generous PTO | Learning and development stipendSenior-level Full TimeSan Francisco, CA; New York, NY11h ago
-
Software Engineer, Infrastructure - Autonomy & Robotics USD 159K-235KApache Flink | Apache Spark | C++ | Continuous integration | Data ProcessingMid-level Full TimeSan Francisco, CA11h ago
-
Full Stack Software Engineer, Data USD 125K-175KAngular | Build systems | C# | CI/CD | Computer Vision401k retirement plan | Employee stock purchase plan | Medical/Dental/Vision insurance | Paid Holidays | Paid parental leaveEntry-level Full TimeHawthorne, CA12h ago
-
Principal Engineer - GenAI Applications & MLOps USD 175K-242KAWS | Bigtable | Data integration | Distributed Systems | Event ProcessingRemote US basedSenior-level Full TimeUS Remote R13h ago
-
Senior Data Engineer (Azure & Databricks) USD 175K-247KApache Spark | Cloud Data | Cloud Data Platforms | Data Governance | Data LineageSenior-level Full TimeDallas, TX, United States13h ago
-
Machine Learning Engineer USD 153K-222KAWS | CI/CD | Cloud platform | Deep learning | Federated LearningMid-level Full TimePeachtree Corners, GA, United States14h ago
-
GTM Engineer: Data Infrastructure & AI Intelligence USD 105K-168KAirflow | BI Dashboards | BigQuery | DBT | Data DictionaryEmployee benefits package | Hybrid work modelSenior-level Full TimeBoston, MA14h ago
-
Forward Deployed Robotics Engineer USD 140K-245KAcoustic Positioning | DVL | Dead Reckoning | EKF | Error state estimation401k matching | Dental insurance | Equity | Family planning assistance | Flexible paid time offMid-level Full TimePittsburgh14h ago
-
Senior Software Engineer, Data USD 225K-300KAPIs | AWS | Airflow | Argo | Batch Processing401k match | Family building benefits | Flexible time off | Free OneMedical memberships | Healthcare plansSenior-level Full TimeNew York, NY, United States14h ago
-
Senior Data Infrastructure Engineer USD 140K-160KAWS | Amazon Redshift | Apache Airflow | Apache Kafka | AzureEquity | Health insurance | Paid time off | Parental leave | Retirement planSenior-level Full TimeBurlington, Massachusetts, United States15h ago
-
Staff AI Engineer USD 200K-300KAccuracy Monitoring | Agent systems | Artificial Intelligence | Authentication | Authorization401k eligibility | Hybrid work | Paid time off | Parental leave | Remote workSenior-level Full TimeUnited States (Remote) R15h ago
-
Principal AI Software Engineer USD 224K-308KAWS | Cloud Computing | Data Processing | Docker | Endpoint Security401k match | Adoption and surrogacy reimbursement | Cancer Care Program | Dependent care FSA | Employee assistance programSenior-level Full TimeUnited States - Remote R15h ago