Member of Technical Staff (Infrastructure): World Models
Tasks
- Allocate GPU and cluster resources
- Build operate and scale GPU infrastructure
- Collaborate with researchers and engineers on workload requirements
- Coordinate GPU provider relationships
- Design scheduling for inference and training coexistence
- Develop automation tooling and observability
- Drive architecture decisions for compute and storage systems
- Improve reliability through incident response
- Monitor GPU utilization and cost
- Participate in on-call rotation
- Set scheduling policy
- Validate infrastructure fit for evolving workloads
Perks/Benefits
- Fully Distributed Async First Culture
- Hardware setup of your choice
- Internet stipend
- Meals stipend
- Pension contribution
- Phone stipend
- Private health coverage
Skills/Tech-stack
Automation | Distributed Computing | Distributed Storage | Distributed Systems | GPU infrastructure | Incident Response | Kernel debugging | Kubernetes | Linux | Monitoring | Networking | Observability | Resource allocation | Scheduling | Slurm | Storage
Education
N/A
Related jobs
-
Senior AI Software Engineer/Python EUR 50K-81KAWS | Azure | Azure AI | Azure AI Suite | Azure DevOps23 vacation days | Accelerated professional development | December 24 off | December 31 off | Flexible scheduleSenior-level Full TimeTeletrabajo R5h ago
-
AI Solutions Engineer AUD 100K-160KAI Agents | API Integration | Automation | Documentation | LLM APIsMid-level Full TimeHíbrido (1004, Buenos Aires, Buenos Aires, … R5h ago
-
Mid-level Full TimeRemote, Guadalajara, Mexico R6h ago
-
APIs | AWS | Automation | Azure | Azure MonitorRemote workMid-level Full TimeRemote R7h ago
-
Agent systems | Agentic Workflows | Autonomous Agents | Chatbots | Conversational AIInternal mobility | Professional development | Remote-friendly culture | Work-life balanceSenior-level Full TimePoland, REMOTE, Poland R9h ago
-
AI Engineer EUR 45K-72KAI Services | Agentic Architectures | Azure | Azure AI | Azure AI ServicesDevelopment focused meetups | Health benefits | Real ownership | Speed coaching | Sports benefitsMid-level Full TimeFully Remote R10h ago
-
AI machine learning | Azure | DAX | Data Modeling | Data QualityDiscounts on training and programs | Flexible benefits | Flexible work hours | Health insurance | Intensive work scheduleMid-level Full TimeBarcelona, Spain R13h ago
-
Software Engineer II - Abnormal Data Platform USD 149K-214KAerospike | Amazon DynamoDB | Apache Spark | Data Storage | DatabricksDistributed team collaboration | Remote work | Technical mentorshipMid-level Full TimeRemote - USA R17h ago
-
AWS EC2 | AWS Glue | AWS Lambda | AWS Step Functions | Amazon DynamoDBAnnual time off | Continuous development opportunities | Employee insurance coverage | Global Connected Culture | Remote work flexibilitySenior-level Full TimeMexico, Remote R18h ago
-
AWS Glue | AWS Lambda | AWS Step Functions | Amazon DynamoDB | Amazon EC2Annual time off | Continuous development opportunities | Employee insurance coverage | Global Connected Culture | Remote work flexibilitySenior-level Full TimeMexico, Remote R20h ago
-
Senior Software Engineer – Backend (Python / Typescript / Big Data / AWS / Kubernetes) MXN 1040K-1300KAWS | AWS Glue | Amazon EMR | Apache Kafka | Apache SparkContinuous development opportunities | Employee insurance coverage | Paid time off | Remote work flexibility | Wellness programsSenior-level Full TimeMexico, Remote R20h ago
-
AWS | AWS EMR | AWS Glue | AWS Lambda | AWS Step FunctionsAnnual time off | Continuous development opportunities | Employee insurance coverage | Global Connected Culture | Remote work flexibilitySenior-level Full TimeMexico, Remote R20h ago
-
API Security | Access Control | Airflow | Amazon Redshift | BigQueryFlexible hours | Remote workSenior-level Full TimePortugal - Remote R21h ago
-
Access Control | Airflow | Amazon Redshift | BigQuery | CI/CDRemote work flexibilitySenior-level Full TimePakistan - Remote R21h ago
-
AWS CloudFormation | Airflow | Amazon Kinesis | Amazon Redshift | BigQueryFlexible hours | Remote workMid-level Full TimeBrazil - Remote R21h ago
-
Principal Machine Learning Engineer USD 245K-393KCloud infrastructure | Data Science | Distributed Systems | Infrastructure as Code | ML pipelinesSenior-level Full TimeChicago, Illinois, USA R23h ago
-
Sr Sales Engineer, West USD 160K-196KAnalytics | Apache Spark | Artificial Intelligence | Dataiku | Kubernetes401k match | Dental insurance | Employer paid disability coverage | Flexible spending accounts | Medical insuranceSenior-level Full TimeUnited States, Remote R1d ago
-
Senior Security Engineer, Incident Response GBP 91K-110KAWS | Access Control | Azure | Cloud Security | DFIRSenior-level Full TimeAmsterdam, Netherlands; Berlin, Germany; London, United … R1d ago
-
AI Engineer USD 53K-119KAPI Design | Cost Optimization | Embeddings | Evaluation | JSONDental insurance | Gym stipend | Health insurance | Medical membership | Offsite retreatsSenior-level Full TimeRemote, US R1d ago
-
Android Development | C Sharp | C plus plus | C# | Command LineMid-level Full TimeMountain View, CA, US; Redmond, WA, … R1d ago
-
Machine Learning Engineer, Chakra INR 2000K-4600KBenchmarking | Conversational AI | Data Pipelines | Deep learning | DockerMid-level Full TimeHybrid in Bangalore, India R1d ago
-
AI Engineer - kf USD 150K-225KAPIs | Agent Orchestration | Authentication | Databricks | Distributed SystemsBirthday off | English lessons | Extra vacation week | Food credits | Referral bonusesMid-level Full TimeRemote R1d ago
-
Machine Learning Engineer, Integrity USD 120K-235KAdversarial Machine Learning | Anomaly Detection | Audio analysis | Behavioral analytics | BenchmarkingMid-level Full TimeHybrid in Santa Clara, CA R1d ago
-
AWS RDS | AWS Security | Amazon Web Services | Apache Spark | AutomationEquipment and office stipend | Flexible PTO | Laptop and tools | Learning and development stipend | Paid exams and certificationsSenior-level Full TimeARGENTINA R1d ago
-
Senior Data Analytics Engineer USD 145K-180KAWS Glue | AWS S3 | Ad Spend | Amazon Athena | Amazon RedshiftPaid time off | Remote workSenior-level Full TimeRemote job R1d ago