LLM Pre-training & Distributed Engineer (AI Infrastructure)
Tasks
- Automate checkpointing
- Implement failure recovery
- Optimize InfiniBand networking and RDMA
- Optimize memory management
- Orchestrate distributed training runs
Perks/Benefits
- N/A
Skills/Tech-stack
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeed | GPU clusters | Infiniband | Kubernetes | Megatron-LM | Pipeline parallelism | PyTorch | Python | RDMA | Slurm | Tensor Parallelism
Education
N/A
Related jobs
-
APIs | Azure Data | Azure Data Explorer | Azure Data Factory | Azure DevOpsOnsite workMid-level Full TimeSingapore, Singapore3h ago
-
Mid-level Full TimeSingapore14h ago
-
Engineer, AI Developer/AI Platform SGD 90K-130KArtificial Intelligence | Human-in-the-loop | Lifecycle Management | Machine Learning | Model LifecycleEntry-level Full TimeAero - 507 Airport Road, SG17h ago
-
Senior Data Engineer (18 Months Contract) SGD 160K-191KAI | Automation | C# | Continuous Learning | Data GovernanceEmployee resource groupsSenior-level Contract Full TimeSingapore - Woodlands - NorthTech17h ago
-
Senior Big Data Engineer SGD 90K-130KAlgorithms | C++ | Data Extraction Transformation Loading | Data Structures | Data extractionSenior-level Full Time Internship新加坡23h ago
-
Professional Services Engineer SGD 90K-140KAPI Integration | Amazon Web Services | Authentication | Java | KubernetesMid-level Full TimeSingapore, SG1d ago
-
API Integration | Apache Airflow | Data Pipelines | Deployment | DocumentationSenior-level Contract Full TimeSingapore, Singapore, Singapore1d ago
-
API | API Integration | Automation | Data Processing | DocumentationMid-level Full TimeSingapore, Singapore, Singapore1d ago
-
Artificial Intelligence | Data Mining | Data analytics | Deep learning | JMPMid-level Full TimeFab 10A, Singapore1d ago
-
Language Models | Language Processing | Large Language Models | Linear Algebra | Natural LanguageEntry-level Full TimeNTU Main Campus, Singapore1d ago
-
Assistant Vice President/ Vice President, Network Data Engineer, Core Technology Infrastructure SGD 120K-261KAgile | Ansible | Arista | BGP | Change ManagementFlexible benefits | In-office collaborationExecutive-level Full TimeSingapore1d ago
-
C# | Electromagnetism | MATLAB | Mechanics | NumPyEnglish CV requirement | Flexible part-time schedule | Freelance project-based workEntry-level FreelanceSingapore - Remote R1d ago
-
NumPy | Numerical Simulation | Pandas | Python | SciPySenior-level FreelanceSingapore - Remote R1d ago
-
NumPy | Numerical Simulation | Pandas | Python | SciPyPart-time project workEntry-level FreelanceSingapore - Remote R1d ago
-
Software Engineer SGD 122K-140KApache Flink | Apache Kafka | Apache Spark | Cloud Dataflow | Cloud RunMid-level Full TimeCrimson House Singapore1d ago
-
Staff/Lead LLM Data Scientist (Singapore based) SGD 120K-135KAgent Orchestration | Cost Optimization | Deep learning | Evaluation | ExperimentationRelocation assistance | Visa sponsorshipSenior-level Full TimeSingapore2d ago
-
Senior Data Engineer, Compliance Data Platform SGD 139K-143KAuditability | Data Lineage | Data Modeling | Data Quality | Data immutabilityCompany events | Education subsidy | Healthcare | L and D programs | Meal allowanceSenior-level Full TimeHong Kong, Hong Kong SAR; Singapore, …2d ago
-
Staff Data Engineer, Finance Data Platform SGD 171K-206KAirflow | Anomaly Detection | Audit Trail | Data Lineage | Data ModelingEducation subsidy | Healthcare coverage | L and D programs | Meal allowances | Team building programsSenior-level Full TimeHong Kong, Hong Kong SAR; Singapore, …2d ago
-
Senior Machine Learning - Search SGD 140K-182KBM25 | C++ | Collaborative Filtering | Deep learning | Dense vectorsSenior-level Full TimeSingapore2d ago
-
Structural Engineer (with Computational Design Skills) - “High-Rise & Complex Buildings” SGD 45K-54KC# | Element analysis | Finite Element Analysis | Finite element | GrasshopperEntry-level Full TimeSingapore, Singapore2d ago
-
BigQuery | Data Cleansing | Data Visualization | Data handling | Deep learningSenior-level Full TimeFab 10A, Singapore2d ago
-
Backend Engineer (GenAI platform) SGD 105K-170KAPI Design | AWS Bedrock | Bedrock Agents | Data masking | ETLMid-level Full TimeSingapore3d ago
-
Container Orchestration | Distributed Computing | GPU | HPC | KubernetesMid-level Full TimeSingapore3d ago
-
CAN bus | Connector selection | Current Sense Amplifier | DC Power Systems | DC powerCross-functional collaboration | Hands on hardware deployment | Open source contributionsSenior-level Full TimeSingapore3d ago
-
Amazon S3 | C++ | Cloud Native | Cloud Native Architecture | ConcurrencySenior-level Full TimeSingapore3d ago