Inference Systems Backend Engineer - ARK Large Model Platform (Singapore)
Tasks
- Build distributed LLM inference systems
- Develop large model training and inference systems
- Implement elastic scheduling for distributed GPU clusters
- Integrate heterogeneous accelerators with training and inference frameworks
- Optimize compilers for model workloads
- Optimize model computation and training performance
- Orchestrate training and inference tasks
- Quantize models for inference
- Research and design training and inference architectures
- Schedule large scale inference traffic
- Solve high concurrency, reliability, and scalability issues
- Tune large GPU training clusters
Perks/Benefits
- N/A
Skills/Tech-stack
Compiler optimization | Distributed Systems | Elastic Scheduling | GPU scheduling | High Performance | High-Performance Computing | Language Models | Large Language Models | Machine Learning | Model Quantization | NPU | Performance Computing | TPU | Task Orchestration
Education
N/A
Related jobs
-
Computer Vision | Generative AI | Information Retrieval | Language Models | Language ProcessingSenior-level Full TimeSingapore, Singapore10h ago
-
Backend Development | High Availability | Inference Optimization | Machine Learning | Model InferenceSenior-level Full TimeSingapore, Singapore10h ago
-
Backend Development | Data Engineering | Distributed Systems | Java | PythonHands-on experience | Internship onboardingEntry-level InternshipSingapore, Singapore10h ago
-
Machine Learning Operations Manager (Safety Model Operations) - AI Data Service & Operations SGD 99K-139KActive Learning | Alerting | CI/CD | Data Drift | Data IngestionMid-level Full TimeSingapore, Singapore10h ago
-
Deep learning | Distributed Systems | Inference Optimization | Information Retrieval | JavaSenior-level Full TimeSingapore, Singapore10h ago
-
C++ | Deep learning | Distributed Systems | Docker | ElasticsearchMid-level Full TimeSingapore, Singapore10h ago
-
Backend Development | Cloud infrastructure | Data Processing | Debugging | Distributed SystemsMid-level Full TimeSingapore, Singapore10h ago
-
Automation | C# | C++ | Cloud Networking | Code debuggingOn-call rotation | Travel up to 15 percent in regionSenior-level Full TimeSingapore11h ago
-
Forward Deployed Engineer V, GenAI, Google Cloud SGD 74K-130KAPI Integration | Agent systems | Agentic Workflows | Cloud platform | CrewAISenior-level Full TimeSingapore11h ago
-
AI compute | AI compute clusters | AI hardware | Chip interconnects | Collective communicationSenior-level Full TimeSingapore11h ago
-
Senior-level Full TimeFab 10A, Singapore22h ago
-
Robotics Engineer USD 137K-187KBenchmarks | Calibration | Computer Vision | Data logging | Data synchronizationEntry-level Full TimeSF Bay Area, CA, Remote, International, … R1d ago
-
Clustering | Data Lake | Data Modeling | Data Transformation | Data WarehousingInterpersonal support | Knowledge transfer sessions | Training and consultancyExecutive-level Full TimeITE-HQ (Headquarters), Singapore1d ago
-
Mid-level Full TimeSingapore1d ago
-
Senior Specialist, AI Engineer SGD 92K-170KAWS | Agentic AI | Audit trails | Azure | Azure DatabricksSenior-level Full TimeSG-Collyer Quay, Singapore1d ago
-
Android | C# | Computer Vision | Data Preparation | FlutterSenior-level Full TimeRepublic Polytechnic, Singapore1d ago
-
Intern - AI / Data Science Engineering SGD 54K-66KAnomaly Detection | Data Engineering | Data Visualization | Hypothesis Testing | Machine LearningEntry-level Full Time InternshipFab 10N/X, Singapore1d ago
-
Container Orchestration | Distributed Computing | GPU | HPC | KubernetesMid-level Full TimeSingapore2d ago
-
AI Engineer(Senior Level) SGD 147K-180KFaiss | Information Retrieval | Intent Classification | Java | Machine LearningSenior-level Full Time新加坡3d ago
-
Machine Learning LLM Application Intern (Global LIVE Operation Intelligence) - 2026 Start (BS/MS) SGD 42K-57KImage Generation | Language Models | Language Processing | Large Language Models | Machine LearningEntry-level InternshipSingapore, Singapore3d ago
-
Deep learning | Information Retrieval | Machine Learning | Multimodal Learning | Ranking algorithmsEntry-level Full TimeSingapore, Singapore4d ago
-
AI Research Scientist SGD 54K-60KAWS | Azure | Computer Vision | Data Engineering | Data ModelingFlexible work arrangements | Hybrid work | International projects | Project based rotations | Referral programEntry-level Full TimeSGP - Singapore - Singapore (Boulevard …4d ago
-
Senior Staff Software Engineer SGD 150K-204KApache Flink | Apache Hive | Apache Iceberg | Apache Kafka | Apache SparkCareer development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeSingapore-Singapore4d ago
-
Big Data | Data integration | Distributed Systems | Java | Metadata ManagementEntry-level InternshipSingapore, Singapore5d ago
-
Agent systems | Cloud platform | CrewAI | Data Pipelines | Google CloudSenior-level Full TimeSingapore5d ago