aijobs.net

AI Systems & Platform Internals - Technical Architect

San Francisco, California, United States

USD 90K-144K (estimate) Entry-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Async execution | Batch inference | C++ | CUDA | Caching | Context Pruning | Deep learning | Distributed Systems | Evaluation | Fault Tolerance | GPU Computing | Go | Inference | Java | Job Scheduling | KV cache | Logging | Machine Learning | Microservices | Model Serving | Model sharding | NCCL | Observability | Orchestration | Pipeline parallelism | Profiling | Prompt Compression | Python | RCCL | Release Management | Response Caching | Retrieval-Augmented Generation | Rust | SLOs | Safety testing | Semantic Caching | Tensor Parallelism | Tracing | Triton | TypeScript

Education

N/A

Roles

Architect | Systems Architect | Technical Architect

Regions

North America

Countries

United States

States

California, US

Cities

San Francisco, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs