Solutions Architect, LLM Model Builder
US, CA, Santa Clara, United States
USD 152K-241K Senior-level Full Time
Tasks
- Advise partners on foundation model architectures
- Benchmark foundation model solutions
- Create TCO calculators and sizing models
- Define evaluation workflows and validation recipes
- Design inference architecture for prefill and decode
- Develop reference architectures and benchmark recipes
- Guide fine tuning distillation quantization and compression
- Optimize batching routing and serving efficiency
- Perform production readiness testing
- Plan compute and cluster sizing
- Select GPU network storage and memory configurations
- Translate model and infrastructure topics for partners and customers
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | CUDA | Compression | Distillation | Evaluation | Fine Tuning | GPU infrastructure | Inference Optimization | Infiniband | JAX | Latency optimization | MPI | NCCL | NEMO | NVIDIA Triton | NVLink | Nemotron | PyTorch | Python | Quantization | Reinforcement Learning | Synthetic data | TensorFlow | TensorRT-LLM | Throughput Optimization | VLLM
Education
Regions
Countries
States
Cities
Related jobs
-
Autonomous Driving – Internship in Machine Learning USD 68K-136KAzure Machine Learning | Computer Vision | Convolutional Neural Networks | Data Pipelines | Data PreprocessingEntry-level InternshipSunnyvale, CA, United States7h ago
-
Staff Software Engineer- GEN AI USD 131K-210KAngular | Application Security | Authentication | C# | C++401k | Dental insurance | Health insurance | Paid time off | Vision insuranceSenior-level Full TimeAustin, TX, United States12h ago
-
Sr. Data and AI Engineer USD 180K-200KAgile | Amazon Web Services | Azure | Big Data | Data ArchitecturePublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R12h ago
-
Business Intelligence | Dashboards | Data Modeling | Data Visualization | ETLSenior-level Full TimeBoston, MA, United States13h ago
-
Data Engineer ID50062 USD 148K-170KAmazon Web Services | Apache Airflow | Apache Spark | Avro | Cloud platformEducation budget | Fitness budget | Flexible schedule | Mentorship | Remote work optionSenior-level Full TimeSan Francisco, United States14h ago
-
DevOps Engineer - Project Delivery Senior Analyst USD 107K-173KAnsible | Argo CD | Bash | CI/CD | DockerSenior-level Full TimeDallas, Texas, United States15h ago
-
Senior Software Engineer, Cross Platform Applications USD 212K-387KArtificial Intelligence | Automation | Code Analysis | Dynamic analysis | JavaScriptSenior-level Full TimeSan Jose, California, United States16h ago
-
Machine Learning Engineer, AI Coding Tools USD 156K-387KDeep learning | GPU clusters | Inference acceleration | Language Models | Large Language ModelsMid-level Full TimeSan Jose, California, United States16h ago
-
Automated Regression | Automated regression testing | BM25 | Benchmarking | Code ExecutionSenior-level Full TimeSeattle, Washington, United States16h ago
-
Agent architecture | Automated Regression | Automated regression testing | BM25 | BenchmarkingSenior-level Full TimeSan Jose, California, United States16h ago
-
Data Engineer USD 62K-62KAzure Data | Azure Data Factory | DBT | Data Factory | Data Modeling401k | Dental insurance | Disability insurance | Flexible spending account | Internal promotion opportunitiesMid-level Full TimeKS, Leawood16h ago
-
Lead Solution Architect (AI & Data Applications)_U.S USD 175K-225KAngular | Autogen | CI/CD | Databricks | Databricks AppsSenior-level Full TimeJersey City, NJ, United States16h ago
-
Backend Software Engineer - Security Data USD 122K-316KApache Kafka | Apache Spark | Data Modeling | Data Quality | ETLMid-level Full TimeSan Jose, California, United States16h ago
-
C++ | Cloud platform | Conversational AI | Document AI | Evaluation FrameworksConferences and industry events participation | Industry thought leadership | Technical product briefingsSenior-level Full TimeReston, VA, USA; Boulder, CO, USA17h ago
-
Senior Software Engineer, AI/ML GenAI USD 174K-252KC++ | Capacity Management | Cloud platform | Computer Vision | Data ProcessingSenior-level Full TimeSunnyvale, CA, USA17h ago
-
C++ | Data Processing | Debugging | Generative AI | Language ModelsSenior-level Full TimeMountain View, CA, USA17h ago
-
C++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingSenior-level Full TimeMountain View, CA, USA17h ago
-
Senior Software Engineer, AI/ML, Search Ads USD 174K-252KAds bidding | C++ | Data Science | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeNew York, NY, USA; Mountain View, …17h ago
-
Staff Datacloud Blackbelt Engineer, Data and AI USD 183K-265KAI Engineering | AI/ML | AI/ML workflows | BigQuery | Cloud ArchitectureSenior-level Full TimeSunnyvale, CA, USA17h ago
-
Accelerator development | Co-design | Compilation Optimization | Compiler development | Compute architectureSenior-level Full TimeMountain View, CA, USA; Kirkland, WA, …17h ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA17h ago
-
Customer Engineer III, Applied AI, Google Cloud USD 174K-252KAgent tooling | C++ | Cloud Native | Cloud Native Architecture | Conversational AISenior-level Full TimeNew York, NY, USA; Chicago, IL, …17h ago
-
APIs | Agent systems | CrewAI | Fine Tuning | Hugging FaceSenior-level Full TimeAddison, TX, USA; Austin, TX, USA17h ago
-
Software Engineer III, AI/ML GenAI, YouTube USD 147K-211KC++ | Computer Vision | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA17h ago
-
Principal Engineer, Autonomous Cloud USD 307K-427KAgent Frameworks | Artificial Intelligence | Distributed Systems | Evaluation | GenAISenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA17h ago