Senior / Staff AI Research Engineer, Real-Time Inference
Tasks
- Apply structured sparsity for efficient inference
- Benchmark latency throughput power and accuracy
- Collaborate on model architectures for real time control loop latency
- Compile hardware aware inference graphs
- Deploy ML models to edge compute platforms
- Develop inference pipelines for embodied AI models
- Implement model compression including quantization pruning and distillation
- Optimize CUDA kernels and memory layout
- Profile and debug inference stacks
- Use TensorRT for performance optimization
- Use Triton for kernel optimization
Perks/Benefits
- 401k plan
- Dental insurance
- Equity program
- Fully stocked kitchen
- Green card support
- Health insurance
- Lunches Dinners In Office
- Stock options
- Team building events
- Visa sponsorship
- Vision insurance
Skills/Tech-stack
C++ | CUDA | CUDA kernels | Edge Computing | Embedded Systems | FP16 | GPU Architecture | Graph compilation | INT4) | INT8 | Knowledge Distillation | Layout optimization | Memory layout | Memory layout optimization | Model Compression | ONNX Runtime | Profiling | Pruning | Python | Quantization | Structured Sparsity | TVM | TensorRT | Triton
Education
Related jobs
-
Machine Learning Engineer, Physical AI USD 150K-200KAWS | CUDA | Computer Vision | Deep learning | FastAIBi-annual offsites | Dental insurance | Flexible PTO | Health insurance | Learning and development budgetMid-level Full TimeSan Francisco, CA, US1h ago
-
AWS Data Architect USD 140K-216KAPI Integration | AWS CloudFormation | AWS Glue | AWS Lake Formation | AWS LambdaOnsite daily at officeSenior-level Full TimeFremont, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Apache Airflow | AvroFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeTampa, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Airflow | Apache SparkEducation budget | Fitness budget | Flexible schedule | Mentorship | Office workSenior-level Full TimeFort Lauderdale, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Airflow | Apache SparkEducation budget | Fitness budget | Flextime with remote and office options | Mentorship | Personalized growth roadmapsSenior-level Full TimeJacksonville, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Amazon SageMaker | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsSenior-level Full TimeOrlando, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Personalized growth roadmaps | Remote and office options | TechtalksSenior-level Full TimeTallahassee, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | DuckDBFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote optionsSenior-level Full TimeBaltimore, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | Data LakeEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsSenior-level Full TimeRichmond, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS SageMaker | Amazon Web Services | Apache Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Remote and office options | TechtalksSenior-level Full TimeMiami, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeAustin, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | Big DataFlexible schedule | Mentorship | Personalized growth roadmaps | Remote work options | TechtalksSenior-level Full TimeHouston, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | Data LakehouseFlextime | Mentorship | Office options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeBlacksburg, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Avro | Data Lake | Data LakehouseFlextime | Mentorship | Office work options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeWest Palm Beach, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Apache Airflow | Apache SparkEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeSan Francisco, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Avro | Big DataEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapsSenior-level Full TimeNew York, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Apache Spark | AvroFlextime | Mentorship | Office work options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeBoca Raton, United States2h ago
-
Data Engineer USD 94K-157KAWS | AWS Glue | Apache Kafka | Apache NiFi | Apache SparkHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilitySenior-level Full TimeUSA-Remote Work R3h ago
-
Data Pipelines | Data Processing | Deep learning | Distributed Training | Feature EngineeringMid-level Full TimeSan Jose, California, United States3h ago
-
Senior Data Engineer, Machine Learning USD 205K-235KApache Spark | Data Lakes | Data Modeling | Data Privacy | Data SecuritySenior-level Full TimeMenlo Park, CA4h ago
-
Manager, Content Engineering — AI Content Understanding USD 134K-196KA/B | A/B Testing | Annotation | B testing | Content labelingMid-level Full TimeMenlo Park, CA | New York, …4h ago
-
Software Engineer, AI/ML, Platforms and Devices USD 147K-211KData Processing | Data Structures | Data Structures and Algorithms | Debugging | Distributed SystemsMid-level Full TimeMountain View, CA, USA4h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud USD 174K-252KC++ | Code Reviews | Computer Vision | Data Processing | Data StorageSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Staff Software Engineer, YouTube Ads Machine Learning USD 207K-300KC plus plus | Data Processing | Data Storage | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA4h ago
-
Senior Software Engineer, Machine Learning, Core ML USD 174K-252KC++ | Compiler optimization | Data Processing | Data parallelism | DebuggingSenior-level Full TimeMountain View, CA, USA4h ago