Research, Mid-Training
Tasks
- Build training evaluations
- Debug distributed training at scale
- Design late stage data mix and quality uplift
- Develop synthetic data pipelines
- Implement context length extension methods
- Measure scaling with compute and data
- Research and optimize learning rate schedules
- Translate research insights into measurable gains
Perks/Benefits
- N/A
Skills/Tech-stack
Context Length Extension | Data Filtering | Data weighting | Deep learning | Distributed Training | Language Models | Large Language Models | Learning Rate | Learning Rate Scheduling | Learning Theory | Machine Learning | Machine learning theory | Optimization | Positional encoding | PyTorch | Python | Statistics | Synthetic data | Warmup
Education
Related jobs
-
AWS Data Architect USD 140K-216KAPI Integration | AWS CloudFormation | AWS Glue | AWS Lake Formation | AWS LambdaOnsite daily at officeSenior-level Full TimeFremont, United States1h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Apache Airflow | AvroFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeTampa, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Airflow | Apache SparkEducation budget | Fitness budget | Flexible schedule | Mentorship | Office workSenior-level Full TimeFort Lauderdale, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Airflow | Apache SparkEducation budget | Fitness budget | Flextime with remote and office options | Mentorship | Personalized growth roadmapsSenior-level Full TimeJacksonville, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Amazon SageMaker | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsSenior-level Full TimeOrlando, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Personalized growth roadmaps | Remote and office options | TechtalksSenior-level Full TimeTallahassee, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | DuckDBFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote optionsSenior-level Full TimeBaltimore, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | Data LakeEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsSenior-level Full TimeRichmond, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS SageMaker | Amazon Web Services | Apache Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Remote and office options | TechtalksSenior-level Full TimeMiami, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeAustin, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | Big DataFlexible schedule | Mentorship | Personalized growth roadmaps | Remote work options | TechtalksSenior-level Full TimeHouston, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | Data LakehouseFlextime | Mentorship | Office options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeBlacksburg, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Avro | Data Lake | Data LakehouseFlextime | Mentorship | Office work options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeWest Palm Beach, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Apache Airflow | Apache SparkEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeSan Francisco, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Avro | Big DataEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapsSenior-level Full TimeNew York, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | Airflow | Apache Spark | AvroFlextime | Mentorship | Office work options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeBoca Raton, United States2h ago
-
Data Engineer USD 94K-157KAWS | AWS Glue | Apache Kafka | Apache NiFi | Apache SparkHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilitySenior-level Full TimeUSA-Remote Work R3h ago
-
Senior Data Engineer, Machine Learning USD 205K-235KApache Spark | Data Lakes | Data Modeling | Data Privacy | Data SecuritySenior-level Full TimeMenlo Park, CA4h ago
-
Software Engineer, AI/ML, Platforms and Devices USD 147K-211KData Processing | Data Structures | Data Structures and Algorithms | Debugging | Distributed SystemsMid-level Full TimeMountain View, CA, USA4h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud USD 174K-252KC++ | Code Reviews | Computer Vision | Data Processing | Data StorageSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Computer Vision | Data Processing | Debugging | Distributed Computing | Fine TuningSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Staff Software Engineer, YouTube Ads Machine Learning USD 207K-300KC plus plus | Data Processing | Data Storage | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA4h ago
-
Staff Software Engineer, AI/ML GenAI, Google Cloud AI USD 207K-300KAlgorithms | Computer Vision | Data Preparation | Data Processing | Data StructuresSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Senior Software Engineer, Machine Learning, Core ML USD 174K-252KC++ | Compiler optimization | Data Processing | Data parallelism | DebuggingSenior-level Full TimeMountain View, CA, USA4h ago
-
Software Engineer III, AI/ML Computer Vision, AR USD 147K-211KC++ | Computer Vision | Data Processing | Debugging | Image classificationSenior-level Full TimeSan Jose, CA, USA4h ago