Senior Machine Learning Engineer- Data
Palo Alto, California
Full Time Senior-level / Expert USD 180K - 250K
As MLE on Luma's Data team you are responsible for raising the bar for our data quality. Data is the critical foundation of our products, and we are looking for individuals who can identify creative approaches to data and captioning and then implement solutions for processing at PB scale. Good candidates should have exceptional general python engineering skills alongside a combination of industry ML experience, Data experience, and passion for building AI products.
Responsibilities
- Design data pipelines, including finding appropriate data sources, scraping, filtering, post-processing, de-duplicating, and versioning. The system should be robust and scalable for production use.
- Design and implement frameworks to evaluate the effectiveness of our models and data. For example, set up the standards for an automated evaluation pipeline to run before any new model gets deployed into the API.
- Work closely with others who might be data contributors or consumers or both to incorporate their data usage needs on a variety of tasks and domains.
- Work with human labeling vendors to refine the procedure and guidelines to collect high-quality human annotation data.
- Conduct open-ended research to improve the quality of collected data, including but not limited to, semi-supervised learning, human-in-the-loop machine learning and fine-tuning with human feedback.
Experience
- 5+ years of relevant experience or demonstration of high impact projects as a Data Engineer, Machine Learning Engineer, or Data Scientist, dealing with large amounts of data on a daily basis.
- Have a strong belief in the criticality of high-quality data and are highly motivated to work with the associated challenges.
- Have experience working in large distributed systems.
- Strong generalist python and pytorch skills
- Experience using SQL, Spark, or other tools for processing large amounts of data.
- Please note this role is not meant for recent grads.
Compensation
- The pay range for this position in California is $180,000 - $250,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.
Job stats:
0
0
0
Categories:
Engineering Jobs
Machine Learning Jobs
Tags: APIs Data pipelines Data quality Distributed Systems Engineering Machine Learning Pipelines Python PyTorch Research Spark SQL
Perks/benefits: Career development Competitive pay Equity / stock options
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Staff Machine Learning Engineer jobsStaff Data Scientist jobsBI Developer jobsData Scientist II jobsPrincipal Data Engineer jobsData Manager jobsJunior Data Analyst jobsResearch Scientist jobsData Science Manager jobsBusiness Data Analyst jobsData Engineer III jobsSenior AI Engineer jobsLead Data Analyst jobsData Specialist jobsData Science Intern jobsSr. Data Scientist jobsPrincipal Software Engineer jobsData Analyst Intern jobsSoftware Engineer II jobsData Analyst II jobsBI Analyst jobsAzure Data Engineer jobsSoftware Engineer, Machine Learning jobsJunior Data Engineer jobsSenior Data Scientist, Performance Marketing jobs
Snowflake jobsEconomics jobsLinux jobsOpen Source jobsBanking jobsHadoop jobsComputer Vision jobsRDBMS jobsJavaScript jobsPhysics jobsMLOps jobsKafka jobsData Warehousing jobsKPIs jobsAirflow jobsGoogle Cloud jobsNoSQL jobsR&D jobsStreaming jobsScala jobsData warehouse jobsOracle jobsClassification jobsGitHub jobsPostgreSQL jobs
Scikit-learn jobsSAS jobsCX jobsTerraform jobsPySpark jobsScrum jobsPandas jobsData Mining jobsDistributed Systems jobsIndustrial jobsBigQuery jobsRobotics jobsLooker jobsJira jobsJenkins jobsUnstructured data jobsE-commerce jobsRedshift jobsdbt jobsData strategy jobsPharma jobsReact jobsMicroservices jobsMySQL jobsNumPy jobs