Machine Learning Engineer Intern (Training Pre-processing) - 2025 Summer (PhD)
San Jose, California, United States
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Team Introduction:
The TikTok Flink Ecosystem Team plays a critical role in delivering real-time computing capabilities to power TikTok’s massive-scale recommendation, search, and advertising systems. This team is focused on building the infrastructure for stream processing at exabyte scale — enabling ultra-low-latency, high-reliability, and cost-efficient real-time data transformations.
We are deeply involved in developing and optimizing Apache Flink and surrounding components like connectors, state backends, and runtime execution models to meet TikTok’s rapidly evolving data needs at EB-level throughput and scale.
We also collaborate closely with ML infrastructure teams to bridge real-time stream processing and machine learning. This includes integrating Velox to accelerate model training, building multimodal data pipelines, and utilizing frameworks like Ray to orchestrate large-scale distributed ML workflows.
Responsibilities:
- Design and develop core Flink operators, connectors, or runtime modules to support TikTok’s exabyte-scale real-time processing needs.
- Build and maintain low-latency, high-throughput streaming pipelines powering online learning, recommendation, and ranking systems.
- Collaborate with ML engineers to design end-to-end real-time ML pipelines, enabling efficient feature generation, training data streaming, and online inference.
- Leverage Velox for compute-optimized ML data transformation and training acceleration on multimodal datasets (e.g., video, audio, and text).
- Use Ray to coordinate distributed machine learning workflows and integrate real-time feature pipelines with ML model training/inference.
- Optimize Flink job performance, diagnose bottlenecks, and deliver scalable solutions across EB-scale streaming workloads.
The TikTok Flink Ecosystem Team plays a critical role in delivering real-time computing capabilities to power TikTok’s massive-scale recommendation, search, and advertising systems. This team is focused on building the infrastructure for stream processing at exabyte scale — enabling ultra-low-latency, high-reliability, and cost-efficient real-time data transformations.
We are deeply involved in developing and optimizing Apache Flink and surrounding components like connectors, state backends, and runtime execution models to meet TikTok’s rapidly evolving data needs at EB-level throughput and scale.
We also collaborate closely with ML infrastructure teams to bridge real-time stream processing and machine learning. This includes integrating Velox to accelerate model training, building multimodal data pipelines, and utilizing frameworks like Ray to orchestrate large-scale distributed ML workflows.
Responsibilities:
- Design and develop core Flink operators, connectors, or runtime modules to support TikTok’s exabyte-scale real-time processing needs.
- Build and maintain low-latency, high-throughput streaming pipelines powering online learning, recommendation, and ranking systems.
- Collaborate with ML engineers to design end-to-end real-time ML pipelines, enabling efficient feature generation, training data streaming, and online inference.
- Leverage Velox for compute-optimized ML data transformation and training acceleration on multimodal datasets (e.g., video, audio, and text).
- Use Ray to coordinate distributed machine learning workflows and integrate real-time feature pipelines with ML model training/inference.
- Optimize Flink job performance, diagnose bottlenecks, and deliver scalable solutions across EB-scale streaming workloads.
Job stats:
3
0
0
Categories:
Deep Learning Jobs
Engineering Jobs
Machine Learning Jobs
Tags: Data pipelines Flink Machine Learning ML infrastructure Model training PhD Pipelines Streaming
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Data Scientist II jobsSr. Data Engineer jobsBusiness Intelligence Developer jobsPrincipal Data Engineer jobsBI Developer jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsDevOps Engineer jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsData Manager jobsStaff Software Engineer jobsAI/ML Engineer jobsData Science Manager jobsLead Data Analyst jobsData Analyst Intern jobsBusiness Data Analyst jobsSr. Data Scientist jobsData Specialist jobsBusiness Intelligence Analyst jobsData Governance Analyst jobsData Engineer III jobsSenior Backend Engineer jobs
Consulting jobsMLOps jobsAirflow jobsOpen Source jobsEconomics jobsKafka jobsLinux jobsGitHub jobsKPIs jobsTerraform jobsJavaScript jobsPrompt engineering jobsPostgreSQL jobsRAG jobsBanking jobsStreaming jobsScikit-learn jobsClassification jobsNoSQL jobsData Warehousing jobsRDBMS jobsPhysics jobsComputer Vision jobsdbt jobsPandas jobs
Google Cloud jobsHadoop jobsScala jobsLangChain jobsGPT jobsR&D jobsBigQuery jobsData warehouse jobsMicroservices jobsCX jobsELT jobsDistributed Systems jobsReact jobsScrum jobsOracle jobsLooker jobsIndustrial jobsPySpark jobsOpenAI jobsJira jobsRedshift jobsRobotics jobsSAS jobsTypeScript jobsUnstructured data jobs