Software Engineer Intern - Data Infrastructure
Santa Clara, CA
Plus
Plus is an AI-first global autonomous trucking technology company safely developing autonomous trucks for the roads of the world.
Ready to get hands-on with real-world, large-scale data challenges? We’re seeking a Software Engineer Intern to help build and improve an event mining framework used for uncovering key insights in massive datasets. In this internship, you will work with our dynamic team to scale our systems, harness the power of the cloud, and leverage Spark to make our data processing pipelines faster and more efficient.Expect to dive into distributed computing, and cloud technologies while closely collaborating with experienced engineers. You’ll have the chance to see your work directly impact a high-growth environment, all while learning best practices in modern data engineering.
Responsibilities:
- Learn & Contribute: Get up to speed on our existing event mining framework, pipelines, and tools.
- Build & Scale: Assist in enhancing the framework to handle larger datasets and more complex workloads. Collaborate on implementing Spark-based solutions in the cloud for better scalability.
- Migrate & Modernize: Support migrating components from our legacy system into the newly developed infrastructure, ensuring minimal disruption and improved functionality.
- Research & Experiment: Help mine interesting events in the dataset and explore new ways to expand our framework’s capabilities.
Required Skills:
- Proficiency in Python
- Strong analytical thinking and eagerness to learn
- Basic knowledge of cloud platforms (AWS, Azure, or GCP)
Preferred Skills:
- Experience with Spark or other big data technologies
Job stats:
5
2
0
Category:
Engineering Jobs
Tags: AWS Azure Big Data Engineering GCP Pipelines Python Research Spark
Perks/benefits: Career development Team events
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsSr. Data Engineer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsBusiness Intelligence Analyst jobsStaff Machine Learning Engineer jobsData Science Manager jobsPrincipal Software Engineer jobsData Manager jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsDevOps Engineer jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsSr. Data Scientist jobsStaff Software Engineer jobsLead Data Analyst jobsAI/ML Engineer jobsResearch Scientist jobsSenior Backend Engineer jobsData Engineer III jobsBI Analyst jobs
NLP jobsAirflow jobsOpen Source jobsEconomics jobsMLOps jobsTerraform jobsKPIs jobsNoSQL jobsKafka jobsLinux jobsJavaScript jobsComputer Vision jobsData Warehousing jobsRDBMS jobsGoogle Cloud jobsPostgreSQL jobsPhysics jobsBanking jobsGitHub jobsScikit-learn jobsHadoop jobsScala jobsStreaming jobsData warehouse jobsPandas jobs
R&D jobsOracle jobsdbt jobsCX jobsBigQuery jobsClassification jobsLooker jobsReact jobsDistributed Systems jobsPySpark jobsScrum jobsRAG jobsRedshift jobsJira jobsELT jobsRobotics jobsPrompt engineering jobsMicroservices jobsIndustrial jobsGPT jobsSAS jobsMySQL jobsData Mining jobsNumPy jobsTypeScript jobs