Software Engineering Intern

New York

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Baseten

Effortlessly serve optimized open source & custom models on the fastest, most reliable model delivery network

View all jobs at Baseten

Apply now Apply later

ABOUT BASETEN

Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast. Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re trusted by leading AI-driven innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, and Zed to deliver industry-leading performance, security, and reliability for their mission-critical workloads. With our recent $75M Series C funding, we’re growing fast to make AI accessible across all products.

THE ROLE

As an intern at Baseten, you’ll work on real projects and contribute to systems and products that help our users ship their ML products. You won't be off in a corner doing "intern work" – we're moving too fast and have too much to build for that. Instead, you'll receive hands-on support and mentorship to help you grow fast and make a real impact quickly.

Engineering interns can join one of our four teams:

  1. Core Product: You’ll help build the core Baseten developer workflows that enable users to get value out of ML models. The Core Product team is at the forefront of new product development across a large surface area, including model APIs, training, and dedicated deployment.

  2. Forward Deployed Engineering: You will partner closely with our customers to understand their problems and engineer ML solutions. This role provides a unique front-row view into the opportunities and challenges facing companies implementing ML and AI solutions at scale.

  3. Model Performance: You will implement, refine, and productionize cutting-edge techniques (e.g. quantization, speculative decoding, KV cache reuse, chunked prefill, and LoRA) for ML model inference and infrastructure.

  4. Infrastructure: You'll architect and support development of our ML inference platform that powers production AI applications. You'll make technical decisions for the infrastructure enabling developers to deploy, scale, and monitor ML models with high performance and reliability.

RESPONSIBILITIES

  • Own small projects end-to-end, functioning as both an engineer and a project manager, with a focus on user empathy, project specification, and end-to-end execution

  • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity

  • Fix bugs and resolve customer issues with urgency

  • Help drive long-term improvements to reliability of systems and velocity of development

  • Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates

REQUIREMENTS

  • 2+ prior internships or research experiences

  • Minimum of 3-months commitment required to intern

  • Working towards a Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. Advanced coursework in machine learning, systems, and infrastructure is a plus. Graduating in 2025 or 2026.

  • A 5-day workweek, during which you will be in-office in San Francisco or New York a minimum of 3 days a week

  • Familiarity with building tools for technical audiences

  • Proficient coding abilities in one or more popular programming or scripting languages

BENEFITS

  • Competitive compensation package.

  • This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.

  • An inclusive and supportive work culture that fosters learning and growth.

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.


At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Apply now Apply later
Job stats:  4  0  0
Category: Engineering Jobs

Tags: APIs Computer Science Engineering LoRA Machine Learning Mathematics ML models Model inference Research Security Spark

Perks/benefits: Career development Competitive pay Startup environment

Region: North America
Country: United States

More jobs like this