R&D AI Software Engineer / End-to-End Machine Learning Engineer / RAG and LLM
Warsaw, Masovian Voivodeship, Poland - Remote
Pathway
Python ETL framework for stream processing, real-time analytics, RAG, and LLM pipelines.About Pathway
Pathway is an enabler for Live AI, allowing organizations to run contextualized ML models connected to ever-changing enterprise data. In addition to being an infrastructure provider delivering an AI framework, we are working to advance the state-of-the-art.
Pathway is VC-funded, with some amazing BAs (such as Lukasz Kaiser, co-inventer of Transformers). Pathway's CTO has co-authored papers with Goeff Hinton and Yoshua Bengio. We have just raised a $10M+ seed, with exciting developments ahead. The management team includes growth leaders who have scaled companies with multiple exits, and who have built online communities reaching millions of users.
Out client portfolio is focused around mobility, IoT data, logs and transactions, and also includes larger actors such as NATO and national postal services. We have a vibrant community centered around our developer frameworks, with almost 10,000 stars on GitHub: https://github.com/pathwaycom/
Our European offices are in Paris, France and Wroclaw, Poland. Our HQ is in Menlo Park, CA.
The Opportunity
We are currently searching for 2 ML/AI Engineers with a solid software engineering backbone who are able to prototype, evaluate, improve, and productionize end-to-end Machine Learning projects with enterprise data.
For representative examples of the type of projects you would be expected to create or deliver, see our pipelines" rel="nofollow noreferrer noopener" target="_blank">AI pipelines templates.
If you would consider it would be fun to create a hybrid Vector/Graph index that beats the state of the art on a RAG benchmark, to deliver a working AI pipeline to a client in a critical industry, or to pre-process datasets in a way which would boost LLM accuracy in inference & training - this is the job for you!
You Will
- help design experimental end-to-end ML/AI pipelines
- contribute to addressing new use cases, beyond state of the art
- improve/adapt AI pipelines for production, working directly with client data (often live data streams)
- invent ways to pre-process data sources and perform tweaks (reranking, model paramater configuration...) for optimal performance of AI pipelines.
- design benchmark tasks and perform experiments.
- build unit tests and implement model monitoring.
- contribute high-quality production code to our developer frameworks, used by thousands of developers.
- help to pre-process data sets for LLM training.
The results of your work will play a crucial role in the success of both our developer offering and client delivery.
Requirements
Cover letter
It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that.
You Are
- A graduate of a 4+-year university degree in Computer Science, where you have received A-grades in both foundational courses (Algorithms, Computational Complexity, Graph Theory,...) and Machine Learning courses.
- Passionate about delivering high-quality code and solutions that work.
- Good with data & engineering innovation in practice - you know how to put things together so that they don't blow up.
- Experienced at hands-on Machine Learning / Data Science work in the Python stack (notebooks, etc.).
- Experienced with model monitoring, git, build systems, and CI/CD.
- Respectful of others
- Curious of new technology - an avid reader of HN, arXiv feeds, AI digests, ...
- Fluent in English
Bonus Points
- Successful track-record in algorithms (ICPC / IOI), data science contests (Kaggle), or a HuggingFace leaderboard.
- Showing a code portfolio.
- PhD in Computer Science.
- Authoring a paper at major Machine Learning conference.
- You like playing/tinkering with new tools in the LLM stack.
- You are already a part of the Pathway community, or have been recognized for your work in one of our bootcamps.
Why You Should Apply
- Join an intellectually stimulating work environment.
- Be a technology innovator that makes a difference: your code gets delivered to a community of thousands of developers, and to clients processing billions of records of data.
- Be part of one of an early-stage AI startup that believes in impactful research and foundational changes.
Benefits
- Type of contract: Full-time, permanent
- Preferable joining date: January 2025. The positions are open until filled β please apply immediately.
- Compensation: competitive base salary (80th to 99th percentile) based on profile and location + Employee Stock Option Plan + possible bonuses if working on client projects. The stated lower band of EUR 72k/ USD 75k for the salary base concerns senior candidates; mid-senior or mid candidates may be considered with adapted salary bands.
- Location: Remote work. Possibility to work or meet with other team members in one of our offices: Paris, France, or Wroclaw, Poland. Possibility to visit our Menlo Park, CA headquarters for several months. As a general rule, permanent residence will be required in the EU, UK, US, or Canada.
(Note for candidates based in India: We are proud to be part of the current Inter-IIT as well as a partner of ICPC India. Top IIT/IIIT graduates are more than welcome to apply regardless of current location.)
If you meet our broad requirements but are missing some experience, donβt hesitate to reach out to us.
Tags: CI/CD Computer Science Engineering Git GitHub HuggingFace LLMs Machine Learning ML models PhD Pipelines Python R RAG R&D Research Transformers
Perks/benefits: Career development Competitive pay Equity / stock options Salary bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.