Data Scientist
Papago, United States
Caris Life Sciences
Caris fulfills the promise of precision oncology through advanced laboratory testing, including tumor profiling and blood-based cancer diagnostics.At Caris, we understand that cancer is an ugly word—a word no one wants to hear, but one that connects us all. That’s why we’re not just transforming cancer care—we’re changing lives.
We introduced precision medicine to the world and built an industry around the idea that every patient deserves answers as unique as their DNA. Backed by cutting-edge molecular science and AI, we ask ourselves every day: “What would I do if this patient were my mom?” That question drives everything we do.
But our mission doesn’t stop with cancer. We're pushing the frontiers of medicine and leading a revolution in healthcare—driven by innovation, compassion, and purpose.
Join us in our mission to improve the human condition across multiple diseases. If you're passionate about meaningful work and want to be part of something bigger than yourself, Caris is where your impact begins.
Position Summary
Caris Life Sciences is seeking a data scientist to expand, test, and validate a suite of molecular biomarkers aimed to improve the standard of care for patients undergoing treatment for cancer. This is a research role within the Caris signature development program and responsibilities will center on statistical or machine-learning derived predictions of phenotypic treatment response built from the genotypic data types available on Caris molecular sequencing platforms. A successful candidate will have the analytical, code-oriented mindset to create reproducible data science pipelines, and the communication skills to discuss the implications of the scientific results with our medical professionals.
Job Responsibilities
Work with disease experts to determine the cohort selection, model development, and validation steps that make up a project roadmap for a genetic signature.
Iteratively develop statistical or machine-learned derived features sets built from Caris genetic sequencing data.
Communicate the impact and interpretation of predicted clinical outcomes for the targeted disease type.
Structure queries and organize codebases in a streamlined and reproducible manner.
Compare novel signatures with baselines derived from the molecular health literature.
Interface with data engineering and bioinformatics teams to understand the intricacies of underlying datasets.
Required Qualifications
PhD in Data Science, Computational Biology, Bioinformatics, Engineering, or related scientific field.
1-5 years experience in Data Science
Proficiency in Python.
Proficiency in data visualization.
Familiarity with Linux ecosystem, Git, and queries from SQL or related database families.
Experience with common machine-learning Python libraries such as Sklearn, PyTorch, TensorFlow, Keras, etc.
Ability to communicate quantifiable results through tables, figures, and plots.
Proficiency in Microsoft Office Suite, specifically Word, Excel, Outlook, and general working knowledge of Internet for business use.
Preferred Qualifications
Experience with interpretation of clinical health records including Electronic Health Records, insurance claims data, or patient histories
OR with bioinformatics pipeline development and genetic file types such as VCF, BAM, FASTQ
Good code documentation practices and experience with workflow management packages.
Cloud programming experience, in particular under the AWS Sagemaker ecosystem.
Physical Demands
Will work at a computer most of the time, with some time spent collaborating with subject matter experts and business group leaders either in person or through remote conferencing.
Visual acuity and analytical skill to distinguish fine detail.
Must possess ability to sit and/or stand for long periods of time.
Training
All job specific, safety, and compliance training are assigned based on the job functions associated with this employee.
Other
Job may require after-hours response to emergency issues.
This position may require periodic travel and some evenings, weekends, and/or holidays.
Conditions of Employment: Individual must successfully complete pre-employment process, which includes criminal background check, drug screening, credit check ( applicable for certain positions) and reference verification.
This job description reflects management’s assignment of essential functions. Nothing in this job description restricts management’s right to assign or reassign duties and responsibilities to this job at any time.
Caris Life Sciences is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, gender, gender identity, sexual orientation, age, status as a protected veteran, among other things, or status as a qualified individual with disability.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Bioinformatics Biology Data visualization Engineering Excel FASTQ Git Keras Linux ML models PhD Pipelines Python PyTorch Research SageMaker Scikit-learn SQL Statistics TensorFlow
Perks/benefits: Career development Insurance
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.