Applied Scientist, Document Understanding
USD 136K-253K Mid-level Full Time
Tasks
- Apply knowledge distillation for model compression
- Build document enrichment systems
- Design evaluation frameworks for document understanding
- Design semantic chunking models
- Develop knowledge graph construction pipelines
- Drive technical decisions for chunking classification and extraction
- Extract and link citations entities and legal concepts
- Generate synthetic data for model training
- Partner with engineering on delivery reliability and scale
Perks/Benefits
- Access to Headspace app
- Continuous learning and development
- Employee assistance program
- Fitness reimbursement
- Flexible work life balance policies
- Hybrid work model
- Paid volunteer days off
- Retirement savings
- Tuition reimbursement
- Two mental health days off
- Work from anywhere up to 8 weeks per year
Skills/Tech-stack
Annotation | Citation Parsing | Data Generation | DeepSpeed | Document Layout Analysis | Document Understanding | Document layout | Entity Linking | Entity recognition | Evaluation | Few-Shot Learning | Few-shot | Information Extraction | Knowledge Distillation | Knowledge graphs | LLM | Layout analysis | Model Compression | Multi-document Reasoning | Multi-hop reasoning | Multi-label classification | Multi-task Learning | NLP | Post-training | PyTorch | Python | RAG | Relation extraction | Semantic chunking | Synthetic Data Generation | Synthetic data | Taxonomy Classification | Transformers
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Sr. Staff Data Scientist- Eng USD 145K-209KAgent systems | Agentic AI | BigQuery | Classification | Data GovernanceSenior-level Full TimeLowell,MA,United States R18h ago
-
Data Scientist (Remote) USD 140K-215KContext Management | DPO | DeepSpeed | Experiment tracking | Experimental DesignEmployee networks | Great Place to Work certification | Paid adoption leave | Paid parental leave | Professional developmentMid-level Full TimeUSA VA Remote, United States R1d ago
-
Lead Data Scientist, Stars Population Health USD 142K-195KCloud Computing | Data Engineering | Data Modeling | Data segmentation | Healthcare Analytics401k retirement savings | Bi weekly internet expense stipend | Paid time off | Remote workSenior-level Full TimeRemote US, United States R1d ago
-
Data Scientist (Remote) USD 40K-50KDevOps | Git | Machine Learning | Python | R401k matching | Charitable Gift Matching | Dental benefits | Employee stock purchase plan | Health benefitsSenior-level Full TimeRemote - UT, United States R1d ago
-
Senior Actuarial Data Scientist (Hybrid) USD 113K-194KBig Data | Data Imputation | Data Visualization | GBM | Generalized Linear Models401k contribution | Non sponsorship | Paid Holidays | Paid family leave | Paid time offSenior-level Full TimeAF-WI Madison Natl HQ, United States R1d ago
-
Senior Data Scientist USD 97K-178KAWS | BERT | Bayesian statistics | Data Pipelines | Data Wrangling401k match | Company pension plan | Disability insurance | Education benefit | Employee stock purchase planSenior-level Full TimeWash, 213 Washington St., Newark, NJ, … R1d ago
-
Data Scientist - Inference, Community Support USD 151K-175KBayesian Modeling | Causal Inference | Experimental Design | Feature Engineering | Machine LearningEmployee travel credits | Inclusion and Belonging Culture | Remote eligibleSenior-level Full TimeRemote - USA R1d ago
-
Data Scientist - Algorithms, Community Support USD 151K-175KCausal Inference | Language Models | Language Processing | Large Language Models | Machine LearningSenior-level Full TimeRemote - USA R1d ago
-
Senior Applied Scientist - Search USD 200K-200KData Science | Fine Tuning | Information Retrieval | Knowledge graphs | Language Models401k retirement | Dental insurance | Equity package | Health insurance | Hybrid work scheduleSenior-level Full TimeNew York City R1d ago
-
Sr. Data Scientist, Performance Marketing USD 139K-287KCausal Inference | Dashboarding | ETL | Experimentation | ForecastingSenior-level Full TimeSan Francisco, CA, US; Remote, US R3d ago
-
Senior Applied AI/ML Scientist - Compass USD 196K-269KAgentic AI | Artificial Intelligence | Cost Optimization | Data Modeling | Data strategyHybrid work flexibility | Remote work optionSenior-level Full TimeNew York City, NY; San Francisco, … R3d ago
-
Administrative Data | Administrative data analysis | Claims data | Claims data analysis | Data AnalysisBereavement leave | Employee assistance program | Health insurance | Paid parental leave | Paid time offMid-level Full TimeD.C., Washington, DC, Remote, Remote R3d ago
-
Data Scientist, Product Analytics USD 178K-204KArtificial Intelligence | Data Mining | Experimentation | Forecasting | Key Performance IndicatorsMid-level Full TimeSunnyvale, CA | Menlo Park, CA … R3d ago
-
Product Data Scientist USD 74K-111KClustering | Deep learning | Feature Engineering | Generalized Linear Models | JavaScriptMid-level Full TimeSchaumburg - Hybrid, IL, United States R4d ago
-
Staff Data Scientist, Ads Delivery USD 164K-339KApache Spark | Causal Inference | Experimentation | Machine Learning | PredictionSenior-level Full TimeSeattle , WA, US; Remote, US R4d ago
-
Senior ML Data Scientist - Women’s Health USD 147K-203KData Validation | Deep learning | Machine Learning | Model Monitoring | PythonDental insurance | Employee discounts | Health insurance | Mental health resources | Paid HolidaysSenior-level Full TimeRemote - United States R4d ago
-
Assistant Vice President, Data Analytics USD 139K-140KA/B | A/B Testing | APIs integration | Adobe Analytics | B testingTelecommutingExecutive-level Full TimeJersey City NJ Plaza 2, United … R4d ago
-
Senior Data Scientist USD 137K-254KAmazon Web Services | Azure | C++ | CI/CD | Data AnalysisEmployee assistance program | Headspace access | Hybrid work model | Mental health days off | Paid volunteer days offSenior-level Full TimeUnited States of America, McLean, Virginia R4d ago
-
Sr. Data Scientist - Process Modeling USD 134K-181KAgile | Artificial Intelligence | Data Engineering | Data Pipelines | Data VisualizationAward-winning time-off plans | Career development opportunities | Dental insurance | Flexible spending accounts | Flexible work modelsSenior-level Full TimeUS - Massachusetts - Boston -Field/Remote, … R4d ago
-
Centralized Statistical Monitoring, Director USD 200K-270KAnomaly Detection | Artificial Intelligence | Bayesian Methods | Biostatistics | Business ValidationCareer development opportunities | Comprehensive health coverage | Dental coverage | Flexible spending accounts | Flexible work modelsExecutive-level Full TimeUS - California - Thousand Oaks … R4d ago
-
Anomaly Detection | Autoencoders | BigQuery | Classification | Cloud FunctionsSenior-level Full TimeWA, United States R4d ago
-
Data Scientist / ML Engineer USD 170K-210KAWS | Azure | Bias Evaluation | Cloud Computing | Cloud platformFlexible working hours | Remote workSenior-level Full TimeNew York, NY, US, Remote R4d ago
-
Senior Quantitative Scientist, Commercial-Facing USD 141K-212KBiostatistics | CPT | Data analytics | Electronic Health Records | Epidemiology401k match | Flexible vacation | Headspace meditation app access | Health insurance dental insurance vision insurance | Learning and wellness stipendSenior-level Full TimeNew York, United States, San Francisco, … R4d ago
-
A/B | A/B Testing | Apache Spark | B testing | CalibrationCommuter benefits | Dental insurance | Disability insurance | Healthcare | Hybrid work scheduleSenior-level Full TimeRedwood City, US R4d ago
-
IL0219 – Data Scientist. USD 128K-128KAWS Lambda | AWS QuickSight | AWS RDS | AWS Redshift | AWS S3401k match | Adoption Assistance | Dental insurance | Disability insurance | Employer DiscountsMid-level Full TimeWarrenville, IL, United States R4d ago