Lead Applied Scientist, Document Understanding
USD 140K-274K Senior-level Full Time
Tasks
- Build document enrichment systems using taxonomies
- Design and deploy semantic chunking models for legal documents
- Design component and end to end evaluation frameworks
- Develop LLM based knowledge graph construction pipelines
- Extract and link citations entities and legal concepts
- Generate synthetic data for evaluation
- Lead knowledge distillation to compress large models into SLMs
- Mentor applied scientists and ML practitioners
- Own architecture chunking and knowledge extraction decisions
- Partner with engineering for reliability and scale
- Provide technical input to AI strategy and roadmap
Perks/Benefits
- Employee assistance program
- Employee stock purchase plan
- Fitness reimbursement
- Flexible work arrangements
- Headspace app access
- Mental health days
- Paid volunteer days
- Retirement savings plan
- Tuition reimbursement
- Work from anywhere up to 8 weeks per year
Skills/Tech-stack
AWS SageMaker | AzureML | Citation Parsing | Data Generation | DeepSpeed | Document Layout Analysis | Document enrichment | Document layout | Entity Linking | Entity recognition | Evaluation Frameworks | Few-Shot Learning | Few-shot | Hugging Face | Hugging Face Transformers | Information Extraction | Knowledge Distillation | Knowledge graphs | LLM | Language Processing | Layout analysis | Model Compression | Multi-task Learning | Natural Language | Natural Language Processing | PyTorch | Python | RAG | Relation extraction | Retrieval-Augmented Generation | Semantic chunking | Synthetic Data Generation | Synthetic data | Taxonomies
Education
Roles
Applied Scientist | Lead | Lead Applied Scientist | Scientist
Regions
Countries
States
Cities
Related jobs
-
Sr. Data Scientist USD 120K-150KAWS | Agile | Bayesian Modeling | CI/CD | Experimental DesignDental insurance | Health care | Paid time off | Retirement plan | Sick leaveSenior-level Full TimeSt. Louis, Missouri, US R1d ago
-
Data Engineer Lead | $140k-$175k + Hybrid + Equity | Exciting High Growth AI Operational Intelligence Startup A USD 140K-175KApache Airflow | Apache Kafka | DBT | Dagster | Data LineageEquity | Health insurance | Hybrid work | Medical insurance | Paid HolidaysExecutive-level Full TimeWayne, PA, United States R1d ago
-
Manager, Data Analytics USD 158K-224KA/B | A/B Testing | Amazon S3 | Amplitude | Apache SparkGenerous parental leave | Headspace membership | Health care coverage | Monthly wellness stipend | Retirement savings matchMid-level Full TimeRemote - Los Angeles, CA R1d ago
-
Senior Data Scientist, Consumer USD 190K-267KA/B | A/B Testing | B testing | Causal Inference | Data Modeling401k match | Caregiving support | Coaching | Family planning support | Flexible vacationSenior-level Full TimeRemote - United States R1d ago
-
Lead Machine Learning Engineer I, Lifetime Value USD 164K-205KAWS | AWS SageMaker | Azure | Debugging | DeploymentMentorship | Remote friendly work locationSenior-level Full TimeRemote (United States) R1d ago
-
Senior-level Full TimeRemote or Washington, DC R1d ago
-
Machine Learning Scientist, Multimodal AI USD 124K-156KAWS | Attention Mechanisms | Cloud Computing | Convolutional Neural Networks | Deep learning401k | Baby bonding leave | Commuter benefits | Dental insurance | Disability insuranceMid-level Full TimeUS Remote R1d ago
-
Associate Data Scientist USD 65K-78KArtificial Intelligence | Azure | Classification | Clustering | Computer VisionHealth benefits | Paid time off | Retirement savings (401k) | Sick time off | Wellness benefitsMid-level Full TimeNorthbrook, IL, United States R1d ago
-
Senior Marketing Decision Scientist II USD 167K-212KA/B | A/B Testing | Airflow | Amazon Redshift | Audience targetingAnnual equity refresh grants | Equity grant | Flexible work arrangement | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Senior Data Scientist, Innovation Lab - Remote USD 100K-150KApache Spark | Attention Models | Boosting | CNN | Cassandra401k match | Dental insurance | Flexible time off | Flexible work environment | Medical insuranceSenior-level Full TimeSan Diego, CA, United States R1d ago
-
Data Engineer Lead | $140k-$175k + Hybrid + Equity | Exciting High-Growth AI-Powered Operational Intelligence Startup A USD 140K-175KApache Airflow | Apache Kafka | DBT | Dagster | PostgreSQLEquipment provided | Equity | Health insurance | Medical insurance | Paid HolidaysExecutive-level Full TimeWayne, PA, United States R1d ago
-
Senior-level Full TimeRemote - US R1d ago
-
AWS | AWS Glue | AWS KMS | AWS Lambda | AWS SecretsEast Coast hours | Remote workSenior-level Full TimeAtlanta, Georgia, United States R2d ago
-
A/B | A/B Testing | B testing | Backtesting | Bayesian Hierarchical Models401k | Commuter benefits | Dental insurance | Educational assistance | Flexible spending accountSenior-level Full TimeNew York, NY, US, NY 10019 R2d ago
-
Lead Analytics Engineer - Data Modeling & Quality USD 146K-198KAWS Athena | Amazon Redshift | Apache Airflow | Apache Hudi | Argo WorkflowsAI adoption opportunity | Exposure to senior leaders | Flexible remote workSenior-level Full TimeRemote (USA) R2d ago
-
Business Intelligence Lead - CAHPS & HOS Analytics USD 117K-161KBig Data | Dashboard Development | Data Mining | Data Pipelines | Data VisualizationSenior-level Full TimeRemote US, United States R2d ago
-
Applied AI Scientist III USD 94K-164KAI Governance | Automated Evaluation | CI/CD | Data Analysis | Data VisualizationSenior-level Full TimeDayton WFH, United States R2d ago
-
Applied AI & Data Scientist USD 98K-166KAPI Integration | Bias/fairness | Calibration | Data Quality | Embeddings401k match | Certification reimbursement | Childbirth leave | Company provided short term disability insurance | Company provided term life insuranceMid-level Full TimeNew Haven, CT, US, 06510 R2d ago
-
Senior Applied AI & Data Scientist USD 122K-207KA/B | A/B Testing | Agentic Workflows | Audit controls | B testing401-k match | Certifications support | Childbirth leave | Company Funded Retirement Plan | Dental insuranceSenior-level Full TimeNew Haven, CT, US, 06510 R2d ago
-
Senior Data Scientist II USD 104K-174KAccurint Trax | Anaconda | Analyst’s Notebook | ArcGIS | ArcGIS ProHybrid schedule | Incentive bonus | Professional mentoringSenior-level Full TimeHome based-Virginia, United States R2d ago
-
AI Data Scientist Sr. USD 120K-160KData Visualization | Data Wrangling | ETL | Feature Engineering | Language Processing401k matching | Dental insurance | Disability insurance | Employee assistance program | Flexible spending accountSenior-level Full TimeTelecommuter TX, United States R2d ago
-
Data Scientist USD 120K-167KAWS | Azure | Classification | Cloud Computing | ClusteringHealth care plan | Life insurance | Paid time off | Retirement plan | Stock option planMid-level Full TimeChina Lake Acres, California, United States … R2d ago
-
Associate Director, AI and Machine Learning USD 158K-208KAI Agents | AWS | Access Control | Audit Trail | Azure401k | Dental insurance | FSA | HSA | Life insuranceMid-level Full TimeRemote (US), United States R2d ago
-
Staff Data Scientist USD 153K-220KAirflow | Artificial Intelligence | CI/CD | DBT | Data VisualizationMedical coverage | Pluralsight subscription | Professional development funds | Remote flexibility | Unlimited PTOSenior-level Full TimeRemote - USA, United States R2d ago
-
Senior Data Scientist - AI Services and Platforms USD 102K-164KAgentic AI | BigQuery | Clustering | Data Preparation | Data VisualizationRemote workSenior-level Full TimePA, Working at Home - Pennsylvania, … R2d ago