Lead Applied Scientist, Document Understanding
USD 140K-274K Senior-level Full Time
Tasks
- Build document enrichment systems using taxonomies
- Design and deploy semantic chunking models for legal documents
- Design component and end to end evaluation frameworks
- Develop LLM based knowledge graph construction pipelines
- Extract and link citations entities and legal concepts
- Generate synthetic data for evaluation
- Lead knowledge distillation to compress large models into SLMs
- Mentor applied scientists and ML practitioners
- Own architecture chunking and knowledge extraction decisions
- Partner with engineering for reliability and scale
- Provide technical input to AI strategy and roadmap
Perks/Benefits
- Employee assistance program
- Employee stock purchase plan
- Fitness reimbursement
- Flexible work arrangements
- Headspace app access
- Mental health days
- Paid volunteer days
- Retirement savings plan
- Tuition reimbursement
- Work from anywhere up to 8 weeks per year
Skills/Tech-stack
AWS SageMaker | AzureML | Citation Parsing | Data Generation | DeepSpeed | Document Layout Analysis | Document enrichment | Document layout | Entity Linking | Entity recognition | Evaluation Frameworks | Few-Shot Learning | Few-shot | Hugging Face | Hugging Face Transformers | Information Extraction | Knowledge Distillation | Knowledge graphs | LLM | Language Processing | Layout analysis | Model Compression | Multi-task Learning | Natural Language | Natural Language Processing | PyTorch | Python | RAG | Relation extraction | Retrieval-Augmented Generation | Semantic chunking | Synthetic Data Generation | Synthetic data | Taxonomies
Education
Roles
Applied Scientist | Lead | Lead Applied Scientist | Scientist
Regions
Countries
States
Cities
Related jobs
-
Quantitative Analytics Lead USD 164K-245KCloud Databases | Credit Risk | Credit Risk Management | Fraud Detection | Fraud riskESPP | Flexible spending wallets | Health care coverage | Remote-first | Time offSenior-level Full TimeRemote US R23h ago
-
Sr. Data Scientist (Credit Risk) USD 165K-185KDashboarding | Data Analysis | Data Visualization | Data pipeline | Exploratory Data Analysis401k match | Dental insurance | Education Expense Reimbursement | Employee assistance program | Employee resource groupsSenior-level Full TimeTempe, AZ, United States R23h ago
-
Sr. Data Scientist (Credit Risk) USD 165K-185KCredit Risk | Credit risk modeling | Dashboarding | Data Analysis | Data Governance401k match | Diversity and inclusion programs | Education expense support | Employee assistance program | Employee resource groupsSenior-level Full TimeSan Mateo, CA, United States R23h ago
-
AI Observability | AWS | Azure | CI/CD | Cost ControlCareer advancement | Fully remote work | Professional development opportunities | Work-life balanceSenior-level Full TimeCanada R1d ago
-
HTML | JSON | Markdown | Matplotlib | NumPyFlexible schedule | Independent contractor role | Performance-Based Incentives | Remote work within CanadaMid-level Contract Full TimeCanada R1d ago
-
AI integration | API | Business Intelligence | CRM | Data AnalysisBonus potential | Hybrid work arrangement | Professional development opportunitiesSenior-level Full TimePittsburgh, PA, United States R1d ago
-
Lead Data Engineer USD 188K-230KAirflow | Apache Spark | Azure Cosmos | Azure Cosmos DB | Azure DataDomestic travel up to 5 percent | Relocation not authorized | Remote workSenior-level Full TimeRemote - Minnesota, United States R1d ago
-
Associate Data Scientist USD 80K-110KClassification | Clustering | DBT | Data Analysis | Docker401k match | Dental insurance | Medical insurance | Paid time off | Vision insuranceMid-level Full TimeRemote Ohio, United States R1d ago
-
Computational statistics | MATLAB | NumPy | Pandas | PythonPart-time freelance | Project based workSenior-level FreelanceNew York, New York, United States … R1d ago
-
Combinatorics | Graph theory | Mathematical Statistics | NumPy | Number theoryFlexible hours | Paid per project | Part-time freelance work | Project based workSenior-level FreelanceTexas, United States - Remote R1d ago
-
C# | MATLAB | NumPy | Pandas | PythonPart-time schedule | Project based workSenior-level FreelanceMichigan, United States - Remote R1d ago
-
Senior Marketing Data Analyst USD 122K-176K6sense | ABM | Attribution Modeling | Data Modeling | LLMSenior-level Full TimeRemote - USA R1d ago
-
Customer Data Scientist (Statsig) USD 165K-276KA/B | A/B Testing | Amazon Redshift | B testing | BigQuery401k match | Dental insurance | Employee stock purchase program | Flexible time off | Learning and development stipendsSenior-level Full TimeRemote - USA R1d ago
-
Senior Data Scientist, Innovation Lab - Remote USD 95K-150KAttention Mechanisms | Cassandra | Convolutional Neural Networks | Deep learning | GPU401k matching | Dental insurance | Flexible time off | Flexible work environment | Medical insuranceSenior-level Full TimeSan Diego, CA, United States R1d ago
-
Applied Data Scientist USD 120K-170KData Analysis | MATLAB | Python | R | SQLDental insurance | Disability insurance | Flexible work | Health insurance | Health savings accountSenior-level Full TimeRosslyn, VA or Remote R1d ago
-
Sr. Data Scientist, Marketing USD 139K-287KAttribution | Causal Inference | Dashboarding | ETL | ExperimentationSenior-level Full TimeSan Francisco, CA, US; Remote, US R2d ago
-
Lead Data Scientist USD 210K-240KAirflow | Apache Beam | Data Architecture | Data Governance | Data LakeHealth insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote - USA R2d ago
-
Data Scientist, Marketing Inference USD 130K-160KA/B | A/B Testing | B testing | Bayesian Modeling | Causal Inference401k matching | Company offsite | Employee wellness program | Equipment stipend | Free therapyMid-level Full TimeUS - Remote R2d ago
-
Senior Bioinformatics Scientist USD 117K-146KAlgorithm Design | Bioinformatics | C plus plus | Command Line | Command-line Tools401k benefits | Commuter benefits | Comprehensive medical/dental/vision plans | Employee referral program | Fertility care benefitsSenior-level Full TimeUS Remote R2d ago
-
Data Scientist USD 149K-207KA/B | A/B Testing | B testing | Classification | DBT401k matching | Company holidays | Gym stipend | Health benefits | Lunch providedSenior-level Full TimeNew York, NY R2d ago
-
Senior Clinical Data Scientist - Healthcare USD 155K-170KArtificial Intelligence | Benchmarking | Data Science | Healthcare Analytics | Healthcare Performance Metrics401k plan | Dental insurance | Flexible paid time off | Flexible work hours | Hybrid work modelSenior-level Full TimeSeattle, WA R2d ago
-
SVP, Social Analytics USD 163K-280KA/B | A/B Testing | Analytics | Attribution | B testing401k | Dental insurance | Flexible paid time off | Life insurance | Long-term disability insuranceExecutive-level Full TimeNew York - 150 E 42nd, … R2d ago
-
Barrier functions | C# | C++ | CUDA | Control Theory401k match | Employee assistance program | Employee scholar program | Flexible work schedules | HolidaysSenior-level Full TimeUS-CT-EAST HARTFORD-RTRC L ~ 411 Silver … R2d ago
-
AWS | Azure | Data Pipelines | Data Visualization | ETL401k matching | Counseling sessions | Dental insurance | Disability coverage | Employee assistance programSenior-level Full TimeUS-PR-AGUADILLA-110 ~ Rd 110 N Km … R2d ago
-
Staff Data Scientist - Payments USD 195K-257KAutomation | Blockchain | Causal Inference | Dashboarding | Data QualityFlexible work environment | Inclusive culture | Mission-driven workSenior-level Full TimeU.S. - California, United States R2d ago