Pflichtpraktikum mit MA: Continued Pre-Training von Large Language Models zur KI-Domänenadaption
Tasks
- Analyze and extend tokenizer for domain vocabulary coverage
- Build domain specific text corpus
- Conduct literature research on continued pre training and domain adaptive pre training
- Develop production domain specialized AI language model
- Evaluate adapted model using perplexity and downstream performance
- Perform continued pre training on open source language model
Perks/Benefits
- Access to modern technologies
- Collaborative team
- Flexible working hours
- Opportunity to work on research projects
- Subsequent master thesis
Skills/Tech-stack
Continued Pretraining | DeepSpeed | Domain Adaptation | Hugging Face | Hugging Face Transformers | Language Models | Language Processing | Large Language Models | Natural Language | Natural Language Processing | Open Source | Python | Retrieval-Augmented Generation | Tokenization
Education
Bachelor of Science | Master of Engineering | Master of Science
Related jobs
-
Editorial AI Lead (w/m/d) EUR 47K-47KAPI Integration | JavaScript | LLM | N8n | Prompt engineeringConference attendance | Flexible work | Health benefits | Mobility support | Modern officeSenior-level Full TimeKarlsruhe, BW, Germany11h ago
-
API Contract | API contract design | Agent Builder | Agentic Orchestration | Automated testingSenior-level Full TimeFrankfurt am Main, Germany; Munich, Germany12h ago
-
Data Architecture | Data Engineering | Deep learning | Digital Twin | ForecastingContinuing education budget | Flexible work arrangements | Health and wellness support | Networking opportunities | Paid vacationEntry-level Full TimeKarlsruhe, Entgeltgruppe 1312h ago
-
Data Engineer (m/f/d) - Berlin EUR 55K-78KAPIs | Apache Airflow | DBT | Data Modeling | Data MonitoringExtra days off | Flexible bonus | Hybrid work | Office food via Wolt | Offsites and team eventsEntry-level Full TimeBerlin, Germany15h ago
-
Senior Consultant Databricks (m/w/d) EUR 57K-75KApache Spark | Cloud infrastructure | Databricks | Databricks SQL | Delta LakeBike leasing | Company car leasing | Company smartphone | Corporate volunteering | Family supportSenior-level Full Timemehrere Standorte, DE15h ago
-
Data Engineer (m/f/d) EUR 40K-52KApache Flink | Apache Kafka | Apache Spark | Azure Data | Azure Data ServicesEmployee discount | Mental health support | Onboarding program | Training and development | Work from homeMid-level Full TimeCologne, Germany16h ago
-
Senior ML Ops Engineer (m/f/d) EUR 55K-66KAWS | Azure | Bash | CI/CD | Data VersioningCorporate pension plan | Mental health support | Personal development | Teambuilding events | Work from homeSenior-level Full TimeCologne, Germany16h ago
-
Algorithms | C++ | Data Analysis | Digital maps | Geospatial DataAccessibility | Childcare | Coaching | Company doctor | Employee discountsEntry-level Full TimeSindelfingen, DE23h ago
-
Analytics Engineer (m/w/d) EUR 45K-54KApache Superset | Cloud Computing | Google BigQuery | Microsoft PowerPoint | Microsoft SQLCareer development | Global opportunitiesMid-level Full TimeWürzburg, Germany23h ago
-
Junior Analytics Engineer (all genders) EUR 15K-18KCode Reviews | Data Modeling | Data Monitoring | Data Pipelines | Data QualityCorporate discounts platform | E bike lease program | Flexible work hours | Gym discounts | Hybrid workEntry-level Full TimeBremen, Germany1d ago
-
R&D Engineer – AI Integration (m/f/d) EUR 58K-78KAWS | Azure | C++ | DVC | Data VisualizationAgile team collaboration | Opportunity to lead sub projectsSenior-level Full TimeMunich, Germany1d ago
-
Forward Deployed Engineer - Unum EUR 75K-96KAgentic Workflows | Cloud infrastructure | Deployment | Full Stack | Full-Stack DevelopmentCareer growth planning | Innovation support | Transparent performance based rewardsMid-level Full TimeMünchen, Germany1d ago
-
Data Platform Engineer (Mid-Level) (m/f/d) EUR 54K-67KAWS | CI/CD | DBT | Datadog | GitLab CIHybrid work model | Learning and development | Mentoring program | Travel perks | Wellbeing supportMid-level Full TimeBerlin1d ago
-
Senior ML Ops Engineer (m/f/d) EUR 55K-66KAWS | Azure | Bash | CI/CD | Cloud platformCorporate pension plan | Mental health support | Personal development | Team events | Work from homeSenior-level Full TimeCologne, Germany1d ago
-
Medior Software Engineer EUR 50K-76KAWS | Azure | CI/CD | Cloud Computing | GCPClient project exposure | End-to-end ownership | High autonomyMid-level Full TimeMünchen, Germany1d ago
-
AWS Aurora | Backup & Recovery | Capacity Planning | Database performance | Database performance tuningAnnual company retreats | Flexible working hours | High-quality equipment | Paid annual leave | Performance-based bonusesMid-level Full TimeGermany1d ago
-
AWS CDK | AWS Lambda | AWS SageMaker | Amazon S3 | Apache IcebergDog-friendly offices | Flexible working hours | Home-office allowance | Hybrid work setup | Learning daysEntry-level Part TimeBerlin, Germany; Hamburg, Germany R1d ago
-
AI / ML Lead Consultant (m/w/d) EUR 75K-85KAPIs | Advanced Analytics | CI/CD | Computer Vision | DBTCareer development | Continuous learning | International team | Support for certificationsSenior-level Full TimeMünchen, BY, Germany1d ago
-
API | Automation | Code platforms | Language Models | Large Language ModelsFully remote | High autonomy | Learning opportunities | Location flexibilityMid-level Full TimeGermany R1d ago
-
Consultant SAP Data & Analytics EUR 60K-75KBusiness Technology Platform | Data Flows | Data Intelligence | Data Modeling | Data VisualizationCompany pension | EGYM Wellpass | Flexible work hours | Health programs | JobradMid-level Full TimeHamburg, München, Mannheim, Remote, Dortmund R1d ago
-
CI/CD | Container Technologies | DBT | Dagster | Data ModelingMid-level Full TimeFrankfurt, DE1d ago
-
AWS | Azure | Data Warehousing | Databricks | ELTSenior-level Full TimeGarching bei München, DE, 857481d ago
-
Computational Fluid Dynamics | Finite element | Finite element method | Fluid Dynamics | High PerformanceFlexible working hours | International research environmentEntry-level Full TimeKiel, Schleswig-Holstein, DE, 241051d ago
-
AI Services | Anthropic API | Automated testing | Azure AI | Azure AI ServicesCertification opportunities | Collaborative team | Continuous learning | Cross-industry projects | Flexible work arrangementsSenior-level Full TimeBerlin, Germany R2d ago
-
Agile | Data Science | Machine Learning | Python | SQLEntry-level Full TimeIsmaning, Germany2d ago