Member of technical staff (Inference) - Paris
Tasks
- Collaborate with research teams on inference efficiency
- Develop GPU kernels for attention and matrix operations
- Develop inference pipelines
- Implement model compression
- Implement quantization
- Optimize model memory usage
- Optimize throughput and latency
- Research and apply state of the art inference techniques
Perks/Benefits
Skills/Tech-stack
C++ | CUDA | Caching | Continuous batching | Distributed Computing | Flash Attention | GPU Programming | Ggml | Llama.cpp | Model Compression | NCCL | ONNX | ONNX Runtime | Paged Attention | PyTorch | Python | Quantization | Rust | SGLang | TensorRT-LLM | Triton | VLLM
Education
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer
Related jobs
-
Alternance - Data Platform Engineer (F/H) EUR 23K-24KAWS | Amazon S3 | Ansible | Apache Airflow | Apache FlinkFlexible working hours | Health insurance | Paid time off | Professional training catalog | Public transport subsidyEntry-level ApprenticeshipPuteaux, J, FR R2h ago
-
Alternance - Data Platform Engineer (F/H) EUR 23K-24KAWS | Airflow | Amazon S3 | Ansible | Apache FlinkFlexible working hours | Learning and development access | Paid time off | Public transport coverage | Telework policyEntry-level ApprenticeshipPuteaux, J, FR R2h ago
-
Data Engineer BI & Visualisation H/F EUR 52K-55KDBT | Python | SQL | Snowflake | StreamlitMeetups | Phone allowance | Professional communities | Sustainable mobility allowance | Training opportunitiesMid-level Contract Full TimeParis, IDF, France17h ago
-
Claude | Dagster | Dashboarding | Data Catalog | Data GovernanceEntry-level ApprenticeshipParis, Paris, France19h ago
-
Data Engineer Senior (H/F) EUR 45K-50KAmazon Web Services | Apache Airflow | Apache Hadoop | Apache Kafka | Apache SparkSenior-level Full TimeParis, IDF, France19h ago
-
Head of AI Engineering (M/F/D) EUR 90K-120KArtificial Intelligence | Change Management | Code review | Deployment | Fine TuningFamily care policy | Flexible work environment | Health insurance | Home office equipment support | Lunch vouchersExecutive-level Full TimeParis, IDF, France R19h ago
-
Algorithms | Apache Beam | Apache Hadoop | Apache Hive | Apache PigIn region travelMid-level Full TimeParis, France22h ago
-
Consultant(e) Data Science & IA EUR 28K-28KAWS | Azure | Big Data | Cloud platform | Computer VisionEntry-level Full TimeParis, IDF, France23h ago
-
API Development | Amazon Web Services | Azure | Cloud platform | Code VersioningEntry-level InternshipParis, IDF, France1d ago
-
Stagiaire Data Science EUR 17K-20KApache Spark | Data Visualization | Databricks | Distributed Computing | Generative AIEntry-level InternshipLa Garenne Colombes, FR, 922501d ago
-
Data Analytics Engineer EUR 14K-19KCI/CD | DBT | Data Modeling | Data Visualization | DatabricksCSE discounts | Game nights | Gym pass access | Meal vouchers | Modern office locationEntry-level Full TimeParis, France1d ago
-
Lead Data Engineer (H/F) EUR 45K-55KAWS | Apache Spark | Cloud Computing | Cloud platform | Continuous integrationCareer coaching | Community events participation | Proximity managementSenior-level Full TimeParis1d ago
-
Senior AI Engineer - Agentic EUR 70K-88KAmazon Web Services | Apache Airflow | Apache Spark | Data Engineering | Databricks1 day remote per week | 4 days onsite per week | Permanent contractSenior-level Full TimeParis, France1d ago
-
AWS | Ansible | Azure | CI/CD | Cloud platformGym membership contribution | Health insurance | Insurance | Meal vouchers | Parental leaveSenior-level Full TimeParis1d ago
-
Applied Scientist / Research Engineer - EMEA EUR 54K-81KAgents | CI/CD | Data Curation | Deep learning | GPUGym membership contribution | Health insurance | Insurance | Meal vouchers | Parental leaveMid-level Full TimeParis1d ago
-
APIs | Checkpointing | Cloud Platforms | Containerization | Data ProcessingAdditional paid time off | Annual hackathons | Equipment budget | Equity options | Flexible working conditionsSenior-level Full TimeFrance1d ago
-
Alternant(e) AI/Data Engineer (H/F) EUR 28K-28KBigQuery | Cloud Run | Cloud platform | Gemini API | Google ADKEntry-level Full Time6 rue Fructidor, Saint-Ouen, France1d ago
-
Senior MLOps Engineer (H/F) EUR 42K-50KAs-a-Service | Authentication | CI/CD | Cloud Computing | Distributed SystemsLearning culture | On call compensation benefits N/ASenior-level Full TimeFRA, Antony, France1d ago
-
Alternant(e) Data Engineer EUR 28K-28KApache Spark | Azure | Cloud | GCP | HadoopContinuous learning | Technical certifications | Training resourcesEntry-level Full TimeToulouse, FR1d ago
-
Alternant(e) Data Engineer EUR 28K-28KApache Hive | Apache Spark | Cloud platform | Data Pipelines | Data TransformationContinuous learning | Technical certifications | Training platform accessEntry-level Full TimeIssy-les-Moulineaux, FR1d ago
-
Data Engineer (H/F) EUR 50K-55KAKS | Apache Airflow | Azure | Azure Data | Azure Data FactoryTeleworkMid-level Full TimeParis, France R1d ago
-
CI/CD | Code review | Data Pipelines | Generative AI | MLOpsCSE Services | Locker rooms | Meal tickets | Mentorship | Restaurant benefitsEntry-level Apprenticeship InternshipCluses, Auvergne-Rhône-Alpes, France1d ago
-
Senior AI Engineer - Bits AI Security Analyst EUR 58K-88KBackend Development | Go | Golden Set | Human-in-the-loop | JavaCommunity guilds | Employee stock purchase plan | Inclusion talks | Mental health benefits | Mentor/Buddy programSenior-level Full TimeParis, France1d ago
-
Apache Hadoop | Apache Hive | Apache Kafka | Apache Spark | Azure HDInsightCertification preparation platforms | Community contributions | Conference opportunities | Cooptation bonus | Employee profit-sharingSenior-level Full TimeNantes, Pays de la Loire, France R1d ago
-
Consultant(e) Senior Data Science & IA EUR 50K-60KAPI Development | AWS | Azure | CI/CD | Cloud ComputingRemote work | Training opportunities | Work with international teamsSenior-level Full TimeParis, IDF, France R1d ago