Data Science & AI Support Specialist
The Hague, Netherlands
Full Time Mid-level / Intermediate Clearance required EUR 34K - 79K * est.
Spektrum have a wide range of exciting opportunities in several global locations.
We are always looking to add great new talent to our team and look forward to hearing from you.
Spektrum supports apex purchasers (NATO, UN, EU, and National Government and Defence) and their Tier 1 supplier ecosystem with a wide range of specialist services. We provide our clients with professional services, specialised aerospace and defence sales, delivery, and operational subject matter expertise. We are looking for personnel to join our team and support key client projects.
Who we are supporting
The NATO Communication and Information Agency (NCIA) is responsible for providing secure and effective communications and information technology (IT) services to NATO's member countries and its partners. The agency was established in 2012 and is headquartered in Brussels, Belgium.
The NCIA provides a wide range of services, including:
- Cyber Security: The NCIA provides advanced cybersecurity solutions to protect NATO's communication networks and information systems against cyber threats.
- Command and Control Systems: The NCIA develops and maintains the systems used by NATO's military commanders to plan and execute operations.
- Satellite Communications: The NCIA provides satellite communications services to enable secure and reliable communications between NATO forces.
- Electronic Warfare: The NCIA provides electronic warfare services to support NATO's mission to detect, deny, and defeat threats to its communication networks.
- Information Management: The NCIA manages NATO's information technology infrastructure, including its databases, applications, and servers.
Overall, the NCIA plays a critical role in ensuring the security and effectiveness of NATO's communication and information technology capabilities.
The program
Assistance and Advisory Service (AAS)
The NATO Communications and Information Agency (NCI Agency) is NATO’s principal C3 capability deliverer and CIS service provider. It provides, maintains and defends the NATO enterprise-wide information technology infrastructure to enable Allies to consult together under Article IV, and, when required, stand together in the face of attack under Article V.
To provide these critical services, in the modern evolving dynamic environment the NCI Agency needs to build and maintain high performance-engaged workforce. The NCI Agency workforce strategically consists of three major categorise's: NATO International Civilians (NIC)'s, Military (Mil), and Interim Workforce Consultants (IWC)'s. The IWCs are a critical part of the overall NCI Agency workforce and make up approximately 15 percent of the total workforce.
Role ID – 2025-0155
Role Background
The NATO Information and Communication Agency (NCIA) located in The Hague, Netherlands, is currently involved in processing vast amounts and highly variant data coming from theatre for the purpose of efficient archiving. In light of these activities, within NCIA Chief Technology Office, the Exploiting Data Science and Artificial Intelligence (EDS&AI) team is tasked to apply Big Data and AI technology to prepare, run and adjust processing pipelines for processing various source data into archiving formats and metadata, and prepare for (semantic) search. NATO has an obligation to support national investigations into situation that occurred in theatre. In order to support the different teams involved most optimal, the EDS&AI team brings the expertise to extract and exploit the vast and varied data on the table, by using the Agency’s high performance computing classified sandbox. The EDS&AI team provides the core data science skills and technology needed for big data analysis and AI. The EDS&AI team applies innovative technology to data whenever it is not possible to extract value with conventional approaches.
Role Duties and Responsibilities
The services described below will be provided to the NCIA CTO/EDS&AI team, as they deliver specialised Data Science and AI results to their stakeholders in NATO Headquarters and NATO Allied Command Operations. Overarching objectives:
- Make required documents from theatre accessible and searchable by archivists during execution
- Capture document contents into long term preservation formats
- Capture Functional Area System (FAS; back-up) contents into long term preservation formats
- Identify (and remove) duplicate documents, records of temporary value and non records that are not required for archiving
- Provide (interim/final) data reports describing actions and results
- Setting up / improving pipelines to process all required documents and that uniquely identifies and traces decisions and processing steps. This is to be conducted on the provided classified sandbox environment, with provided performance hardware and toolsets.
- Implementing / improving (missing) pipeline steps for marking duplicate files, based on file attributes, path (structure) and content (similarity), and rules for considering a file or structure a duplicate.
- Extracting document-format records from Functional Area Systems (FAS) databases and back-ups performed otherwise. Archiving SME’s and system SME’s are available for guidance on target formats and source system structure and data interpretation. Each FAS is processed separately; not all sprints touch upon this item.
- Processing / Monitoring progress of various office, image and video file types to the accepted archiving formats, including extraction of metadata and preparing search semantic indexes.
- Automating registering all processed documents with semantic indexes with the sandbox natural language search tool.
- Automating the final copy of all non-duplicate and extracted archive documents with content and metadata to the NATO archiving system.
- Reporting status, progress and statistics of the (raw) files being processed to archive formats, metadata and search indexes.
- Delivering full reporting of results, trace of pipeline steps taken and (stakeholder) accepted failures.
Essential Skills and Experience
- At least 3 years’ of practical experience in the field of data science and/ or data analytics;
- Experience using data processing/visualization/analytics software packages and development environments, preferably such as KNIME, VS Code, GitLab, Power BI, Jupyter Lab, and Docker-based API;
- Experience with data processing Big Data, creating and utilizing containerized building blocks and running containers (APIs) on Kubernetes clusters;
- Experience with programming/scripting in languages like Python, R, SQL and working with data formats like CSV, XML, JSON;
- Experience performing content extraction from files/databases/systems, (LLM based) embedding models, entity-extraction, key-word-extraction and content similarity measures;
- Creative, flexible and pro-active overcoming obstacles;
- Good drafting, communication and presentation skills in English, including technical and non-technical levels;
- High attention to detail and accuracy;
Education
- Master in Computer Science, Engineering or relevant field.
- A higher degree in Data Science is preferred.
Working Location
- The Hague, Netherlands
Working Policy
- On-Site
Travel
- Some travel to other NATO sites may be required
Security Clearance
- Valid National or NATO Secret personal security clearance
We never know what new opportunities might be just over the horizon. If this opportunity isn't for you please feel free to send us your resume anyway and be the first to know if something suitable for your skills and experience comes up.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Apex APIs Big Data Computer Science CSV Data analysis Data Analytics Docker Engineering GitLab HPC JSON Jupyter KNIME Kubernetes LLMs Pipelines Power BI Python R Security SQL Statistics XML
Perks/benefits: Career development Flex hours Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.