Senior Data Engineer (Palantir Foundry platform)

Colombia

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Lean Tech

Lean Solutions Group is a top workforce optimization company. Explore our offshore and nearshore staffing solutions to transform your business operations.

View all jobs at Lean Tech

Apply now Apply later

Company Overview: Lean Tech is a rapidly expanding organization situated in Medellín, Colombia. We pride ourselves on possessing one of the most influential networks within software development and IT services for the entertainment, financial, and logistics sectors. Our corporate projections offer a multitude of opportunities for professionals to elevate their careers and experience substantial growth. Joining our team means engaging with expansive engineering teams across Latin America and the United States, contributing to cutting-edge developments in multiple industries. Currently, we are seeking a Senior Data Engineer (Palantir Foundry platform)  to join our team. Here are the challenges that our next warrior will face and the requirements we look for:  Position Title: Senior Data Engineer (Palantir Foundry platform) Location: Remote - LATAM What you will be doing: The primary responsibility of this role is to lead and execute large-scale data engineering projects using the Palantir Foundry platform. This includes the end-to-end development of robust and scalable ETL/ELT workflows by leveraging Spark-based Transforms, the Pipeline Canvas, and Foundry’s low-code/no-code tools to deliver timely and reliable insights from complex data ecosystems. Design, implement, and optimize Foundry Transforms (Code Workbooks) using Apache Spark (Python/Scala) to cleanse, validate, join, and expose datasets via Datasets or Views.
Use Foundry’s Pipeline Canvas to wire together data sources, transformations, and outputs in a visual, low-code environment.
Ingest and normalize large, disparate datasets from sources such as ERP, CRM, and IoT streams, operating on daily or near real-time cadences.
Define and manage Ontologies (business-friendly data models) so analytics teams can self-serve data reliably and intuitively.
Leverage Dataset Builder and Object Library to manage schemas, support full or incremental loads, and standardize dataset governance.
Integrate data through standard and custom APIs and connectors (e.g., JDBC, S3, Kafka, Snowflake, Foundry’s Dataset Writer/Reader).
Implement and maintain robust data quality checks and automated alerts to ensure early detection of anomalies or breaches of defined thresholds.
Monitor and tune pipeline performance and resource usage (partitioning strategies, caching, and load balancing) for production environments.
Automate deployment pipelines using CI/CD practices to promote Foundry Transforms into production safely and efficiently.
Collaborate with cross-functional teams to ensure alignment between data engineering solutions and key business objectives.
Provide technical mentorship, ensuring best practices in code quality, version control, testing, and documentation.
Bring domain expertise to support aviation-related data challenges (if applicable), but open to broader industry applications. Required Skills & Experience: Strong hands-on experience developing data pipelines and workflows in Palantir Foundry, including Transforms, Ontology modeling, Workspaces, Actions, and Pipeline Canvas.
Deep understanding of Apache Spark APIs, including batch and streaming data processing.
Advanced programming proficiency in Python; experience in Scala or Java is a plus.
Strong command of SQL and working with structured, semi-structured, and unstructured data.
Familiarity with key Python libraries and tools: PySpark, Pandas, NumPy, Great Expectations, Pytest/Unittest.
Proven track record with CI/CD practices, preferably deploying to Foundry or cloud-based platforms (AWS, Azure).
Understanding of data architecture, performance optimization, and modern ELT principles.
Experience with data integration tools and connectors (e.g., JDBC, Kafka, S3, Snowflake).
Applied experience with ontology management, dataset structuring, and self-serve enablement.
Familiarity with agile methodologies (Scrum, Kanban) and managing operational tickets.
Strong documentation, version control, and testing habits for data workflows. Good to Have: Industry experience in aviation, especially in maintenance or operational analytics.
Familiarity with data visualization tools like Tableau or Power BI.
Certifications in AWS, Azure, or other cloud platforms.
Knowledge of machine learning pipelines or data science workflows.
Experience with data governance, compliance standards, and metadata management.
Background in DevOps or infrastructure-as-code for pipeline orchestration. Soft Skills: Strong analytical and problem-solving mindset with attention to detail and data accuracy.
Ability to explain technical concepts clearly to both technical and non-technical stakeholders.
Proactive collaborator and effective communicator across multidisciplinary teams.
Leadership and mentoring skills to guide junior engineers and contribute to a culture of learning.
High adaptability to new technologies, including low-code/no-code tooling environments.
Excellent time management and organizational skills, capable of juggling multiple priorities effectively. Why you will love Lean Tech:
Join a powerful tech workforce and help us change the world through technologyProfessional development opportunities with international customers
Collaborative work environment
Career path and mentorship programs that will lead to new levels. Join Lean Tech and contribute to shaping the data landscape within a dynamic and growing organization. Your skills will be honed, and your contributions will play a vital role in our continued success. Lean Tech is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Agile APIs Architecture AWS Azure CI/CD Data governance Data pipelines Data quality Data visualization DevOps ELT Engineering ETL Java Kafka Kanban Machine Learning NumPy Pandas Pipelines Power BI PySpark Python Scala Scrum Snowflake Spark SQL Streaming Tableau Testing Unstructured data

Perks/benefits: Career development Startup environment

Region: South America
Country: Colombia

More jobs like this