Introduction to Apache Arrow

Introduction to Apache Arrow

In this article we are going to see what this tool is and how it works. Apache Arrow is present in most of the frameworks in our environment, such as Spark, Dask, Drill, Tensorflow, Kudu, etc. All of them are…

New features in DBT 1.9

New features in DBT 1.9

The developers of DBT have set themselves the goal of releasing a new major release every four months, and so far they are delivering. Last October, version 1.9 of this popular framework was released, and has been undergoing corrections. In…

Tutorial DataHub 4: API

Tutorial DataHub 4 – API

In this new part of the DataHub tutorial saga, we are going to work on the connectivity with the platform through the API. As data engineers, the goal is to incorporate DataHub as a Data Governance tool in our ecosystem.…

Trigger rules in Apache Airflow

Trigger rules in Apache Airflow

Airflow is an open source framework for developing, orchestrating and monitoring batch workflows. In case you are not familiar with it, and to better understand the content of this article, it is recommended to review the basics of Apache Airflow…

Team building: Training and team work

Team building: Training and team work

At Damavis we continue to add moments and experiences together through team building activities and events. With the aim of promoting the integration of all members, we organise these actions that allow us to get to know our colleagues better…