Introduction to Apache Arrow

In this article we are going to see what this tool is and how it works. Apache Arrow is present in most of the frameworks in our environment, such as Spark, Dask, Drill, Tensorflow, Kudu, etc. All of them are…
In this article we are going to see what this tool is and how it works. Apache Arrow is present in most of the frameworks in our environment, such as Spark, Dask, Drill, Tensorflow, Kudu, etc. All of them are…
In Introduction to Apache APISIX we studied the different components of this platform and the advantages of using it. In addition, we saw a way to dockerise it and we configured a simple route with the openid-connect plugin, integrating authentication…
Airflow is an open source framework for developing, orchestrating and monitoring batch workflows. In case you are not familiar with it, and to better understand the content of this article, it is recommended to review the basics of Apache Airflow…
The growing demand for applications and services has created a great need to more efficiently manage the APIs that communicate them. To meet this need, Apache APISIX is a high-performance, scalable, open source API management platform that acts as a…
Git is a version control system that has become an essential tool for software development. It allows teams of developers to work collaboratively on the same code and efficiently manage change history. In this post we will introduce Git at…
The integration of technological tools and solutions based on Big Data and AI, such as Revenue Management Systems (RMS), are essential allies to maximise profits and optimise an organisation’s revenue. More and more companies are opting for the incorporation of…
As a BI tool, Looker Studio is a visualizer as well as a data management platform which allows extracting information while maintaining its governance, security, accessibility and agility in use. The main advantage of its use lies in the fact…
Today we are going to talk about two ways of testing in Apache Airflow. Historically, testing in Airflow has been something that has been a headache for all users of the famous framework. The coupling of the code with the…
Apache Airflow is an open source tool designed for workflow orchestration especially useful in the field of data engineering. DAGs are defined in Python files and set the relationship and dependencies between the tasks to be executed. You can take…
Usually, when we start working on a new integration where it is necessary to connect to AWS services in the early stages of development, it is easier and faster to work only locally. For this, we can make use of…