First steps to program in Pyspark and Pycharm

First steps with Pyspark and Pycharm

Definitive guide to configure the Pyspark development environment in Pycharm; one of the most complete options. Spark has become the Big Data tool par excellence, helping us to process large volumes of data in a simplified, clustered and fault-tolerant way.…

Applying graph theory to find the optimal route

Graphs – Finding optimal routes

An example of how the use of graphs can help us find optimal routes to solve various problems. A graph system can be used for multiple purposes, being in some cases very useful to solve complex problems. In this article…

Pentaho PDI Plugin for Airflow

Schedule, orchestrate and monitor your Kettle tasks with Airflow with this Pentaho plugin. At Damavis we know the importance of data processing. Extracting, cleaning, transforming, aggregating, loading or cross-referencing multiple data sources allows our clients to have Insights or Predictive…