Tag code

The use of Window in Apache Spark

The use of Window in Apache Spark

When processing data we often find ourselves in a situation where we want to calculatevariables over certain subset of observations. For example, we might be interested in theaverage value per group or the maximum value for each group. groupBy and…

Aggregation Pipelines on MongoDB

Aggregation pipelines on MongoDB

Why use MongoDB? MongoDB is a document oriented NOSQL open source database, which means that data does not necessarily have to follow a certain schema. This makes MongoDB an ideal candidate as a database for big data workloads as it…

Clean Code with Alpakka Kafka

Clean code with Alpakka Kafka

At Damavis we are very aware of the importance for our clients to have access to their data in real time. For this reason, one of our strengths is the development of tools and technologies that can move, transform and…