Introduction to MongoDB
What is MongoDB? MongoDB is an open source NOSQL database. This means that data does not necessarily have to follow a schema. All data is stored in a JSON similar document format known as BSON or Binary JSON and will…
What is MongoDB? MongoDB is an open source NOSQL database. This means that data does not necessarily have to follow a schema. All data is stored in a JSON similar document format known as BSON or Binary JSON and will…
Big Data analytics can be defined as the process by which large volumes of data are examined in order to find hidden patterns, correlations, market trends, customer behaviours, etc. in them. The aim is to obtain information that can help…
Since our Pentaho PDI plugin for Apache Airflow release, we have seen an industry shift towards the usage of Apache Hop for data processing. What is Apache Hop? Apache Hop started (late 2019) as a fork of Kettle PDI, is…
In general, a company has several types of data: some from CRM, others from Google Analytics, from the web or e-commerce, etc. This data comes from different sources and, in addition, each one has a different structure and format, so…
One way we at Damavis help our clients is through the implementation of Record Linkage solutions. This type of process makes it possible to find records that refer to the same entity but are found in different data sources. Thanks…
In the technological sector, it is increasingly common to find job offers looking for profiles related to the management of large volumes of data. Big Data is considered one of the big bets for the future, both for companies that…
At Damavis we work hand in hand with our clients to design Big Data and Artificial Intelligence technological solutions. And not only we do it by developing the project that the client proposes us from the beginning, but we study…
We all know that Artificial Intelligence is advancing at a significant speed in recent years. One of the aspects that is currently gaining most visibility is the ability to generate images with the unique source of origin of a descriptive…
In a previous post, we talked about the basics of linear programming and the Simplex method and, in this article, we are going to give an introduction to an emerging branch of mathematics: Fuzzy Logic. In a later article, we…
Apache Spark is an open source framework that allows us to process large volumes of data in a distributed way. How? By dividing the large volumes of data, impossible to process in one machine, and distributing them among the different…