Why a Data Scientist should know Data Engineering
Today, we would like to share with you how useful it could be for a data scientist at Damavis to have a good understanding about data engineering. For those of you who have a background in statistics and mathematics and…
Damavis Summary of week 15, 2021
Application of Survival Analysis to price changes
Damavis Summary of week 14, 2021
First steps with Apache YARN customization
Basic Single Node Configuration Note: the code of this post has been tested using Apache Hadoop 2.10.1. Please check out our previous post, Introduction to Apache Hadoop, to configure this version of Hadoop, in case you have not done it…
Damavis Summary of week 13, 2021
Simple perceptron: Mathematical definition and properties
One of the most common problems in the world of data science is that of predicting the value of certain variables from other variables. For example, we need to know whether it is convenient to grant a loan to a…
Damavis Summary of Week 12, 2021
Aggregation Pipelines on MongoDB
Why use MongoDB? MongoDB is a document oriented NOSQL open source database, which means that data does not necessarily have to follow a certain schema. This makes MongoDB an ideal candidate as a database for big data workloads as it…