Carlos Rodriguez

Carlos Rodriguez

Introduction to Pycharm Templates

Introduction to Pycharm Templates

PyCharm is an IDE developed by JetBrains for Python. As an IDE, its goal is to provide the programmer with a set of features that make code development in this language more efficient. For example, it automatically writes import lines…

Introduction to Logistic Regression

Introduction to Logistic Regression

Logistic regression is a statistical methodology that allows modeling the relationships between a binary categorical variable and a set of explanatory variables. Specifically, it models the probability that an observation belongs to one of the categories of that binary variable.…

The use of Window in Apache Spark

The use of Window in Apache Spark

When processing data we often find ourselves in a situation where we want to calculatevariables over certain subset of observations. For example, we might be interested in theaverage value per group or the maximum value for each group. The groupBy…