# Category Data Science

## RAG implementations and extensions

In a previous article of our blog we detailed what is RAG (Retrieval Augmented Generation) and how to take advantage of embedding models to extend the knowledge of an LLM with our own document base. In this post, we will…

## Heteroscedasticity: Impact on Linear Regression

The linear regression model is one of the most useful tools in every data scientist’s kit bag. Although this post is geared towards people with first-hand knowledge of this statistical model, it never hurts to remember that linear regression aims…

## Retrieval Augmented Generation: What is RAG?

RAG stands for “retrieval augmented generation”. RAG is intended to allow a “large language model” (LLM) to use additional information provided by a user. LLMs are trained on large amounts of text, usually obtained from the Internet. However, it would…

## Price elasticity of demand

Modeling and explaining the consumer behavior of a product is very important to know what factors affect their behavior, price being one of these important factors. Let’s take as an example a company X that sells clothing. It is a…

## Linear regression with elastic net: implementations in Python

A linear regression is a model that is used to approximate the linear relationship between a dependent variable Y and a set of independent variables X. In matrix format it can be expressed as where ε is a vector of…

## Text Embeddings: The basis of modern NLP

Introduction All graphics in this article are interactive due to the nature of trying to represent text graphically. To see which sentence each point represents, simply hover the mouse over it. The results obtained have been generated on the “STS…

## Digital twins with Artificial Intelligence

What is a digital twin? When we talk about digital twins, we are mainly referring to a virtual representation of a physical system, where a copy (twin) of the real environment is reproduced using the information (Big Data) that affects…

## Kernel Trick in Machine Learning

The kernel trick is a typical method in machine learning to transform data from an original space to an arbitrary Hilbert space, usually of higher dimensions, where they are more easily separable (ideally, linearly separable). This technique is the basis…

## Object recognition with Deep Learning

Object detection is a branch of computer vision that focuses on identifying the presence and location of objects in images. It is used in a wide variety of applications, from surveillance systems, medical image analysis to autonomous driving. This technique…

## Machine Learning with Python: Practical examples

In a previous blog article, we analyzed from a theoretical point of view which are the main libraries for data science and machine learning used by data scientists to treat data efficiently and extract its full potential.  In this post,…