Skip to content
No results
  • Home
  • About us
  • Services
  • English
    • Español
    • English
Damavis Logo
  • Home
  • About us
  • Services
  • English
    • Español
    • English
Damavis Logo
Theoretical introduction to Spark Structured Streaming
  • Data Engineering

Theoretical introduction to Spark Structured Streaming

In recent years, data processing with low latency, practically in real time, is becoming a requirement increasingly demanded by companies in their big data processes. It is in this context where the concept of stream processing is introduced, which refers…

  • Agustín Mora
  • 2024-07-17
Pricing with rule system using Linear Programming
  • Algorithms

Pricing with rule system using Linear Programming

The pricing systems currently used in companies have a substantial difference with respect to those of 20 years ago. We have moved from static pricing systems, that is, constant prices at different times of the year, to dynamic systems, where…

  • Daniel Bestard
  • 2024-07-11
  • Data Engineering

Watermarks in Apache Spark Structured Streaming

Apache Spark’s Structured Streaming API is a powerful tool for processing real-time data streams. In this context, there are certain use cases where ensuring the accuracy of the processed data is not trivial due to the time dimension that inherently…

  • Jordi Vanrell
  • 2024-07-05
Data relationship models in a Data Warehouse
  • Data Engineering

Data Relationship Models in a Data Warehouse

In the field of Data Engineering, efficient database design is essential to handle large volumes of data and provide effective analysis. Throughout my experience as a Data Engineer, I have worked with the main data relationship systems and have observed…

  • Óscar García
  • 2024-06-28
Team Building in Damavis: Paintball and Warriors
  • Damavis

Team Building in Damavis: Paintball and Warriors

Being a hard working team that develops great projects is not incompatible with the possibility of organizing different and fun activities to promote a good working environment.  At Damavis we put this into practice by having the whole team, which…

  • Laura Rodríguez
  • 2024-06-20
Vector database: What it is and how it works
  • Data Engineering

Vector database: What it is and how it works

This article assumes that there is a basic knowledge about embeddings of objects, either text or images. In case you don’t have any notions on the subject, the post on Text Embeddings: the basis of modern NLP explains this concept.…

  • Antoni Casas
  • 2024-05-30
RAG implementations and extensions
  • Data Science

RAG implementations and extensions

In a previous article of our blog we detailed what is RAG (Retrieval Augmented Generation) and how to take advantage of embedding models to extend the knowledge of an LLM with our own document base. In this post, we will…

  • Jesús Aguado
  • 2024-05-23
Differences between Looker Studio and Looker Studio Pro
  • Software

Differences between Looker Studio and Looker Studio Pro

As a BI tool, Looker Studio is a visualizer as well as a data management platform which allows extracting information while maintaining its governance, security, accessibility and agility in use. The main advantage of its use lies in the fact…

  • Vanessa Pradas
  • 2024-05-10
  • Data Analytics

Python libraries for interactive map visualization

When there is a need to explore data or generate visualizations related to geographic entities (the polygon of a country or geographic coordinate points) it is quite reasonable to think about maps. Sometimes it may be valid to use simpler…

  • Jordi Vanrell
  • 2024-05-03
Heteroscedasticity: Impact on Linear Regression
  • Data Science

A Practical Guide to Heteroscedasticity in Linear Regression

The linear regression model is one of the most useful tools in every data scientist’s kit bag. Although this post is geared towards people with first-hand knowledge of this statistical model, it never hurts to remember that linear regression aims…

  • Agustín Mora
  • 2024-04-26
Prev
1 2 3 4 5 6 7 8 … 19
Next
  • Español
  • English

Recent Posts

  • Regression, Machine Learning and Deep Learning to predict cancellations
  • Git branching: GitHub Flow, GitFlow and Trunk-Based Development
  • The most important new features in Airflow 3.0
  • Getting started with Apache NiFi
  • Monitoring with Prometheus and Grafana

Archives

  • June 2025 (3)
  • May 2025 (3)
  • April 2025 (2)
  • March 2025 (2)
  • February 2025 (4)
  • January 2025 (4)
  • December 2024 (3)
  • November 2024 (4)
  • October 2024 (5)
  • September 2024 (4)
  • August 2024 (5)
  • July 2024 (4)
  • June 2024 (2)
  • May 2024 (4)
  • April 2024 (4)
  • March 2024 (3)
  • February 2024 (5)
  • January 2024 (1)
  • December 2023 (4)
  • November 2023 (3)
  • October 2023 (1)
  • September 2023 (1)
  • June 2023 (2)
  • May 2023 (2)
  • April 2023 (1)
  • March 2023 (3)
  • February 2023 (4)
  • January 2023 (1)
  • December 2022 (4)
  • November 2022 (3)
  • October 2022 (3)
  • September 2022 (6)
  • August 2022 (4)
  • July 2022 (5)
  • June 2022 (2)
  • May 2022 (3)
  • April 2022 (3)
  • March 2022 (2)
  • February 2022 (4)
  • January 2022 (4)
  • December 2021 (9)
  • November 2021 (8)
  • October 2021 (8)
  • September 2021 (2)
  • June 2021 (3)
  • May 2021 (8)
  • April 2021 (8)
  • March 2021 (7)
  • February 2021 (9)

Categories

  • Algorithms (30)
  • Damavis (46)
  • Damavis team (3)
  • Data Analytics (9)
  • Data Engineering (41)
  • Data Science (22)
  • Software (39)
Copyright © 2025 - WordPress Theme by CreativeThemes