Category Data Engineering

OData v4 protocol: Metadata and basic queries

OData v4 protocol: Metadata and basic queries

Data processing and consumption are essential elements of the contemporary business world. Therefore, there are mysterious pieces of software, commonly abbreviated as APIs, whose role is fundamental in this traffic of information.  APIs (Application Programming Interface) are mechanisms for integration…

Custom Data Source in Spark 3

Custom Data Source in Spark 3

In 2020 Apache Spark released version 3.0.0.0 which introduced some changes to the API for defining custom data sources, known within the Spark environment as Custom Data Source. These were previously used through DatasourceV2, which generated confusion and an unintuitive…