Weaver

| Summary | Features | Links | Configuration | Documentation | Extra Details & Sponsors |

Implementations

Weaver (the nest-builder)

Weaver birds build exquisite and elaborate nest structures that are a rival to any human feat of engineering. Some of these nests are the largest structures to be built by birds. [Eden].

Although weavers are named for their elaborately woven nests, some are notable for their selective parasitic nesting habits instead. [Wikipedia]

Weaver is an OGC-API flavored Execution Management Service that allows the execution of workflows chaining various applications and Web Processing Services inputs and outputs. Remote execution is deferred by the EMS to one or many Application, Deployment and Execution Service or remote service providers, and employs Common Workflow Language (CWL) configurations to define an OGC Application Package deployed for each process.

dependencies

Requires Python 3.8+ Libraries.io Dependencies Status PyUp Dependencies Status

license

GitHub License FOSSA Status

build status

ReadTheDocs Build Status (master branch) Docker Build Mode (latest version) Docker Build Status (latest version)

tests status

Github Actions CI Build Status (master branch) Github Actions CI Build Status (latest tag) Code Coverage Codacy Badge

releases

Latest Tagged Version Commits since latest release Docker Image

Summary

Weaver is primarily an Execution Management Service that allows the execution of workflows chaining various applications and Web Processing Services inputs and outputs. Remote execution of each process in a workflow chain is dispatched by the EMS to one or many registered Application, Deployment and Execution Service by ensuring the transfer of files accordingly between instances when located across multiple remote locations.

Weaver can also accomplish the ADES role in order to perform application deployment at the data source using the application definition provided by Common Workflow Language (CWL) configuration. It can then directly execute a registered process OGC Application Package with received inputs from a WPS request to expose output results for a following ADES in a EMS workflow execution chain.

Weaver extends OGC API - Processes by providing additional functionalities such as more detailed job logs endpoints, adding more process management and search request options than required by the standard, and supporting remote providers registration for dynamic process definitions, to name a few. Because of this, not all features offered in Weaver are guaranteed to be applicable on other similarly behaving ADES and/or EMS instances. The reference specification is tracked to preserve the minimal conformance requirements and provide feedback to Open Geospatial Consortium (OGC) in this effect.

Weaver can be launched either as an EMS, an ADES or an HYBRID of both according to its configuration. For more details, see Configuration and Documentation sections.

Features

Following videos present some of the features and potential capabilities of servicing and executing processes offered by Application, Deployment and Execution Service and Execution Management Service instances like Weaver.

Keywords: Big Data, software architecture, Earth Observation, satellite data, processing, climate change, machine learning, climate services.

Applications

The video shares the fundamental ideas behind the architecture, illustrates how application stores for Earth Observation data processing can evolve, and illustrates the advantages with applications based on machine learning.

Watch the Application video on YouTube
Watch the Application video: http://www.youtube.com/watch?v=v=no3REyoxE3

Platform

The video shares the fundamental ideas behind the architecture, illustrates how platform managers can benefit from application stores, and shows the potential for multidisciplinary workflows in thematic platforms.

Watch the Platform video on YouTube
Watch the Platform video: http://www.youtube.com/watch?v=v=QkdDFGEfIAY

Links

Docker image repositories:

Name

Reference

Access

DockerHub

pavics/weaver

public

CRIM registry

ogc/weaver

restricted

CRIM OGC Processes

ogc-public

restricted

For a prebuilt image, pull as follows:

docker pull pavics/weaver:5.8.0

For convenience, following tags are also available:

  • weaver:5.8.0-manager: Weaver image that will run the API for WPS process and job management.

  • weaver:5.8.0-worker: Weaver image that will run the process job runner application.

Following links correspond to existing servers with Weaver configured as EMS or ADES instances respectively.

Institution & Partners

Project & Description

Version

Entrypoint

CRIM

DACCS / OGC - Hirondelle Development Instance

crim-hirondelle-weaver-version

https://hirondelle.crim.ca/weaver

CRIM

Demonstration Services Portal

crim-services-weaver-version

https://services.crim.ca/weaver

Ouranos

PAVICS Server

ouranos-pavics-weaver-version

https://pavics.ouranos.ca/weaver/

University of Toronto

Marble Climate - RedOak Instance

UofT-RedOak-weaver-version

https://redoak.cs.toronto.edu/weaver/

CRIM, ECCC, CLIMAtlantic, Ouranos, PCIC, PCC

ClimateData.ca / DonneesClimatiques.ca Portal

climate-data-weaver-version

https://pavics.climatedata.ca/

Note

The test servers will not necessarily be up-to-date with the latest version.

Configuration

All configuration settings can be overridden using a weaver.ini file that will be picked during instantiation of the application. An example of such file is provided here: weaver.ini.example.

Setting the operational mode of Weaver (EMS/ADES/HYBRID) is accomplished using the weaver.configuration field of weaver.ini. For more configuration details, please refer to Documentation.

Documentation

The REST API documentation is auto-generated and served under any running Weaver application on route {WEAVER_URL}/api/. This documentation will correspond to the version of the executed Weaver application. For the latest documentation, you can refer to the OpenAPI Specification served directly on ReadTheDocs.

More ample details about installation, configuration and usage are also provided on ReadTheDocs. These are generated from corresponding information provided in docs source directory.

Extra Details & Sponsors

The project was initially developed upon OGC Testbed-14 – ESA Sponsored Threads – Exploitation Platform findings and improvements following from previous OGC Testbed-13 - Cloud Engineering Report architecture designs. It was also built upon sponsorship from the U.S. Department of Energy to support common API of the Earth System Grid Federation (ESGF). The findings are reported on the OGC Testbed-14 thread, and more explicitly in the ADES & EMS Results and Best Practices Engineering Report.

The project has been employed for OGC Testbed-15 - Machine Learning Thread to demonstrate the use of Machine Learning interactions with OGC web standards in the context of natural resources applications. The advancements are reported through the OGC Testbed-15: Machine Learning Engineering Report.

Developments are continued in OGC Testbed-16 to improve methodologies in order to provide better interoperable geospatial data processing in the areas of Earth Observation Application Packages. Findings and recommendations are presented in the OGC Testbed-16: Data Access and Processing Engineering Report.

Todo

deploy from ipynb, add OGC Testbed-16: Earth Observation Application Packages with Jupyter Notebooks Engineering Report (https://github.com/crim-ca/weaver/issues/63)

Videos and more functionalities were introduced in Weaver following OGC Earth Observation Applications Pilot. Corresponding developments are reported in the OGC Earth Observation Applications Pilot: CRIM Engineering Report.

Weaver has been used to participate in interoperability testing effort that lead to OGC Best Practice for Earth Observation Application Package technical report. This resulted, along with previous efforts, in the definition of OGC API - Processes - Part 2: Deploy, Replace, Undeploy (DRU) backed by validated test cases using Common Workflow Language (CWL) as the representation method for the deployment and execution of OGC Application Package close to the data.

Weaver is employed in the OGC Open Science Persistent Demonstrator initiative to demonstrate reusability, portability, and transparency in the context of open science in Earth Observation, using OGC Application Package encoded as CWL for interoperability and distributed processing workflows. Its related developments and demonstrations were presented at the OGC 129th Member’s Meeting (2024, Montréal) and the Earth Science Information Partners (ESIP) 2024 Meeting.

Weaver is employed in OGC Testbed-20 - GeoDataCubes to improve and work on the alignment of multiple community standards involved in workflow design, such as CWL, openEO and OGC API - Processes - Part 3: Workflows and Chaining, for processing of multidimensional data involved through GeoDataCube interactions.

The project is furthermore developed through the Data Analytics for Canadian Climate Services (DACCS) initiative and is employed by the ClimateData.ca / DonneesClimatiques.ca portal.

Weaver is implemented in Python with the Pyramid web framework. It is part of PAVICS and Birdhouse ecosystems and is available within the birdhouse-deploy server stack.

Table of Contents