weaver.formats
Module Contents
- class weaver.formats.ContentType[source]
Supported
Content-Type
values.Media-Type nomenclature:
<type> "/" [x- | <tree> "."] <subtype> ["+" suffix] *[";" parameter=value]
- class weaver.formats.ContentEncoding[source]
Supported
Content-Encoding
values.Note
Value
binary
is kept for convenience and backward compatibility with older definitions. It will default to the same encoding strategy as ifbase64
was specified explicitly. Valuebinary
is not part of RFC 4648, but remains a common occurrence that dates from whenformat: binary
was the approach employed to represent binary (JSON-schema Draft-04 and prior) instead of what is now recommended usingcontentEncoding: base64
(JSON-schema Draft-07).See also
- static is_text(encoding: Any) bool [source]
Indicates if the
Content-Encoding
value can be categorized as textual data.
- static is_binary(encoding: Any) bool [source]
Indicates if the
Content-Encoding
value can be categorized as binary data.
- static open_parameters(encoding: Any, mode: FileModeSteamType = 'r') Tuple[FileModeEncoding, typing_extensions.Literal[UTF-8, None]] [source]
Obtains relevant
mode
andencoding
parameters foropen()
using the specifiedContent-Encoding
.
- static encode(data: AnyStr, encoding: AnyContentEncoding = BASE64, binary: typing_extensions.Literal[True] = True) bytes [source]
- static encode(data: AnyStr, encoding: AnyContentEncoding = BASE64, binary: typing_extensions.Literal[False] = False) str
- static encode(data: DataStrT, encoding: AnyContentEncoding = BASE64, binary: typing_extensions.Literal[None] = None) DataStrT
Encodes the data to the requested encoding and convert it to the string-like data type representation.
- static decode(data: AnyStr, encoding: AnyContentEncoding = BASE64, binary: typing_extensions.Literal[True] = True) bytes [source]
- static decode(data: AnyStr, encoding: AnyContentEncoding = BASE64, binary: typing_extensions.Literal[False] = False) str
- static decode(data: DataStrT, encoding: AnyContentEncoding = BASE64, binary: typing_extensions.Literal[None] = None) DataStrT
Decodes the data from the specified encoding and convert it to the string-like data type representation.
- class weaver.formats.OutputFormat[source]
Renderer output formats for CLI, OpenAPI and HTTP response content generation.
- classmethod get(format_or_version: str | AnyOutputFormat | AnyContentType | weaver.base.PropertyDataTypeT, default: AnyOutputFormat | None = None, allow_version: bool = True) AnyOutputFormat | weaver.base.PropertyDataTypeT [source]
Resolve the applicable output format.
- Parameters:
format_or_version – Either a WPS version, a known value for a
f
/format
query parameter, or anAccept
header that can be mapped to one of the supported output formats.default – Default output format if none could be resolved. If no explicit default is specified as default in case of unresolved format,
JSON
is used by default.allow_version – Enable WPS version specifiers to infer the corresponding output representation.
- Returns:
Resolved output format.
- classmethod convert(data: weaver.typedefs.JSON, to: AnyOutputFormat | AnyContentType | None, item_root: str = 'item') str | weaver.typedefs.JSON [source]
Converts the input data from JSON to another known format.
- Parameters:
data – Input data to convert. Must be a literal JSON object, not a JSON-like string.
to – Target format representation. If the output format is not JSON, it is ALWAYS converted to the formatted string of the requested format to ensure the contents are properly represented as intended. In the case of JSON as target format or unknown format, the original object is returned directly.
item_root – When using XML or HTML representations, defines the top-most item name. Unused for other representations.
- Returns:
Formatted output.
- class weaver.formats.SchemaRole[source]
Constants container that provides similar functionalities to
ExtendedEnum
without explicit Enum membership.
- weaver.formats.get_allowed_extensions() List[str] [source]
Obtain the complete list of extensions that are permitted for processing by the application.
Note
This is employed for security reasons. Files can still be specified with another allowed extension, but it will not automatically inherit properties applicable to scripts and executables. If a specific file type is refused due to its extension, a PR can be submitted to add it explicitly.
- weaver.formats.get_format(media_type: str, default: str | None = None) pywps.inout.formats.Format | None [source]
Obtains a
Format
with predefined extension and encoding details from known media-types.
- weaver.formats.get_extension(media_type: str, dot: bool = True) str [source]
Retrieves the extension corresponding to
media_type
if explicitly defined, or by parsing it.
- weaver.formats.get_content_type(extension: str, charset: str | None = None, default: str | None = None) str | None [source]
Retrieves the Content-Type corresponding to the specified extension if it can be matched.
- Parameters:
extension – Extension for which to attempt finding a known Content-Type.
charset – Charset to apply to the Content-Type as needed if extension was matched.
default – Default Content-Type to return if no extension is matched.
- Returns:
Matched or default Content-Type.
- weaver.formats.add_content_type_charset(content_type: str | ContentType, charset: str | None) str [source]
Apply the specific charset to the content-type with some validation in case of conflicting definitions.
- Parameters:
content_type – Desired Content-Type.
charset – Desired charset parameter.
- Returns:
updated content-type with charset.
- weaver.formats.get_cwl_file_format(media_type: str) Tuple[weaver.typedefs.JSON | None, str | None] [source]
- weaver.formats.get_cwl_file_format(media_type: str, make_reference: typing_extensions.Literal[False] = False, **__: bool) Tuple[weaver.typedefs.JSON | None, str | None]
- weaver.formats.get_cwl_file_format(media_type: str, make_reference: typing_extensions.Literal[True] = False, **__: bool) str | None
Obtains the extended schema reference from the media-type identifier.
Obtains the corresponding IANA/EDAM/etc.
format
value to be applied under a CWL I/OFile
from themedia_type
(Content-Type
header) using the first matched one.Lookup procedure is as follows:
- If
make_reference=False
: - If there is a match, returns
tuple({<namespace-name: namespace-url>}, <format>)
with: corresponding namespace mapping to be applied under
$namespaces
in the CWL.value of
format
adjusted according to the namespace to be applied toFile
in the CWL.
- If there is a match, returns
If there is no match but
must_exist=False
, returns a literal and non-existing definition astuple({"iana": <iana-url>}, <format>)
.If there is no match but
must_exist=True
ANDallow_synonym=True
, retry the call with the synonym if available, or move to next step. Skip this step ifallow_synonym=False
.Otherwise, returns
(None, None)
- If
- If
make_reference=True
: If there is a match, returns the explicit format reference as
<namespace-url>/<format>
.If there is no match but
must_exist=False
, returns the literal reference as<iana-url>/<format>
(N.B.: literal non-official media-type reference will be returned even if an official synonym exists).If there is no match but
must_exist=True
ANDallow_synonym=True
, retry the call with the synonym if available, or move to next step. Skip this step ifallow_synonym=False
.Returns a single
None
as there is no match (directly or synonym).
- If
- Note:
In situations where
must_exist=False
is used and that the namespace and/or full format URL cannot be resolved to an existing reference, CWL will raise a validation error as it cannot confirm theformat
. You must therefore make sure that the returned reference (or a synonym format) really exists when usingmust_exist=False
before providing it to the CWL I/O definition. Settingmust_exist=False
should be used only for literal string comparison or pre-processing steps to evaluate formats.
- Parameters:
media_type – Some reference, namespace’d or literal (possibly extended) media-type string.
make_reference – Construct the full URL reference to the resolved media-type. Otherwise, return tuple details.
must_exist – Return result only if it can be resolved to an official media-type (or synonym if enabled), otherwise
None
. Non-official media-type can be enforced if disabled, in which case IANA namespace/URL is used as it preserves the original<type>/<subtype>
format.allow_synonym – Allow resolution of non-official media-type to an official media-type synonym if available. Types defined as synonym have semantically the same format validation/resolution for CWL. Requires
must_exist=True
, otherwise the non-official media-type is employed directly as result.
- Returns:
Resolved media-type format for CWL usage, accordingly to specified arguments (see description details).
- weaver.formats.map_cwl_media_type(cwl_format: str | None) str | None [source]
Obtains the Media-Type that corresponds to the specified CWL
format
.- Parameters:
cwl_format – Long form URL or namespaced variant of a CWL format referring to an ontology Media-Type.
- Returns:
Resolved Media-Type.
- weaver.formats.clean_media_type_format(media_type: str, suffix_subtype: bool = False, strip_parameters: bool = False) str | None [source]
Obtains a generic media-type identifier by cleaning up any additional parameters.
Removes any additional namespace key or URL from
media_type
so that it corresponds to the generic representation (e.g.:application/json
) instead of the<namespace-name>:<format>
mapping variant used in CWL->inputs/outputs->File->format or the complete URL reference.Removes any leading temporary local file prefix inserted by CWL when resolving namespace mapping. This transforms
file:///tmp/dir/path/package#application/json
to plainapplication/json
.According to provided arguments, it also cleans up additional parameters or extracts sub-type suffixes.
- Parameters:
media_type – Media-Type, full URL to media-type or namespace-formatted string that must be cleaned up.
suffix_subtype – Remove additional sub-type specializations details separated by
+
symbol such that an explicit format likeapplication/vnd.api+json
returns only its most basic suffix format defined as``application/json``.strip_parameters – Removes additional media-type parameters such that only the leading part defining the
type/subtype
are returned. For example, this will get rid of; charset=UTF-8
or; version=4.0
parameters.
Note
Parameters
suffix_subtype
andstrip_parameters
are not necessarily exclusive.
- weaver.formats.guess_target_format(request: weaver.typedefs.AnyRequestType) ContentType [source]
- weaver.formats.guess_target_format(request: weaver.typedefs.AnyRequestType, default: ContentType | str | None) ContentType
- weaver.formats.guess_target_format(request: weaver.typedefs.AnyRequestType, return_source: typing_extensions.Literal[True], override_user_agent: bool) Tuple[ContentType, FormatSource]
- weaver.formats.guess_target_format(request: weaver.typedefs.AnyRequestType, default: ContentType | str | None, return_source: typing_extensions.Literal[True], override_user_agent: bool) Tuple[ContentType, FormatSource]
Guess the best applicable response
Content-Type
header from the request.Considers the request
Accept
header,format
query and alternativelyf
query to parse possible formats. Full Media-Type are expected in the header. Query parameters can use both the full Media-Type, or only the sub-type (i.e.: JSON, XML, etc.), with case-insensitive names.Defaults to
ContentType.APP_JSON
if none was specified asdefault
explicitly and that noAccept` header or ``format
/f
queries were provided. Otherwise, applies the specifieddefault
format specifiers were not provided in the request.Can apply
User-Agent
specific logic to override automatically addedAccept
headers by many browsers such that sending requests to the API using them will not automatically default back to typical XML or HTML representations. If browsers are used to send requests, but thatformat
/f
queries are used directly in the URL, those will be applied since this is a very intuitive (and easier) approach to request different formats when using browsers. Optionoverride_user_agent
must be enabled to apply this behavior.When
User-Agent
clients are identified as another source, such as sending requests from a server or from code, both headers and query parameters are applied directly without question.- Returns:
Matched media-type or default, and optionally, the source of resolution.
- weaver.formats.find_supported_media_types(io_definition: weaver.typedefs.ProcessInputOutputItem) List[str] | None [source]
Finds all supported media-types indicated by an I/O.
Note
Assumes that media-types are indicated under
formats
, which should have been obtained either by direct submission when using WPS deployment, generated fromschema
using OGC deployment, or using the nestedformat
ofFile
types from CWL deployment.- Parameters:
io_definition
- Returns:
supported media-types
- weaver.formats.repr_json(data: Any, force_string: bool = True, ensure_ascii: bool = False, indent: int | None = 2, **kwargs: Any) weaver.typedefs.JSON | str | None [source]
Ensure that the input data can be serialized as JSON to return it formatted representation as such.
If formatting as JSON fails, returns the data as string representation or
None
accordingly.