weaver.formats
Module Contents
- class weaver.formats.ContentType[source]
Supported
Content-Typevalues.Media-Type nomenclature:
<type> "/" [x- | <tree> "."] <subtype> ["+" suffix] *[";" parameter=value]
- class weaver.formats.ContentEncoding[source]
Supported
Content-Encodingvalues.Note
Value
binaryis kept for convenience and backward compatibility with older definitions. It will default to the same encoding strategy as ifbase64was specified explicitly. Valuebinaryis not part of RFC 4648, but remains a common occurrence that dates from whenformat: binarywas the approach employed to represent binary (JSON-schema Draft-04 and prior) instead of what is now recommended usingcontentEncoding: base64(JSON-schema Draft-07).See also
- static is_text(encoding: Any) bool[source]
Indicates if the
Content-Encodingvalue can be categorized as textual data.
- static is_binary(encoding: Any) bool[source]
Indicates if the
Content-Encodingvalue can be categorized as binary data.
- static open_parameters(encoding: Any, mode: FileModeSteamType = 'r') Tuple[FileModeEncoding, typing_extensions.Literal[UTF-8, None]][source]
Obtains relevant
modeandencodingparameters foropen()using the specifiedContent-Encoding.
- static encode(data: AnyStr, encoding: ContentEncoding = BASE64, binary: typing_extensions.Literal[True] = True) bytes[source]
- static encode(data: AnyStr, encoding: ContentEncoding = BASE64, binary: typing_extensions.Literal[False] = False) str
- static encode(data: DataStrT, encoding: ContentEncoding = BASE64, binary: typing_extensions.Literal[None] = None) DataStrT
Encodes the data to the requested encoding and convert it to the string-like data type representation.
- static decode(data: AnyStr, encoding: ContentEncoding = BASE64, binary: typing_extensions.Literal[True] = True) bytes[source]
- static decode(data: AnyStr, encoding: ContentEncoding = BASE64, binary: typing_extensions.Literal[False] = False) str
- static decode(data: DataStrT, encoding: ContentEncoding = BASE64, binary: typing_extensions.Literal[None] = None) DataStrT
Decodes the data from the specified encoding and convert it to the string-like data type representation.
- class weaver.formats.OutputFormat[source]
Renderer output formats for CLI, OpenAPI and HTTP response content generation.
- classmethod get(format_or_version: str | AnyOutputFormat | weaver.base.PropertyDataTypeT, default: AnyOutputFormat | None = JSON, allow_version: bool = True) AnyOutputFormat | weaver.base.PropertyDataTypeT[source]
Resolve the applicable output format.
- Parameters:
format_or_version – Either a WPS version, a known value for a
f/formatquery parameter, or anAcceptheader that can be mapped to one of the supported output formats.default – Default output format if none could be resolved.
allow_version – Enable WPS version specifiers to infer the corresponding output representation.
- Returns:
Resolved output format.
- classmethod convert(data: weaver.typedefs.JSON, to: AnyOutputFormat | str, item_root: str = 'item') str | weaver.typedefs.JSON[source]
Converts the input data from JSON to another known format.
- Parameters:
data – Input data to convert. Must be a literal JSON object, not a JSON-like string.
to – Target format representation. If the output format is not JSON, it is ALWAYS converted to the formatted string of the requested format to ensure the contents are properly represented as intended. In the case of JSON as target format or unknown format, the original object is returned directly.
item_root – When using XML representations, defines the top-most item name. Unused for other representations.
- Returns:
Formatted output.
- class weaver.formats.SchemaRole[source]
Constants container that provides similar functionalities to
ExtendedEnumwithout explicit Enum membership.
- weaver.formats.get_allowed_extensions() List[str][source]
Obtain the complete list of extensions that are permitted for processing by the application.
Note
This is employed for security reasons. Files can still be specified with another allowed extension, but it will not automatically inherit properties applicable to scripts and executables. If a specific file type is refused due to its extension, a PR can be submitted to add it explicitly.
- weaver.formats.get_format(media_type: str, default: str | None = None) pywps.inout.formats.Format | None[source]
Obtains a
Formatwith predefined extension and encoding details from known media-types.
- weaver.formats.get_extension(media_type: str, dot: bool = True) str[source]
Retrieves the extension corresponding to
media_typeif explicitly defined, or by parsing it.
- weaver.formats.get_content_type(extension: str, charset: str | None = None, default: str | None = None) str | None[source]
Retrieves the Content-Type corresponding to the specified extension if it can be matched.
- Parameters:
extension – Extension for which to attempt finding a known Content-Type.
charset – Charset to apply to the Content-Type as needed if extension was matched.
default – Default Content-Type to return if no extension is matched.
- Returns:
Matched or default Content-Type.
- weaver.formats.add_content_type_charset(content_type: str | ContentType, charset: str | None) str[source]
Apply the specific charset to the content-type with some validation in case of conflicting definitions.
- Parameters:
content_type – Desired Content-Type.
charset – Desired charset parameter.
- Returns:
updated content-type with charset.
- weaver.formats.get_cwl_file_format(media_type: str, make_reference: bool = False, must_exist: bool = True, allow_synonym: bool = True) Tuple[weaver.typedefs.JSON | None, str | None] | str | None[source]
Obtains the extended schema reference from the media-type identifier.
Obtains the corresponding IANA/EDAM/etc.
formatvalue to be applied under a CWL I/OFilefrom themedia_type(Content-Typeheader) using the first matched one.Lookup procedure is as follows:
- If
make_reference=False: - If there is a match, returns
tuple({<namespace-name: namespace-url>}, <format>)with: corresponding namespace mapping to be applied under
$namespacesin the CWL.value of
formatadjusted according to the namespace to be applied toFilein the CWL.
- If there is a match, returns
If there is no match but
must_exist=False, returns a literal and non-existing definition astuple({"iana": <iana-url>}, <format>).If there is no match but
must_exist=TrueANDallow_synonym=True, retry the call with the synonym if available, or move to next step. Skip this step ifallow_synonym=False.Otherwise, returns
(None, None)
- If
- If
make_reference=True: If there is a match, returns the explicit format reference as
<namespace-url>/<format>.If there is no match but
must_exist=False, returns the literal reference as<iana-url>/<format>(N.B.: literal non-official media-type reference will be returned even if an official synonym exists).If there is no match but
must_exist=TrueANDallow_synonym=True, retry the call with the synonym if available, or move to next step. Skip this step ifallow_synonym=False.Returns a single
Noneas there is no match (directly or synonym).
- If
- Note:
In situations where
must_exist=Falseis used and that the namespace and/or full format URL cannot be resolved to an existing reference, CWL will raise a validation error as it cannot confirm theformat. You must therefore make sure that the returned reference (or a synonym format) really exists when usingmust_exist=Falsebefore providing it to the CWL I/O definition. Settingmust_exist=Falseshould be used only for literal string comparison or pre-processing steps to evaluate formats.
- Parameters:
media_type – Some reference, namespace’d or literal (possibly extended) media-type string.
make_reference – Construct the full URL reference to the resolved media-type. Otherwise, return tuple details.
must_exist – Return result only if it can be resolved to an official media-type (or synonym if enabled), otherwise
None. Non-official media-type can be enforced if disabled, in which case IANA namespace/URL is used as it preserves the original<type>/<subtype>format.allow_synonym – Allow resolution of non-official media-type to an official media-type synonym if available. Types defined as synonym have semantically the same format validation/resolution for CWL. Requires
must_exist=True, otherwise the non-official media-type is employed directly as result.
- Returns:
Resolved media-type format for CWL usage, accordingly to specified arguments (see description details).
- weaver.formats.map_cwl_media_type(cwl_format: str | None) str | None[source]
Obtains the Media-Type that corresponds to the specified CWL
format.- Parameters:
cwl_format – Long form URL or namespaced variant of a CWL format referring to an ontology Media-Type.
- Returns:
Resolved Media-Type.
- weaver.formats.clean_media_type_format(media_type: str, suffix_subtype: bool = False, strip_parameters: bool = False) str | None[source]
Obtains a generic media-type identifier by cleaning up any additional parameters.
Removes any additional namespace key or URL from
media_typeso that it corresponds to the generic representation (e.g.:application/json) instead of the<namespace-name>:<format>mapping variant used in CWL->inputs/outputs->File->format or the complete URL reference.Removes any leading temporary local file prefix inserted by CWL when resolving namespace mapping. This transforms
file:///tmp/dir/path/package#application/jsonto plainapplication/json.According to provided arguments, it also cleans up additional parameters or extracts sub-type suffixes.
- Parameters:
media_type – Media-Type, full URL to media-type or namespace-formatted string that must be cleaned up.
suffix_subtype – Remove additional sub-type specializations details separated by
+symbol such that an explicit format likeapplication/vnd.api+jsonreturns only its most basic suffix format defined as``application/json``.strip_parameters – Removes additional media-type parameters such that only the leading part defining the
type/subtypeare returned. For example, this will get rid of; charset=UTF-8or; version=4.0parameters.
Note
Parameters
suffix_subtypeandstrip_parametersare not necessarily exclusive.
- weaver.formats.guess_target_format(request: weaver.typedefs.AnyRequestType, default: ContentType | str | None = ContentType.APP_JSON) ContentType | str[source]
Guess the best applicable response
Content-Typeheader from the request.Considers the request
Acceptheader,formatquery and alternativelyfquery to parse possible formats. Full Media-Type are expected in the header. Query parameters can use both the full Media-Type, or only the sub-type (i.e.: JSON, XML, etc.), with case-insensitive names.Defaults to
ContentType.APP_JSONif none was specified asdefaultexplicitly and that noAccept` header or ``format/fqueries were provided. Otherwise, applies the specifieddefaultformat specifiers were not provided in the request.Applies some specific logic to handle automatically added
Acceptheaders by many browsers such that sending requests to the API using them will not automatically default back to XML or similar HTML representations. If browsers are used to send requests, but thatformat/fqueries are used directly in the URL, those will be applied since this is a very intuitive (and easier) approach to request different formats when using browsers.When
User-Agentclients are identified as another source, such as sending requests from a server or from code, both headers and query parameters are applied directly without question.- Returns:
Matched media-type or default.
- weaver.formats.repr_json(data: Any, force_string: bool = True, ensure_ascii: bool = False, indent: int | None = 2, **kwargs: Any) weaver.typedefs.JSON | str | None[source]
Ensure that the input data can be serialized as JSON to return it formatted representation as such.
If formatting as JSON fails, returns the data as string representation or
Noneaccordingly.