METADATA - mozsearch

Metadata-Version: 2.1

Name: taskcluster

Version: 44.2.2

Summary: Python client for Taskcluster

Home-page: https://github.com/taskcluster/taskcluster

Author: Mozilla Taskcluster and Release Engineering

Author-email: release+python@mozilla.com

License: UNKNOWN

Platform: UNKNOWN

Classifier: Programming Language :: Python :: 2.7

Classifier: Programming Language :: Python :: 3.5

Classifier: Programming Language :: Python :: 3.6

Classifier: Programming Language :: Python :: 3.7

Description-Content-Type: text/markdown

License-File: LICENSE

Requires-Dist: requests (>=2.4.3)

Requires-Dist: mohawk (>=0.3.4)

Requires-Dist: slugid (>=2)

Requires-Dist: taskcluster-urls (>=12.1.0)

Requires-Dist: six (>=1.10.0)

Requires-Dist: aiohttp (>=3.7.4) ; python_version >= "3.6"

Requires-Dist: async-timeout (>=2.0.0) ; python_version >= "3.6"

Provides-Extra: test

Requires-Dist: pytest ; extra == 'test'

Requires-Dist: pytest-cov ; extra == 'test'

Requires-Dist: pytest-mock ; extra == 'test'

Requires-Dist: httmock ; extra == 'test'

Requires-Dist: mock ; extra == 'test'

Requires-Dist: setuptools-lint ; extra == 'test'

Requires-Dist: flake8 ; extra == 'test'

Requires-Dist: psutil ; extra == 'test'

Requires-Dist: hypothesis ; extra == 'test'

Requires-Dist: tox ; extra == 'test'

Requires-Dist: coverage ; extra == 'test'

Requires-Dist: python-dateutil ; extra == 'test'

Requires-Dist: subprocess32 ; (python_version == "2.7") and extra == 'test'

Requires-Dist: pytest-asyncio ; (python_version >= "3.6") and extra == 'test'

Requires-Dist: aiofiles ; (python_version >= "3.6") and extra == 'test'

Requires-Dist: httptest ; (python_version >= "3.6") and extra == 'test'

# Taskcluster Client for Python

[![Download](https://img.shields.io/badge/pypi-taskcluster-brightgreen)](https://pypi.python.org/pypi/taskcluster)

[![License](https://img.shields.io/badge/license-MPL%202.0-orange.svg)](http://mozilla.org/MPL/2.0)

**A Taskcluster client library for Python.**

This library is a complete interface to Taskcluster in Python.  It provides

both synchronous and asynchronous interfaces for all Taskcluster API methods,

in both Python-2 and Python-3 variants.

## Usage

For a general guide to using Taskcluster clients, see [Calling Taskcluster APIs](https://docs.taskcluster.net/docs/manual/using/api).

### Setup

Before calling an API end-point, you'll need to create a client instance.

There is a class for each service, e.g., `Queue` and `Auth`.  Each takes the

same options, described below.  Note that only `rootUrl` is

required, and it's unusual to configure any other options aside from

`credentials`.

For each service, there are sync and async variants.  The classes under

`taskcluster` (e.g., `taskcluster.Queue`) are Python-2 compatible and operate

synchronously.  The classes under `taskcluster.aio` (e.g.,

`taskcluster.aio.Queue`) require Python >= 3.6.

#### Authentication Options

Here is a simple set-up of an Index client:

```python

import taskcluster

index = taskcluster.Index({

  'rootUrl': 'https://tc.example.com',

  'credentials': {'clientId': 'id', 'accessToken': 'accessToken'},

})

```

The `rootUrl` option is required as it gives the Taskcluster deployment to

which API requests should be sent.  Credentials are only required if the

request is to be authenticated -- many Taskcluster API methods do not require

authentication.

In most cases, the root URL and Taskcluster credentials should be provided in [standard environment variables](https://docs.taskcluster.net/docs/manual/design/env-vars).  Use `taskcluster.optionsFromEnvironment()` to read these variables automatically:

```python

auth = taskcluster.Auth(taskcluster.optionsFromEnvironment())

```

Note that this function does not respect `TASKCLUSTER_PROXY_URL`.  To use the Taskcluster Proxy from within a task:

```python

auth = taskcluster.Auth({'rootUrl': os.environ['TASKCLUSTER_PROXY_URL']})

```

#### Authorized Scopes

If you wish to perform requests on behalf of a third-party that has small set

of scopes than you do. You can specify [which scopes your request should be

allowed to

use](https://docs.taskcluster.net/docs/manual/design/apis/hawk/authorized-scopes),

in the `authorizedScopes` option.

```python

opts = taskcluster.optionsFromEnvironment()

opts['authorizedScopes'] = ['queue:create-task:highest:my-provisioner/my-worker-type']

queue = taskcluster.Queue(opts)

```

#### Other Options

The following additional options are accepted when constructing a client object:

* `signedUrlExpiration` - default value for the `expiration` argument to `buildSignedUrl`

* `maxRetries` - maximum number of times to retry a failed request

### Calling API Methods

API methods are available as methods on the corresponding client object.  For

sync clients, these are sync methods, and for async clients they are async

methods; the calling convention is the same in either case.

There are four calling conventions for methods:

```python

client.method(v1, v1, payload)

client.method(payload, k1=v1, k2=v2)

client.method(payload=payload, query=query, params={k1: v1, k2: v2})

client.method(v1, v2, payload=payload, query=query)

```

Here, `v1` and `v2` are URL parameters (named `k1` and `k2`), `payload` is the

request payload, and `query` is a dictionary of query arguments.

For example, in order to call an API method with query-string arguments:

```python

await queue.listTaskGroup('JzTGxwxhQ76_Tt1dxkaG5g',

  query={'continuationToken': previousResponse.get('continuationToken')})

```

### Generating URLs

It is often necessary to generate the URL for an API method without actually calling the method.

To do so, use `buildUrl` or, for an API method that requires authentication, `buildSignedUrl`.

```python

import taskcluster

index = taskcluster.Index(taskcluster.optionsFromEnvironment())

print(index.buildUrl('findTask', 'builds.v1.latest'))

secrets = taskcluster.Secrets(taskcluster.optionsFromEnvironment())

print(secret.buildSignedUrl('get', 'my-secret'))

```

Note that signed URLs are time-limited; the expiration can be set with the `signedUrlExpiration` option to the client constructor, or with the `expiration` keyword arguement to `buildSignedUrl`, both given in seconds.

### Generating Temporary Credentials

If you have non-temporary taskcluster credentials you can generate a set of

[temporary credentials](https://docs.taskcluster.net/docs/manual/design/apis/hawk/temporary-credentials) as follows. Notice that the credentials cannot last more

than 31 days, and you can only revoke them by revoking the credentials that was

used to issue them (this takes up to one hour).

It is not the responsibility of the caller to apply any clock drift adjustment

to the start or expiry time - this is handled by the auth service directly.

```python

import datetime

start = datetime.datetime.now()

expiry = start + datetime.timedelta(0,60)

scopes = ['ScopeA', 'ScopeB']

name = 'foo'

credentials = taskcluster.createTemporaryCredentials(

    # issuing clientId

    clientId,

    # issuing accessToken

    accessToken,

    # Validity of temporary credentials starts here, in timestamp

    start,

    # Expiration of temporary credentials, in timestamp

    expiry,

    # Scopes to grant the temporary credentials

    scopes,

    # credential name (optional)

    name

```

You cannot use temporary credentials to issue new temporary credentials.  You

must have `auth:create-client:<name>` to create a named temporary credential,

but unnamed temporary credentials can be created regardless of your scopes.

### Handling Timestamps

Many taskcluster APIs require ISO 8601 time stamps offset into the future

as way of providing expiration, deadlines, etc. These can be easily created

using `datetime.datetime.isoformat()`, however, it can be rather error prone

and tedious to offset `datetime.datetime` objects into the future. Therefore

this library comes with two utility functions for this purposes.

```python

dateObject = taskcluster.fromNow("2 days 3 hours 1 minute")

  # -> datetime.datetime(2017, 1, 21, 17, 8, 1, 607929)

dateString = taskcluster.fromNowJSON("2 days 3 hours 1 minute")

  # -> '2017-01-21T17:09:23.240178Z'

```

By default it will offset the date time into the future, if the offset strings

are prefixed minus (`-`) the date object will be offset into the past. This is

useful in some corner cases.

```python

dateObject = taskcluster.fromNow("- 1 year 2 months 3 weeks 5 seconds");

  # -> datetime.datetime(2015, 10, 30, 18, 16, 50, 931161)

```

The offset string is ignorant of whitespace and case insensitive. It may also

optionally be prefixed plus `+` (if not prefixed minus), any `+` prefix will be

ignored. However, entries in the offset string must be given in order from

high to low, ie. `2 years 1 day`. Additionally, various shorthands may be

employed, as illustrated below.

```

  years,    year,   yr,   y

  months,   month,  mo

  weeks,    week,         w

  days,     day,          d

  hours,    hour,         h

  minutes,  minute, min

  seconds,  second, sec,  s

```

The `fromNow` method may also be given a date to be relative to as a second

argument. This is useful if offset the task expiration relative to the the task

deadline or doing something similar.  This argument can also be passed as the

kwarg `dateObj`

```python

dateObject1 = taskcluster.fromNow("2 days 3 hours");

dateObject2 = taskcluster.fromNow("1 year", dateObject1);

taskcluster.fromNow("1 year", dateObj=dateObject1);

  # -> datetime.datetime(2018, 1, 21, 17, 59, 0, 328934)

```

### Generating SlugIDs

To generate slugIds (Taskcluster's client-generated unique IDs), use

`taskcluster.slugId()`, which will return a unique slugId on each call.

In some cases it is useful to be able to create a mapping from names to

slugIds, with the ability to generate the same slugId multiple times.

The `taskcluster.stableSlugId()` function returns a callable that does

just this.

```python

gen = taskcluster.stableSlugId()

sometask = gen('sometask')

assert gen('sometask') == sometask  # same input generates same output

assert gen('sometask') != gen('othertask')

gen2 = taskcluster.stableSlugId()

sometask2 = gen('sometask')

assert sometask2 != sometask  # but different slugId generators produce

                              # different output

```

### Scope Analysis

The `scopeMatch(assumedScopes, requiredScopeSets)` function determines

whether one or more of a set of required scopes are satisfied by the assumed

scopes, taking *-expansion into account.  This is useful for making local

decisions on scope satisfaction, but note that `assumed_scopes` must be the

*expanded* scopes, as this function cannot perform expansion.

It takes a list of a assumed scopes, and a list of required scope sets on

disjunctive normal form, and checks if any of the required scope sets are

satisfied.

Example:

```python

requiredScopeSets = [

    ["scopeA", "scopeB"],

    ["scopeC:*"]

assert scopesMatch(['scopeA', 'scopeB'], requiredScopeSets)

assert scopesMatch(['scopeC:xyz'], requiredScopeSets)

assert not scopesMatch(['scopeA'], requiredScopeSets)

assert not scopesMatch(['scopeC'], requiredScopeSets)

```

### Pagination

Many Taskcluster API methods are paginated.  There are two ways to handle

pagination easily with the python client.  The first is to implement pagination

in your code:

```python

import taskcluster

queue = taskcluster.Queue({'rootUrl': 'https://tc.example.com'})

i = 0

tasks = 0

outcome = queue.listTaskGroup('JzTGxwxhQ76_Tt1dxkaG5g')

while outcome.get('continuationToken'):

    print('Response %d gave us %d more tasks' % (i, len(outcome['tasks'])))

    if outcome.get('continuationToken'):

        outcome = queue.listTaskGroup('JzTGxwxhQ76_Tt1dxkaG5g', query={'continuationToken': outcome.get('continuationToken')})

    i += 1

    tasks += len(outcome.get('tasks', []))

print('Task Group %s has %d tasks' % (outcome['taskGroupId'], tasks))

```

There's also an experimental feature to support built in automatic pagination

in the sync client.  This feature allows passing a callback as the

'paginationHandler' keyword-argument.  This function will be passed the

response body of the API method as its sole positional arugment.

This example of the built in pagination shows how a list of tasks could be

built and then counted:

```python

import taskcluster

queue = taskcluster.Queue({'rootUrl': 'https://tc.example.com'})

responses = []

def handle_page(y):

    print("%d tasks fetched" % len(y.get('tasks', [])))

    responses.append(y)

queue.listTaskGroup('JzTGxwxhQ76_Tt1dxkaG5g', paginationHandler=handle_page)

tasks = 0

for response in responses:

    tasks += len(response.get('tasks', []))

print("%d requests fetch %d tasks" % (len(responses), tasks))

```

### Pulse Events

This library can generate exchange patterns for Pulse messages based on the

Exchanges definitions provded by each service.  This is done by instantiating a

`<service>Events` class and calling a method with the name of the vent.

Options for the topic exchange methods can be in the form of either a single

dictionary argument or keyword arguments.  Only one form is allowed.

```python

from taskcluster import client

qEvt = client.QueueEvents({rootUrl: 'https://tc.example.com'})

# The following calls are equivalent

print(qEvt.taskCompleted({'taskId': 'atask'}))

print(qEvt.taskCompleted(taskId='atask'))

```

Note that the client library does *not* provide support for interfacing with a Pulse server.

### Logging

Logging is set up in `taskcluster/__init__.py`.  If the special

`DEBUG_TASKCLUSTER_CLIENT` environment variable is set, the `__init__.py`

module will set the `logging` module's level for its logger to `logging.DEBUG`

and if there are no existing handlers, add a `logging.StreamHandler()`

instance.  This is meant to assist those who do not wish to bother figuring out

how to configure the python logging module but do want debug messages

## Uploading and Downloading Objects

The Object service provides an API for reliable uploads and downloads of large objects.

This library provides convenience methods to implement the client portion of those APIs, providing well-tested, resilient upload and download functionality.

These methods will negotiate the appropriate method with the object service and perform the required steps to transfer the data.

All methods are available in both sync and async versions, with identical APIs except for the `async`/`await` keywords.

These methods are not available for Python-2.7.

In either case, you will need to provide a configured `Object` instance with appropriate credentials for the operation.

NOTE: There is an helper function to upload `s3` artifacts, `taskcluster.helper.upload_artifact`, but it is deprecated as it only supports the `s3` artifact type.

### Uploads

To upload, use any of the following:

* `await taskcluster.aio.upload.uploadFromBuf(projectId=.., name=.., contentType=.., contentLength=.., uploadId=.., expires=.., maxRetries=.., objectService=.., data=..)` - asynchronously upload data from a buffer full of bytes.

* `await taskcluster.aio.upload.uploadFromFile(projectId=.., name=.., contentType=.., contentLength=.., uploadId=.., expires=.., maxRetries=.., objectService=.., file=..)` - asynchronously upload data from a standard Python file.

  Note that this is [probably what you want](https://github.com/python/asyncio/wiki/ThirdParty#filesystem), even in an async context.

* `await taskcluster.aio.upload(projectId=.., name=.., contentType=.., contentLength=.., expires=.., uploadId=.., maxRetries=.., objectService=.., readerFactory=..)` - asynchronously upload data from an async reader factory.

* `taskcluster.upload.uploadFromBuf(projectId=.., name=.., contentType=.., contentLength=.., expires=.., uploadId=.., maxRetries=.., objectService=.., data=..)` - upload data from a buffer full of bytes.

* `taskcluster.upload.uploadFromFile(projectId=.., name=.., contentType=.., contentLength=.., expires=.., uploadId=.., maxRetries=.., objectService=.., file=..)` - upload data from a standard Python file.

* `taskcluster.upload(projectId=.., name=.., contentType=.., contentLength=.., expires=.., uploadId=.., maxRetries=.., objectService=.., readerFactory=..)` - upload data from a sync reader factory.

A "reader" is an object with a `read(max_size=-1)` method which reads and returns a chunk of 1 .. `max_size` bytes, or returns an empty string at EOF, async for the async functions and sync for the remainder.

A "reader factory" is an async callable which returns a fresh reader, ready to read the first byte of the object.

When uploads are retried, the reader factory may be called more than once.

The `uploadId` parameter may be omitted, in which case a new slugId will be generated.

### Downloads

To download, use any of the following:

* `await taskcluster.aio.download.downloadToBuf(name=.., maxRetries=.., objectService=..)` - asynchronously download an object to an in-memory buffer, returning a tuple (buffer, content-type).

  If the file is larger than available memory, this will crash.

* `await taskcluster.aio.download.downloadToFile(name=.., maxRetries=.., objectService=.., file=..)` - asynchronously download an object to a standard Python file, returning the content type.

* `await taskcluster.aio.download.download(name=.., maxRetries=.., objectService=.., writerFactory=..)` - asynchronously download an object to an async writer factory, returning the content type.

* `taskcluster.download.downloadToBuf(name=.., maxRetries=.., objectService=..)` - download an object to an in-memory buffer, returning a tuple (buffer, content-type).

  If the file is larger than available memory, this will crash.

* `taskcluster.download.downloadToFile(name=.., maxRetries=.., objectService=.., file=..)` - download an object to a standard Python file, returning the content type.

* `taskcluster.download.download(name=.., maxRetries=.., objectService=.., writerFactory=..)` - download an object to a sync writer factory, returning the content type.

A "writer" is an object with a `write(data)` method which writes the given data, async for the async functions and sync for the remainder.

A "writer factory" is a callable (again either async or sync) which returns a fresh writer, ready to write the first byte of the object.

When uploads are retried, the writer factory may be called more than once.

### Artifact Downloads

Artifacts can be downloaded from the queue service with similar functions to those above.

These functions support all of the queue's storage types, raising an error for `error` artifacts.

In each case, if `runId` is omitted then the most recent run will be used.

* `await taskcluster.aio.download.downloadArtifactToBuf(taskId=.., runId=.., name=.., maxRetries=.., queueService=..)` - asynchronously download an object to an in-memory buffer, returning a tuple (buffer, content-type).

  If the file is larger than available memory, this will crash.

* `await taskcluster.aio.download.downloadArtifactToFile(taskId=.., runId=.., name=.., maxRetries=.., queueService=.., file=..)` - asynchronously download an object to a standard Python file, returning the content type.

* `await taskcluster.aio.download.downloadArtifact(taskId=.., runId=.., name=.., maxRetries=.., queueService=.., writerFactory=..)` - asynchronously download an object to an async writer factory, returning the content type.

* `taskcluster.download.downloadArtifactToBuf(taskId=.., runId=.., name=.., maxRetries=.., queueService=..)` - download an object to an in-memory buffer, returning a tuple (buffer, content-type).

  If the file is larger than available memory, this will crash.

* `taskcluster.download.downloadArtifactToFile(taskId=.., runId=.., name=.., maxRetries=.., queueService=.., file=..)` - download an object to a standard Python file, returning the content type.

* `taskcluster.download.downloadArtifact(taskId=.., runId=.., name=.., maxRetries=.., queueService=.., writerFactory=..)` - download an object to a sync writer factory, returning the content type.

## Integration Helpers

The Python Taskcluster client has a module `taskcluster.helper` with utilities which allows you to easily share authentication options across multiple services in your project.

Generally a project using this library will face different use cases and authentication options:

* No authentication for a new contributor without Taskcluster access,

* Specific client credentials through environment variables on a developer's computer,

* Taskcluster Proxy when running inside a task.

### Shared authentication

The class `taskcluster.helper.TaskclusterConfig` is made to be instantiated once in your project, usually in a top level module. That singleton is then accessed by different parts of your projects, whenever a Taskcluster service is needed.

Here is a sample usage:

1. in `project/__init__.py`, no call to Taskcluster is made at that point:

```python

from taskcluster.helper import Taskcluster config

tc = TaskclusterConfig('https://community-tc.services.mozilla.com')

```

2. in `project/boot.py`, we authenticate on Taskcuster with provided credentials, or environment variables, or taskcluster proxy (in that order):

```python

from project import tc

tc.auth(client_id='XXX', access_token='YYY')

```

3. at that point, you can load any service using the authenticated wrapper from anywhere in your code:

```python

from project import tc

def sync_usage():

    queue = tc.get_service('queue')

    queue.ping()

async def async_usage():

    hooks = tc.get_service('hooks', use_async=True)  # Asynchronous service class

    await hooks.ping()

```

Supported environment variables are:

- `TASKCLUSTER_ROOT_URL` to specify your Taskcluster instance base url. You can either use that variable or instanciate `TaskclusterConfig` with the base url.

- `TASKCLUSTER_CLIENT_ID` & `TASKCLUSTER_ACCESS_TOKEN` to specify your client credentials instead of providing them to `TaskclusterConfig.auth`

- `TASKCLUSTER_PROXY_URL` to specify the proxy address used to reach Taskcluster in a task. It defaults to `http://taskcluster` when not specified.

For more details on Taskcluster environment variables, [here is the documentation](https://docs.taskcluster.net/docs/manual/design/env-vars).

### Loading secrets across multiple authentications

Another available utility is `taskcluster.helper.load_secrets` which allows you to retrieve a secret using an authenticated `taskcluster.Secrets` instance (using `TaskclusterConfig.get_service` or the synchronous class directly).

This utility loads a secret, but allows you to:

1. share a secret across multiple projects, by using key prefixes inside the secret,

2. check that some required keys are present in the secret,

3. provide some default values,

4. provide a local secret source instead of using the Taskcluster service (useful for local development or sharing _secrets_ with contributors)

Let's say you have a secret on a Taskcluster instance named `project/foo/prod-config`, which is needed by a backend and some tasks. Here is its content:

```yaml

common:

  environment: production

  remote_log: https://log.xx.com/payload

backend:

  bugzilla_token: XXXX

task:

  backend_url: https://backend.foo.mozilla.com

```

In your backend, you would do:

```python

from taskcluster import Secrets

from taskcluster.helper import load_secrets

prod_config = load_secrets(

  Secrets({...}),

  'project/foo/prod-config',

  # We only need the common & backend parts

  prefixes=['common', 'backend'],

  # We absolutely need a bugzilla token to run

  required=['bugzilla_token'],

  # Let's provide some default value for the environment

  existing={

    'environment': 'dev',

  # -> prod_config == {

  #     "environment": "production"

  #     "remote_log": "https://log.xx.com/payload",

  #     "bugzilla_token": "XXXX",

  #   }

```

In your task, you could do the following using `TaskclusterConfig` mentionned above (the class has a shortcut to use an authenticated `Secrets` service automatically):

```python

from project import tc

prod_config = tc.load_secrets(

  'project/foo/prod-config',

  # We only need the common & bot parts

  prefixes=['common', 'bot'],

  # Let's provide some default value for the environment and backend_url

  existing={

    'environment': 'dev',

    'backend_url': 'http://localhost:8000',

  # -> prod_config == {

  #     "environment": "production"

  #     "remote_log": "https://log.xx.com/payload",

  #     "backend_url": "https://backend.foo.mozilla.com",

  #   }

```

To provide local secrets value, you first need to load these values as a dictionary (usually by reading a local file in your format of choice : YAML, JSON, ...) and providing the dictionary to `load_secrets` by using the `local_secrets` parameter:

```python

import os

import yaml

from taskcluster import Secrets

from taskcluster.helper import load_secrets

local_path = 'path/to/file.yml'

prod_config = load_secrets(

  Secrets({...}),

  'project/foo/prod-config',

  # We support an optional local file to provide some configuration without reaching Taskcluster

  local_secrets=yaml.safe_load(open(local_path)) if os.path.exists(local_path) else None,

```

## Compatibility

This library is co-versioned with Taskcluster itself.

That is, a client with version x.y.z contains API methods corresponding to Taskcluster version x.y.z.

Taskcluster is careful to maintain API compatibility, and guarantees it within a major version.

That means that any client with version x.* will work against any Taskcluster services at version x.*, and is very likely to work for many other major versions of the Taskcluster services.

Any incompatibilities are noted in the [Changelog](https://github.com/taskcluster/taskcluster/blob/main/CHANGELOG.md).

Source code

Revision control

Copy as Markdown

Other Tools