Edit on GitHub

Download model

To download model binaries for models in the model registry, you can programmatically generate signed URLs using the DVC Studio get-download-uris API. This API requires the model to be stored with DVC with a S3, Azure, http or https remote. Note that you need to set up remote cloud credentials for DVC Studio to have rights to the signed URLs. The signed URLs expire after one hour.

Endpoint: api/model-registry/get-download-uris
HTTP Method: GET

Request

paramdesctyperequiredexample value
repoGit repo URLstringtrueiterative/demo-bank-customer-churn
nameModel namestringtruerandomforest-model
versionModel versionstringfalsev2.0.0
stageModel stagestringfalseprod

Only one of stage or version is allowed. If no version or stage is specified, the latest version is returned.

When your model is annotated in non-root dvc.yaml file (typical for monorepo case), model name will be constructed from two parts separated by colon: path/to/dvc/yaml:model_name.

headerdescexample value
AuthorizationHeader for authtoken abc123

Response

Response is a JSON-encoded dict. If the request was successful, keys will be paths to files inside the repo, and values will be signed URLs you can query to actually download the model.

Example

First, you need your DVC Studio client access token with Model Registry scope. For this example, we set it in the DVC_STUDIO_TOKEN environment variable:

$ export DVC_STUDIO_TOKEN=<TOKEN>
import json
import os
import requests


url = "https://studio.datachain.ai/api/model-registry/get-download-uris"
token = os.environ["DVC_STUDIO_TOKEN"]
headers = {"Authorization": f"token {token}"}
params = {
    "repo": "git@github.com:iterative/demo-bank-customer-churn.git",
    "name": "randomforest-model",
    "version": "v2.0.0"
}

response = requests.get(url, headers=headers, params=params)
for rel_path, obj_url in json.loads(response.content).items():
    obj = requests.get(obj_url)
    ...
$ curl "https://studio.datachain.ai/api/model-registry/get-download-uris?repo=git@github.com:iterative/demo-bank-customer-churn.git&name=randomforest-model&version=v2.0.0" --header "Authorization:token ${DVC_STUDIO_TOKEN}"

{
    ".mlem/model/clf-model": "https://sandbox-datasets-iterative.s3.amazonaws.com/bank-customer-churn/86/bd02376ac675568ba2fac566169ef9?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAU7UXIWDIQFPCO76Q%2F20230706%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20230706T134619Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=6807259ddd1f4448ed1e3c5d4503039884f7779381ee556175096b0a884ba1a6"
}

Running this example will fail because it requires that you have set up remote cloud credentials in DVC Studio to where the model is stored.

Content

๐Ÿ› Found an issue? Let us know! Or fix it:

Edit on GitHub

โ“ Have a question? Join our chat, we will help you:

Discord Chat