Healthcare / Life Sciencenvidia_nimapache-2.0/cc-by-4.0Added Apr 27, 2026

AlphaFold2

DeepMind AlphaFold2 NVIDIA NIM for protein structure prediction from amino acid sequence. This manifest remains onboarding-only until the pod-start failure observed on H200/eu-north2 is diagnosed and a successful Forge probe is recorded.

protein_sequence→structure, json

life-sciencebiologyproteinprotein-folding+2

Try in playground ↓Deploy Serverless ↓Open NVIDIA model card ↗API docs

Context window

4096

VRAM needed

33.8 GB

Observed working set on a supported GPU.

API route: POST /protein-structure/alphafold2/predict-structure-from-sequence

After the last request a backend stays warm on its GPU for about 15 minutes, then frees the GPU. The next request triggers a fresh cold start.

Status

cold

Not running.

API target

deepmind-alphafold2 version v1 using scheduler-selected GPU/region

POST to the native route with the shown model field; the playground below generates a full payload.

Open docs Try target

API route

/protein-structure/alphafold2/predict-structure-from-sequence

HTTP method

POST

Model field

deepmind-alphafold2

Version field

model_version: v1

1Verify targetRuns the auth guard and selected endpoint/model/routing check.
2Validate targetConfirm the selected GPU or region is still verified, or print copyable best-target exports.
3Estimate runValidate warm and first-cold request cost before prewarming or first traffic.
4Check runtimeConfirm whether the selected version is warm or starting.
5Prewarm targetStart the selected version on its pinned GPU or region before latency-sensitive traffic.
6Open docsUse the selected target snippets for the first request.Open docs

One-block API check

Terminal-ready smoke test for this selected target.

View command

set -euo pipefail
# Forge API smoke test
# Forge selected target: route=/v1/models/deepmind-alphafold2/inference-routes model=deepmind-alphafold2 version=v1
FORGE_API_BASE=${FORGE_API_BASE:-'https://YOUR_FORGE_HOST'}
case "${FORGE_API_KEY:-}" in
  ""|replace-with-your-forge-api-key)
    echo 'Set FORGE_API_KEY to a real Forge API key before running this snippet; browser SSO sessions are not sent to copied curl or SDK clients.' >&2
    exit 1
    ;;
esac
forge_api_url() {
  endpoint="$1"
  base="${FORGE_API_BASE%/}"
  case "$base:$endpoint" in
    */v1:/v1|*/v1:/v1/*|*/v1:/v1\?*) printf '%s%s\n' "$base" "${endpoint#/v1}" ;;
    *) printf '%s%s\n' "$base" "$endpoint" ;;
  esac
}
curl -sS --fail-with-body "$(forge_api_url '/v1/models/deepmind-alphafold2/inference-routes?model_version=v1')" \
  --max-time "${FORGE_REQUEST_TIMEOUT_SECONDS:-600}" \
  -H "Authorization: Bearer ${FORGE_API_KEY}" | \
python3 -m json.tool

Client fit

Native route check · Best for model-specific payloads; run the route check first, then copy the schema-specific request from the playground.

Routing auto-selected

Copied snippets omit gpu_type and region, so Forge chooses a compatible GPU and region at request time. Use GPU performance to pin a target.

Request URL

https://YOUR_FORGE_HOST/protein-structure/alphafold2/predict-structure-from-sequence

Authentication

Client auth: Set FORGE_API_KEY to a real Forge API key before running copied curl, fetch, or SDK snippets. Browser SSO only authenticates this web session.

Open Account

Authorization: Bearer $FORGE_API_KEY

Pinned setup

export FORGE_API_BASE='https://YOUR_FORGE_HOST'
export FORGE_API_KEY="${FORGE_API_KEY:-replace-with-your-forge-api-key}"
export FORGE_REQUEST_TIMEOUT_SECONDS="${FORGE_REQUEST_TIMEOUT_SECONDS:-600}"
export FORGE_API_ROUTE='/protein-structure/alphafold2/predict-structure-from-sequence'
export MODEL_OR_FAMILY_SLUG='deepmind-alphafold2'
export FORGE_MODEL_VERSION='v1'

Project .env

Copy these values into a local .env file when moving the selected target into an app or SDK client.

# Forge selected target: route=/protein-structure/alphafold2/predict-structure-from-sequence model=deepmind-alphafold2 version=v1
FORGE_API_BASE="https://YOUR_FORGE_HOST"
FORGE_API_ROUTE="/protein-structure/alphafold2/predict-structure-from-sequence"
FORGE_API_KEY="replace-with-your-forge-api-key"
FORGE_REQUEST_TIMEOUT_SECONDS="600"
MODEL_OR_FAMILY_SLUG="deepmind-alphafold2"
FORGE_MODEL_VERSION="v1"

Project .gitignore

Add these rules before replacing the placeholder API key so local Forge secrets stay out of commits while .env.example can remain tracked.

# Forge local API secrets
.env
.env.*
!.env.example

Preflight URLs and commands

Run estimate URL

https://YOUR_FORGE_HOST/v1/models/deepmind-alphafold2/run-estimate?model_version=v1

Best verified target

set -euo pipefail
# Forge selected target: route=/protein-structure/alphafold2/predict-structure-from-sequence model=deepmind-alphafold2 version=v1
FORGE_API_BASE=${FORGE_API_BASE:-'https://YOUR_FORGE_HOST'}
export MODEL_OR_FAMILY_SLUG=${MODEL_OR_FAMILY_SLUG:-'deepmind-alphafold2'}
export FORGE_MODEL_VERSION=${FORGE_MODEL_VERSION:-'v1'}
case "${FORGE_API_KEY:-}" in
  ""|replace-with-your-forge-api-key)
    echo 'Set FORGE_API_KEY to a real Forge API key before running this snippet; browser SSO sessions are not sent to copied curl or SDK clients.' >&2
    exit 1
    ;;
esac
forge_api_url() {
  endpoint="$1"
  base="${FORGE_API_BASE%/}"
  case "$base:$endpoint" in
    */v1:/v1|*/v1:/v1/*|*/v1:/v1\?*) printf '%s%s\n' "$base" "${endpoint#/v1}" ;;
    *) printf '%s%s\n' "$base" "$endpoint" ;;
  esac
}
reliability_path="$(python3 -c 'import os
from urllib.parse import quote, urlencode

model = os.environ.get("MODEL_OR_FAMILY_SLUG", "").strip()
if not model:
    raise SystemExit("Set MODEL_OR_FAMILY_SLUG from search or route finder output before checking reliability.")
params = {}
model_version = os.environ.get("FORGE_MODEL_VERSION", "").strip()
if model_version:
    params["model_version"] = model_version
gpu_type = os.environ.get("FORGE_GPU_TYPE", "").strip()
if gpu_type:
    params["gpu_type"] = gpu_type
region = os.environ.get("FORGE_REGION", "").strip()
if region:
    params["region"] = region
path = "/v1/models/" + quote(model, safe="") + "/reliability"
if params:
    path += "?" + urlencode(params)
print(path)')"
curl -sS --fail-with-body "$(forge_api_url "$reliability_path")" \
  --max-time "${FORGE_REQUEST_TIMEOUT_SECONDS:-600}" \
  -H "Authorization: Bearer ${FORGE_API_KEY}" | \
python3 -c 'import json, shlex, sys

payload = json.load(sys.stdin)
print(
    f"{payload.get('\''slug'\'')} reliability={payload.get('\''reliability_status'\'')} "
    f"supported={payload.get('\''supported_rows'\'', 0)}/{payload.get('\''total_rows'\'', 0)}"
)
filters = payload.get("filters") or {}
if filters:
    print("filters: " + ", ".join(f"{key}={value}" for key, value in filters.items()))

def describe_target(target):
    details = []
    request_ms = target.get("request_ms_p50") or target.get("request_ms")
    if request_ms is not None:
        details.append(f"p50={request_ms}ms")
    warm_cost = target.get("estimated_warm_request_cost_usd")
    if warm_cost is not None:
        details.append(f"warm_cost_usd={warm_cost}")
    elif target.get("cost_per_gpu_hour_usd") is not None:
        details.append(f"gpu_hour_usd={target['\''cost_per_gpu_hour_usd'\'']}")
    success_rate = target.get("observed_success_rate")
    if isinstance(success_rate, (int, float)):
        details.append(f"success={success_rate:.0%}")
    return ", ".join(details) or target.get("status") or "supported"

exports = {}
for label, key in (
    ("fastest supported", "fastest_supported_target"),
    ("lowest-cost supported", "lowest_cost_supported_target"),
):
    target = payload.get(key) or {}
    gpu_type = target.get("gpu_type")
    if not gpu_type:
        continue
    identity = (str(gpu_type), str(target.get("region") or ""))
    exports.setdefault(identity, {"labels": [], "target": target})["labels"].append(label)

if not exports:
    print("No supported GPU/region target returned.", file=sys.stderr)
    print(json.dumps({
        "status_counts": payload.get("status_counts", {}),
        "failure_reason_counts": payload.get("failure_reason_counts", {}),
    }, indent=2))
    raise SystemExit(1)

for (gpu_type, region), entry in exports.items():
    assignments = [f"FORGE_GPU_TYPE={shlex.quote(gpu_type)}"]
    if region:
        assignments.append(f"FORGE_REGION={shlex.quote(region)}")
    labels = " + ".join(entry["labels"])
    details = describe_target(entry["target"])
    print(f"export {'\'' '\''.join(assignments)}  # {labels}: {details}")'

Runtime status URL

https://YOUR_FORGE_HOST/v1/model-families/deepmind-alphafold2/status?version=v1

Runtime warmup command

set -euo pipefail
# Forge selected target: route=/protein-structure/alphafold2/predict-structure-from-sequence model=deepmind-alphafold2 version=v1
FORGE_API_BASE=${FORGE_API_BASE:-'https://YOUR_FORGE_HOST'}
export MODEL_OR_FAMILY_SLUG=${MODEL_OR_FAMILY_SLUG:-'deepmind-alphafold2'}
export FORGE_MODEL_VERSION=${FORGE_MODEL_VERSION:-'v1'}
export FORGE_KEEP_WARM=${FORGE_KEEP_WARM:-false}
case "${FORGE_API_KEY:-}" in
  ""|replace-with-your-forge-api-key)
    echo 'Set FORGE_API_KEY to a real Forge API key before running this snippet; browser SSO sessions are not sent to copied curl or SDK clients.' >&2
    exit 1
    ;;
esac
forge_api_url() {
  endpoint="$1"
  base="${FORGE_API_BASE%/}"
  case "$base:$endpoint" in
    */v1:/v1|*/v1:/v1/*|*/v1:/v1\?*) printf '%s%s\n' "$base" "${endpoint#/v1}" ;;
    *) printf '%s%s\n' "$base" "$endpoint" ;;
  esac
}
runtime_start_path="$(python3 -c 'import os
from urllib.parse import quote

model = os.environ.get("MODEL_OR_FAMILY_SLUG", "").strip()
if not model:
    raise SystemExit("Set MODEL_OR_FAMILY_SLUG from the model picker output")
print("/v1/model-families/" + quote(model, safe="") + "/start")')"
python3 -c 'import json, os

def env_value(name):
    value = os.environ.get(name, "").strip()
    return value or None

payload = {}
version = env_value("FORGE_MODEL_VERSION")
if version:
    payload["version"] = version
gpu_type = env_value("FORGE_GPU_TYPE")
if gpu_type:
    payload["gpu_type"] = gpu_type
region = env_value("FORGE_REGION")
if region:
    payload["region"] = region
keep_warm = env_value("FORGE_KEEP_WARM")
payload["run_until_stopped"] = (keep_warm or "").lower() in {"1", "true", "yes", "on"}
print(json.dumps(payload))' | \
curl -sS --fail-with-body "$(forge_api_url "$runtime_start_path")" \
  --max-time "${FORGE_REQUEST_TIMEOUT_SECONDS:-600}" \
  -X POST \
  -H "Authorization: Bearer ${FORGE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d @- | \
python3 -c 'import json, sys

payload = json.load(sys.stdin)
slug = payload.get("slug") or "runtime"
gpu_type = payload.get("gpu_type") or "scheduler-selected GPU"
region = payload.get("region") or "scheduler-selected region"
startup_ms = payload.get("startup_ms")
state = "cold-started" if payload.get("was_cold_start") else "already warm"
suffix = f"; startup_ms={startup_ms}" if startup_ms is not None else ""
print(f"{slug} {state} on {gpu_type} in {region}{suffix}; keep_warm={payload.get('\''keep_warm'\'')}")'

GPU performance

Pick a verified target for repeatable runs. Failed or pending details appear on the status hover.

Try selected target

Runs on · v1

· floor 40 GB

Target readiness

No verified targets yet

0/6 verified1 awaiting probe5 unavailable

GPU	Region	Status	VRAM	Cold start	Model time	Relative	Est. $/GPU-hr
B200	us-central1	incompatible	134.4 GB	—	—	—	$7.15
B300	uk-south1	incompatible	—	—	—	—	$7.85
H100	—	not probed	—	—	—	—	—
H200	eu-north2	incompatible	—	—	7m 55sp50 warm model time 1 sample	100%	$4.50
L40S	eu-north1	incompatible	33.8 GB	—	9m 9slatest warm probe time	87% · -13%	$1.82
RTX6000	us-central1	incompatible	71.8 GB	—	—	—	$1.80

How we measure

Model time uses the p50 warm model-reported execution time when available, then falls back to the latest probe time; p95/p99 and sample count appear when there is enough probe history. Cold start excludes the first (uncached) run. VRAM is the peak GPU memory seen during the probe. Relative compares each row's model time to the highlighted baseline (fastest row by default; hover any row to re-root). The fastest chip marks only verified supported GPU-region rows. Estimated on-demand GPU price (Nebius pay-as-you-go); shown for performance/price comparison. Configured minimum GPU memory: 40 GB.

Try it out

cold·Healthcare / Life Science

Compare GPUsOpen Account

Leave GPU on “Any available GPU” to use a warm or verified backend automatically.API docs for this target

Inputs

Protein sequenceMSA algorithm · enumMSA databases · jsonPerform geometry refinement · booleanPerform geometry refinementStructure model preset · enumModels to relax · enum

E-value · number1

MSA iterations · number1

API examples

Use the API

API docs

Snippet target: deepmind-alphafold2 version v1 using scheduler-selected GPU/region.

Client auth: Set FORGE_API_KEY to a real Forge API key before running copied curl, fetch, or SDK snippets. Browser SSO only authenticates this web session.

Open Account

FORGE_API_BASE=${FORGE_API_BASE:-'https://YOUR_FORGE_HOST'}
case "${FORGE_API_KEY:-}" in
  ""|replace-with-your-forge-api-key)
    echo 'Set FORGE_API_KEY to a real Forge API key before running this snippet; browser SSO sessions are not sent to copied curl or SDK clients.' >&2
    exit 1
    ;;
esac
forge_api_url() {
  endpoint="$1"
  base="${FORGE_API_BASE%/}"
  case "$base:$endpoint" in
    */v1:/v1|*/v1:/v1/*|*/v1:/v1\?*) printf '%s%s\n' "$base" "${endpoint#/v1}" ;;
    *) printf '%s%s\n' "$base" "$endpoint" ;;
  esac
}
forge_print_response() {
  response_file="$1"
  if [ ! -s "$response_file" ]; then
    printf '%s\n' '(empty response)'
    return 0
  fi
  if command -v python3 >/dev/null 2>&1; then
    python3 -m json.tool "$response_file" 2>/dev/null || cat "$response_file"
  else
    cat "$response_file"
  fi
}
response_file="$(mktemp)"
if curl -sS --fail-with-body "$(forge_api_url '/protein-structure/alphafold2/predict-structure-from-sequence')" \
  --max-time "${FORGE_REQUEST_TIMEOUT_SECONDS:-600}" \
  -H "Authorization: Bearer ${FORGE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
  "e_value": 1,
  "sequence": "MKQHKAMIVALIVICITAVVAALVTRKDLCEVHIRTGQTEVAVF",
  "algorithm": "mmseqs2",
  "databases": [
    "uniref90",
    "mgnify",
    "small_bfd"
  ],
  "iterations": 1,
  "relax_prediction": false,
  "structure_model_preset": "monomer",
  "structure_models_to_relax": "none",
  "model": "deepmind-alphafold2",
  "model_version": "v1"
}' \
  -o "$response_file"; then
  forge_print_response "$response_file"
  status=$?
  rm -f "$response_file"
  (exit "$status")
else
  status=$?
  cat "$response_file" >&2
  rm -f "$response_file"
  (exit "$status")
fi

Setup & .env

Install for curl

Copy setup before the request when moving this snippet into a fresh shell. The default 600 second timeout is intentional for GPU cold starts and can be overridden with FORGE_REQUEST_TIMEOUT_SECONDS.

export FORGE_API_BASE='https://YOUR_FORGE_HOST'
export FORGE_API_KEY="${FORGE_API_KEY:-replace-with-your-forge-api-key}"
export FORGE_REQUEST_TIMEOUT_SECONDS="${FORGE_REQUEST_TIMEOUT_SECONDS:-600}"

Project .env

Copy these values into a local .env file when moving the selected target into an app or SDK client.

# Forge selected target: route=/protein-structure/alphafold2/predict-structure-from-sequence model=deepmind-alphafold2 version=v1
FORGE_API_BASE="https://YOUR_FORGE_HOST"
FORGE_API_ROUTE="/protein-structure/alphafold2/predict-structure-from-sequence"
FORGE_API_KEY="replace-with-your-forge-api-key"
FORGE_REQUEST_TIMEOUT_SECONDS="600"
MODEL_OR_FAMILY_SLUG="deepmind-alphafold2"
FORGE_MODEL_VERSION="v1"

Output

Run a request to see output here.

Deploy to Nebius Serverless

Run a dedicated, autoscaling endpoint in your own Nebius account. The endpoint runs under your account and billing — Forge just pre-fills the configuration for you.

Deploy in your Nebius account ↗

Opens the Nebius Console with the image pre-filled for AlphaFold2 (Forge version v1).

Need a throughput- and cost-optimized build tuned for specific Nebius GPUs? Nebius Token Factory is coming soon — contact your Nebius account team for early access.