NebiusForge
Model CatalogTrainingAgentsDocsStatusAccountSign in
← Back to agents
life-science·NVIDIA·v0.1.0

BioNeMo Research Assistant

This agent follows the Nebius Serverless AI cookbook BioNeMo recipe. It runs a CPU agent endpoint backed by TokenFactory and calls a GPU-backed BioNeMo-compatible service endpoint for protein embeddings, structure prediction, retrieval, MD metadata, genomics generation, and chat.

installableUser-owned ServerlessResearch only
Start install ↓Source recipe ↗
Service backend
1 GPU · B200 SXM A

GPU B200 SXM A · 1 GPU · 20 vCPU · 224 GB

Agent runtime
CPU · 4 vCPU · 16 GB

CPU D3 · 4 vCPU · 16 GB

GPU requirement
1 GPU required

GPU B200 SXM A · 1 GPU · 20 vCPU · 224 GB, CPU D3 · 4 vCPU · 16 GB

Included

BioNeMo-compatible service paths exposed to the agent.

Biomedical chat

/v1/chat/completions

Routes nonclinical biomedical text generation and educational summarization requests.

Protein embeddings

/v1/embeddings/protein

Routes public or synthetic protein sequences to an ESM-2-style embedding service.

Structure prediction

/v1/structure/boltz2

Prepares Boltz2-style nonclinical structure prediction calls.

Biomedical retrieval

/v1/retrieval/literature

Routes retrieval and literature-search embedding requests.

Molecular dynamics metadata

/v1/md/openmm

Returns OpenMM-style molecular dynamics timing and artifact metadata.

Genomics generation

Validated GPUs

Current Forge validation project exposed only gpu-b200-sxm-a for Serverless GPU jobs on 2026-06-18.

2 supported
PlatformPresetGPUStatusTested
gpu-b200-sxm-a1gpu-20vcpu-224gb1 × NVIDIA B200 · 179 GBsupported2026-06-18T14:40:27Z
gpu-b200-sxm-a8gpu-160vcpu-1792gb8 × NVIDIA B200 · 179 GBsupported2026-06-18T14:46:14Z

Matrix scope: service smoke only. It confirms the service image starts, sees the requested GPU shape, and exercises all advertised demo skills.

Install

Create the GPU service first, then create the CPU agent endpoint against it.

1 GPU required

Step 1

BioNeMo-compatible service

GPU-backed service endpoint that exposes the BioNeMo-compatible skills the agent calls.

Runtime
1 GPU · B200 SXM A
Auth
Bearer token
Port
8000
Open create form ↗

Step 2

Agent endpoint

CPU orchestration endpoint that serves the NeMo Agent Toolkit workflow and calls the service backend.

Runtime
CPU · 4 vCPU · 16 GB
/v1/genomics/carbon

Routes synthetic DNA/RNA generation demos.

Validation

Platform

gpu-b200-sxm-a / 1gpu-20vcpu-224gb for the BioNeMo-compatible service; cpu-d3 / 4vcpu-16gb for the agent

Scope

B200 service smoke exercised the six advertised deterministic BioNeMo-compatible demo skills.

Limit

Not validated as all real BioNeMo or NIM model containers loaded concurrently on one B200; production replacements must be validated per selected model/backend.

Auth
Bearer token
Port
8000
Open create form ↗

Required inputs

NEBIUS_API_KEYBIONEMO_BASE_URLBIONEMO_API_KEY