Deployment

Docker

Build image

make docker-build
# or: IMAGE_TAG=1.2.0 make docker-build

Run locally

make docker-run
# Uses PORT (or exported MCP_PORT) and defaults to 3000 if neither is set
# Example: PORT=6677 make docker-run
# Swagger UI: http://localhost:<port>/docs
# OpenAPI JSON: http://localhost:<port>/docs-api.json
# MCP endpoint: http://localhost:<port>/mcp
# Liveness: http://localhost:<port>/livez
# Readiness: http://localhost:<port>/readyz
# Legacy health: http://localhost:<port>/health

docker-compose (recommended for local/staging)

make compose-up     # build + start in background
make compose-down   # stop

# docker-compose.yml uses MCP_PORT from your shell or .env

Push to registry

make docker-push REGISTRY=your.registry.io
# Pushes: your.registry.io/workflow-os-mcp:1.0.0

Vercel

Vercel can serve:

the static landing bundle from landing-page/
the HTTP API through the Node serverless entry at api/index.ts

Build configuration

Use the repo-owned Vercel build script:

pnpm run build:vercel

This ensures both the TypeScript server build and generated HTML docs are current for deployment.

Routes exposed on Vercel

Path	Method	Behavior
`/`	GET	Serves `landing-page/index.html`
`/docs`	GET	Swagger UI from the shared Express HTTP app
`/docs-api.json`	GET	Raw OpenAPI JSON
`/livez`	GET	Liveness response
`/readyz`	GET	Readiness response
`/health`	GET	Legacy health response
`/mcp`	POST	Reliable MCP JSON-RPC path on Vercel
`/mcp`	GET	Best-effort SSE stream
`/mcp`	DELETE	Stateless teardown/no-op
`/docs/*.html`	GET	Generated static documentation pages from `landing-page/docs/`

Vercel caveat for SSE

GET /mcp is still exposed on Vercel through the shared HTTP app, but long-lived SSE is inherently less reliable on serverless than on Docker or Kubernetes because function lifetime and streaming behavior are platform-constrained.

Treat:

POST /mcp as the primary and reliable Vercel integration path
GET /mcp as best-effort only

If you need robust long-running SSE behavior, prefer Docker, Compose, or Kubernetes deployment.

Kubernetes

Prerequisites

kubectl configured for your cluster
Docker image pushed to a registry accessible from the cluster
Update k8s/deployment.yaml → image: to point to your registry image

Apply all manifests

make k8s-apply
# Example: PORT=6677 K8S_SERVICE_PORT=8080 make k8s-apply

This creates (in order):

workflow-os namespace
workflow-os-mcp-config ConfigMap
workflow-os-mcp Deployment (2 replicas)
workflow-os-mcp ClusterIP Service (default port 80 → current container port)

Useful commands

make k8s-status      # kubectl get all -n workflow-os
make k8s-logs        # tail pod logs
make k8s-restart     # rolling restart (zero-downtime)
make k8s-delete      # tear down all resources

Update image after a new build

# Push new image
make docker-push REGISTRY=your.registry.io IMAGE_TAG=1.1.0

# Update deployment.yaml image tag, then apply
make k8s-apply

# Or trigger a rolling restart if image tag hasn't changed (uses :latest)
make k8s-restart

Resource sizing

Defaults in k8s/deployment.yaml:

Resource	Request	Limit
CPU	100m	500m
Memory	128Mi	256Mi

Adjust per your cluster capacity. The server is stateless — scale replicas freely.

The default manifest also includes:

a startupProbe on /livez so cold starts do not immediately fail liveness
a livenessProbe on /livez
a readinessProbe on /readyz

Environment Variables

All env vars are set in k8s/configmap.yaml for Kubernetes or in docker-compose.yml for Compose.

Variable	Default	Description
`MCP_TRANSPORT`	`http`	Always `http` in Docker/K8s
`MCP_PORT`	defaults to `3000`	Current container HTTP port
`WORKFLOW_ROOT`	bundled `workflows/`	Override to mount external files
`NODE_ENV`	`production`	Node environment
`LOG_LEVEL`	`info`	Structured stderr log threshold: `debug`, `info`, `warn`, or `error`
`PUBLIC_BASE_URL`	unset	Canonical base URL for generated API docs and OpenAPI server metadata
`METRICS_ENABLED`	`true`	Expose `/metrics` and `/metrics/prometheus` when enabled

Additional deployment-time variables:

Variable	Default	Description
`PORT`	`3000`	Docker/K8s container HTTP port used by Make and Kubernetes templating
`K8S_SERVICE_PORT`	`80`	Kubernetes Service port exposed inside the cluster

Health Check Endpoints

Path	Method	Description
`/livez`	GET	Returns process liveness for container/orchestrator probes
`/readyz`	GET	Returns readiness status plus workflow metadata
`/health`	GET	Legacy combined readiness endpoint
`/metrics`	GET	Returns in-process counters and duration aggregates
`/metrics/prometheus`	GET	Returns Prometheus text exposition output
`/mcp`	POST	MCP JSON-RPC messages
`/mcp`	GET	MCP SSE stream
`/mcp`	DELETE	Session teardown (no-op in stateless mode)

Manual health check

# If you keep your current local Docker/K8s port mapping, use:
curl http://localhost:<port>/livez
# {"status":"ok","service":"workflow-os-mcp","transport":"http"}

curl http://localhost:<port>/readyz
# {"status":"ok","service":"workflow-os-mcp","transport":"http",...}

curl http://localhost:<port>/health
# {"status":"ok","service":"workflow-os-mcp","transport":"http",...}

curl http://localhost:<port>/metrics
# {"service":"workflow-os-mcp","transport":"http","counters":{...},"durations":{...}}

curl http://localhost:<port>/metrics/prometheus
# http_request_started_total{method="POST"} 1