Production Checklist

Complete this checklist before deploying Lasso RPC to production. Each item ensures reliability, security, and observability.

Environment Configuration

Required Variables

SECRET_KEY_BASE

Purpose: Phoenix signing/encryption secret for session security.Generate:

mix phx.gen.secret

Requirements:

Minimum 64 bytes
Keep secret (never commit to version control)
Use same value across all nodes in a cluster

Set:

export SECRET_KEY_BASE="your-64-byte-secret-here"

Validation:

echo $SECRET_KEY_BASE | wc -c
# Should output 64 or more

PHX_HOST

Purpose: Public hostname for URL generation and CORS.Set:

export PHX_HOST="rpc.example.com"

Examples:

Single region: rpc.example.com
Multi-region: us-east-1.rpc.example.com, eu-west-1.rpc.example.com

Validation:

curl https://$PHX_HOST/api/health

PHX_SERVER

Purpose: Start the HTTP server (required for releases).Set:

export PHX_SERVER="true"

Note: Automatically set in Dockerfile. Verify in non-Docker deployments.

LASSO_NODE_ID

Purpose: Unique, stable identifier for this node.Set:

export LASSO_NODE_ID="us-east-1"

Requirements:

Unique per node
Stable (don’t change after deployment)
Convention: use region names for geo-distributed deployments

Examples:

us-east-1, eu-west-1, ap-southeast-1 (cloud regions)
iad, lhr, sin (datacenter codes)
production-1, staging-1 (environment-based)

PORT (optional)

Purpose: HTTP listener port.Default: 4000Set custom port:

export PORT="8080"

Provider API Keys

Identify required providers

Review your profile YAML files for ${ENV_VAR} placeholders:

grep -r '${' config/profiles/

Example output:

url: "https://eth-mainnet.g.alchemy.com/v2/${ALCHEMY_API_KEY}"
url: "https://mainnet.infura.io/v3/${INFURA_API_KEY}"

Set provider API keys

export ALCHEMY_API_KEY="your-alchemy-key"
export INFURA_API_KEY="your-infura-key"
export QUICKNODE_API_KEY="your-quicknode-key"

Validate startup

Lasso crashes at startup if ${ENV_VAR} placeholders are unresolved.Test locally:

mix phx.server
# Should start without errors

Clustering (Optional)

Set clustering variables (if clustering)

export CLUSTER_DNS_QUERY="lasso.internal"
export CLUSTER_NODE_BASENAME="lasso"

Note: Both must be set for clustering to activate. Omit both for standalone mode.

Verify DNS resolution

dig $CLUSTER_DNS_QUERY
# Should return all node IPs

Test node connectivity

# Test EPMD port
telnet <other-node-ip> 4369

Profile Configuration

Validate Profile YAML

Check profile syntax

# Validate YAML syntax
for f in config/profiles/*.yml; do
  echo "Validating $f"
  ruby -ryaml -e "YAML.load_file('$f')" || echo "ERROR in $f"
done

Verify required fields

Each profile must have:

name: Display name
slug: URL-safe identifier
chains: At least one chain with providers

name: "Production"
slug: "production"
chains:
  ethereum:
    chain_id: 1
    providers:
      - id: "alchemy_ethereum"
        url: "https://eth-mainnet.g.alchemy.com/v2/${ALCHEMY_API_KEY}"

Test profile loading

# Start server and check logs
mix phx.server

# Should see:
# [info] Loaded profile: production

Rate Limits

Configure rate limits

Set appropriate rate limits in profile frontmatter:

# config/profiles/production.yml
default_rps_limit: 100    # Requests per second per client IP
default_burst_limit: 500  # Burst allowance

Test rate limiting

# Generate load
for i in {1..150}; do
  curl http://localhost:4000/rpc/fastest/ethereum \
    -X POST \
    -H "Content-Type: application/json" \
    -d '{"jsonrpc":"2.0","method":"eth_blockNumber","params":[],"id":1}' &
done

# Should see 429 Too Many Requests after exceeding limit

Health Checks

Configure Health Endpoint

Test health endpoint

curl http://localhost:4000/api/health
# Should return 200 OK with JSON response

Configure orchestrator health checks

Kubernetes:

livenessProbe:
  httpGet:
    path: /api/health
    port: 4000
  initialDelaySeconds: 30
  periodSeconds: 10
  timeoutSeconds: 5
  failureThreshold: 3

readinessProbe:
  httpGet:
    path: /api/health
    port: 4000
  initialDelaySeconds: 10
  periodSeconds: 5
  timeoutSeconds: 3
  failureThreshold: 2

Docker:

HEALTHCHECK --interval=30s --timeout=10s --retries=3 --start-period=40s \
  CMD curl -f http://localhost:4000/api/health || exit 1

Docker Compose:

healthcheck:
  test: ["CMD", "curl", "-f", "http://localhost:4000/api/health"]
  interval: 30s
  timeout: 10s
  retries: 3
  start_period: 40s

TLS/HTTPS

Terminate TLS at Load Balancer

Configure reverse proxy

Lasso serves HTTP. Terminate TLS at your reverse proxy or load balancer.nginx:

server {
  listen 443 ssl http2;
  server_name rpc.example.com;
  
  ssl_certificate /path/to/cert.pem;
  ssl_certificate_key /path/to/key.pem;
  
  location / {
    proxy_pass http://lasso-backend:4000;
    proxy_http_version 1.1;
    proxy_set_header Upgrade $http_upgrade;
    proxy_set_header Connection "upgrade";
    proxy_set_header Host $host;
    proxy_set_header X-Real-IP $remote_addr;
    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
    proxy_set_header X-Forwarded-Proto $scheme;
  }
}

Cloud Load Balancers:

AWS ALB: Configure HTTPS listener with ACM certificate
GCP Load Balancer: Use Google-managed certificates
Azure Application Gateway: Configure SSL termination

Set PHX_HOST to HTTPS domain

export PHX_HOST="rpc.example.com"
export PHX_SCHEME="https"  # Optional, defaults to https in prod

Test HTTPS

curl https://rpc.example.com/api/health

Logging

Structured JSON Logs

Verify JSON logging

Lasso emits structured JSON logs in production.Example log:

{
  "event": "rpc.request.completed",
  "request_id": "abc123",
  "strategy": "fastest",
  "chain": "ethereum",
  "jsonrpc_method": "eth_blockNumber",
  "routing": {
    "selected_provider": {"id": "alchemy_ethereum"},
    "retries": 0
  },
  "timing": {
    "upstream_latency_ms": 120,
    "end_to_end_latency_ms": 125
  },
  "timestamp": "2026-03-03T10:30:45.123Z"
}

Configure log aggregation

Send logs to a centralized logging service:Docker:

services:
  lasso:
    logging:
      driver: "json-file"
      options:
        max-size: "10m"
        max-file: "3"

Kubernetes:

# Logs automatically collected by cluster logging (Fluentd, etc.)

Systemd:

journalctl -u lasso -f --output=json

Set up log monitoring

Create alerts for:

High error rates
Circuit breaker openings
Provider failures
Slow response times (P95/P99)

Monitoring

Dashboard Access

Verify dashboard

curl http://localhost:4000/dashboard
# Should return HTML (200 OK)

Secure dashboard access (production)

The dashboard is public by default. Consider:Option 1: Restrict via reverse proxy

location /dashboard {
  auth_basic "Lasso Dashboard";
  auth_basic_user_file /etc/nginx/.htpasswd;
  proxy_pass http://lasso-backend:4000;
}

Option 2: Firewall rules Only allow dashboard access from internal IPs.Option 3: VPN requirement Require VPN connection for dashboard access.

Metrics Collection

Enable VM metrics (if needed)

VM metrics (BEAM stats) are enabled by default.Disable if not needed:

export LASSO_VM_METRICS_ENABLED="false"

Monitor key metrics

Track these metrics:Request metrics:

Requests per second (total and per provider)
Latency (P50, P95, P99)
Error rate
Circuit breaker state

System metrics:

CPU usage
Memory usage
BEAM scheduler utilization
Process count

Provider metrics:

Provider availability
Provider latency
Circuit breaker trips
Failover rate

Network & Firewall

Port Configuration

Open required ports

Single node:

4000 (HTTP, configurable via PORT)

Clustered nodes:

4000 (HTTP)
4369 (EPMD)
9000-9999 (Erlang distribution, configurable)

Configure firewall rules

HTTP (4000):

Allow: Public (or application subnets)

EPMD (4369) and distribution ports:

Allow: Only cluster nodes (internal network)
Deny: Public internet

Capacity Planning

Resource Allocation

Estimate request volume

Calculate expected requests per second (RPS):Example:

1000 users
10 requests per user per minute
= 167 RPS

Add 2x headroom: 334 RPS

Size compute resources

CPU:

1 vCPU per 500 RPS (approximate)
Example: 334 RPS → 2 vCPUs

Memory:

Base: 512MB
Add 256MB per 1000 RPS
Example: 334 RPS → 1GB

Disk:

Minimal (logs only, unless persisting metrics)
10GB should suffice

Test under load

Use load testing tools:

# Example with Apache Bench
ab -n 10000 -c 100 -p request.json -T application/json \
  http://localhost:4000/rpc/fastest/ethereum

Monitor:

Response time (P95, P99)
Error rate
CPU/memory usage

Disaster Recovery

Backup Configuration

Backup profile files

# Backup profiles
tar -czf lasso-profiles-backup.tar.gz config/profiles/

Document environment variables

Store environment configuration in a secure secrets manager:

AWS Secrets Manager
HashiCorp Vault
Kubernetes Secrets

Failover Testing

Test provider failover

# Simulate provider failure by blocking traffic
iptables -A OUTPUT -d <provider-ip> -j DROP

# Verify requests failover to next provider
curl -X POST http://localhost:4000/rpc/fastest/ethereum?include_meta=body \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","method":"eth_blockNumber","params":[],"id":1}'

# Check lasso_meta.selected_provider.id in response

Test node failure (if clustered)

# Stop one node
docker stop lasso-us-east-1

# Verify:
# - Dashboard shows node as :disconnected
# - Applications route to remaining nodes
# - Metrics exclude failed node

Pre-Launch Checklist

Complete verification before production launch:

Environment

SECRET_KEY_BASE set (64+ bytes)
PHX_HOST set to public hostname
PHX_SERVER=true set
LASSO_NODE_ID set to unique, stable value
PORT configured (if not using 4000)
Provider API keys set for all ${ENV_VAR} in profiles
If clustering: CLUSTER_DNS_QUERY and CLUSTER_NODE_BASENAME set
If clustering: DNS resolution verified
If clustering: Erlang distribution ports open between nodes

Configuration

Profile YAML validated (no syntax errors)
Profile YAML startup tested (no unresolved ${ENV_VAR})
Rate limits configured in profile frontmatter
Provider capabilities defined (or defaults accepted)
Circuit breaker thresholds reviewed (or defaults accepted)

Infrastructure

Health check (GET /api/health) monitored by orchestrator/LB
TLS terminated at reverse proxy or load balancer
Firewall rules configured (4000 public, 4369/9000+ internal only)
Load balancer configured (if multiple nodes per region)
GeoDNS/anycast configured (if multi-region)

Observability

Structured JSON logs verified
Log aggregation configured (Fluentd, CloudWatch, etc.)
Dashboard access secured (auth, VPN, or internal-only)
Metrics monitoring configured (CPU, memory, RPS, latency)
Alerts configured (high error rate, circuit breaker, slow response)

Testing

Load testing completed (verify RPS capacity)
Provider failover tested (circuit breaker triggers correctly)
Rate limiting tested (429 responses after exceeding limit)
If clustered: Node failure tested (dashboard shows :disconnected)
If multi-region: Regional failover tested (GeoDNS/LB routes correctly)

Documentation

Runbook created (restart procedures, troubleshooting)
Environment variables documented in secrets manager
Profile configuration backed up
On-call team trained (dashboard usage, common issues)

Post-Launch Monitoring

Monitor these metrics in the first 24-48 hours:

Request volume: Verify within expected range
Error rate: Should be <1%
Latency: P95/P99 within acceptable thresholds
Circuit breaker trips: Investigate any provider issues
Memory/CPU: Verify resources are adequate
Log errors: Review for unexpected issues

Next Steps

Docker Deployment - Containerized deployment guide
Clustering - Multi-node setup
Geo-Distributed - Multi-region architecture

Get Started

Core Concepts

Configuration

Deployment

Observability

Advanced

Production Checklist

Environment Configuration

Required Variables

Provider API Keys

Clustering (Optional)

Profile Configuration

Validate Profile YAML

Rate Limits

Health Checks

Configure Health Endpoint

TLS/HTTPS

Terminate TLS at Load Balancer

Logging

Structured JSON Logs

Monitoring

Dashboard Access

Metrics Collection

Network & Firewall

Port Configuration

Capacity Planning

Resource Allocation

Disaster Recovery

Backup Configuration

Failover Testing

Pre-Launch Checklist

Environment

Configuration

Infrastructure

Observability

Testing

Documentation

Post-Launch Monitoring

Next Steps

Get Started

Core Concepts

Configuration

Deployment

Observability

Advanced

Documentation Index

​Environment Configuration

​Required Variables

​Provider API Keys

​Clustering (Optional)

​Profile Configuration

​Validate Profile YAML

​Rate Limits

​Health Checks

​Configure Health Endpoint

​TLS/HTTPS

​Terminate TLS at Load Balancer

​Logging

​Structured JSON Logs

​Monitoring

​Dashboard Access

​Metrics Collection

​Network & Firewall

​Port Configuration

​Capacity Planning

​Resource Allocation

​Disaster Recovery

​Backup Configuration

​Failover Testing

​Pre-Launch Checklist

​Environment

​Configuration

​Infrastructure

​Observability

​Testing

​Documentation

​Post-Launch Monitoring

​Next Steps

Environment Configuration

Required Variables

Provider API Keys

Clustering (Optional)

Profile Configuration

Validate Profile YAML

Rate Limits

Health Checks

Configure Health Endpoint

TLS/HTTPS

Terminate TLS at Load Balancer

Logging

Structured JSON Logs

Monitoring

Dashboard Access

Metrics Collection

Network & Firewall

Port Configuration

Capacity Planning

Resource Allocation

Disaster Recovery

Backup Configuration

Failover Testing

Pre-Launch Checklist

Environment

Configuration

Infrastructure

Observability

Testing

Documentation

Post-Launch Monitoring

Next Steps