Status

buildr-plannr system status

Public service health for agent planning, authentication, billing, data, and background work. The same signals drive the internal on-call path and customer incident decisions.

Current state

All customer-facing buildr-plannr systems are operational.

Operational

Updated 23 May 2026, 02:55 UTC

Components

Customer-facing service map

Status summary API

Web app

Marketing pages, login, secured app shell, and workspace navigation.

Operational
Public impact
Customers can open public pages and authenticated workspaces.
Owner
Product engineering

Planner API

Workspace, project, issue, evidence, import, export, and agent routes.

Operational
Public impact
Planning APIs are available for human and agent workflows.
Owner
API owner

Cognito auth

Custom login, signup, verification, reset, logout, and hosted UI callback.

Operational
Public impact
Users can authenticate and reach secured workspaces.
Owner
Auth owner

Agent work queue

Ready-work discovery, agent claims, run quotas, task contracts, and evidence.

Operational
Public impact
Agents can claim scoped work when workspace policy allows it.
Owner
Agent platform owner

Billing and entitlements

Stripe checkout, portal, webhooks, plan limits, and usage enforcement.

Operational
Public impact
Billing operations and plan entitlements are available.
Owner
Billing owner

Data layer and backups

DynamoDB storage, backup/export controls, retention, and restore workflows.

Operational
Public impact
Workspace data reads and writes are healthy.
Owner
Data owner

Uptime alerts

Alerts route from service symptoms to accountable owners

Synthetic health and readiness

Scheduled checks for /health and /ready from outside AWS.

CRITICAL
Threshold
Two consecutive failures or 60 seconds above the response SLO.
First response
5 minutes
Customer visible
Yes

Web app 5xx rate

CloudFront and origin 5xx rates for public and app routes.

CRITICAL
Threshold
5xx rate above 2% for 5 minutes or any sustained origin outage.
First response
5 minutes
Customer visible
Yes

Planner API failure rate

Route-level API errors for projects, issues, imports, exports, and agents.

HIGH
Threshold
5xx rate above 1% for 10 minutes or one critical write path down.
First response
10 minutes
Customer visible
Yes

Cognito auth failure spike

Login, signup, verification, reset, callback, and token validation errors.

HIGH
Threshold
Failure rate doubles the 24 hour baseline for 10 minutes.
First response
10 minutes
Customer visible
Yes

Stripe webhook failures

Stripe delivery failures, replay failures, and entitlement drift.

HIGH
Threshold
Any production webhook outage or three failed deliveries in 15 minutes.
First response
10 minutes
Customer visible
Yes

Agent queue age

Oldest agent, import, export, or evidence job age.

MEDIUM
Threshold
Oldest customer-visible job exceeds 10 minutes.
First response
30 minutes
Customer visible
Internal first

DynamoDB health

Throttles, system errors, PITR status, and backup/export job failures.

CRITICAL
Threshold
Any table unavailable, PITR disabled, or sustained throttling.
First response
5 minutes
Customer visible
Yes