Real agentic workflows.
Every token on-prem.
Two regulated-industry agents running live against a Cascadia mesh — open-weight 8B-class models spread across a room of Intel AI PCs, one of them pipeline-parallel across two machines. Each step shows the serving node, latency, and signed receipt. Open the live routing dashboard side-by-side and watch the requests land.
Healthcare
Clinical referral triage
extract → triage criteria → urgency → ICD-10 coding assist → schedule → SBAR + letters → safety gate
Why on-prem: PHI never leaves the premises
Run the demo →Finance
KYC onboarding + AML screening
extract → watchlist screen → adjudicate hits → adverse media → risk rules → MLRO memo → policy gate
Why on-prem: BSA/AML · SAR-adjacent confidentiality
Run the demo →The fleet
| qwen3-8b | single node | extraction · classification · adjudication · QA gates |
| llama-8b-2stage | pipeline-parallel × 2 AI PCs | long-form synthesis, streamed live off the chain |
| phi-3.5-mini | single node | JSON repair rung · gate fallback |