Question 1

Why move AI on-premise in 2026?

Accepted Answer

Regulation, model capability, and hardware economics have converged. The EU AI Act and ISO 42001 have raised the audit threshold for cloud AI in regulated environments. Open-weight models have closed the capability gap with proprietary cloud offerings. A single on-premise node now serves a 70B-parameter model to a working team without per-seat subscription costs. Retaining data inside the perimeter is now a tractable architectural decision rather than a cost trade-off.

Question 2

Is the environment genuinely private?

Accepted Answer

Yes. Inference and retrieval execute on client hardware, inside the client perimeter. No outbound call is made to a vendor API at inference time. Network egress is constrained at the firewall by default, and the environment operates under enforced network isolation.

Question 3

Why not adopt an enterprise cloud-AI product?

Accepted Answer

Enterprise cloud AI is a capable product. The structural constraint is unchanged: client documents continue to traverse a third-party environment. For attorney-client privilege, patient data, or trade secrets, this is an architectural question, not a contractual one. An on-premise deployment removes the question by removing the external system from the data path.

Question 4

What is the typical engagement duration?

Accepted Answer

5 to 14 working days from agreement to handover. The majority of the engagement is spent on document corpus integration and connection to existing internal systems, not on the inference layer itself. Standard commercial terms are 50% on engagement, 50% on handover.

Question 5

Is a dedicated internal owner required to operate the environment?

Accepted Answer

No. The architecture is deliberately conservative and open-source throughout. Operational ownership transfers to client IT under a written runbook. Engagement experience indicates most clients require no further intervention for six months or more following handover.

Question 6

What if the consultancy becomes unavailable?

Accepted Answer

Every component delivered is open-source and documented. A written runbook is provided at handover. A competent internal administrator can assume operations directly, and a small network of independent consultants operating the same architecture is available as an alternative. No proprietary lock-in is introduced at any layer of the architecture.

Four engagements. One principle: AI stays inside your perimeter.

Select the entry point for your environment.

Architectural Assessment

Initiate discussion

Sovereign Deployment

Initiate discussion

Managed Operations

Initiate discussion

Bespoke Capability Build

Initiate discussion

From first engagement to production environment - typically inside three weeks.

Qualification

Assessment

Deployment

Operations

An open-architecture stack, owned and operated inside your perimeter.

Inference layer

Retrieval layer

Workflow layer

Isolation perimeter

Compute substrate

Questions reviewed before engagement.

Scope your engagement.