zedbyl.tech/blog

Notes from the field.

Long-form writing on private AI deployment, on-premises infrastructure, compliance, and the engineering decisions behind every system I ship.

Filter by tag

8 / 8 posts

FEATURED · FIELD NOTE 001

2026 · 04

ARCHITECTURE9 min read

Why on-premises is not "cloud without internet"

The engineering trade-offs behind real isolation, GDPR data residency, and where most "private ChatGPT" pitches fall apart under audit.

Apr 12, 2026Nikita Chetverikov

Read article →

04· All field notes

FN / 002

2026 · 05

INFRAMay 7, 202616 min

Apple Silicon as an inference node: M4 Max & M3 Ultra, honest digits

Benchmarks for 70B models on M4 Max and M3 Ultra. Why Apple is betting on local inference - and what the token economics tell us about the future.

FN / 003

2026 · 05

COMPLIANCEMay 4, 202611 min

Public AI assistants in higher education: the GDPR exposure most institutions have not assessed

When staff paste student work into a public AI assistant - ChatGPT, Claude, Gemini, whichever - the institution becomes the controller for a processor it never contracted. A walk through GDPR Articles 5, 28, 32 and 35, the rulings already issued, and the architectural fix that does not require banning AI.

FN / 004

2026 · 04

ARCHITECTUREApr 28, 202614 min

A private LLM for a research lab: notes from a 14-day rollout

Field notes from a 14-day on-premises private LLM deployment for a 28-person genomics lab: Mac Studio M3 Ultra running Llama 3.3 70B via Ollama, AnythingLLM RAG over 2,400 unpublished documents, pfSense deny-by-default egress, and zero outbound bytes after handover - with the hardware trade-offs, GDPR and grant-compliance framing, and the three things that broke.

FN / 005

2026 · 05

CASE STUDYMay 18, 202612 min

Private RAG for contract review: a law firm case study (Dubai/London)

How a Dubai/London law firm cut contract review time by 73% with a private Llama 3.1 70B RAG running on-prem over 12k binding documents - audit-grade citations, NDA-safe, no cloud.

FN / 006

2026 · 05

CASE STUDYMay 19, 202611 min

Hand-written referrals to structured EHR records: a clinic NER case study (UAE)

A UAE clinic group turned hand-written referral letters into structured EHR records in 4 seconds with 99.2% NER precision - on a single on-prem A6000, never touching the public internet.

FN / 007

2026 · 06

OPINIONJun 16, 202615 min

Why 40% of AI Projects Get Canceled - And the Five Decisions That Separate the Rest

Gartner says more than 40% of AI agent projects will be canceled by the end of 2027. The reason is almost never the technology. Here is what actually goes wrong - and a practical framework for not ending up in that pile.

FN / 008

2026 · 06

CASE STUDYJun 28, 202615 min

How I Built an Autonomous AI Content Engine for a Crypto Media Company

A crypto media company needed to cover a 24/7 market with a finite editorial team. Here is how I built an autonomous pipeline that went from detecting events to publishing finished articles - and why the hardest part had nothing to do with AI generation.

One field note a month. No marketing.

If you run sensitive data and want a heads-up when I publish, drop your email. No tracking, no upsells, no AI roundup of someone else's blog.