News — Varda

June 2026

SILA: AI Safety at the Layer Where Jailbreaks Actually Live — Without Touching Free Speech

Varda's representation-layer safety control makes misuse measurably harder to land, keeps the model helpful on legitimate requests, and enforces only the line the law already draws.

The headlines keep repeating a hard truth: today's AI safety lives on the surface. Guardrails are bolted onto a model's output — a filter reading the words after the thinking is done — and a determined attacker with the right phrasing can talk straight past them. A growing chorus of experts has gone further, arguing that steering a model's behavior reliably "can't be done."

Varda built SILA to answer that — not with a bigger filter, but by working at a deeper layer.

SILA is a runtime, representation-layer safety control. Instead of policing words after the fact, it reads intent as it forms during inference and gently steers the model's internal state toward a chosen safety constitution — the layer where jailbreaks actually operate, beneath the reach of surface guardrails. It doesn't replace the safety stack a company already trusts; it strengthens that stack from underneath.

In rigorous internal benchmarking across standard adversarial suites, SILA produces a measurable, reproducible reduction in successful attacks — verified not at a single lucky setting but across a complete dose-response sweep, so the effect is mapped, repeatable, and honest. And it does so while keeping helpful answers helpful: legitimate requests stay answered, because the goal is a model that is both safer and still genuinely useful. Varda is deliberate about the claim it makes here. SILA makes misuse measurably harder — not impossible. That precision is the point; it's what separates a real control from a marketing promise.

A line drawn at the Constitution, not at an opinion.

SILA's most important design decision is what it refuses to do. The system is constitution-agnostic — it enforces whatever rulebook it's pointed at — and Varda ships it set to the legal line for free speech, grounded in U.S. Supreme Court precedent. SILA does not adjudicate political "disinformation," a label that has shifted with the seasons. It is built to stop the serious harms that people across every political perspective already agree on — the kind no one should be able to extract from an AI — while leaving the realm of opinion untouched.

It is a steering wheel, not a destination. The dial is visible, auditable, and reversible — full on, full off, or anywhere between — and Varda's choice is to hold it at the line free people have already drawn together.

That stance arrives at a fitting moment. The White House's March 2026 National Policy Framework for Artificial Intelligence names free speech among its core pillars. SILA was designed, from the start, to make safety and free expression the same mechanism rather than opposing forces.

Built to be trusted — and tested.

Varda offers SILA for private technical review, where the full benchmark results can be walked through line by line. The conviction behind the product is simple: safety you can verify, helpfulness you can feel, and a principled refusal to become a tool for anything other than preventing real harm.

"We designed SILA around the line the law already draws — no further. It can do more; we won't. Preventing genuine harm should never require silencing legitimate speech, and we built SILA so it doesn't have to."

— James Peterson, Chief Technology Officer, Varda

About Varda — Varda builds the infrastructure layer for trustworthy autonomous AI: persistent memory, continuous identity, and principled, verifiable safety. To arrange a technical review of SILA, contact varda@vardasila.com.

News & Updates

SILA: AI Safety at the Layer Where Jailbreaks Actually Live — Without Touching Free Speech

A line drawn at the Constitution, not at an opinion.

Built to be trusted — and tested.