Safety
Safety is not a feature. It is the foundation.
We believe powerful AI demands careful development and responsible deployment. These principles govern every decision we make at Merkium.
Safety by default
Every Kili model undergoes rigorous safety evaluation — including adversarial testing and bias audits — before any public deployment.
Honest by design
We train Kili to be truthful, to surface uncertainty explicitly, and to decline requests rather than produce misleading outputs.
Interpretable
We invest in mechanistic understanding of our models' internal representations — not just evaluating outputs, but explaining them.
Human-centered
Kili is designed to augment human judgment and creativity. Our systems are tools, not replacements — and we build them accordingly.
Our commitments are public, measurable, and enforced.
Responsible Scaling Policy
We tie model capability thresholds to concrete safety requirements. If our safeguards cannot keep pace with a model's capabilities, we pause deployment until they can.
The Kili Constitution
A transparent, publicly documented set of behavioral principles that govern how Kili responds — open to external scrutiny and community feedback.
Independent red-teaming
External domain experts systematically probe every frontier release for misuse vectors, bias patterns, and failure modes before we ship.