Provisional Patent Pending

We broke the scaling laws.

15M parameters. One epoch on a laptop. 2M Zulu sentences. 61.7% zero-shot intent on Korean. 56.5% on Japanese. 40+ languages. Runs on a $50 phone. No data centers. No H100s. No parallel data.

Frontier models serve roughly 100 of Earth's 6,000 languages and demand GPUs to run. (Stanford HAI, 2024) Bhala is the inversion — small, local, governable — and the substrate for every other language and every other stack.

The three inversions

A new class of AI.

Three bets the industry got wrong. Three results that change the valuation of every model trained under the old assumptions.

Large → Small

Capability isn't scale.

15M parameters beats InkubaLM-422M on Swahili intent and ties GPT-4o — from a model trained on one language in one epoch on a laptop.

See Swahili head-to-head
One → Many

Train structure once. Every language inherits it.

Zulu → Korean (61.7%). Zulu → Japanese (56.5%). Zulu → Hindi (60.3%). Zulu → Amharic (60.9%). No parallel data. No target-language training.

See cross-family matrix
Opaque → Composable

Prompts are blunt. Operators are algebra.

Name them. Compose them. Sign them. Reverse them. Nine independent claims in the provisional filing protect the controllable-embedding API as a product.

See the API spec

Business Impact

Measurable outcomes for the enterprise.

We focus on the results that move the needle—reducing risk, ensuring compliance, and slashing operational costs through verifiable AI.

Fewer wrong answers

Your AI stops making things up when the question is sensitive.

When a customer asks a compliance-sensitive question, your AI pulls the right documents the first time — and you get a signed record of exactly why it chose them. No guessing. No arguing with auditors.

Buyers: Banks · Insurers · Health Systems
EU AI Act ready

Show the regulator what the AI did, before they ask.

Every decision the model makes comes with a plain-language receipt: what it saw, what it chose, why. Remove bias on a specific prompt without retraining the model. Prove it was removed.

Buyers: Regulated Enterprise · Government · Legal
Runs on a $50 phone

Your product keeps working when the internet doesn't.

The model fits on a cheap smartphone and answers in under a blink. No cloud call, no monthly bill per user, no data leaves the device. Ideal for emerging markets, field work, and anything offline.

Buyers: Mobile OEMs · Defense · Field Ops

Pick the product

Which one fits your situation?

Every use case below is a ready-made product. Pick the one closest to what you need — you can always compose your own from the same building blocks.

Governable Embeddings

Precisely steer AI behavior with named operators for sentiment, intent, and bias. Ensure 100% predictable outcomes with an auditable trail for every single inference.

Composes

Governable EmbeddingsSozisi ManifoldMorpheme-Aware Tokenization

RAG Platforms, Regulated Industries, Compliance Teams

Interpretable AI

Turn opaque AI into an inspectable system. Every decision leaves a mathematical trace, making your deployment compliant with the EU AI Act and other strict regulatory frameworks.

Composes

Governable EmbeddingsSozisi ManifoldMorpheme-Aware Tokenization

Financial Services, Healthcare, Regulators, Legal

Sovereign AI

Complete intelligence independence. Deploy the entire stack on your own infrastructure to ensure total control over your data, weights, and security perimeter.

Composes

Sozisi ManifoldGovernable EmbeddingsMorpheme-Aware TokenizationEdge-Class BackboneSelf-Healing Inference

Governments, Central Banks, Defense, Regulated Enterprise

Edge AI

High-performance intelligence on any device, offline. Bring powerful NLU to smartphones and IoT hardware with zero cloud dependency and sub-50ms latency.

Composes

Edge-Class BackboneMorpheme-Aware Tokenization

Mobile OEMs, Defense, Rural Tech, Offline Markets

Translation

Enter new markets instantly. Our structural approach enables high-fidelity translation into local languages without the need for massive, expensive parallel datasets.

Composes

Sozisi ManifoldGovernable Embeddings

Localization, Customer Ops, Public Sector

Language Understanding

Teach a model vocabulary and it knows one language. Teach it structure and it knows thousands. Works zero-shot across 40+ languages today — because scaffolding is what they share.

Composes

Sozisi ManifoldMorpheme-Aware TokenizationGovernable Embeddings

Fintech, Healthtech, Global SaaS

Developer preview · v1Responsible AI · EU AI Act ready

One call. A signed receipt.

Apply a named action — change sentiment, redirect intent, remove bias — to any query at inference. Every call returns the result and a signed record your compliance team can read.

  • • One HTTP call. No retraining. No re-encoding.
  • • Works across 40+ languages with the same action.
  • • Every intervention logged, reversible, auditable.
POST /v1/embeddings/shift
curl https://api.bhala.ai/v1/embeddings/shift \
  -H "Authorization: Bearer $BHALA_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "I want to cancel my subscription",
    "lang": "en",
    "operators": [
      { "id": "sentiment_positive", "alpha": 1.0 }
    ]
  }'

# → {
#   "embedding": [ ... 128 floats ... ],
#   "operators_applied": [
#     { "id": "sentiment_positive", "alpha": 1.0,
#       "shift_norm": 0.431, "latency_ms": 23 }
#   ],
#   "model": "sci-v3",
#   "audit_id": "aud_01HXX..."
# }

Roadmap

From one language to the substrate of every stack.

We proved the hardest case first. Next we ship the API, then we become the interpretability layer for every foundation model.

Phase 1Shipped
2025 — Shipped

Proof on the hardest case

Pretrain on a single morphologically-rich language (isiZulu). Demonstrate zero-shot transfer to 40+ more across 10 families. Core patents filed on the architecture and the operators.

Phase 2Active
Q2 2026

Governable Embeddings API

Ship the operator library, audit log, and per-operator billing. First enterprise pilots in regulated verticals — banking, healthcare, compliance.

Phase 3Planned
Q3–Q4 2026

Composition layer for every foundation model

A thin layer that wraps OpenAI, Cohere, or any customer's in-house embedding model — and makes it governable. Our operators, their backbone.

Phase 4Planned
2027

The interpretability layer of the AI stack

Every regulator, bank, and health system requires auditable interventions. We are the substrate that makes their AI legal to ship.

Compose the intelligence you need.

Start with one core unit. Add more as you grow. Own the whole stack when you're ready for sovereign deployment.

Backed by

Techstars