About

Frontier AI
doesn't need a
datacenter.

Ternative is an open-source AI project from Colombia, building competitive language models — and the inference engine to run them — on hardware ordinary people already own.

The thesis

Constraints as
a method.

The dominant story of modern AI is one of scale: more parameters, more GPUs, more capital. Orchid 1.0 is an argument against that being the only path.

It's a 2-billion-parameter ternary-weight model, fine-tuned from Microsoft's BitNet b1.58 and aligned with ORPO across two preference-tuning rounds — the first competitive LLM trained and aligned in Colombia. Every stage ran on a single RTX 3050 laptop with 4 GB of VRAM. No cloud, no cluster.

Getting there meant solving problems the big labs never hit: how to fine-tune in 4 GB, and how to serve a ternary model with a LoRA adapter when no existing engine could. So we built ternative — and released all of it under Apache 2.0, with weights, code, a technical paper, and an archived DOI.

What we stand for

Reproducible, local, open.

−10+1

Reproducible

A documented recipe with published failure modes — not a black box. Reproduce the benchmarks with the scripts in the repo.

local-first

Local-first

Inference runs on your machine — CPU-only if needed. No account, no telemetry, no cloud dependency.

Apache 2.0

Open

Weights, engine, paper and data recipe — all public, free for research and commercial use alike.

Sponsor the work

Keep open AI
independent.

Ternative is built and maintained in the open, outside any large lab. Funding goes directly to continued development — better models, a faster engine, and the friendly Orchid Desktop app. If your organization relies on open, reproducible AI, consider supporting it through FLOSS/fund.

Support on FLOSS/fund ↗ Star on GitHub ↗

Apache 2.0

Permissive license — yours to build on

100% open

Weights · engine · paper · data recipe

Everything, in the open

Project links

Hugging Face
Model card, GGUF weights & LoRA adapter huggingface.co ↗ ternative engine
C++17 / CUDA inference engine — source & releases github.com ↗ orchid-1.0 recipe
Training recipe, eval harness & reproduction scripts github.com ↗ Zenodo · Technical paper
DOI 10.5281/zenodo.20452163 — archived research record zenodo.org ↗ FLOSS/fund
Fund continued open development floss.fund ↗

Cite the work

Used Orchid or ternative
in research?

Both have a citable record. The model is archived on Zenodo with a permanent DOI.

citation.bib

@software{romerochisco2026ternative,
  title  = {ternative: Inference Engine for
            Ternary-Weight LLMs with Runtime LoRA},
  author = {Romero Chisco, Michelangelo},
  year   = {2026},
  license = {Apache-2.0}
}

Frontier AIdoesn't need adatacenter.

Constraints asa method.