Question 1

Can I keep using llama.cpp underneath?

Accepted Answer

osFoundry has its own inference runtime — you don’t need llama.cpp. If you’re committed to a custom runtime, the BYO-VPC / BYO-server path lets you point Maestro at your own endpoint.

Question 2

Is osFoundry as customisable as a DIY stack?

Accepted Answer

For the integration points (prompts, retrieval, routing, post-hooks, tools), yes — via osStudio plugins. For the runtime internals (KV-cache management, attention kernels) — no, that’s opinionated.

Question 3

Do I still control my data?

Accepted Answer

Yes. Local-first mode keeps everything on-device. BYO-VPC is available for enterprise. Open-weight models mean no proprietary lock-in.

Question 4

What about cost?

Accepted Answer

For local-only usage, osFoundry is free. For team / cloud features, you pay per-second compute and per-GB storage — typically 60-90% less than running the equivalent DIY infrastructure at the same uptime, once you factor in ops time.

Question 5

Can osFoundry plugins replace my custom code?

Accepted Answer

For most patterns, yes. Retrieval stages, post-hooks, routing rules, custom commands, tool UIs, and workspace guards all have a plugin slot. Write the same TypeScript you’d write in a custom integration, ship it as a plugin, share it.

Question 6

Is the community catalogue actually useful?

Accepted Answer

Increasing — apps, agents, MCP servers, prompts, retrieval pipelines are already shareable. Quality varies; install-and-fork is the workflow.