Name: Mistral Small 3
Author: Mistral AI

Question 1

Is Mistral Small 3 free to use?

Accepted Answer

Mistral Small 3 is free to run locally on your own hardware. Hosted access through osFoundry is metered (input Free (local) / $ 0.10 /1M, output Free (local) / $ 0.30 /1M). You can switch between local and hosted at any time.

Question 2

Can I use Mistral Small 3 commercially?

Accepted Answer

Yes — commercial use is allowed. Permits commercial use, modification, distribution, and patent grants without royalties. Attribution required (preserve copyright + licence notices).

Question 3

What is the context window of Mistral Small 3?

Accepted Answer

Mistral Small 3 supports a 32K token context window.

Question 4

How much VRAM does Mistral Small 3 need?

Accepted Answer

Approximately 15 GB at Q4 quantisation, or 58 GB at full FP16 precision. Fits on a single 24GB consumer GPU.

Question 5

Can I run Mistral Small 3 locally?

Accepted Answer

Yes. Mistral Small 3 is open-weights and runs locally on a workstation GPU. osFoundry's local runtime handles model loading, quantisation, and routing.

Question 6

What is Mistral Small 3 best at?

Accepted Answer

Mistral Small 3 is well-suited to low-latency chat and routing, tool calling and function use, edge deployment on consumer GPUs.

Question 7

How do I use Mistral Small 3 in osFoundry?

Accepted Answer

Paste your Mistral AI API key in the key dialog (or deploy the open weights for self-hostable models), assign Mistral Small 3 to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.