Controlled public beta

Known limits

InferGrade helps technical users choose and compare local LLM setups with evidence. It does not certify that a model is safe, reliable, legally suitable, or ready for operational use.

Supported path

Apple Silicon with the Desktop Runner is the reference beta path.
Managed llama.cpp Metal runtime setup is supported after explicit user action.
Hub queued runs require a paired, listening Runner whose release matches the Hub pinned Runner version.
Uploaded results are private by default. Public result URLs resolve only after the owner publishes a result.

Preview and deferred paths

Windows with NVIDIA CUDA is a technical beta target, not a controlled-public-beta feature yet.
Linux CUDA remains a best-effort CLI path until there is a separate support plan.
ROCm, Vulkan, non-Mac CPU-only paths, and build-from-source runtime setup are experimental unless a result says otherwise.
Cloud execution is deferred unless an internal run is explicitly marked experimental.

Evidence shapes

Demo, sample, thin, failed, and informational evidence stay visibly distinct from decision-grade evidence.
Result pages describe what a run proves, what it does not prove, and what benchmark would improve the answer.
Scores are scoped to the use case, benchmark lane, hardware, runtime, and artifact that produced the evidence.
InferGrade does not publish raw private outputs, runner tokens, pairing codes, signed URLs, or private artifacts.

Before relying on an answer

Check whether the Result page is exact, nearby, sample, or informational evidence.
Compare only within the stated use case, hardware, runtime, and benchmark scope.
Run the suggested follow-up benchmark when the answer asks for confirmation.

Read methodology Install quickstart Open Hub