WACS.WASI.NN 0.3.4

There is a newer version of this package available.
See the version list below for details.

dotnet add package WACS.WASI.NN --version 0.3.4

NuGet\Install-Package WACS.WASI.NN -Version 0.3.4

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="WACS.WASI.NN" Version="0.3.4" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="WACS.WASI.NN" Version="0.3.4" />
 

 Directory.Packages.props

<PackageReference Include="WACS.WASI.NN" />
 

 Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add WACS.WASI.NN --version 0.3.4

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: WACS.WASI.NN, 0.3.4"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package WACS.WASI.NN@0.3.4

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=WACS.WASI.NN&version=0.3.4
 

 Install as a Cake Addin

#tool nuget:?package=WACS.WASI.NN&version=0.3.4
 

 Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

WACS.WASI.NN

Host bindings for wasi-nn — the WebAssembly System Interface for neural-network inference. Both the component-model WIT (wasi:nn@0.2.0-rc-2024-10-28) and the legacy WITX (wasi_ephemeral_nn) ABIs are wired against a single backend SPI; embedders configure which backend handles which graph encoding and the host orchestrator routes accordingly.

Packages

Package	Role
`WACS.WASI.NN`	Core: `WasiNNConfiguration`, `WasiNNHost` (`IBindable`), `IBackend` SPI, WIT + WITX bindings, source-gen `[WitSource]` interfaces under `Wacs.WASI.NN.Nn.{Tensor, Errors, Graph, Inference}`, `IdentityBackend`, `runtime.UseWasiNN(...)` extension
`WACS.WASI.NN.DependencyInjection`	DI bundle for the transpiler-direct-link path: `WasiNNBundle`, `WasiPreview2NNBundle` composite, concrete resource impls (`Tensor`, `Graph`, `GraphExecutionContext`, `Error`), `services.AddWasiNN(...)` registration
`WACS.WASI.NN.OnnxRuntime`	Direct ONNX Runtime backend (`graph-encoding.onnx`). Ships `WasiNNOnnxBindable` parameterless adapter for `--bind`
`WACS.WASI.NN.MLNet`	Microsoft.ML-flavored backend wrapping ONNX Runtime under an `MLContext` lifecycle. Ships `WasiNNMLNetBindable` for `--bind`
`WACS.WASI.NN.LlamaSharp`	LlamaSharp / llama.cpp backend (`graph-encoding.ggml`). Ships `WasiNNLlamaSharpBindable` with `WACS_WASINN_GGUF_DIR`-driven name registry

The packages are siblings — consumers wiring only one backend skip the others' NuGet transitives (ORT native binaries, Microsoft.ML, LlamaSharp's llama.cpp runtime).

Quick start

CLI (zero code):

wacs run my.component.wasm --wasip2 --wasi-nn -d ./models
# --wasi-nn loads Wacs.WASI.NN.OnnxRuntime; bundled with the CLI.
# Component auto-dispatches wasi:cli/run@<version>#run.

Embedder one-liner (interpreter path):

using Wacs.Core.Runtime;
using Wacs.WASI.NN;
using Wacs.WASI.NN.OnnxRuntime;
using Wacs.WASI.NN.Types;

var runtime = new WasmRuntime();
using var host = runtime.UseWasiNN(b =>
 b.AddBackend(GraphEncoding.ONNX, new OnnxBackend()));
// runtime now satisfies wasi-nn imports for both ABIs.

Verbose path (still supported):

var cfg = WasiNNConfiguration.DefaultConfiguration();
cfg.Backends[GraphEncoding.ONNX] = new OnnxBackend();
using var host = new WasiNNHost(cfg);
host.BindToRuntime(runtime);

Transpiler-direct-link (component-model perf path):

using Wacs.WASI.Preview2.DependencyInjection;
using Wacs.WASI.NN.DependencyInjection;

services
 .AddWasiPreview2()
 .AddWasiNN(b => b.AddBackend(GraphEncoding.ONNX, new OnnxBackend()))
 .AddWasiPreview2NNBundle(); // composite for the single hostBundle slot

HostPackageResolver finds the WasiPreview2NNBundle composite when both packages are loaded — its forwarding properties cover both Preview2 + WASI.NN [WitSource] interfaces through one CLR object, no transpiler emit changes required.

For LLM workloads on the WasmEdge GGUF convention:

var llama = LlamaSharpBackend.FromPaths(new Dictionary<string, string>
{
 ["llama-7b"] = "/models/llama-7b-q4.gguf",
});

var cfg = WasiNNConfiguration.DefaultConfiguration();
cfg.LoadByNameBackend = llama; // takes precedence over NamedModelResolver
// Optional: cfg.Backends[GraphEncoding.ONNX] = new OnnxBackend();

using var host = new WasiNNHost(cfg);
host.BindToRuntime(runtime);

Guest pseudocode (any wasi-nn binding library targeting either ABI):

let graph = wasi_nn::load_by_name("llama-7b")?;
let mut ctx = graph.init_execution_context()?;
let prompt = "What is 2 + 2?";
let input = Tensor::new(&[prompt.len() as u32], TensorType::U8, prompt.as_bytes());
let outputs = ctx.compute(&[("0", &input)])?;
let response = std::str::from_utf8(outputs[0].1.data())?;

Architecture

Guest (compiled against either WIT or WITX)
 │
 │ imports
 ▼
WasmRuntime — wasi:nn/{tensor,graph,inference,errors}@0.2.0-rc-2024-10-28
 + wasi_ephemeral_nn (legacy)
 │
 ▼
WitBindings.cs / WitxBindings.cs — canonical-ABI lift/lower per import
 │
 ▼
WasiNNHost — resource tables + LoadGraphByNameDispatch + ResolveBackend
 │
 ▼
IBackend (encoding-keyed in WasiNNConfiguration.Backends, plus the
 optional LoadByNameBackend slot for backend-internal registries)
 │
 ▼
IBackendGraph → IBackendContext → Compute(NamedTensor[]) → NamedTensor[]

Dual ABIs, one SPI

Both ABIs converge on the same IBackend surface. The WITX side synthesizes input names by index ("0", "1", …) so the backend never needs to know whether the guest came in through the legacy or component-model path. Resource handles (graph / context / tensor / error) live in shared ResourceTables on WasiNNHost, so the two ABIs are interchangeable — a guest could in principle mint a graph through WITX and call its WIT methods, though no real guest does that.

Zero-copy graph-load

graph.load(builders, ...) lifts the guest's list<list<u8>> directly as ReadOnlyMemory<byte> views over the linear-memory array — no host-side copy. Backends MUST consume the bytes before LoadGraph returns; ORT / LlamaSharp / Microsoft.ML all naturally satisfy this (their load APIs copy/pin model bytes into native memory at session/ weights construction). For multi-MB ONNX models or multi-GB GGUFs, this saves one full copy on every load. See IBackend.LoadGraph for the ownership contract.

Encoding routing

Encoding	Default backend route
`onnx`	`OnnxBackend` (or `MLNetBackend` if both registered, last-write-wins)
`tensorflow` / `tensorflowlite`	unwired in v0 — embedder provides their own `IBackend`
`ggml`	`LlamaSharpBackend` via `LoadByNameBackend` slot
`openvino` / `pytorch`	unwired — embedder-supplied `IBackend`
`autodetect`	whichever backend the embedder registered for it

Encodings without a registered backend reject graph.load with error-code.invalid-encoding; the host never silently routes between encodings.

Type support (v0)

FP32, FP64, U8, I32, I64 round-trip through every backend. FP16 and BF16 throw error-code.unsupported-operation — wiring them needs the .NET 8 Half type or ORT's Float16 struct exposed explicitly. Tracked for v1.

Testing

Wacs.WASI.NN.Test (and the per-backend test sibling projects) run 24 unit tests covering the SPI surface + every error path. Real-model end-to-end tests need a wasm-component fixture imports wired to wasi-nn, plus actual ONNX / GGUF model files; those land in a follow-up under Spec.Test/components/fixtures/wasi-nn-* once the fixture build pipeline catches up. CI doesn't currently provision GB-scale GGUF files; LlamaSharp generation tests will be gated behind a WACS_GGUF_MODEL_PATH env var.

Pinning

wit/wasi-nn.wit and wit/wasi-nn.witx are vendored verbatim from upstream WebAssembly/wasi-nn at commit 71320d9 (2024-10-28). The WIT package version is wasi:nn@0.2.0-rc-2024-10-28. Re-fetch via the commands in wit/deps.lock.

The legacy WITX is retained "for consistency only" per the upstream header — but real wasi-nn guests today still target it (notably most Rust crates predating the WIT cut). Both ABIs are first-class here.

Product	Versions Compatible and additional computed target framework versions.
.NET	net5.0 net5.0 was computed. net5.0-windows net5.0-windows was computed. net6.0 net6.0 was computed. net6.0-android net6.0-android was computed. net6.0-ios net6.0-ios was computed. net6.0-maccatalyst net6.0-maccatalyst was computed. net6.0-macos net6.0-macos was computed. net6.0-tvos net6.0-tvos was computed. net6.0-windows net6.0-windows was computed. net7.0 net7.0 was computed. net7.0-android net7.0-android was computed. net7.0-ios net7.0-ios was computed. net7.0-maccatalyst net7.0-maccatalyst was computed. net7.0-macos net7.0-macos was computed. net7.0-tvos net7.0-tvos was computed. net7.0-windows net7.0-windows was computed. net8.0 net8.0 is compatible. net8.0-android net8.0-android was computed. net8.0-browser net8.0-browser was computed. net8.0-ios net8.0-ios was computed. net8.0-maccatalyst net8.0-maccatalyst was computed. net8.0-macos net8.0-macos was computed. net8.0-tvos net8.0-tvos was computed. net8.0-windows net8.0-windows was computed. net9.0 net9.0 was computed. net9.0-android net9.0-android was computed. net9.0-browser net9.0-browser was computed. net9.0-ios net9.0-ios was computed. net9.0-maccatalyst net9.0-maccatalyst was computed. net9.0-macos net9.0-macos was computed. net9.0-tvos net9.0-tvos was computed. net9.0-windows net9.0-windows was computed. net10.0 net10.0 was computed. net10.0-android net10.0-android was computed. net10.0-browser net10.0-browser was computed. net10.0-ios net10.0-ios was computed. net10.0-maccatalyst net10.0-maccatalyst was computed. net10.0-macos net10.0-macos was computed. net10.0-tvos net10.0-tvos was computed. net10.0-windows net10.0-windows was computed.
.NET Core	netcoreapp3.0 netcoreapp3.0 was computed. netcoreapp3.1 netcoreapp3.1 was computed.
.NET Standard	netstandard2.1 netstandard2.1 is compatible.
MonoAndroid	monoandroid monoandroid was computed.
MonoMac	monomac monomac was computed.
MonoTouch	monotouch monotouch was computed.
Tizen	tizen60 tizen60 was computed.
Xamarin.iOS	xamarinios xamarinios was computed.
Xamarin.Mac	xamarinmac xamarinmac was computed.
Xamarin.TVOS	xamarintvos xamarintvos was computed.
Xamarin.WatchOS	xamarinwatchos xamarinwatchos was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

.NETStandard 2.1
- WACS (>= 0.13.8)
- WACS.ComponentModel (>= 0.3.5)
- WACS.HostBindings.Abstractions (>= 0.3.0)
net8.0
- WACS (>= 0.13.8)
- WACS.ComponentModel (>= 0.3.5)
- WACS.HostBindings.Abstractions (>= 0.3.0)

NuGet packages (7)

Showing the top 5 NuGet packages that depend on WACS.WASI.NN:

Package	Downloads
WACS.WASI.NN.OpenVino OpenVINO backend for WACS.WASI.NN. Implements IBackend for graph-encoding.openvino via the OpenVINO.CSharp.API NuGet. Loads OpenVINO IR (xml + bin) from the wasi-nn multi-builder shape. Ships as a sibling NuGet so consumers wiring only one backend don't pull the OpenVINO native runtimes. Tracks the wasi-nn proposal at WASI Phase 2 — the WIT surface may change as the proposal evolves.
WACS.WASI.NN.OnnxRuntime ONNX Runtime backend for WACS.WASI.NN. Implements IBackend for graph-encoding.onnx via Microsoft.ML.OnnxRuntime. Ships as a sibling NuGet so consumers wiring only one backend don't pull the OnnxRuntime native binaries.
WACS.WASI.NN.MLNet ML.NET backend for WACS.WASI.NN. Hosts ONNX inference under an MLContext lifecycle so embedders can compose wasi-nn inference with the rest of their ML.NET pipeline (preprocessing transformers, custom predictors, etc.). For raw tensor inference with no pipeline integration, prefer Wacs.WASI.NN.OnnxRuntime — it avoids the Microsoft.ML transitive surface.
WACS.WASI.NN.DependencyInjection Microsoft.Extensions.DependencyInjection extension methods for WACS.WASI.NN. Registers the WasiNNBundle that the transpiler's HostPackageResolver direct-links wasi:nn imports against — symmetric with WACS.WASI.Preview2.DependencyInjection. Tracks the wasi-nn proposal at WASI Phase 2 — the WIT surface may change as the proposal evolves.
WACS.WASI.NN.LlamaSharp LlamaSharp / llama.cpp backend for WACS.WASI.NN. Implements GraphEncoding.ggml on the WasmEdge convention (load-by-name plus U8 tensors carrying UTF-8 prompt and response text). Models are resolved by name through an embedder-supplied registry — model files are GB-scale and not embedded in the wasm.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
0.4.1	154	5/16/2026
0.4.0	281	5/12/2026
0.3.5	121	5/12/2026
0.3.4	250	5/11/2026
0.3.3	143	5/11/2026
0.3.1	222	5/10/2026
0.3.0	202	5/10/2026

URL: https://www.nuget.org/packages/WACS.WASI.NN/0.3.4

⇱ NuGet Gallery | WACS.WASI.NN 0.3.4