Zero-Copy GPU Inference from WebAssembly on Apple Silicon
A WebAssembly module's linear memory can be shared directly with the Apple Silicon GPU: no copies, no serialization, no intermediate buffers. Here's how the zero-copy chain works, what we measured, and what it enables for stateful AI inference.…