WIO: Upload-Enabled Computational Storage on CXL SSDs

arXiv:2604.02442v1 Announce Type: new
Abstract: The widening gap between processor speed and storage latency has made data movement a dominant bottleneck in modern systems. Two lines of storage-layer innovation attempted to close this gap: persistent memory shortened the latency hierarchy, while computational storage devices pushed processing toward the data. Neither has displaced conventional NVMe SSDs at scale, largely due to programming complexity, ecosystem fragmentation, and thermal/power cliffs under sustained load. We argue that storage-side compute should be emph{reversible}: computation should migrate dynamically between host and device based on runtime conditions. We present sys, which realizes this principle on CXL SSDs by decomposing I/O-path logic into migratable emph{storage actors} compiled to WebAssembly. Actors share state through coherent CXL.mem regions; an agility-aware scheduler migrates them via a zero-copy drain-and-switch protocol when thermal or power constraints arise. Our evaluation on an FPGA-based CXL SSD prototype and two production CSDs shows that sys turns hard thermal cliffs into elastic trade-offs, achieving up to 2$times$ throughput improvement and 3.75$times$ write latency reduction without application modification.

Liked Liked