Async ML Inference on Apple Silicon

⚓ Rust 📅 2026-03-30 👤 surdeus 👁️ 8

Info

This post is auto-generated from RSS feed The Rust Programming Language Forum - Latest topics. Source: Async ML Inference on Apple Silicon

I've just open-sourced batch_forge, a specialized inference runner for the JAX/Equinox \ ecosystem. The project is written in pure Rust and leverages metal-rs for custom compute kernels. Current features include a zero-copy Safetensors loader, an async tokio request manager, and stateful attention state management. Feedback on the memory-mapped loader and Metal buffer synchronization would be greatly appreciated.

1 post - 1 participant

Read full topic

🏷️ Rust_feed

👍 󠁮󠁮󠁮󠁮 👎 󠁮󠁮󠁮󠁮