The Performance Gap
Benchmarks conducted on Llama-3-8B (~15GB).
| Orchestrator | Data Path | Ready-to-Infer Time |
|---|---|---|
| HF (Xet + Rust) | CDN → Disk → VRAM | 32.04s |
| Vajra | CDN → VRAM Direct | 16.87s |
Throughput: 7.62 Gbps with 128MB parallel chunks.
Bypassing the Disk Tax for Sovereign AI Infrastructure
Benchmarks conducted on Llama-3-8B (~15GB).
| Orchestrator | Data Path | Ready-to-Infer Time |
|---|---|---|
| HF (Xet + Rust) | CDN → Disk → VRAM | 32.04s |
| Vajra | CDN → VRAM Direct | 16.87s |
Throughput: 7.62 Gbps with 128MB parallel chunks.