> IOTA (subnet 9) ran a 100B parameter model using 48 single A100-80GB GPUs that were distributed (non-colocated, across multiple providers and multiple datacenters) and connected only over the internet (no datacenter fabric) > achieved 30.8% average MFU, which is impressive because frontier labs ru
@kelxyz_·Jun 1, 2026
Read article
