How hard is it to add a replica to even a subset of the #InternetArchive or to get ones own "Internet in a box" deployment going? Like, price wise and access to data wise.
Do I need to budget for a petabyte or two of storage? Does it necessitate an entire datacenter?