I found this interesting set of papers while trying to figure out what happened to PubSub. The papers cover advanced operations tools that Amazon operators use to run the site along with some research by the author. It’s interesting stuff.