// blameless.com
SRE can help ensure that teams are customer-focused, even if the best way forward breaks the rules or requires you to re-write them. Two ways SRE accomplishes this are by fostering a culture of blamelessness and using SLOs to glean insights into the customers’ experience.
// engineering.linkedin.com
A curated curriculum for aspiring site reliability engineers recently open sourced and made available on GitHub. Developed by LinkedIn engineers to equip their new SREs with the knowledge and skills to flourish before integrating with a specific team.
// research.fb.com
This paper shows how Facebook leverage differents networking infrastructure to prevent or mask any disruptions during releases. Zero Downtime Release is a collection of mechanisms used at Facebook to shield the end-users from any disruptions, preserve the cluster capacity and robustness of the infrastructure when updates are released globally.
// brennerm.github.io
At first sight AWS launch configurations and templates may seem very similar. Both allow you to define a blueprint for EC2 instances. This post takes a look at their differences and see which one we should prefer.
// github.com
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE).
// blog.alexgleason.me
Cloud hosting has made it easy for anyone to run a website, but it's time to cut out the middleman. You can run a real server anywhere there's power and an internet connection. You can run a server from your home. You can run it from a datacenter in an undisclosed location.
// devblogs.microsoft.com
This post describes Microsoft's approach to helping you stay safe and up-to-date with containers — largely via their container image publishing system — and with associated guidance of the images they publish.
// trendmicro.com
Malicious actors are targeting popular DevOps technologies and finding new ways to attack containers and cloud environments. They are currently implementing security checks to try to exploit a bad implementation and escape from the container to the host or deploy a cryptocurrency miner and profit from their victims’ resources.
// suade.org
In reality most of the "super-fast" benchmarks mean very little except for a few niche use-cases. If you look at the code in detail, you'll see that they're simple "Hello, World!". As soon as you introduce any actual Python work into the code you'll see those numbers plunge.