SRE NEWSLETTER

Issue #12 // January 29, 2021

Improving How we Deploy GitHub
// github.blog
Over the last year GitHub has doubled the number of developers contributing to the main GitHub.com application. While this seems like a solely positive thing on the surface, the 2x increase in folks contributing to the core software exposed some problems in terms of tooling. Tooling that worked for them a year ago no longer functioned in the same capacity.
How We Improved SmashingMag Performance
// smashingmagazine.com
Smashing Magazine takes a deep look at some of the changes they've made on their site — running on JAMStack with React — to optimize the web performance and improve metrics. They includes some of the mistakes they've made, and some of the unexpected changes that helped boost all the metrics across the board.
Software Development Topics I've Changed my Mind on After 6 Years in the Industry
// chriskiehl.com
Chris lists opinions about programming, architecure, databases, and project management that 6 years have reinforced, picked up, or changed his mind.
Cloud Native Predictions for 2021 and Beyond
// aniszczyk.org
Chris Aniszczyk has a unique and privileged vantage point of cloud native trends given to all the member companies and developers he works with, so he figured he'd share his thoughts of where things will be going in 2021 and beyond.
GitOps-based Policy Management: How to Scale in a Multi-Node, Multicloud World
// weave.works
This post walks you through the common challenges faced in multi-cluster environments, and discuss how GitOps and effective policy management simplifies large-scale Kubernetes deployments anywhere.
Scaling Kubernetes to 7,500 Nodes
// openai.com
OpenAI scaled Kubernetes clusters to 7,500 nodes. Scaling a single Kubernetes cluster to this size is rarely done and requires some special care, but the upside is a simple infrastructure that allows our machine learning research teams to move faster and scale up without changing their code.
A Deeper Dive into our May 2019 Security Incident
// stackoverflow.blog
Back in May 2019, Stack Overflow had a security incident. They’re now in a position to give more detail about what happened, how it happened, and what they did to address the underlying issues that allowed the incident to occur.
After Three Decades, You Can Finally Have A Distributed SQL Database
// nextplatform.com
The Next Platform interviews Timothy Prickett Morgan, the CEO of Coachroach Labs to check in and see what the plan is for the years ahead and to take the pulse of the emerging database set.
Rust in Production: 1Password
// serokell.io
Rust has taken the programming language world by storm. In an interview with Michael Fey, VP of Engineering at 1Password, find out why they chose Rust for their product, the benefits of Rust for security-centered applications, and what cool libraries you should look into if you’re developing something similar in Rust.
Cloud DevOps With OpenShift and Jfrog
// openshift.com
How do you manage Kubernetes across the cloud providers? OpenShift, OpenShift Pipelines, and the JFrog platform achieve this goal of providing universal infrastructure and a development platform across cloud and on-premise.