@eldersnake@we.loveprivacy.club This was an interesting read for sure! š I donāt think it had anything I hadnāt already considered in terms of the ethical/moral points of view. Iām not sure where I stand myself either to be honest. Iāve forced myself to get familiar with the ecosystem and tooling, because in my line of work as a tech lead (staff engineer in sre) you donāt want to be that one guy that ya know š Ethically/Morally though, Iām definitely with the sentiment of this post š Much like the whole Crypto hype yaers back (if yāall remember?!) this is also one of the most energy hungry pieces of ātechā (if you can call it that?) in a while. Then thereās these other issues āstealing peopleās workā, āreliance is causing humans to become cognitively weak and neural connections to shrinkā, to name a fewā¦
Move beyond basic threshold alerts! Define clear Service Level Objectives (SLOs) and measure Service Level Indicators (SLIs) to track real user impact. Use Prometheus to alert when your SLOs are at risk, ensuring you focus on what truly matters to your users. #Monitoring #SRE #Prometheus
@bender@twtxt.net Bahahah š¤£š mate, me and one of my SRE colleagues actually came up with the terminology ourselves! š
This is an example of what I believe every SRE should master and whatever Post Incident Review (PIR) should focus on. Where did the system fail. What are the missing or incomplete Safety Controls.