Searching txt.sour.is

Twts matching #SRE
Sort by: Newest, Oldest, Most Relevant

@eldersnake@we.loveprivacy.club This was an interesting read for sure! šŸ‘ I don’t think it had anything I hadn’t already considered in terms of the ethical/moral points of view. I’m not sure where I stand myself either to be honest. I’ve forced myself to get familiar with the ecosystem and tooling, because in my line of work as a tech lead (staff engineer in sre) you don’t want to be that one guy that ya know šŸ˜‰ Ethically/Morally though, I’m definitely with the sentiment of this post šŸ˜… Much like the whole Crypto hype yaers back (if y’all remember?!) this is also one of the most energy hungry pieces of ā€œtechā€ (if you can call it that?) in a while. Then there’s these other issues ā€œstealing people’s workā€, ā€œreliance is causing humans to become cognitively weak and neural connections to shrinkā€, to name a few…

⤋ Read More
In-reply-to » grafana is confusing af i deployed it again for my job (that is so wild to say...) and i'm like HOW DO THESE ALERTS WORK

Move beyond basic threshold alerts! Define clear Service Level Objectives (SLOs) and measure Service Level Indicators (SLIs) to track real user impact. Use Prometheus to alert when your SLOs are at risk, ensuring you focus on what truly matters to your users. #Monitoring #SRE #Prometheus

⤋ Read More
In-reply-to » I am sure it wasn’t your intention (not even remotely), but it sounds a lot like corporate bullshit. Hahahaha! Are you sure you haven’t been institutionalised?

@bender@twtxt.net Bahahah šŸ¤£šŸ˜‚ mate, me and one of my SRE colleagues actually came up with the terminology ourselves! šŸ˜›

⤋ Read More
In-reply-to » This weekend (as some of you may now) I accidently nuke this Pod's entire data volume šŸ¤¦ā€ā™‚ļø What a disastrous incident 🤣 I decided instead of trying to restore from a 4-month old backup (we'll get into why I hadn't been taking backups consistently later), that we'd start a fresh! šŸ˜… Spring clean! 🧼 -- Anyway... One of the things I realised was I was missing a very critical Safety Controls in my own ways of working... I've now rectified this...

This is an example of what I believe every SRE should master and whatever Post Incident Review (PIR) should focus on. Where did the system fail. What are the missing or incomplete Safety Controls.

⤋ Read More