txt.sour.is External profile for @<prologic https://twtxt.net/user/prologic/twtxt.txt>

Ever wondered what it would cost to self-hosted vs. use the cloud? Well I often doubt myself every time I look at hardware prices, and I know I have to do some hardware refresh soon™ for the Mills DC (something I don’t have a regular plan or budget for), here’s a rough ball park:

The Mills DC has cost me around ~$15k to build and maintain over the last ~10 years or so. Roughly speaking. I’ve never actually taken a Bill of Materials or anything, but I could if anyone is interested in more specifics.

The equivalent of resources if run in the “Cloud” would cost around:

~$1,000 for virtual machines
~$12000 for storage

So around ~$2,000/month to run.

Keep this in mind anytime anyone ever tries to con you into believing “Cloud is cheaper”. It’s not.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 05:50 (14w ago)

↳ In-reply-to » @movq @falsifian @prologic Maybe I don't know what I'm talking about and You've probably already read this: Everything you need to know about the “Right to be forgotten” coming straight out of the EU's GDPR Website itself. It outlines the specific circumstances under which the right to be forgotten applies as well as reasons that trump the one's right to erasure ...etc.

@aelaraji@aelaraji.com This is one of the reasons why yarnd has a couple of settings with some sensible/sane defaults:

I could already imagine a couple of extreme cases where, somewhere, in this peaceful world one’s exercise of freedom of speech could get them in Real trouble (if not danger) if found out, it wouldn’t necessarily have to involve something to do with Law or legal authorities. So, If someone asks, and maybe fearing fearing for… let’s just say ‘Their well being’, would it heart if a pod just purged their content if it’s serving it publicly (maybe relay the info to other pods) and call it a day? It doesn’t have to be about some law/convention somewhere … 🤷 I know! Too extreme, but I’ve seen news of people who’d gone to jail or got their lives ruined for as little as a silly joke. And it doesn’t even have to be about any of this.

There are two settings:

$ ./yarnd --help 2>&1 | grep max-cache
      --max-cache-fetchers int        set maximum numnber of fetchers to use for feed cache updates (default 10)
  -I, --max-cache-items int           maximum cache items (per feed source) of cached twts in memory (default 150)
  -C, --max-cache-ttl duration        maximum cache ttl (time-to-live) of cached twts in memory (default 336h0m0s)

So yarnd pods by default are designed to only keep Twts around publicly visible on either the anonymous Frontpage or Discover View or your Timeline or the feed’s Timeline for up to 2 weeks with a maximum of 150 items, whichever get exceeded first. Any Twts over this are considered “old” and drop off the active cache.

It’s a feature that my old man @off_grid_living@twtxt.net was very strongly in support of, as was I back in the day of yarnd’s design (nothing particularly to do with Twtxt per se) that I’ve to this day stuck by – Even though there are some 😉 that have different views on this 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 05:46 (14w ago)

↳ In-reply-to » @falsifian Do you have specifics about the GRPD law about this?

@aelaraji@aelaraji.com Thanks for this! 🙏

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 01:46 (14w ago)

Bahahahaha very clever @lyse@lyse.isobeef.org I look forward to reading your report ! 🤣 However…

$ yarnc debug https://twtxt.net/user/prologic/twtxt.txt | grep -E '^pqst4ea' | tee | wc -l
0

I very quickly proved that Twt was never from me 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 01:19 (14w ago)

@yarn_police@twtxt.net Cool cool 🙇‍♂️

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 00:39 (14w ago)

↳ In-reply-to » Heads up, @prologic! We're seeing increased spate of burglaries in your neighbourhood. Please stay alert, while we keep you safe out there.

@yarn_police@twtxt.net What’s going on?

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 22:30 (14w ago)

↳ In-reply-to » Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@movq@www.uninformativ.de Yes that’s true they are only integrity checks. But beyond a malicious pod (ignore yarnd’a gossiping protocol for now) how does what @lyse@lyse.isobeef.org presented work exactly? 😅

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 22:02 (14w ago)

↳ In-reply-to » @falsifian Do you have specifics about the GRPD law about this?

But this is no different to how jenny does things with storing every Twt in a Maildir I suppose? 🤔

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 22:02 (14w ago)

↳ In-reply-to » @falsifian Do you have specifics about the GRPD law about this?

This has specifically come up before in the form of “informal complaints” against yarnd because of the way it permanently stores and archives Twts, so even if you decide you changed your mind, or deleted that line out of your feed, if my pod or @xuu or @abucci@anthony.buc.ci or @eldersnake@we.loveprivacy.club (or any other handful of pods still around?) saw the Twt, it’d be permanently archived.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 22:01 (14w ago)

↳ In-reply-to » @falsifian Do you have specifics about the GRPD law about this?

Yeah I’m curious to find out too beyond just “here say”. But regardless of whether we should or shouldn’t care about this or should or shouldn’t comply. We should IMO. I’d have to build something that horrendously violates someone’s rights in another country.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 21:59 (14w ago)

↳ In-reply-to » Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@movq@www.uninformativ.de Care to explain how this explicit/attack works for me? 🤣

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 21:58 (14w ago)

↳ In-reply-to » I've also put up this PR Add compatible methods for Index to behave as the Archiver (transition) #1177 that will act as a transition from the old naive archiver to the new bluge-based search/index. I will switch my pod over to this soon to test it before anyone else does.

Well that was bloody awful. This PR bokr my pod for some strange reason I can’t figure out why or how 😱 The process just kept getting terminated from something, somewhere (no panic). weird. I’ve reverted this PR for now @xuu

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 20:42 (14w ago)

↳ In-reply-to » And we're back. Sorry about that 😅

Really though I only managed to save a few GB, but it’s enough for now.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 20:39 (14w ago)

↳ In-reply-to » @prologic woot, woot! Glad everything went well. I feel it faster already!

@bender@twtxt.net Haha 😛 Faster? Maybe 🤔 But yeah it’s good to have backups! (that work)

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 20:38 (14w ago)

↳ In-reply-to » And we're back. Sorry about that 😅

I’ve also put up this PR Add compatible methods for Index to behave as the Archiver (transition) #1177
that will act as a transition from the old naive archiver to the new bluge-based search/index. I will switch my pod over to this soon to test it before anyone else does.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 20:37 (14w ago)

↳ In-reply-to » And we're back. Sorry about that 😅

For those curious, the archive on this pod had reached around ~22GB in size. I had to suck it down to my more powerful Mac Studio to clean it up and remove a bunch of junk. Then copy all the data back. This is what my local network traffic looked like for the last few hours 😱

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 20:35 (14w ago)

And we’re back. Sorry about that 😅

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 17:49 (14w ago)

↳ In-reply-to » Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@lyse@lyse.isobeef.org Hmmm I’m not sure sure I get what you’re getting at here. In order for this to be true, yarnd would have to be maliciously fabricating a Twt with the Hash D.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 17:12 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

i.e: there must be two versions of the Twt in the feed.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 17:12 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

@lyse@lyse.isobeef.org This is true. But the client MUST supply the original too! Or this doesn’t work 😢

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:30 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

If OTOH your client doesn’t store individual Twts in a cache/archive or some kind of database, then verification becomes quite hard and tedious. However I think of this as an implementation details. The spec should just call out that clients must validate/verify the edit request and the matching hash actually exists in that feed, not how the client should implement that.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:29 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

@lyse@lyse.isobeef.org Yes you do. You keep both versions in your cache. They have different hashes. So you have Twt A, a client indicates Twt B is an edit of A, your client has already seen A and cached and archived it, now your client fetches B which is indicated of editing A. You cache/archive B as well, but now indicate in your display that B replaces A (maybe display, link both) or just display B or whatever. But essentially you now have both, but an indicator of one being an edit of the other.

The right thing to do here of course is to keep A in the “thread” but display B. Why? So the thread/chain doesn’t actually break or fork (forking is a natural consequence of editing, or is it the other way around? 🤔).

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:26 (14w ago)

↳ In-reply-to » It just occurs to me we're now building some kind of control structures or commands with (edit:…) and (delete:…) into feeds. It's not just a simple "add this to your cache" or "replace the cache with this set of messages" anymore. Hmm. We might need to think about the consequences of that, can this be exploited somehow, etc.

@lyse@lyse.isobeef.org I’m all for dropping delete btw, Or at least not making it mandatory, as-in “clients should” rather than “clients must”. But yes I agree, let’s explore all the possible ways this can be exploited (if at all).

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:25 (14w ago)

↳ In-reply-to » Trying to sum up the current proposal (keeping hashes):

@movq@www.uninformativ.de I think not.

What about edits of edits? Do we want to “chain” edits or does the latest edit simply win?

This gets too complicated if we start to support this kind of nonsense 🤣

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:24 (14w ago)

↳ In-reply-to » @falsifian I think we’re talking about different ideas here. 🤔

@movq@www.uninformativ.de Thank you! 🙏

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:22 (14w ago)

↳ In-reply-to » Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@lyse@lyse.isobeef.org Walk me through this? 🤔 I get what you’re saying, but I’m too stupid to be a “hacker” 🤣

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:20 (14w ago)

↳ In-reply-to » @lyse

But yes, at the end of the day if the edit request is invalid or cannot be verified, it should be ignored as treated as “malicious”.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:19 (14w ago)

↳ In-reply-to » @lyse

@lyse@lyse.isobeef.org @movq@www.uninformativ.de So a client that has the idea of a cache/archive wouldn’t necessarily have to re-check that the Twt being marked as “edited” belongs to that feed or not, the client would already know that for sure. At least this is how yarnd works and I’m sure jenny can make similar assertions too.

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:17 (14w ago)

↳ In-reply-to » @david Thanks, that's good feedback to have. I wonder to what extent this already exists in registry servers and yarn pods. I haven't really tried digging into the past in either one.

@lyse@lyse.isobeef.org @falsifian@www.falsifian.org Contributions to search.twtxt.net, which runs yarns (not to be confused with yarnd) are always welcome 🤗 – I don’t have as much “spare time” as I used to due to the nature of my job (Staff Engineer); but I try to make improvements every now and again 💪

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:15 (14w ago)

↳ In-reply-to » @prologic Do you have a link to some past discussion?

@falsifian@www.falsifian.org You make good points though, I made similar arguments about this too back in the day. Twtxt v2 / Yarn.social being at least ~4 years old now 😅

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:14 (14w ago)

↳ In-reply-to » @prologic Do you have a link to some past discussion?

@falsifian@www.falsifian.org Do you have specifics about the GRPD law about this?

Would the GDPR would apply to a one-person client like jenny? I seriously hope not. If someone asks me to delete an email they sent me, I don’t think I have to honour that request, no matter how European they are.

I’m not sure myself now. So let’s find out whether parts of the GDPR actually apply to a truly decentralised system? 🤔

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:11 (14w ago)

↳ In-reply-to » @lyse I don't think this is true.

LOL 😂 This:

anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves

I’d like to see a step-by-step reproduction of this. I don’t buy it 🤣

Admittedly yarnd had a few implementation security bugs, but I’m not sure this is actually possible, unless I’m missing something? 🤔

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 14:08 (14w ago)

↳ In-reply-to » And they have arrived (well, they did around 3 hours ago, LOL). Buttery smooth, my 16 Pro (one with dark cover). It took a bit over an hour to transfer all my data.

@david@collantes.us Very nice! 👍

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 08:12 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

@movq@www.uninformativ.de Ok 😅

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 07:58 (14w ago)

↳ In-reply-to » Okay, the recently implemented --fetch-context, which asks a Yarn pod for a twt, wouldn’t break, but jenny would not be able anymore to verify that it actually got the correct twt. That’s a concrete example where we would lose functionality.

@movq@www.uninformativ.de Hmmm not sure what I was thinking sorry 🤦‍♂️been a long day 😂

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 07:46 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

@movq@www.uninformativ.de Am I missing something? 😅

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 07:19 (14w ago)

↳ In-reply-to » (replyto http://darch.dk/twtxt.txt 2024-09-15T12:50:17Z) @sorenpeter I like this idea. Just for fun, I'm using a variant in this twt. (Also because I'm curious how it non-hash subjects appear in jenny and yarn.)

@movq@www.uninformativ.de Precisely 👌

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 07:18 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

@movq@www.uninformativ.de Is t it? You read each Twt and compute its hash. It’s a simple O(1) lookup of the hash in that feed or your cache/archive right?

⤋ Read More

prologic

twtxt.net

Fri, Sep 20 02:52 (14w ago)

👋 Reminder that next Saturday 28th September will be out monthly online meetup! Hope to see some/all of you there 👌

⤋ Read More

prologic

twtxt.net

Thu, Sep 19 22:35 (14w ago)

↳ In-reply-to » @prologic Hi. i have noticed sometimes when i hit the back button i lose all the surrounding layout and just have a list of twts.

I’ll try to reproduce locally later tonight

⤋ Read More

prologic

twtxt.net

Thu, Sep 19 21:16 (14w ago)

↳ In-reply-to » Location Addressing is fine in smaller or single systems. But when you're talking about large decentralised systems with no single point of control (kind of the point) things like independable variable integrity become quite important.

@lyse@lyse.isobeef.org I don’t think this is true.

⤋ Read More

prologic

twtxt.net

Thu, Sep 19 21:13 (14w ago)

↳ In-reply-to » @prologic I get where you're coming from. But is it really that bad in practice? If you follow any link somewhere in the web, you also don't know if its contents has been changed in the meantime. Is that a problem? Almost never in my experience.

@lyse@lyse.isobeef.org No that’s never a problem because we really only want to “navigate” the web anyway not form threads of xonversation 🤣

⤋ Read More

prologic

twtxt.net

Thu, Sep 19 21:09 (14w ago)

↳ In-reply-to » Okay, the recently implemented --fetch-context, which asks a Yarn pod for a twt, wouldn’t break, but jenny would not be able anymore to verify that it actually got the correct twt. That’s a concrete example where we would lose functionality.

@movq@www.uninformativ.de this approach also wouldn’t work and when that Feed gets archived so you’ll be forced to crawl archived feeds at that point.

⤋ Read More

prologic

twtxt.net

Thu, Sep 19 20:55 (14w ago)

↳ In-reply-to » Trying to sum up the current proposal (keeping hashes):

The important bits missing from this summary (devil is in the details) are two requirements:

Clients should order Twts by their timestamp.
Clients must validate all edit and delete requests that the hash being indicated belongs to and came from that feed.
Client should honour delete requests and delete Twts from their cache/archive.

⤋ Read More

prologic

twtxt.net

Thu, Sep 19 20:52 (14w ago)

↳ In-reply-to » @movq Thanks for the summary!

@lyse@lyse.isobeef.org This is why hashes provide that level of integrity. The hash can be verified in the cache or archive as belonging to said feed.

⤋ Read More