txt.sour.is External profile for @<prologic https://twtxt.net/user/prologic/twtxt.txt/1>

Login

Join

How to follow...

twtxt.txt

Recent twts from prologic

twtxt.net

Sat, Sep 21 02:37 (40w ago)

↳ In-reply-to » And we're back. Sorry about that 😅

For those curious, the archive on this pod had reached around ~22GB in size. I had to suck it down to my more powerful Mac Studio to clean it up and remove a bunch of junk. Then copy all the data back. This is what my local network traffic looked like for the last few hours 😱

⤋ Read More

twtxt.net

Sat, Sep 21 02:35 (40w ago)

And we’re back. Sorry about that 😅

⤋ Read More

twtxt.net

Fri, Sep 20 23:49 (40w ago)

↳ In-reply-to » (#l5452vq) Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@lyse@lyse.isobeef.org Hmmm I’m not sure sure I get what you’re getting at here. In order for this to be true, yarnd would have to be maliciously fabricating a Twt with the Hash D.

⤋ Read More

twtxt.net

Fri, Sep 20 23:12 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

i.e: there must be two versions of the Twt in the feed.

⤋ Read More

twtxt.net

Fri, Sep 20 23:12 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

@lyse@lyse.isobeef.org This is true. But the client MUST supply the original too! Or this doesn’t work 😢

⤋ Read More

twtxt.net

Fri, Sep 20 20:30 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

If OTOH your client doesn’t store individual Twts in a cache/archive or some kind of database, then verification becomes quite hard and tedious. However I think of this as an implementation details. The spec should just call out that clients must validate/verify the edit request and the matching hash actually exists in that feed, not how the client should implement that.

⤋ Read More

twtxt.net

Fri, Sep 20 20:29 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

@lyse@lyse.isobeef.org Yes you do. You keep both versions in your cache. They have different hashes. So you have Twt A, a client indicates Twt B is an edit of A, your client has already seen A and cached and archived it, now your client fetches B which is indicated of editing A. You cache/archive B as well, but now indicate in your display that B replaces A (maybe display, link both) or just display B or whatever. But essentially you now have both, but an indicator of one being an edit of the other.

The right thing to do here of course is to keep A in the “thread” but display B. Why? So the thread/chain doesn’t actually break or fork (forking is a natural consequence of editing, or is it the other way around? 🤔).

⤋ Read More

twtxt.net

Fri, Sep 20 20:26 (40w ago)

↳ In-reply-to » (#fj7dppq) It just occurs to me we're now building some kind of control structures or commands with (edit:…) and (delete:…) into feeds. It's not just a simple "add this to your cache" or "replace the cache with this set of messages" anymore. Hmm. We might need to think about the consequences of that, can this be exploited somehow, etc.

@lyse@lyse.isobeef.org I’m all for dropping delete btw, Or at least not making it mandatory, as-in “clients should” rather than “clients must”. But yes I agree, let’s explore all the possible ways this can be exploited (if at all).

⤋ Read More

twtxt.net

Fri, Sep 20 20:25 (40w ago)

↳ In-reply-to » Trying to sum up the current proposal (keeping hashes):

@movq@www.uninformativ.de I think not.

What about edits of edits? Do we want to “chain” edits or does the latest edit simply win?

This gets too complicated if we start to support this kind of nonsense 🤣

⤋ Read More

twtxt.net

Fri, Sep 20 20:24 (40w ago)

↳ In-reply-to » (#vedhona) @falsifian I think we’re talking about different ideas here. 🤔

@movq@www.uninformativ.de Thank you! 🙏

⤋ Read More

twtxt.net

Fri, Sep 20 20:22 (40w ago)

↳ In-reply-to » (#l5452vq) Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@lyse@lyse.isobeef.org Walk me through this? 🤔 I get what you’re saying, but I’m too stupid to be a “hacker” 🤣

⤋ Read More

twtxt.net

Fri, Sep 20 20:20 (40w ago)

↳ In-reply-to » (#fj7dppq) @lyse

But yes, at the end of the day if the edit request is invalid or cannot be verified, it should be ignored as treated as “malicious”.

⤋ Read More

twtxt.net

Fri, Sep 20 20:19 (40w ago)

↳ In-reply-to » (#fj7dppq) @lyse

@lyse@lyse.isobeef.org @movq@www.uninformativ.de So a client that has the idea of a cache/archive wouldn’t necessarily have to re-check that the Twt being marked as “edited” belongs to that feed or not, the client would already know that for sure. At least this is how yarnd works and I’m sure jenny can make similar assertions too.

⤋ Read More

twtxt.net

Fri, Sep 20 20:17 (40w ago)

↳ In-reply-to » (#w6f7hpa) @david Thanks, that's good feedback to have. I wonder to what extent this already exists in registry servers and yarn pods. I haven't really tried digging into the past in either one.

@lyse@lyse.isobeef.org @falsifian@www.falsifian.org Contributions to search.twtxt.net, which runs yarns (not to be confused with yarnd) are always welcome 🤗 – I don’t have as much “spare time” as I used to due to the nature of my job (Staff Engineer); but I try to make improvements every now and again 💪

⤋ Read More

twtxt.net

Fri, Sep 20 20:15 (40w ago)

↳ In-reply-to » (#6y53k7q) @prologic Do you have a link to some past discussion?

@falsifian@www.falsifian.org You make good points though, I made similar arguments about this too back in the day. Twtxt v2 / Yarn.social being at least ~4 years old now 😅

⤋ Read More

twtxt.net

Fri, Sep 20 20:14 (40w ago)

↳ In-reply-to » (#6y53k7q) @prologic Do you have a link to some past discussion?

@falsifian@www.falsifian.org Do you have specifics about the GRPD law about this?

Would the GDPR would apply to a one-person client like jenny? I seriously hope not. If someone asks me to delete an email they sent me, I don’t think I have to honour that request, no matter how European they are.

I’m not sure myself now. So let’s find out whether parts of the GDPR actually apply to a truly decentralised system? 🤔

⤋ Read More

twtxt.net

Fri, Sep 20 20:11 (40w ago)

↳ In-reply-to » (#l5452vq) @lyse I don't think this is true.

LOL 😂 This:

anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves

I’d like to see a step-by-step reproduction of this. I don’t buy it 🤣

Admittedly yarnd had a few implementation security bugs, but I’m not sure this is actually possible, unless I’m missing something? 🤔

⤋ Read More

twtxt.net

Fri, Sep 20 20:08 (40w ago)

↳ In-reply-to » And they have arrived (well, they did around 3 hours ago, LOL). Buttery smooth, my 16 Pro (one with dark cover). It took a bit over an hour to transfer all my data.

@david@collantes.us Very nice! 👍

⤋ Read More

twtxt.net

Fri, Sep 20 14:12 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

@movq@www.uninformativ.de Ok 😅

⤋ Read More

twtxt.net

Fri, Sep 20 13:58 (40w ago)

↳ In-reply-to » (#ucgvfmq) Okay, the recently implemented --fetch-context, which asks a Yarn pod for a twt, wouldn’t break, but jenny would not be able anymore to verify that it actually got the correct twt. That’s a concrete example where we would lose functionality.

@movq@www.uninformativ.de Hmmm not sure what I was thinking sorry 🤦‍♂️been a long day 😂

⤋ Read More

twtxt.net

Fri, Sep 20 13:46 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

@movq@www.uninformativ.de Am I missing something? 😅

⤋ Read More

twtxt.net

Fri, Sep 20 13:19 (40w ago)

↳ In-reply-to » (replyto http://darch.dk/twtxt.txt 2024-09-15T12:50:17Z) @sorenpeter I like this idea. Just for fun, I'm using a variant in this twt. (Also because I'm curious how it non-hash subjects appear in jenny and yarn.)

@movq@www.uninformativ.de Precisely 👌

⤋ Read More

twtxt.net

Fri, Sep 20 13:18 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

@movq@www.uninformativ.de Is t it? You read each Twt and compute its hash. It’s a simple O(1) lookup of the hash in that feed or your cache/archive right?

⤋ Read More

twtxt.net

Fri, Sep 20 08:52 (40w ago)

👋 Reminder that next Saturday 28th September will be out monthly online meetup! Hope to see some/all of you there 👌

⤋ Read More

twtxt.net

Fri, Sep 20 04:35 (40w ago)

↳ In-reply-to » @prologic Hi. i have noticed sometimes when i hit the back button i lose all the surrounding layout and just have a list of twts.

I’ll try to reproduce locally later tonight

⤋ Read More

twtxt.net

Fri, Sep 20 03:16 (40w ago)

↳ In-reply-to » (#xghlsva) Location Addressing is fine in smaller or single systems. But when you're talking about large decentralised systems with no single point of control (kind of the point) things like independable variable integrity become quite important.

@lyse@lyse.isobeef.org I don’t think this is true.

⤋ Read More

twtxt.net

Fri, Sep 20 03:13 (40w ago)

↳ In-reply-to » (#l5452vq) @prologic I get where you're coming from. But is it really that bad in practice? If you follow any link somewhere in the web, you also don't know if its contents has been changed in the meantime. Is that a problem? Almost never in my experience.

@lyse@lyse.isobeef.org No that’s never a problem because we really only want to “navigate” the web anyway not form threads of xonversation 🤣

⤋ Read More

twtxt.net

Fri, Sep 20 03:09 (40w ago)

↳ In-reply-to » (#ucgvfmq) Okay, the recently implemented --fetch-context, which asks a Yarn pod for a twt, wouldn’t break, but jenny would not be able anymore to verify that it actually got the correct twt. That’s a concrete example where we would lose functionality.

@movq@www.uninformativ.de this approach also wouldn’t work and when that Feed gets archived so you’ll be forced to crawl archived feeds at that point.

⤋ Read More

twtxt.net

Fri, Sep 20 02:55 (40w ago)

↳ In-reply-to » Trying to sum up the current proposal (keeping hashes):

The important bits missing from this summary (devil is in the details) are two requirements:

Clients should order Twts by their timestamp.
Clients must validate all edit and delete requests that the hash being indicated belongs to and came from that feed.
Client should honour delete requests and delete Twts from their cache/archive.

⤋ Read More

twtxt.net

Fri, Sep 20 02:52 (40w ago)

↳ In-reply-to » (#fj7dppq) @movq Thanks for the summary!

@lyse@lyse.isobeef.org This is why hashes provide that level of integrity. The hash can be verified in the cache or archive as belonging to said feed.

⤋ Read More

twtxt.net

Fri, Sep 20 02:51 (40w ago)

↳ In-reply-to » (#fj7dppq) For implementations, it would be nice if “update twts” always came after the twt they are referring to. So I thought about using this opportunity to mandate append-style feeds. But that’s just me being lazy. Implementations will have to be able to cope with any order, because feeds cannot/should not be trusted. 🫤

@movq@www.uninformativ.de I think the order of the lines in a feed don’t matter as long as we can guarantee the order of Twts. Clients should already be ordering by Timestamp anyway.

⤋ Read More

twtxt.net

Fri, Sep 20 02:48 (40w ago)

↳ In-reply-to » Trying to sum up the current proposal (keeping hashes):

@movq@www.uninformativ.de Pretry much 👌

⤋ Read More

twtxt.net

Fri, Sep 20 02:42 (40w ago)

↳ In-reply-to » (#l5452vq) Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@lyse@lyse.isobeef.org Sorry could you explain this sifferently?

⤋ Read More

twtxt.net

Fri, Sep 20 02:39 (40w ago)

↳ In-reply-to » @prologic Hi. i have noticed sometimes when i hit the back button i lose all the surrounding layout and just have a list of twts.

Do you k ow what you clicked on before going back?

⤋ Read More

twtxt.net

Fri, Sep 20 02:30 (40w ago)

↳ In-reply-to » Can I get someone like maybe @xuu or @abucci or even @eldersnake -- If you have some spare time -- to test this yarnd PR that upgrades the Bitcask dependency for its internal database to v2? 🙏

@eldersnake@we.loveprivacy.club Sweet thank you! 🙇‍♂️ I’ll merge this PR tonight I think.

⤋ Read More

twtxt.net

Thu, Sep 19 14:40 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

@david@collantes.us I think we can!

⤋ Read More

twtxt.net

Thu, Sep 19 14:02 (40w ago)

↳ In-reply-to » Can I get someone like maybe @xuu or @abucci or even @eldersnake -- If you have some spare time -- to test this yarnd PR that upgrades the Bitcask dependency for its internal database to v2? 🙏

e.g: Shutdown yarnd and cp -a yarn.db yarn.db.bak before testing this PR/branch.

⤋ Read More

twtxt.net

Thu, Sep 19 14:02 (40w ago)

Can I get someone like maybe @xuu@txt.sour.is or @abucci@anthony.buc.ci or even @eldersnake@we.loveprivacy.club – If you have some spare time – to test this yarnd PR that upgrades the Bitcask dependency for its internal database to v2? 🙏

VERY IMPORTANT If you do; Please Please Please backup your yarn.db database first! 😅 Heaven knows I don’t want to be responsible for fucking up a production database here or there 🤣

⤋ Read More

twtxt.net

Thu, Sep 19 13:59 (40w ago)

↳ In-reply-to » Can someone much smarter than me help me figure out a couple of newly discovered deadlocks in yarnd that I think have always been there, but only recently uncovered by the Go 1.23 compiler.

nevermind; I think this might be some changes internally in Go 1.23 and a dependency I needed to update 🤞

⤋ Read More

twtxt.net

Thu, Sep 19 13:45 (40w ago)

Can someone much smarter than me help me figure out a couple of newly discovered deadlocks in yarnd that I think have always been there, but only recently uncovered by the Go 1.23 compiler.

https://git.mills.io/yarnsocial/yarn/issues/1175

⤋ Read More

twtxt.net

Thu, Sep 19 13:41 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

Location Addressing is fine in smaller or single systems. But when you’re talking about large decentralised systems with no single point of control (kind of the point) things like independable variable integrity become quite important.

⤋ Read More

twtxt.net

Thu, Sep 19 13:40 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

What is being proposed as a counter to content-addressing is called location-addressing. Two very different approaches, both with pros/cons of course. But a local cannot be verified, the content cannot be be guaranteed to be authenticate in any way, you just have to implicitly trust that the location points to the right thing.

⤋ Read More

twtxt.net

Thu, Sep 19 13:38 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

For example, without content-addressing, you’d never have been able to find let alone pull up that ~3yr old Twt of me (my very first), hell I’d even though I lost my first feed file or it became corrupted or something 🤣 – If that were the case, it would actually be possible to reconstruct the feed and verify every single Twt against the caches of all of you 🤣

⤋ Read More

twtxt.net

Thu, Sep 19 13:37 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

@david@collantes.us I really thinks articles like this explain the benefits far better than I can.

⤋ Read More

twtxt.net

Thu, Sep 19 12:20 (40w ago)

↳ In-reply-to » Speaking of AI tech (sorry!); Just came across this really cool tool built by some engineers at Google™ (currently completely free to use without any signup) called NotebookLM 👌 Looks really good for summarizing and talking to document 📃

@david@collantes.us Oh ! 🤦‍♂️

⤋ Read More

twtxt.net

Thu, Sep 19 12:20 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

@david@collantes.us Witout including the content, it’s no longer really “content addressing” now is it? You’re essentially only addressing say nick+timestamp or url+timestamp.

⤋ Read More

twtxt.net

Thu, Sep 19 11:41 (40w ago)

Speaking of AI tech (sorry!); Just came across this really cool tool built by some engineers at Google™ (currently completely free to use without any signup) called NotebookLM 👌 Looks really good for summarizing and talking to document 📃

⤋ Read More

twtxt.net

Thu, Sep 19 11:30 (40w ago)

↳ In-reply-to » Getting a little sick of AI this, AI that. Yes I'll be left behind while everyone else jumps on the latest thing, but I'm not sure I care.

@eldersnake@we.loveprivacy.club Yeah I’m looking forward to that myself 🤣 It’ll be great to see where technology grow to a level of maturity and efficiency where you can run the tools on your own PC or Device and use it for what, so far, I’ve found it to be somewhat decent at; Auto-Complete, Search and Q&A.

⤋ Read More

twtxt.net

Thu, Sep 19 09:15 (40w ago)

↳ In-reply-to » (#v3lkjca) no my fault your client can't handle a little editing ;)

@sorenpeter@darch.dk I really don’t think we can ignore the last ~3 years and a bit of this threading model working quite well for us as a community across a very diverse set of clients and platforms. We cannot just drop something that “mostly works just fine” for the sake of “simplicity”. We have to weight up all the options. There are very real benefits to using content addressing here that really IMO shouldn’t be disregarded so lightly that actually provide a lot of implicit value that users of various clients just don’t get to see. I’d recommend reading up on the ideas behind content addressing before simply dismissing the Twt Hash spec entirely, it wasn’t even written or formalised by me, but I understand how it works quite well 😅 The guy that wrote the spec was (is?) way smarter than I was back then, probably still is now 🤣

⤋ Read More

twtxt.net

Thu, Sep 19 07:16 (40w ago)

↳ In-reply-to » (#lryyjla) @quark My money is on a SHA1SUM hash encoding to keep things much simpler:

@falsifian@www.falsifian.org Right I see. Yeah maybe we want to avoid that 🤣 I do kind of tend to agree with @xuu@txt.sour.is in another thread that there isn’t actually anything wrong with our use of Blake2 at all really, but we may want to consider all our options.

⤋ Read More

twtxt.net

Thu, Sep 19 07:14 (40w ago)

↳ In-reply-to » (#ucgvfmq) @movq going a little sideways on this, "*If twtxt/Yarn was to grow bigger, then this would become a concern again. But even Mastodon allows editing, so how much of a problem can it really be? 😅*", wouldn't it preparing for a potential (even if very, very, veeeeery remote) growth be a good thing? Mastodon signs all messages, keeps a history of edits, and it doesn't break threads. It isn't a problem there.😉 It is here.

@xuu@txt.sour.is I don’t think this is a lextwt problem tbh. Just the Markdown aprser that yarnd currently uses. twtxt2html uses Goldmark and appears to behave better 🤣

⤋ Read More

twtxt.net

Thu, Sep 19 07:14 (40w ago)

↳ In-reply-to » An alternate idea for supporting (properly) Twt Edits is to denoate as such and extend the meaning of a Twt Subject (which would need to be called something better?); For example, let's say I produced the following Twt:

@xuu@txt.sour.is Long while back, I experimented with using similarity algorithms to detect if two Twts were similar enough to be considered an “Edit”.

⤋ Read More

twtxt.net

Thu, Sep 19 07:13 (40w ago)

↳ In-reply-to » Current Twt Hash spec and probability of hash collision:

Right I see what you mean @xuu@txt.sour.is – Can you maybe come up with a fully fleshed out proposal for this? 🤔 This will help solve the problem of hash collision that result from the Twt/hash space growing larger over time without us having to change anything about the way we construct hashes in the first place. We just assume spec compliant clients will just dynamically handle this as the space grows.

⤋ Read More

twtxt.net

Thu, Sep 19 07:11 (40w ago)

↳ In-reply-to » (#lryyjla) @prologic the basic idea was to stem the hash.. so you have a hash abcdef0123456789... any sub string of that hash after the first 6 will match. so abcdef, abcdef012, abcdef0123456 all match the same. on the case of a collision i think we decided on matching the newest since we archive off older threads anyway. the third rule was about growing the minimum hash size after some threshold of collisions were detected.

@xuu@txt.sour.is I think we never progressed this idea further because we weren’t sure how to tell if a hash collision would occur in the first place right? In other words, how does Client A know to expand a hash vs. Client B in a 100% decentralised way? 🤔

⤋ Read More

twtxt.net

Thu, Sep 19 07:09 (40w ago)

↳ In-reply-to » Getting a little sick of AI this, AI that. Yes I'll be left behind while everyone else jumps on the latest thing, but I'm not sure I care.

Plus these so-called “LLM”(s) have a pretty good grasp of the “shape” of language, so they appear to be quite intelligent or produce intelligible response (when they’re actually quite stupid really).

⤋ Read More

twtxt.net

Thu, Sep 19 07:08 (40w ago)

↳ In-reply-to » Getting a little sick of AI this, AI that. Yes I'll be left behind while everyone else jumps on the latest thing, but I'm not sure I care.

@eldersnake@we.loveprivacy.club You don’t get left behind at all 🤣 It’s hyped up so much, it’s not even funny anymore. Basically at this point (so far at least) I’ve concluded that all this GenAI / LLM stuff is just a fancy auto-complete and indexing + search reinvented 🤣

⤋ Read More

twtxt.net

Thu, Sep 19 02:59 (40w ago)

↳ In-reply-to » (#ucgvfmq) @movq going a little sideways on this, "*If twtxt/Yarn was to grow bigger, then this would become a concern again. But even Mastodon allows editing, so how much of a problem can it really be? 😅*", wouldn't it preparing for a potential (even if very, very, veeeeery remote) growth be a good thing? Mastodon signs all messages, keeps a history of edits, and it doesn't break threads. It isn't a problem there.😉 It is here.

@bender@twtxt.net This is the different Markdown parsers being used. Goldmark vs. gomarkdown. We need to switch to Goldmark 😅

⤋ Read More

twtxt.net

Thu, Sep 19 00:37 (40w ago)

↳ In-reply-to » (#ucgvfmq) @movq going a little sideways on this, "*If twtxt/Yarn was to grow bigger, then this would become a concern again. But even Mastodon allows editing, so how much of a problem can it really be? 😅*", wouldn't it preparing for a potential (even if very, very, veeeeery remote) growth be a good thing? Mastodon signs all messages, keeps a history of edits, and it doesn't break threads. It isn't a problem there.😉 It is here.

@quark@ferengi.one i’m guessing the quotas text should’ve been emphasized?

⤋ Read More

twtxt.net

Thu, Sep 19 00:31 (40w ago)

↳ In-reply-to » LinkedIn Is Training AI on User Data Before Updating Its Terms of Service An anonymous reader shares a report: LinkedIn is using its users' data for improving the social network's generative AI products, but has not yet updated its terms of service to reflect this data processing, according to posts from various LinkedIn users and a statement from the company to 404 Media. Instead, the company says it ... ⌘ Read more

@slashdot@feeds.twtxt.net NahahahahHa 🤣 So glad I don’t use LinkedIn 🤦‍♂️

⤋ Read More

twtxt.net

Thu, Sep 19 00:31 (40w ago)

↳ In-reply-to » Taking the last n characters of a base32 encoded hash instead of the first n can be problematic for several reasons:

@falsifian@www.falsifian.org No u don’t sorry. But I tend to agree with you and I think if we continue to use hashes we should keep the remainder in mind as we choose truncation values of N

⤋ Read More

twtxt.net

Thu, Sep 19 00:27 (40w ago)

↳ In-reply-to » (#lryyjla) @quark My money is on a SHA1SUM hash encoding to keep things much simpler:

@falsifian@www.falsifian.org Mostly because Git uses it 🤣 Known attacks that would affect our use? 🤔

⤋ Read More

twtxt.net

Thu, Sep 19 00:23 (40w ago)

↳ In-reply-to » Current Twt Hash spec and probability of hash collision:

@xuu@txt.sour.is I don’t recall where that discussion ended up being though?

⤋ Read More

twtxt.net

Thu, Sep 19 00:21 (40w ago)

↳ In-reply-to » Brisbane is coming onboard. Roosters are "singing" all around @prologic, and the dog is begging for the morning poo/pee walk. @prologic throws a slipper at the dog, as he turns around, and hides under his comforter.

@bender@twtxt.net wut da fuq?! 🤣

⤋ Read More

twtxt.net

Thu, Sep 19 00:20 (40w ago)

↳ In-reply-to » An alternate idea for supporting (properly) Twt Edits is to denoate as such and extend the meaning of a Twt Subject (which would need to be called something better?); For example, let's say I produced the following Twt:

@xuu@txt.sour.is you mean my original idea of basically just automatically detecting Twt edits from the client side?

⤋ Read More

twtxt.net

Thu, Sep 19 00:19 (40w ago)

↳ In-reply-to » (#5vbi2ea) So.. basically a rehash of the email "unsend" requests? What if i was to make a (delete: 5vbi2ea) .. would it delete someone elses twt?

@xuu@txt.sour.is this is where you would need to prove that the editor delete request actually came from that feed author. Hence why integrity is much more important here.

⤋ Read More

twtxt.net

Thu, Sep 19 00:16 (40w ago)

↳ In-reply-to » (#5vbi2ea) @prologic I wouldn't want my client to honour delete requests. I like my computer's memory to be better than mine, not worse, so it would bug me if I remember seeing something and my computer can't find it.

@falsifian@www.falsifian.org without supporting dudes properly though you’re running into GDP issues and the right to forget. 🤣 we’ve had pretty lengthy discussions about this in the past years ago as well, but we never came to a conclusion. We’re all happy with.

⤋ Read More

twtxt.net

Wed, Sep 18 23:29 (40w ago)

↳ In-reply-to » (#ce4g4qa) I’m not advocating in either direction, btw. I haven’t made up my mind yet. 😅 Just braindumping here.

@movq@www.uninformativ.de it would work, you are right, however, it has drawbacks, and I think in the long term would create a new set of problems that we would also then have to solve.

⤋ Read More

twtxt.net

Wed, Sep 18 23:25 (40w ago)

↳ In-reply-to » Everything starts at a "hello world". At least around these parts; the nerdy parts.

@david@collantes.us Hah 🤣

⤋ Read More

twtxt.net

Wed, Sep 18 23:13 (40w ago)

↳ In-reply-to » Incredibly upset---more than you could imagine---because I already made the first mistake, and corrected it (but twtxt.net got it on it's cache, ugh!) :'-( . Can't wait for editing to become a reality!

@david@collantes.us We’ll get there soon™ 🔜

⤋ Read More

twtxt.net

Wed, Sep 18 23:13 (40w ago)

↳ In-reply-to » Everything starts at a "hello world". At least around these parts; the nerdy parts.

@david@collantes.us Hah Welcome back! 😅

⤋ Read More

twtxt.net

Wed, Sep 18 13:16 (40w ago)

↳ In-reply-to » An alternate idea for supporting (properly) Twt Edits is to denoate as such and extend the meaning of a Twt Subject (which would need to be called something better?); For example, let's say I produced the following Twt:

Finally @lyse@lyse.isobeef.org ’s idea of updating metadata changes in a feed “inline” where the change happened (with respect to other Twts in whatever order the file is written in) is used to drive things like “Oh this feed now has a new URI, let’s use that from now on as the feed’s identity for the purposes of computing Twt hashes”. This could extend to # nick = as preferential indicators to clients as well as even other updates such as # description = – Not just # url =

⤋ Read More

twtxt.net

Wed, Sep 18 13:14 (40w ago)

↳ In-reply-to » An alternate idea for supporting (properly) Twt Edits is to denoate as such and extend the meaning of a Twt Subject (which would need to be called something better?); For example, let's say I produced the following Twt:

Likewise we could also support delete:229d24612a2, which would indicate to clients that fetch the feed to delete any cached Twt matching the hash 229d24612a2 if the author wishes to “unpublish” that Twt permanently, rather than just deleting the line from the feed (which does nothing for clients really).

⤋ Read More

twtxt.net

Wed, Sep 18 13:12 (40w ago)

An alternate idea for supporting (properly) Twt Edits is to denoate as such and extend the meaning of a Twt Subject (which would need to be called something better?); For example, let’s say I produced the following Twt:

2024-09-18T23:08:00+10:00	Hllo World

And my feed’s URI is https://example.com/twtxt.txt. The hash for this Twt is therefore 229d24612a2:

$ echo -n "https://example.com/twtxt.txt\n2024-09-18T23:08:00+10:00\nHllo World" | sha1sum | head -c 11
229d24612a2

You wish to correct your mistake, so you make an amendment to that Twt like so:

2024-09-18T23:10:43+10:00	(edit:#229d24612a2) Hello World

Which would then have a new Twt hash value of 026d77e03fa:

$ echo -n "https://example.com/twtxt.txt\n2024-09-18T23:10:43+10:00\nHello World" | sha1sum | head -c 11
026d77e03fa

Clients would then take this edit:#229d24612a2 to mean, this Twt is an edit of 229d24612a2 and should be replaced in the client’s cache, or indicated as such to the user that this is the intended content.

⤋ Read More

twtxt.net

Wed, Sep 18 13:05 (40w ago)

↳ In-reply-to » (#lryyjla) @quark My money is on a SHA1SUM hash encoding to keep things much simpler:

@bender@twtxt.net Just replace the echo with something like pbpaste or similar. You’d just need to shell escape things like " and such. That’s all. Alternatives you can shove the 3 lines into a small file and cat file.txt | ...

⤋ Read More

twtxt.net

Wed, Sep 18 13:04 (40w ago)

With a SHA1 encoding the probability of a hash collision becomes, at various k (number of twts):

>>> import math
>>>
>>> def collision_probability(k, bits):
...     n = 2 ** bits  # Total unique hash values based on the number of bits
...     probability = 1 - math.exp(- (k ** 2) / (2 * n))
...     return probability * 100  # Return as percentage
...
>>> # Example usage:
>>> k_values = [100000, 1000000, 10000000]
>>> bits = 44  # Number of bits for the hash
>>>
>>> for k in k_values:
...     print(f"Probability of collision for {k} hashes with {bits} bits: {collision_probability(k, bits):.4f}%")
...
Probability of collision for 100000 hashes with 44 bits: 0.0284%
Probability of collision for 1000000 hashes with 44 bits: 2.8022%
Probability of collision for 10000000 hashes with 44 bits: 94.1701%
>>> bits = 48
>>> for k in k_values:
...     print(f"Probability of collision for {k} hashes with {bits} bits: {collision_probability(k, bits):.4f}%")
...
Probability of collision for 100000 hashes with 48 bits: 0.0018%
Probability of collision for 1000000 hashes with 48 bits: 0.1775%
Probability of collision for 10000000 hashes with 48 bits: 16.2753%
>>> bits = 52
>>> for k in k_values:
...     print(f"Probability of collision for {k} hashes with {bits} bits: {collision_probability(k, bits):.4f}%")
...
Probability of collision for 100000 hashes with 52 bits: 0.0001%
Probability of collision for 1000000 hashes with 52 bits: 0.0111%
Probability of collision for 10000000 hashes with 52 bits: 1.1041%
>>>

If we adopted this scheme, we could have to increase the no. of characters (first N) from 11 to 12 and finally 13 as we approach globally larger enough Twts across the space. I think at least full crawl/scrape it was around ~500k (maybe)? https://search.twtxt.net/ says only ~99k

⤋ Read More

twtxt.net

Wed, Sep 18 12:54 (40w ago)

↳ In-reply-to » Current Twt Hash spec and probability of hash collision:

@quark@ferengi.one My money is on a SHA1SUM hash encoding to keep things much simpler:

$ echo -n "https://twtxt.net/user/prologic/twtxt.txt\n2020-07-18T12:39:52Z\nHello World! 😊" | sha1sum | head -c 11
87fd9b0ae4e

⤋ Read More

twtxt.net

Wed, Sep 18 12:52 (40w ago)

↳ In-reply-to » Taking the last n characters of a base32 encoded hash instead of the first n can be problematic for several reasons:

I think it was a mistake to take the last n base32 encoded characters of the blake2b 256bit encoded hash value. It should have been the first n. where n is >= 7

⤋ Read More

twtxt.net

Wed, Sep 18 12:52 (40w ago)

Taking the last n characters of a base32 encoded hash instead of the first n can be problematic for several reasons:

Hash Structure: Hashes are typically designed so that their outputs have specific statistical properties. The first few characters often have more entropy or variability, meaning they are less likely to have patterns. The last characters may not maintain this randomness, especially if the encoding method has a tendency to produce less varied endings.
Collision Resistance: When using hashes, the goal is to minimize the risk of collisions (different inputs producing the same output). By using the first few characters, you leverage the full distribution of the hash. The last characters may not distribute in the same way, potentially increasing the likelihood of collisions.
Encoding Characteristics: Base32 encoding has a specific structure and padding that might influence the last characters more than the first. If the data being hashed is similar, the last characters may be more similar across different hashes.
Use Cases: In many applications (like generating unique identifiers), the beginning of the hash is often the most informative and varied. Relying on the end might reduce the uniqueness of generated identifiers, especially if a prefix has a specific context or meaning.

In summary, using the first n characters generally preserves the intended randomness and collision resistance of the hash, making it a safer choice in most cases.

⤋ Read More

twtxt.net

Wed, Sep 18 12:50 (40w ago)

↳ In-reply-to » Current Twt Hash spec and probability of hash collision:

@quark@ferengi.one Bloody good question 🙋 God only knows 🤣

⤋ Read More

twtxt.net

Wed, Sep 18 12:45 (40w ago)

↳ In-reply-to » (#hg754aq) @quark Do you mean something like this?

@bender@twtxt.net Welcome! 🤗

⤋ Read More

twtxt.net

Wed, Sep 18 12:40 (40w ago)

↳ In-reply-to » When will I learn to not look at my work’s smartphone in my free time? 😂 Opened the corporate chat, instant regret.

@movq@www.uninformativ.de Haha 😝

What I was referring to in the OP: Sometimes I check the workphone simply out of curiosity. 😂

⤋ Read More

twtxt.net

Wed, Sep 18 12:39 (40w ago)

↳ In-reply-to » (#xzgj32a) Now WTF!? Suddenly, @falsifian's feed renders broken in my tt Python implementation. Exactly what I had with my Go rewrite. I haven't touched the Python stuff in ages, though. Also, tt and tt2 do not share any data at all.

@movq@www.uninformativ.de Fair 👌

⤋ Read More

twtxt.net

Wed, Sep 18 12:38 (40w ago)

Current Twt Hash spec and probability of hash collision:

The probability of a Twt Hash collision depends on the size of the hash and the number of possible values it can take. For the Twt Hash, which uses a Blake2b 256-bit hash, Base32 encoding, and takes the last 7 characters, the space of possible hash values is significantly reduced.

Breakdown:

Base32 encoding: Each character in the Base32 encoding represents 5 bits of information (since ( 2^5 = 32 )).
7 characters: With 7 characters, the total number of possible hashes is:

⤋ Read More

twtxt.net

Wed, Sep 18 12:32 (40w ago)

↳ In-reply-to » (#hg754aq) @quark Do you mean something like this?

@quark@ferengi.one Add here:

* a0826a65 - Add debug sub-command to yarnc (7 weeks ago) <James Mills>

I’d recommend a git pull && make build

⤋ Read More

twtxt.net

Wed, Sep 18 12:31 (40w ago)

↳ In-reply-to » Just experimenting...

$ echo -n "https://twtxt.net/user/prologic/twtxt.txt\n2020-07-18T12:39:52Z\nHello World! 😊" | sha1sum | head -c 11
87fd9b0ae4e

⤋ Read More

twtxt.net

Wed, Sep 18 12:25 (40w ago)

↳ In-reply-to » Just experimenting...

$ echo -n "https://twtxt.net/user/prologic/twtxt.txt\n2020-07-18T12:39:52Z\nHello World! 😊" | sha256sum | base32 | tr -d '=' | tr 'A-Z' 'a-z' | tail -c 12
tdqmjaeawqu

⤋ Read More

twtxt.net

Wed, Sep 18 12:21 (40w ago)

Just experimenting…

$ echo -n "https://twtxt.net/user/prologic/twtxt.txt\n2020-07-18T12:39:52Z\nHello World! 😊" | sha256sum | base64 | tr -d '=' | tail -c 12
NWY4MSAgLQo

⤋ Read More

twtxt.net

Wed, Sep 18 12:18 (40w ago)

↳ In-reply-to » Could someone knowledgable reply with the steps a grandpa will take to calculate the hash of a twtxt from the CLI, using out-of-the-box tools? I swear I read about it somewhere, but can't find it.

It would appear that the blake2b 256bit digest algorithm is no longer supported by the openssl tool, however blake2s256 is; I’m not sure why 🤔

$ echo -n "https://twtxt.net/user/prologic/twtxt.txt\n2020-07-18T12:39:52Z\nHello World! 😊" | openssl dgst -blake2s256 -binary | base32 | tr -d '=' | tr 'A-Z' 'a-z' | tail -c 7
zq4fgq

Obviously produce the wrong hash, which should be o6dsrga as indicated by the yarnc hash utility:

$ yarnc hash -u https://twtxt.net/user/prologic/twtxt.txt -t 2020-07-18T12:39:52Z "Hello World! 😊"
o6dsrga

But at least the shell pipeline is “correct”.

⤋ Read More

twtxt.net

Wed, Sep 18 12:14 (40w ago)

↳ In-reply-to » Could someone knowledgable reply with the steps a grandpa will take to calculate the hash of a twtxt from the CLI, using out-of-the-box tools? I swear I read about it somewhere, but can't find it.

FWIW the standard UNIX tools for Blake2b are openssl and b2sum – Just trying to figure out how to make a shell pipeline again (if you really want that); as tools keep changing god damnit 🤣

⤋ Read More

twtxt.net

Wed, Sep 18 12:14 (40w ago)

↳ In-reply-to » Could someone knowledgable reply with the steps a grandpa will take to calculate the hash of a twtxt from the CLI, using out-of-the-box tools? I swear I read about it somewhere, but can't find it.

@quark@ferengi.one Do you mean something like this?

$ ./yarnc debug ~/Public/twtxt.txt | tail -n 1
kp4zitq 2024-09-08T02:08:45Z	(#wsdbfna) @<aelaraji https://aelaraji.com/twtxt.txt> My work has this thing called "compressed work", where you can **buy** extra time off (_as much as 4 additional weeks_) per year. It comes out of your pay though, so it's not exactly a 4-day work week but it could be useful, just haven't tired it yet as I'm not entirely sure how it'll affect my net pay

⤋ Read More

twtxt.net

Wed, Sep 18 11:48 (40w ago)

↳ In-reply-to » Could someone knowledgable reply with the steps a grandpa will take to calculate the hash of a twtxt from the CLI, using out-of-the-box tools? I swear I read about it somewhere, but can't find it.

@quark@ferengi.one Watch TV at the moment so unsure about “out of the box tools” but there are 3 implementations on the spec page https://dev.twtxt.net/doc/twthashextension.html

⤋ Read More

twtxt.net

Wed, Sep 18 06:12 (40w ago)

↳ In-reply-to » It feels like an A' Hole pointing at typos while other people are the ones doing the real work ! 😅

@aelaraji@aelaraji.com Sometimes that’s the most valuable though! Feedback!

⤋ Read More

twtxt.net

Wed, Sep 18 03:25 (40w ago)

I’ve been using Codeium too the last week or so ! It’s pretty good and like @xuu@txt.sour.is said is a pretty desent Junior assistant, it helps me write good docs and the tab completion is amazing!

It of course completely sucks at doing anything “intelligent” or complex, but if you just use it as a fancier auto complete it’s actually half way decent 👌

⤋ Read More

twtxt.net

Wed, Sep 18 03:02 (40w ago)

↳ In-reply-to » This might be quite unpopular, but I truly dislike Wordle. The reason isn’t rooted on any psychological issue, it is much, much more simple: people share their Wordle result(s)---I figure they feel good about themselves---and for me it is only uneven, unaligned, wasteful noise. I don’t even want to show you an example, but I am sure you know what I am talking about.

@quark@ferengi.one I admit I find the general “click here to share blah” generally wasteful, useless and unengaging really. Not just Wordle.

I admittedly however, I’ve been guilty of doing this sometimes myself. 🤦‍♂️ sometimes though I think it’s OK to show your achievements. 👌

⤋ Read More

twtxt.net

Tue, Sep 17 22:41 (40w ago)

↳ In-reply-to » Thank you for adding the feature so fast, @prologic! Look at how beautiful this one renders now. Oh my!

@quark@ferengi.one LOL 😂 No worries! 😉

⤋ Read More

twtxt.net

Tue, Sep 17 22:40 (40w ago)

↳ In-reply-to » Opened a couple of issues on twtxt2html. Maybe @prologic will get to them after he has completed his luxurious recharging cycle. LOL.

@quark@ferengi.one All fixed! 🥳 pleasure doing business with y’all. 🤣

⤋ Read More

twtxt.net

Tue, Sep 17 22:07 (40w ago)

↳ In-reply-to » (#xzgj32a) Now WTF!? Suddenly, @falsifian's feed renders broken in my tt Python implementation. Exactly what I had with my Go rewrite. I haven't touched the Python stuff in ages, though. Also, tt and tt2 do not share any data at all.

@lyse@lyse.isobeef.org Just as an aside, shouldn’t you assume utf-8 anyway these days if not specified? 🤔 I mean basically everything almost always uses utf-8 encoding right? 😅

⤋ Read More

twtxt.net

Tue, Sep 17 22:04 (40w ago)

↳ In-reply-to » (#sbg7p7a) So yeah no, whilst it technically works, neither jenny nor yarnd support it very well. Only at a very basic level.

@quark@ferengi.one Okay fair 👌

⤋ Read More

twtxt.net

Tue, Sep 17 22:03 (40w ago)

↳ In-reply-to » When will I learn to not look at my work’s smartphone in my free time? 😂 Opened the corporate chat, instant regret.

@movq@www.uninformativ.de So basically on-call, but you don’t get paid for it? 🤔

⤋ Read More

twtxt.net

Tue, Sep 17 22:01 (40w ago)

↳ In-reply-to » (#vpti3aq) @aelaraji Btw, I'm also open to ideas for this tool and welcome any contributions 👌

@quark@ferengi.one aye aye captain 🤣

⤋ Read More