txt.sour.is External profile for @<prologic https://twtxt.net/user/prologic/twtxt.txt/1>

twtxt.net

Don’t forget about the upcoming Yarn.social online meetup coming up this Saturday! 😅 See #jjbnvgq for details! – Hope to see y’all there 💪

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:59 (39w ago)

👋 Don’t forget to take the Twtxt v2 poll 🙏 if you haven’t done so already (sorry about the confusing question at the end!)

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:55 (39w ago)

↳ In-reply-to » (#knryyga) And finally the legibility of feeds when viewing them in their raw form are worsened as you go from a Twt Subject of (#abcdefg12345) to something like (https://twtxt.net/user/prologic/twtxt.txt 2024-09-22T07:51:16Z).

@doesnm@doesnm.p.psf.lt I don’t even advocate for reading Twtxt in its raw form in the first place, which is why I’m in favor of continuing to use content-based addressing (hashes) and incremental improve what we already have. IMO the only reason to read a Twtxt file in it’s raw form is a) if you’re a developer b) new feed author or c) debugging a client issue.

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:53 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:45 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

And finally the legibility of feeds when viewing them in their raw form are worsened as you go from a Twt Subject of (#abcdefg12345) to something like (https://twtxt.net/user/prologic/twtxt.txt 2024-09-22T07:51:16Z).

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:44 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

There is also a ~5x increase cost in memory utilization for any implementations or implementors that use or wish to use in-memory storage (yarnd does for example) and equally a 5x increase in on-disk storage as well. This is based on the Twt Hash going from a 13 bytes (content-addressing) to 63 bytes (on average for location-based addressing). There is roughly a ~20-150% increase in the size of individual feeds as well that needs to be taken into consideration (on the average case).

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:42 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

With Location-based addressing there is no way to verify that a single Twt actaully came from that feed without actually fetching the feed and checking. That has the effect of always having to rely on fetching the feed and storing a copy of feeds you fetch (which is okay), but you’re force to do this. You cannot really share individual Twts anymore really like yarnd does (as peering) because there is no “integrity” to the Twt identified by it’s <url> <timestamp>. The identify is meaningless and is only valid as long as you can trust the location and that the location at that point hasn’t changed its content.

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:40 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

Location-based addressing is vulnerable to the content changing. If the content changes the “location” is no longer valid. This is a problem if you build systems that rely on this.

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:39 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

So really your argument is just that switching to a location-based addressing “just makes sense”. Why? Without concrete pros/cons of each approach this isn’t really a strong argument I’m afraid. In fact I probably need to just sit down and detail the properties of both approaches and the pros/cons of both.

I also don’t really buy the argument of simplicity either personally, because I don’t technically see it much more difficult to take a echo -e "<url>\t<timestamp>\t<content>" | sha256sum | base64 as the Twt Subject or concatenating the <url> <timestamp> – The “effort” is the same. If we’re going to argue that SHA256 or cryptographic hashes are “too complicated” then I’m not really sure how to support that argument.

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:34 (39w ago)

↳ In-reply-to » Some more arguments for a local-based treading model over a content-based one:

@sorenpeter@darch.dk Points 2 & 3 aren’t really applicable here in the discussion of the threading model really I’m afraid. WebMentions is completely orthogonal to the discussion. Further, no-one that uses Twtxt really uses WebMentions, whilst yarnd supports the use of WebMentions, it’s very rarely used in practise (if ever) – In fact I should just drop the feature entirely.

The use of WebSub OTOH is far more useful and is used by every single yarnd pod everywhere (no that there’s that many around these days) to subscribe to feed updates in ~near real-time without having the poll constantly.

⤋ Read More

prologic

twtxt.net

Tue, Sep 24 12:01 (39w ago)

↳ In-reply-to » Finally pubnix is alive! That's im missing? Im only reading twtxt.net timeline because twtxt-v2.sh works slowly for displaying timeline...

@doesnm@doesnm.p.psf.lt Welcome back 😅

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 15:38 (40w ago)

↳ In-reply-to » I’m not writing on 'twtxt' as much as I did in 2021-2022. While it has many advantages, I couldn't get my close circle to join.

@eapl.me@eapl.me Sad to see you go, disappointed in your choice of X, but respect your decision and choice. I will never cave in myself, even if it means my “circle of friends” remains low. I guess we call ‘em internet friends right? 😅

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 15:11 (40w ago)

↳ In-reply-to » We're now having a thunderstorm with rain, lightning and thunder and the severe weather map shows all green. I'd expect it to be violet.

@lyse@lyse.isobeef.org How violent is the thunderstorm? 🤔

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 13:41 (40w ago)

↳ In-reply-to » LMAO 🤣 ... I've been scrolling through mutt(1) man page and found this:

@aelaraji@aelaraji.com LOl 😂

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 13:03 (40w ago)

A new thing LLM(s) can’t do well. Write patches 🤣

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 13:00 (40w ago)

↳ In-reply-to » I'm experimenting with SQLite and trees. It's going good so far with only my own 439 messages long main feed from a few days ago in the cache. Fetching these 632 rows took 20ms:

@lyse@lyse.isobeef.org Yeah I think it’s one of the reasons why yarnd’s cache became so complicated really. I mean it’s a bunch of maps and lists that is recalculated every ~5m. I don’t know of any better way to do this right now, but maybe one day I’ll figure out a better way to represent the same information that is displayed today that works reasonably well.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 12:26 (40w ago)

↳ In-reply-to » Another interesting side effect of changing from content-based addressing to location-based addressing is that switching from 7-byte keys to 2025-character keys for 3.5 million entries would expand the database size from 24.5 MB to about 7.09 GB—an increase of roughly 7.06 GB!

My point is, this is not a small trade-off to make for the sake of simplicity 😅

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 12:26 (40w ago)

↳ In-reply-to » Another interesting side effect of changing from content-based addressing to location-based addressing is that switching from 7-byte keys to 2025-character keys for 3.5 million entries would expand the database size from 24.5 MB to about 7.09 GB—an increase of roughly 7.06 GB!

@movq@www.uninformativ.de Maybe I misspoke. It’s a factor of 5 in the size of the keyspace required. The impact is significantly less for on-disk storage of raw feeds and such, around ~1-1.5x depending on how many replies there are I suppose.

I wasn’t very clear; my apologies. If we update the current hash truncation length from 7 to 11. But then still decide anyway to go down this location-based twt identity and threading model then yes, we’re talking about twt subjects having a ~5x increase in size on average. Going from 14 characters (11 for the has, 2 for the parens, 1 for the #) to ~63 bytes (average I’ve worked out of length of URL + Timestamp) + 3 byte overhead for parents and space.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 11:49 (40w ago)

↳ In-reply-to » (#22263za) Comparing a few feeds:

@lyse@lyse.isobeef.org Yes I think so.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 11:20 (40w ago)

Don’t forget about the upcoming Yarn.social meetup coming up this Saturday! See #jjbnvgq for details! Hope to see some/all of y’all there 💪

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 11:18 (40w ago)

↳ In-reply-to » I'm experimenting with SQLite and trees. It's going good so far with only my own 439 messages long main feed from a few days ago in the cache. Fetching these 632 rows took 20ms:

@lyse@lyse.isobeef.org And your query to construct a tree? Can you share the full query (screenshot looks scary 🤣) – On another note, SQL and relational databases aren’t really that conduces to tree-like structures are they? 🤣

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 11:10 (40w ago)

↳ In-reply-to » One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

In fact it depends on how many Twts there are that form part of a thread, if you take a much larger sample size of my own feed for example, it starts to approximate ~1.5x increase in size:

$ ./compare.sh https://twtxt.net/user/prologic/twtxt.txt 500
Original file size: 126842 bytes
Modified file size: 317029 bytes
Percentage increase in file size: 149.94%
...

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 11:04 (40w ago)

↳ In-reply-to » (#xgz5bga) Can someone make the edit?

In fact @falsifian@www.falsifian.org you had quite a lot of good feedback, do you mind collecting them in a task list on the doc somewhere so I can get to em? 🤔

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 11:00 (40w ago)

↳ In-reply-to » (#5sdepuq) @lyse I'd suggest making the whole content-type thing a SHOULD, to accommodate people just using some hosting service they don't have much control over. (The same situation could make detecting followers hard, but IMO "please email me if you follow me" is still legit twtxt, even if inconvenient.)

Can someone make the edit?

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 10:57 (40w ago)

↳ In-reply-to » One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

@movq@www.uninformativ.de Tbis was just a representative sample. The real concrete cost here is a ~5x increase in memory consumption for yarnd and/or ~5x increase in disk storage.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 10:51 (40w ago)

↳ In-reply-to » I'm experimenting with SQLite and trees. It's going good so far with only my own 439 messages long main feed from a few days ago in the cache. Fetching these 632 rows took 20ms:

@lyse@lyse.isobeef.org Mind sharing your schema?

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 10:50 (40w ago)

↳ In-reply-to » Reminder to take the Twtxt (anonymous) Poll: http://polljunkie.com/poll/xdgjib/twtxt-v2

@lyse@lyse.isobeef.org Not sure I’ll check

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 10:49 (40w ago)

↳ In-reply-to » (#22263za) Comparing a few feeds:

@lyse@lyse.isobeef.org My proposal is three steps:

increase the hash length from 7 to 11

Then:

Add support for changing your feed’s location without breaking g threads

Then much later:

Add formal support for edits

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 10:45 (40w ago)

↳ In-reply-to » (#rnf234a) With the original specification of 140 character Twt length recommendation. There's only leaves you with about 78 characters worth of anything remotely useful to say in response.

@lyse@lyse.isobeef.org No I don’t either just say’n 😅

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 10:43 (40w ago)

↳ In-reply-to » (#w6f7hpa) @xuu I think it is more tricky than that.

@movq@www.uninformativ.de That’s what I want to know 🤣

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 07:58 (40w ago)

↳ In-reply-to » Another interesting side effect of changing from content-based addressing to location-based addressing is that switching from 7-byte keys to 2025-character keys for 3.5 million entries would expand the database size from 24.5 MB to about 7.09 GB—an increase of roughly 7.06 GB!

So just to be clear, it’s not as bad as the OP in this thread, this is just a worst case scenario. With some additional analysis I did today, its closer to around ~5x the memory requirements of my pod, which would roughly go from ~22MB to ~120MB or so, probably a bit more in practise. But this is still a significant increase in memory. The on-disk requirements would also increase by around ~5x as well on average going from ~12GB to about ~60GB at current archive size.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 06:46 (40w ago)

Just out of curiosity, I inspected the yarns database (the search engine//cralwer) to find the average length of a Twtxt URI:

$ inspect-db yarns.db | jq -r '.Value.URL' | awk '{ total += length; count++ } END { if (count > 0) print total / count }'
40.3387

Given an RFC3339 UTC timestamp has a length of 20 characters with seconds precision. We’re talking about Twt Subject taking up ~63 characters/bytes on average.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 06:30 (40w ago)

↳ In-reply-to » So I whipped up a quick shell script to demonstrate what I mean by the increase in feed size on average as well as the expected increase in storage and retrieval requirements.

Comparing a few feeds:

@xuu@txt.sour.is would see an increase of ~20%
@falsifian@www.falsifian.org would see an increase of ~8%
@bender@twtxt.net would see an increase of ~20%
@lyse@lyse.isobeef.org would see an increase of ~15%
@aelaraji@aelaraji.com would see an increase of ~13%
@sorenpeter@darch.dk would see an increase of ~8%
@movq@www.uninformativ.de would see an increase of ~9%

Just from a scalability standpoint along I’m not seeing a switch to location-based Twt ids to support threading a good idea here. This is what I meant when I said to @david@collantes.us in a recent call that we open up a new can of worms (or new set of problems) by drastically changing the approach, rather than incrementally improving the existing approach we have today (_which has served us well for the past 4 years already_0.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 06:23 (40w ago)

Reminder to take the Twtxt (anonymous) Poll: http://polljunkie.com/poll/xdgjib/twtxt-v2

Apologies, I can’t edit the poll once it’s live, so the suggestion on feedback for supporting Markdown will have to be discussed at another time.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 06:16 (40w ago)

↳ In-reply-to » (#zqpkfla) @prologic Thanks for writing that up!

@xuu@txt.sour.is correct

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 06:16 (40w ago)

↳ In-reply-to » So I'm a location based system, how exactly do I reply to one of these two Twts from @Yarns ? 🤔

@xuu@txt.sour.is 🤣🤣🤣

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 04:57 (40w ago)

So I whipped up a quick shell script to demonstrate what I mean by the increase in feed size on average as well as the expected increase in storage and retrieval requirements.

$ ./compare.sh
Original file size: 28145 bytes
Modified file size: 70672 bytes
Percentage increase in file size: 151.10%
...

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 04:12 (40w ago)

↳ In-reply-to » One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

Thank goodness we relaxed that limit and I’ve stopped being so Puritan about it but my overall point is we would be significantly increasing the human size as well as the machine size of the identity of threads as well as twts

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 04:12 (40w ago)

↳ In-reply-to » One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

With the original specification of 140 character Twt length recommendation. There’s only leaves you with about 78 characters worth of anything remotely useful to say in response.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 04:10 (40w ago)

↳ In-reply-to » One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

Let’s say the overhead is always three bytes two parentheses under space.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 04:10 (40w ago)

↳ In-reply-to » One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

So for example, if we would use @movq@www.uninformativ.de ’s feed as an example thread ID here, his feed with a particular timestamp, were already looking at a subject length of 59 bytes +/- a couple of bytes to denote the subject in the Twt itself/

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 04:05 (40w ago)

One of the reasons we wanted to originally use Contant based addressing and short hashes as our threading model was to keep individual Twts short so that they were still readable if you viewed the manually by hand.

With the proposal to switch to location based addressing using a pointer to a feed and a timestamp in that feed you’re looking at roughly 2025 characters long because both the HTTP and HTML and even URI specifications do not specify maximum length for URI(s) AFAIK only recommendations.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 03:59 (40w ago)

↳ In-reply-to » Another interesting side effect of changing from content-based addressing to location-based addressing is that switching from 7-byte keys to 2025-character keys for 3.5 million entries would expand the database size from 24.5 MB to about 7.09 GB—an increase of roughly 7.06 GB!

@bender@twtxt.net I can’t see myself personally, increasing the infrastructure and costs to run this pod to support this as we switch over potentially and as things continue to grow in scale. You would never get your infinite search and infinite timeline features that you’ve always wanted for example and I would have to drastically reduce what is visible or even searchable at any given point in time to much less than what it is today.

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 03:57 (40w ago)

Another interesting side effect of changing from content-based addressing to location-based addressing is that switching from 7-byte keys to 2025-character keys for 3.5 million entries would expand the database size from 24.5 MB to about 7.09 GB—an increase of roughly 7.06 GB!

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 00:56 (40w ago)

↳ In-reply-to » Okay folks, I've spent all day on this today, and I think its in "good enough"™ shape to share:

@falsifian@www.falsifian.org No worries! Fell few to contribute to the doc directly I’d you wish 👌

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 00:55 (40w ago)

↳ In-reply-to » (#266jaka) Yeah I'm curious to find out too beyond just "here say". But regardless of whether we should or shouldn't care about this or should or shouldn't comply. We should IMO. I'd have to build something that horrendously violates someone's rights in another country.

@falsifian@www.falsifian.org Hmmm not sure sorry 🤔

⤋ Read More

prologic

twtxt.net

Mon, Sep 23 00:45 (40w ago)

↳ In-reply-to » (#6y53k7q) @prologic Do you have a link to some past discussion?

@xuu@txt.sour.is Goos to know! 👌 So as long as we remain decentralized and non-commercial (I assume non/profit works too?) we’re good?

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 12:33 (40w ago)

↳ In-reply-to » Something’s broken.

@lyse@lyse.isobeef.org Nice ! 🙏

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 11:54 (40w ago)

↳ In-reply-to » Hello!

@doesnm@doesnm.p.psf.lt Hello! 👋

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 10:20 (40w ago)

↳ In-reply-to » LOl 😂 Not only have a tried to write up a full Twtxt v2 specification, I've also written a Bash shell script that implements the new spec 😅

@lyse@lyse.isobeef.org Yes let’s make UTF-8 mandatory 👌

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 10:19 (40w ago)

↳ In-reply-to » LOl 😂 Not only have a tried to write up a full Twtxt v2 specification, I've also written a Bash shell script that implements the new spec 😅

@lyse@lyse.isobeef.org Agreed

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 10:13 (40w ago)

Let’s try this pill for Twtxt v2 (no account required)

http://polljunkie.com/poll/xdgjib/twtxt-v2

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 09:35 (40w ago)

↳ In-reply-to » LOl 😂 Not only have a tried to write up a full Twtxt v2 specification, I've also written a Bash shell script that implements the new spec 😅

@lyse@lyse.isobeef.org I’m a bit indifferent whether it’s at the beginning or end tbh.

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 09:21 (40w ago)

↳ In-reply-to » LOl 😂 Not only have a tried to write up a full Twtxt v2 specification, I've also written a Bash shell script that implements the new spec 😅

This is still a draft! Feel free to edit it 👌

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 09:19 (40w ago)

↳ In-reply-to » So I'm a location based system, how exactly do I reply to one of these two Twts from @Yarns ? 🤔

@movq@www.uninformativ.de That’s what I was afraid of 🤣

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 09:18 (40w ago)

↳ In-reply-to » I finally decided to do a few experiments with yarnd to see how many things would break and how many assumptions there are around the idea of "Content Addressing"; here's where I'm at so far:

@movq@www.uninformativ.de Makes sense 👌 I think it’s fair to implement any spec changes incrementaly for sure 👌

And yea since yarnd has a store it’s a bit easier to support edit / delete actions 😅

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 08:50 (40w ago)

So I’m a location based system, how exactly do I reply to one of these two Twts from @Yarns@search.twtxt.net ? 🤔

2024-09-07T12:55:56Z	🥳 NEW FEED: @<twtxt http://edsu.github.io/twtxt/twtxt.txt>
2024-09-07T12:55:56Z	🥳 NEW FEED: @<kdy https://twtxt.kdy.ch/twtxt.txt>

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 08:18 (40w ago)

↳ In-reply-to » Something’s broken.

@lyse@lyse.isobeef.org Yup, this is why you started seeing if you could improve the “trust” of peers right? 😅

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 08:10 (40w ago)

↳ In-reply-to » I finally decided to do a few experiments with yarnd to see how many things would break and how many assumptions there are around the idea of "Content Addressing"; here's where I'm at so far:

@movq@www.uninformativ.de Yeah I think what I’m proposing here is a more pragmatic approach to improvements that will last much longer than our first interaction (~4 years and going strong, but running into minor issues with edit/identify and some collssions_). This scope of changes is much easier to implement for yarnd and I suspect jenny too. and as indicated in here quite easy to have a reference implementation written in Bash with standard UNIX tools.

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 07:53 (40w ago)

↳ In-reply-to » Okay folks, I've spent all day on this today, and I think its in "good enough"™ shape to share:

It’s even sorta/somewhat compatible with our existing feeds (kind of) 🤣 – Bit too stupid to figure out how to write enough correct Bash to make threads display inline nicely in an indented/tree-like fashion, but oh well 😅

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 07:52 (40w ago)

↳ In-reply-to » Okay folks, I've spent all day on this today, and I think its in "good enough"™ shape to share:

Example:

$ ./twtxt-v2.sh reply 242561ce02d "Cool! 👌"
Posted twt with hash: b2c938f9838
...
$ ./twtxt-v2.sh timeline
...
prologic@twtxt.net [2024-09-22T07:26:37Z] <242561ce02d> Okay folks, I've spent all day on this today, and I _think_ its in "good enough"™ shape to share:

**Twtxt v2**:

- Specification: https://docs.mills.io/uJXuisaYTRWYDrl8A2jADg?both
- implementation: https://gist.mills.io/prologic/afdec15443da4d7aa898f383f171ec1b

 ![](https://twtxt.net/media/Wb9MtAiQyEkzNQB5dyVvUR.png)
prologic@localhost [2024-09-22T07:51:16Z] <b2c938f9838> Cool! 👌 (reply-to:242561ce02d)

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 07:26 (40w ago)

Okay folks, I’ve spent all day on this today, and I think its in “good enough”™ shape to share:

Twtxt v2:

Specification: https://docs.mills.io/uJXuisaYTRWYDrl8A2jADg?both
implementation: https://gist.mills.io/prologic/afdec15443da4d7aa898f383f171ec1b

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 06:38 (40w ago)

↳ In-reply-to » Bahahahaha very clever @lyse I look forward to reading your report ! 🤣 However...

@aelaraji@aelaraji.com No that is absolutely correct. Without cryptographic identities and signatures there is no way to verify authenticity. That is correct. And I don’t think we need to necessarily. What I was just showing and proving was that I didn’t write that spoofed Twt in the first place, which was only provable at the time of @lyse@lyse.isobeef.org short-lived attack 🤣 He essentially forked yarnd, hosted it temporarily (I think locally) and used it to poison the caches of a few production pods.

Thankfully the gossip protocol used by yarnd as part of its “peering” between pods isn’t fully trusted, twts are not archived for example into permanent storage. So the moment my pod re-fetched my own feed, the spoofed Twt was obliterated 😅

Eventual consistency 🤣

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 06:26 (40w ago)

LOl 😂 Not only have a tried to write up a full Twtxt v2 specification, I’ve also written a Bash shell script that implements the new spec 😅

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 05:40 (40w ago)

↳ In-reply-to » Had to build a list of all feeds (that I follow) and all twts in them and there are two collisions already:

@movq@www.uninformativ.de Haha 😝 Nice one! And yes I’m also aware of some collisions too!

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 02:53 (40w ago)

↳ In-reply-to » (#v3qx3la) @bender Just desktop notifications at the moment, but you could easily throw in a Ntfy server and get notified about anything you want, wherever you want. 🤣

@aelaraji@aelaraji.com I like Nttfy 👌 I’ve wanted to replace my use of the Pushover service with this for a while now 🤔

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 01:15 (40w ago)

↳ In-reply-to » I finally decided to do a few experiments with yarnd to see how many things would break and how many assumptions there are around the idea of "Content Addressing"; here's where I'm at so far:

@bender@twtxt.net 👌

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 01:11 (40w ago)

👋 Reminder folks of the upcoming Yarn.social monthly online meetup:

I hope to see @david@collantes.us @movq@www.uninformativ.de @lyse@lyse.isobeef.org @xuu@txt.sour.is @sorenpeter@darch.dk and hopefully others too @aelaraji@aelaraji.com @falsifian@www.falsifian.org and anyone else that sees this! 🙏 We’re hopefully going to primarily discuss the future of Twtxt and the last few weeks of discussions 🤣

Event: Yarn.social Online Meetup
When: 28th September 2024 at 12:00pm UTC (midday)
Where: Mills Meet : Yarn.social
Cadence: 4th Saturday of every Month

Agenda:

Let’s talk about the upcoming changes to the Twtxt spec(s)
- See #xgghhnq

#Yarn.social #Meetup

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 01:05 (40w ago)

My Position on the last few weeks of Twtxt spec discussions:

We increase the Hash length from 7 to 11.
We formalise the Update Commands extension.
We amend the Twt Hash and Metadata extension to state:

Feed authors that wish to change the location of their feed (once Twts have been published) must append a new # url = comment to their feed to indicate the new location and thus change the “Hashing URI” used for Twts from that point onward.

This has implications of the “order” of a feed, and we should either do one of two things, either:

Mandate that feeds are append-only.
Or amend the Metadata spec with a new field that denotes the order of the feed so clients can make sense of “inline” comments in the feed. – This would also imply that the default order is (of course) append-only. Suggestion: # direction = [append|prepend]

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 00:22 (40w ago)

I finally decided to do a few experiments with yarnd to see how many things would break and how many assumptions there are around the idea of “Content Addressing”; here’s where I’m at so far:

What breaks

Basically I’m at a point where spending time on this is going to provide very little value, there are assumptions made in the lextwt parser, assumptions made in yarnd, assumptions in the way storage is done and the way threading works and things are looked up. There are far reaching implications to changing the way Twts are identified here to be “location addressed” that I’m quite worried about the amount of effort would be required to change yarnd here.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 23:34 (40w ago)

↳ In-reply-to » Ever wondered what it would cost to self-hosted vs. use the cloud? Well I often doubt myself every time I look at hardware prices, and I know I have to do some hardware refresh soon™ for the Mills DC (something I don't have a regular plan or budget for), here's a rough ball park:

@mckinley@twtxt.net Yes I have, however I’m not counting that because even using “Cloud” is not labor free.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 23:32 (40w ago)

↳ In-reply-to » Something’s broken.

@aelaraji@aelaraji.com We digits it out 🤣 @lyse@lyse.isobeef.org ’s little hack was good but only temporary 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 23:27 (40w ago)

↳ In-reply-to » (#crmwgxq) I’m still more in favor of (replyto:…). It’s easier to implement and the whole edits-breaking-threads thing resolves itself in a “natural” way without the need to add stuff to the protocol.

@sorenpeter@darch.dk Lins of agree with dealing with this kind of social nonsense which we’ve all done in the past 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 23:26 (40w ago)

↳ In-reply-to » (#crmwgxq) I’m still more in favor of (replyto:…). It’s easier to implement and the whole edits-breaking-threads thing resolves itself in a “natural” way without the need to add stuff to the protocol.

@movq@www.uninformativ.de I think your scenario doesn’t account for clients and their storage. The scenario described only really affects clients that come along later. Even then they would also be able to re-fetch mossing Twts from peers or even a search engine to fill in the gaps.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 15:45 (40w ago)

↳ In-reply-to » (#crmwgxq) I’m still more in favor of (replyto:…). It’s easier to implement and the whole edits-breaking-threads thing resolves itself in a “natural” way without the need to add stuff to the protocol.

@movq@www.uninformativ.de That’s kind a problem though right?

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 15:42 (40w ago)

↳ In-reply-to » (#br5oska) @aelaraji This is one of the reasons why yarnd has a couple of settings with some sensible/sane defaults:

@david@collantes.us 🤣🤣🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 15:35 (40w ago)

↳ In-reply-to » (#crmwgxq) I’m still more in favor of (replyto:…). It’s easier to implement and the whole edits-breaking-threads thing resolves itself in a “natural” way without the need to add stuff to the protocol.

I just realized the other big property you lose is:

What if someone completely changes the content of the root of the thread?

Does the Subject reference the feed and timestamp only or the intent too?

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 15:22 (40w ago)

↳ In-reply-to » (#crmwgxq) I’m still more in favor of (replyto:…). It’s easier to implement and the whole edits-breaking-threads thing resolves itself in a “natural” way without the need to add stuff to the protocol.

@bender@twtxt.net Yeah I’ll be honest here; I’m not going to be very happy if we go down this “location addressing” route;

Twt Subjects lose their meaning.
Twt Subjects cannot be verified without looking up the feed.
- Which may or may not exist anymore or may change.
Two persons cannot reply to a Twt independently of each other anymore.

and probably some other properties we’d stand to lose that I’m forgetting about…

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 15:04 (40w ago)

↳ In-reply-to » (#crmwgxq) I’m still more in favor of (replyto:…). It’s easier to implement and the whole edits-breaking-threads thing resolves itself in a “natural” way without the need to add stuff to the protocol.

@movq@www.uninformativ.de One of the biggest reasons I don’t like the (replyto:…) proposal (location addressing vs. content addressing) is that you just introduce a similar problem down the track, albeit rarer where if a feed changes its location, your thread’s “identifiers” are no longer valid, unless those feed authors maintain strict URL redirects, etc. This potentially has the long-term effect of being rather fragile, as opposed to what we have now where an Edit just really causes a natural fork in the thread, which is how “forking” works in the first place.

I realise this is a bit pret here, and it probably doesn’t matter a whole lot at our size. But I’m trying to think way ahead, to a point where Twtxt as a “thing” can continue to work and function decades from now, even with the extensions we’ve built. We’ve already proven for example that Twts and threads from ~4 years ago still work and are easily looked up haha 😝

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 14:06 (40w ago)

↳ In-reply-to » Alright, before I go and watch Formula 1 😅, I made two PRs regarding the two “competing” ideas:

I just read the primary spec I’m strongly in support of and it’s pretty rock solid for me 👌 💯

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 13:22 (40w ago)

↳ In-reply-to » Something’s broken.

Do you recall what it was? I blame my maintenance window 🪟

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 13:22 (40w ago)

↳ In-reply-to » Something’s broken.

@bender@twtxt.net Hmm what you replied to appears to be non-existent: https://twtxt.net/twt/pqst4ea

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 13:20 (40w ago)

↳ In-reply-to » Alright, before I go and watch Formula 1 😅, I made two PRs regarding the two “competing” ideas:

@movq@www.uninformativ.de I just saw thes come through! 🙏 Thank you very much, I’ll definitely have a read tomorrow! 👌

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 12:52 (40w ago)

@bender@twtxt.net Which reply was that? 🤔

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 12:51 (40w ago)

↳ In-reply-to » Bahahahaha very clever @lyse I look forward to reading your report ! 🤣 However...

@bender@twtxt.net Bahahahahaha 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 12:28 (40w ago)

Ever wondered what it would cost to self-hosted vs. use the cloud? Well I often doubt myself every time I look at hardware prices, and I know I have to do some hardware refresh soon™ for the Mills DC (something I don’t have a regular plan or budget for), here’s a rough ball park:

The Mills DC has cost me around ~$15k to build and maintain over the last ~10 years or so. Roughly speaking. I’ve never actually taken a Bill of Materials or anything, but I could if anyone is interested in more specifics.

The equivalent of resources if run in the “Cloud” would cost around:

~$1,000 for virtual machines
~$12000 for storage

So around ~$2,000/month to run.

Keep this in mind anytime anyone ever tries to con you into believing “Cloud is cheaper”. It’s not.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 11:50 (40w ago)

↳ In-reply-to » (#266jaka) @movq @falsifian @prologic Maybe I don't know what I'm talking about and You've probably already read this: Everything you need to know about the “Right to be forgotten” coming straight out of the EU's GDPR Website itself. It outlines the specific circumstances under which the right to be forgotten applies as well as reasons that trump the one's right to erasure ...etc.

@aelaraji@aelaraji.com This is one of the reasons why yarnd has a couple of settings with some sensible/sane defaults:

I could already imagine a couple of extreme cases where, somewhere, in this peaceful world one’s exercise of freedom of speech could get them in Real trouble (if not danger) if found out, it wouldn’t necessarily have to involve something to do with Law or legal authorities. So, If someone asks, and maybe fearing fearing for… let’s just say ‘Their well being’, would it heart if a pod just purged their content if it’s serving it publicly (maybe relay the info to other pods) and call it a day? It doesn’t have to be about some law/convention somewhere … 🤷 I know! Too extreme, but I’ve seen news of people who’d gone to jail or got their lives ruined for as little as a silly joke. And it doesn’t even have to be about any of this.

There are two settings:

$ ./yarnd --help 2>&1 | grep max-cache
      --max-cache-fetchers int        set maximum numnber of fetchers to use for feed cache updates (default 10)
  -I, --max-cache-items int           maximum cache items (per feed source) of cached twts in memory (default 150)
  -C, --max-cache-ttl duration        maximum cache ttl (time-to-live) of cached twts in memory (default 336h0m0s)

So yarnd pods by default are designed to only keep Twts around publicly visible on either the anonymous Frontpage or Discover View or your Timeline or the feed’s Timeline for up to 2 weeks with a maximum of 150 items, whichever get exceeded first. Any Twts over this are considered “old” and drop off the active cache.

It’s a feature that my old man @off_grid_living@twtxt.net was very strongly in support of, as was I back in the day of yarnd’s design (nothing particularly to do with Twtxt per se) that I’ve to this day stuck by – Even though there are some 😉 that have different views on this 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 11:46 (40w ago)

↳ In-reply-to » (#w6f7hpa) @falsifian Do you have specifics about the GRPD law about this?

@aelaraji@aelaraji.com Thanks for this! 🙏

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 07:46 (40w ago)

Bahahahaha very clever @lyse@lyse.isobeef.org I look forward to reading your report ! 🤣 However…

$ yarnc debug https://twtxt.net/user/prologic/twtxt.txt | grep -E '^pqst4ea' | tee | wc -l
0

I very quickly proved that Twt was never from me 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 07:19 (40w ago)

@yarn_police@twtxt.net Cool cool 🙇‍♂️

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 06:39 (40w ago)

↳ In-reply-to » Heads up, @prologic! We're seeing increased spate of burglaries in your neighbourhood. Please stay alert, while we keep you safe out there.

@yarn_police@twtxt.net What’s going on?

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 04:30 (40w ago)

↳ In-reply-to » (#l5452vq) Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@movq@www.uninformativ.de Yes that’s true they are only integrity checks. But beyond a malicious pod (ignore yarnd’a gossiping protocol for now) how does what @lyse@lyse.isobeef.org presented work exactly? 😅

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 04:02 (40w ago)

↳ In-reply-to » (#w6f7hpa) @falsifian Do you have specifics about the GRPD law about this?

But this is no different to how jenny does things with storing every Twt in a Maildir I suppose? 🤔

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 04:02 (40w ago)

↳ In-reply-to » (#w6f7hpa) @falsifian Do you have specifics about the GRPD law about this?

This has specifically come up before in the form of “informal complaints” against yarnd because of the way it permanently stores and archives Twts, so even if you decide you changed your mind, or deleted that line out of your feed, if my pod or @xuu@txt.sour.is or @abucci@anthony.buc.ci or @eldersnake@we.loveprivacy.club (or any other handful of pods still around?) saw the Twt, it’d be permanently archived.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 04:01 (40w ago)

↳ In-reply-to » (#w6f7hpa) @falsifian Do you have specifics about the GRPD law about this?

Yeah I’m curious to find out too beyond just “here say”. But regardless of whether we should or shouldn’t care about this or should or shouldn’t comply. We should IMO. I’d have to build something that horrendously violates someone’s rights in another country.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 03:59 (40w ago)

↳ In-reply-to » (#l5452vq) Another thing: At the moment, anyone could claim that some feed contained a certain message which was then removed again by just creating the hash over the fake message in said feed and invented timestamp themselves. Nobody can ever verify that this was never the case in the first place and completely made up. So, our twt hashes have to be taken with a grain of salt.

@movq@www.uninformativ.de Care to explain how this explicit/attack works for me? 🤣

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 03:58 (40w ago)

↳ In-reply-to » (#ezhsc5a) I've also put up this PR Add compatible methods for Index to behave as the Archiver (transition) #1177 that will act as a transition from the old naive archiver to the new bluge-based search/index. I will switch my pod over to this soon to test it before anyone else does.

Well that was bloody awful. This PR bokr my pod for some strange reason I can’t figure out why or how 😱 The process just kept getting terminated from something, somewhere (no panic). weird. I’ve reverted this PR for now @xuu@txt.sour.is

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 02:42 (40w ago)

↳ In-reply-to » And we're back. Sorry about that 😅

Really though I only managed to save a few GB, but it’s enough for now.

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 02:39 (40w ago)

↳ In-reply-to » (#ezhsc5a) @prologic woot, woot! Glad everything went well. I feel it faster already!

@bender@twtxt.net Haha 😛 Faster? Maybe 🤔 But yeah it’s good to have backups! (that work)

⤋ Read More

prologic

twtxt.net

Sat, Sep 21 02:38 (40w ago)

↳ In-reply-to » And we're back. Sorry about that 😅

I’ve also put up this PR Add compatible methods for Index to behave as the Archiver (transition) #1177
that will act as a transition from the old naive archiver to the new bluge-based search/index. I will switch my pod over to this soon to test it before anyone else does.

⤋ Read More