On how the Threadiverse should work

Die4Ever@programming.dev · 11 months ago

On how the Threadiverse should work

Nightwatch Admin@feddit.nl · 11 months ago

That’s not a great idea. I understand where OP is coming from, but this could make the admins of the user account servers very powerful , as they would be in control of who would get access to what.

Die4Ever@programming.dev · 11 months ago

I’m not sure what you mean, don’t they already have that power?

Nightwatch Admin@feddit.nl · edit-2 11 months ago

No, because I can now create an account at - as a fictitious example - Lemmypoop, and see their content and everyone else’s, something that could be prevented when I only can create an account on poop-disliking Lemmy.clean. At the very best, it would end up in a huge amount of little bubble islands, where accounts and content are bound together.

Nightwatch Admin@feddit.nl · 11 months ago

Why am making such a gross example btw

Die4Ever@programming.dev · 11 months ago

so basically you’re saying that it might be difficult to find an instance for your account that federates with the community you want instead of just signup up on the same instance as the community you want? that does make sense

although you could look at the posts on the front page of the instances community and see what instances those users are from, or the instance sidebar could have some suggested instances for creating an account

but yea it could be a little awkward

just another dev@lemmy.my-box.dev · 11 months ago

How would they be more powerful than in the current situation?

sugar_in_your_tea@sh.itjust.works · edit-2 11 months ago

I think this solution misses both the point of the current solution and doesn’t actually solve many of the problems the current design has. Others have made some good points, do I’ll leave that to other threads for discussion.

I’m actually working on a project that I think will solve more of the problems (though it has a few of its own), but I think it’s incredibly unlikely that I’ll finish it, let alone have it be as popular as Lemmy. So I’ll put my notes here in case someone else wants to carry the torch (if not, I’ll post a repo once it’s ready enough).

Architecture

The service will be a distributed, P2P network based on distributed hash tables.

In other words, there are no instances, and all data is stored by users.

Communities

Communities are a namespace, not a centralized location, so creating content is local first and eventually consistent as people pull it in (DHT will ensure high availability). This has interesting implications for moderation, which I’ll discuss later.

In practice, there will probably be storage instances to help out with availability, at least in the early days as the software is tuned. Compute costs would be incredibly low and can run off a simple key/value lookup, so hosting should be relatively cheap.

Authentication

User authentication is based on a blockchain. Basically, any time you make an account or change a password, it’s verified by others. The churn here should be low, so that processing probably doesn’t need to be rewarded and users’ devices would just do it in the background.

All posts are signed with the key that’s used on the blockchain, so you can always verify authorship of a post. And that brings us to:

Moderation

Moderation is also distributed, so there are no mods as such, only a web of trust. Basically, you pick certain users that you think are trustworthy, and their actions on the network will determine whether you see a post or not. For example:

if a post is reported as CSAM by enough of your trusted users, you never see the post
upvotes and downvotes of people you trust have greater weight than votes by everyone else (may not even see votes by other users)
you never see posts by users that enough people you trust have blocked

And so on. This whole process will be transparent, so you can always audit what a user has reported, blocked, or voted on.

Client

The client will be cross platform day one, written in Rust and React using Tauri. It’ll start desktop only, though Tauri is working on mobile support (currently in beta) so that could change before it’s released.

The reasoning here is that the networking layer i intend to use (Iroh) is written in Rust, and it’s usually pretty easy to find React devs. I also like that Tauri generally has a smaller install package, which can make the reserved storage space requirement an easier pull to swallow.

Limitations/Problems

Searching

Searching is complicated in a distributed environment since you never know how many nodes you need to hit to get a satisfactory answer. This will require lots of tuning and there may need to be a full text search instance/cluster set up to help.

Lemmy doesn’t have this, so I’ll probably put it off for later as well.

Availability

A lot of users will likely use phones as their primary or perhaps only interface, and phones are very sensitive to data usage. I think users may be okay with it if the app can track and limit data usage while on data, which adds complexity to the app.

Web app

Web apps need a server to make requests to, and that just doesn’t exist, so there would need to be some kind of bridge into the network.

Latency

Theoretically, searching and fetching data from a distributed network is fast, but that may not be true in practice given the target demographic for this app (i.e. lots of mobile users).

Persistence

Data only lives as long as someone has it on their device, so less popular content could just disappear. I think this is maybe desirable in some cases (e.g. CSAM), but losing data makes me feel a little uncomfortable, so I’ll probably build an archive insurance to store stuff encrypted (to avoid legal liability for stuff like CSAM).

Illegal content

There’s no way for an admin to delete something, so there’s a lot of reliance on the web of trust to prevent illegal content from getting to your device. I’m hoping that this will be a non-issue, provided people set up their WoT properly.

Content

It’s not designed to be an ActivityPub service, so it doesn’t get that data for free. I may look into building a bridge, but it’s not going to be designed in.

Problems it solves

This design solves a bunch of issues with Lemmy, enough that I think the above limitations (and others I didn’t mention) are worth it. For example:

communities - no more instances, so there’s only ever one community for a given name
costs - hosting costs are minimal (just need a few STUN servers), and even caching servers should be much cheaper than larger Lemmy instances
durability - since it’s distributed by nature, there’s no possibility of a a large instance going down, so no worries about communities disappearing
bad mods/admins - you pick your moderation, and you only saw content that people you trust are okay with
illegal content - illegal content should never appear in the first place, if your Web of trust is robust enough; there could even be AI users that identify CSAM and whatnot automatically so humans don’t need to see it
performance - instances can get busy, a P2P network is much harder to grind to a halt; there will need to be protections against DDOS attacks though

Conclusion

I’m still in the early days of figuring everything out, but I think a distributed design is the way forward here. I’ll continue to use and support lemmy until I find a viable alternative though, since it’s at least good enough for now. I just worry about long term viability as instance hosts get tired of hosting.

Anyway, I’d love to hear your thoughts.

On how the Threadiverse should work

On how the Threadiverse should work

The Problems

Why can’t we merge / sync all the communities with the same name?

The admins of my instance are doing bad things!

I want to host an instance, but the storage & network requirements are too high

The Solution

Communities

Users

How does this address the problems

Storage & Network Requirements

Architecture

Communities

Authentication

Moderation

Client

Limitations/Problems

Searching

Availability

Web app

Latency

Persistence

Illegal content

Content

Problems it solves

Conclusion