this post was submitted on 05 Nov 2024
407 points (99.8% liked)

Fediverse

28339 readers
213 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy

founded 1 year ago
MODERATORS
 

Disclaimer: I wrote this article and made this website.

There was some talk of this issue in the recent fediverse inefficiencies thread. I'm hopeful that in the future we'll have a decentralized solution for file hosting but for now I deeply believe that users should pay for their own file hosting.

top 50 comments
sorted by: hot top controversial new old
[–] M0oP0o@mander.xyz 22 points 6 days ago (1 children)

Too wet for server racks in the forest.

[–] sosodev@lemmy.world 10 points 6 days ago (1 children)
[–] M0oP0o@mander.xyz 5 points 6 days ago

Look I know there called "farms" but like I told the last forest gnome, the dank woods is no place to host data.

[–] C126@sh.itjust.works 17 points 6 days ago (3 children)

anyway to use torrent protocol somehow? Like popcorn time did?

[–] tiddy@sh.itjust.works 8 points 6 days ago (1 children)

Ipfs would be similar but more purpose based

[–] C126@sh.itjust.works 1 points 6 days ago

Never heard of that for, it looks exactly built for this problem, better than torrent. Good call.

[–] nutsack@lemmy.world 4 points 6 days ago* (last edited 6 days ago)

this is actually a super interesting idea of which i have never seen proof of concept. other than maybe freenet or something.

[–] TriflingToad@sh.itjust.works 3 points 6 days ago (1 children)

if it was easy to set up id definitely host a terabyte of Lemmy. As a pet.

[–] moseschrute@lemmy.world 2 points 5 days ago

What would you name your pet

[–] mlg@lemmy.world 20 points 6 days ago (2 children)
[–] Ludrol@szmer.info 11 points 6 days ago (1 children)

as I stated in this comment it's not really feasible as to ~5s delay that was tested some time ago.

[–] Lemmchen@feddit.org 0 points 4 days ago

That's the wrong comment.

[–] sosodev@lemmy.world 1 points 6 days ago (1 children)

What would an IPFS solution look like here? That's a genuine question. I don't have much experience with IPFS. It seems like it isn't really used outside of blockchain applications.

[–] taanegl@lemmy.ml -5 points 6 days ago* (last edited 6 days ago) (1 children)

The sustainability of it is questionable. If I'm not mistaken, IPFS is based on Ethereum, which has gone over to proof of stake rather than proof of work, but it's still a pretty cumbersome system.

We're talking about something that needs to compete with Quic and CloudFlare. I'm not sure that Ethereum or even crypto itself is efficient enough as a content delivery method, that IPFS - though a nice idea - is unrealistic.

But that's just speculation from someone who has zero knowledge behind IPFS as a technology and protocol, so take it with a grain of salt.

EDIT: honestly, why qualify with "I'm not sure" when besserwissers and their alts roam the fediverse instead of going to therapy. Smh. Give the people a Tl;Dr at least. I'm not here for long form content.

[–] Scio@lemmy.world 10 points 6 days ago* (last edited 6 days ago) (1 children)

IPFS has absolutely nothing whatsoever to do with Ethereum, or indeed any blockchain. It is a protocol for storing distributing and addressing data by hashes of the content over a peer to peer network.

There is however an initiative to create a commercial market for "pinning*", which is blockchain based. It still has nothing to do with Ethereum, and is a distinct project that uses IPFS rather than being part of the protocol, thankfully. It is also not a "proof of work" sort of waste, but built around proving content that was promised to be stored is actually stored.

Pinning in IPFS is effectively "hosting" data permanently. IPFS is inherently peer to peer: content you access gets added to your local cache and gets served to any peer near you asking for it—like BitTorrent—until it that cache is cleared to make space for new content you access. If nobody keeps a copy of some data you want others to access when your machines are offline, IPFS wouldn't be particularly useful as a CDN. So peers on the network can choose to pin some data, making them exempt from being cleared with cache. It is perfectly possible to offer pinning services that have nothing to do with Filecoin or the blockchain, and those exist already. But the organization developing IPFS wanted an independent blockchain based solution simply because they felt it would scale better and give them a potential way to sustain themselves.

Frankly, it was a bad idea then, as crypto grift was already becoming obvious. And it didn't really take off. But since Filecoin has always been a completely separate thing to IPFS, it doesn't affect how IPFS works in any way, which it continues to do so.

There are many aspects of IPFS the actual protocol that could stand to be improved. But in a lot of ways, it does do many of the things a Fediverse "CDN" should. But that's just the storage layer. Getting even the popular AP servers to agree to implement IPFS is going to be almost as realistic an expectation as getting federated identity working on AP. A personal pessimistic view.

[–] taanegl@lemmy.ml 2 points 6 days ago* (last edited 6 days ago) (1 children)

TL;Dr. From Wikipedia

IPFS allows users to host and receive content in a manner similar to BitTorrent. As opposed to a centrally located server, IPFS is built around a decentralized system of user-operators who hold a portion of the overall data. Any user in the network can serve a file by its content address, and other peers in the network can find and request that content from any node who has it using a distributed hash table (DHT).

So it's BitTorrent in the web browser... thanks. How is that to be competitive with CloudFlare and Quic again? It has the same network issues that the blockchain has, in that it will be cumbersome and slow - for anyone else that doesn't have millions to throw into infrastructure. Welcome to the same problem again, but in a different way.

[–] Scio@lemmy.world 3 points 6 days ago

Ironically, because there's no UDP in browsers, we can't actually get proper p2p on the web. WebRTC through centralized coordination servers at best. Protocol Labs has all but given up on this use-case in favor of using some bootstrapped selection of remote helper nodes.

[–] moon@lemmy.cafe 6 points 6 days ago* (last edited 6 days ago) (2 children)

Personally I'm in the camp that I want history to be lost. That's part of the appeal to me. In fact my favorite feature in the fedi is Mastodon's option to enable auto-deleting posts of a certain age.

Only content that is explicitly pinned or reaches a certain amount of interactions should be saved imo. Since that's the stuff you'd actually want to preserve rather than the 99% of forgettable content, and it would also drastically cut down on file hosting.

Another thing is that a federation should only act as the exchange between users on ActivityPub. It should only cache relevant information and not be expected to store everything, like I wrote before. The user should be a portable account that is stored on a device. The federation server would sync your account between your devices, but not store it. You send your content to the federation, and then the federation sends it out into the world where they choose to do what they want with it. The federation shouldn't hoard it indefinitely.

Also this makes sense from a privacy perspective. If you care about privacy, why would you also want all your data indefinitely stored? Unless certain things are relevant and explicitly kept, it should be expected to expire and be lost by default. Where did we get this expectation that data should be stored forever? Also you expect it to be stored forever and not be trained on by AI?

This comment for example, after about a week or two most of the visibility and interaction of it will drop to zero. At that point, this comment should expire and no longer exist. I wrote this comment, it reached some people, and served it's purpose and should expire. I'm not going to pretend like this comment is some kind of historic document that should be indefinitely preserved, nor do I expect or want it to be.

[–] Ludrol@szmer.info 5 points 6 days ago (1 children)

Can you judge a work of art by it's virality? Should you judge by virality?

A lot of times in history artists got recognition they deserved only after their death. When they ware alive they lived in poverty struggling to make ends meet.

There is a lot of internet 1.0 preserved by internet archive that I didn't get to experience. There are flash games that I would love to preserve and show the next generation.

We wouldn't have known how Scotts Cawthon games have looked like before he made FNAF if not for the preservation efforts.

[–] MrMakabar@slrpnk.net 2 points 6 days ago

Usually those artist did get some recognition during their life, but never got into the main stream. That changed due to the main stream changing and the people who did like the art showing it again. That is actually rather easy to do with something like the Fediverse. It just requires a download option. Especially when everybody is aware, that the content will be deleted, that would be a decent option.

Also a lot of content on social media in general is very short term. Stuff like politcal discussions are fairly useless after a few months in most cases. So that can be deleted without much care and again, if somebody wants to preserve it, they easily can just download it.

[–] Lemmchen@feddit.org 1 points 4 days ago (1 children)

This comment for example, after about a week or two most of the visibility and interaction of it will drop to zero. At that point, this comment should expire and no longer exist.

That's an incredible naive and egoistic take. Think about all the knowledge that is getting lost by applying this approach. How many times have you searched for some obscure thing and found the answer only on some five years old reddit post? That information would be lost for ever if you had your way.

[–] moon@lemmy.cafe 1 points 4 days ago

I think the massive privacy benefits outweigh things like that, which should be documented properly anyways

[–] TORFdot0@lemmy.world 6 points 1 week ago (1 children)

Is a p2p system for media with the instances just hosting magnet links too slow for fediverse purposes? To me this seems like the most resilient way to handle media in a decentralized system

[–] ayyy@sh.itjust.works 3 points 1 week ago (2 children)

If a social network is to take off, it must be accessible from mobile devices behind CGNAT (carrier grade network address translation).

[–] 56_@lemmy.ml 1 points 6 days ago

p2p from behind a CGNAT works just fine as long as a single server is accessible and can mediate connections between other peers. Most non-servers are behind some sort of NAT these days.

[–] pinkystew@reddthat.com 1 points 1 week ago (1 children)
[–] ayyy@sh.itjust.works 6 points 1 week ago

Because that’s where all the users are. The “social” aspect of a “social” network. Anyone can host a forum but it’s useless without users.

[–] glowing_hans@sopuli.xyz 0 points 4 days ago* (last edited 4 days ago)

am I the only one who would advocate for text only storage: no. This comment also gets my point? There should be text only lemmy instances which only save text and do not allow any kind of image posting or storage.

  • text is less offensive to consume and moderate than evil images or video
  • text is way more information dense and can be even compressed more! Truly the green biosphere friendly data format. I would be willing to save text only data of strangers on my hard-drive, but not images or video. Could even be valuable llm analysis training data.

Yes people could post base64 encoded images, but that is a larger technical barrier and can be detected. If image storage is really need, images should be heavily compressed (webp 90% quality loss), provided as links to external sites, and whenever possible svg / vector graphics should be preferred.

[–] NuXCOM_90Percent@lemmy.zip 126 points 1 week ago (5 children)

We totally need sustainable file hosting. Freedom!

Wait... the fuck did you just upload? Oh god. Oh god no. Do I have to call the cops on you? Oh no. Wait, does this count as possession? FUCK!!!!!!

We need someone else to handle the totally sustainable file hosting. Freedom!

[–] Kalkaline@leminal.space 47 points 1 week ago (4 children)

Yep, there needs to be moderation tools that can be quickly deployed to stop the illegal/immoral/evil stuff from spreading and taking over self-hosted servers.

[–] NuXCOM_90Percent@lemmy.zip 41 points 1 week ago (3 children)

And moderation of this kind of content almost always sounds like torture when you hear about what facebook and the like are outsourcing.

Theoretically, this is a good problem for computer vision/machine learning. But there are a LOT of false positives (I think it was Aftermath who did an article on a study of when a nipple becomes female?). And.. what ethical responsibility do you have to report on the fiftieth time that SheIsReallyAnEightThousandYearOldDragon_6969 uploaded CSAM? And how quick do you think people are going to lose faith in you and start wondering if you'll also report on the rampant piracy?

And... there are also false negatives. At which point you find out you have been hosting something truly heinous for the past few months... possibly when local law enforcement tells you.

Like a lot of things: it sounds great. But nobody in their right mind is going to host this for free. And once you start accepting money you start opening yourself up to a LOT of regulations.

load more comments (3 replies)
load more comments (3 replies)
load more comments (4 replies)
[–] hperrin@lemmy.world 48 points 1 week ago* (last edited 1 week ago) (5 children)

Ok, hear me out.

We find the users with the slowest internet and start sending them all the data. They don’t have to keep anything on disk. Then they send it all back and forth between each other. Any time a user makes a request, we just wait for one of the slow nodes to come across the data and send it out.

We use the slowest wires for all the storage. It’s fool proof.

[–] sosodev@lemmy.world 50 points 1 week ago (3 children)

Somebody actually did make this as a joke years ago haha https://github.com/yarrick/pingfs

load more comments (3 replies)
load more comments (4 replies)
load more comments
view more: next ›