[-] admin@lemmit.online 3 points 11 months ago

Can't blame you for that. Personally, I still think it excels at content where communication with OP is irrelevant, like !itookapicture@lemmit.online, !todayilearned@lemmit.online or !dataisbeautiful@lemmit.online. And by far best example of this, if you look at the subscriber count, is nsfw content.

[-] admin@lemmit.online 2 points 11 months ago

Nope. That would be very hard to implement, and probably very confusing and disliked by other lemmy users.

[-] admin@lemmit.online 4 points 11 months ago

I don’t know how the karma thresholds work behind the scenes, but might I suggest for the bot to do a “top for” sort instead? Like it will only repost top content for the past 6 hours only. This will also help get more quality content as well and avoid reposting low effort/quality posts.

This is effectively already kinda how it works. For each subreddit it periodically (anywhere between every 30 minutes to every 12 hours, based on subscriber count and posts per day) requests the "hot" content feed. It then checks each post if it has at least 20 upvotes, and a 80% upvote to downvote ratio. Those numbers are configurable, but that's what they're currently set to - I believe they're a good mix between filtering out the complete garbage while still making sure it doesn't miss good content is.

55
submitted 11 months ago* (last edited 11 months ago) by admin@lemmit.online to c/lemmy@lemmy.ml

A few months ago, I launched the Lemmit instance and bot (@bot@lemmit.online). Primarily, this was to help me stay up to date with some of the content I'd leave behind on Reddi. Additionally, I wanted to give back to the community, so I made it possible for anyone to request the archiving of subreddits to the Lemmit instance.

However, this came with some unintended consequences. Notably, the most subscribed community on the instance has been !AmItheAsshole@lemmit.online. Even though it should have been obvious that there is no way to communicate with the Original Poster, given they're on Reddit.

The pushback against the bot and the instance has increased over time. A recent post, This bot is bad for Lemmy, highlighted these concerns. I've also received similar feedback from admins of major Lemmy Instances and through direct PMs.

As a response, last week I stopped accepting requests for archiving new subreddits. This weekend, I went a step further by discontinuing the archiving of a large amount of "interactive subreddits"—communities primarily centered around Q&A or communication with the Original Poster. This includes subs like !AskReddit@lemmit.online and !dating_advice@lemmit.online, as well as niche and support communities. Such discussions are better hosted on Reddit or Lemmy's equivalent spaces.

I've also adjusted the post karma thresholds to curb spam posts. While this probably won't appease everyone, it should reduce the bot's posting frequency.

Perhaps this might prompt some admins to rethink their choice to defederate from the Lemmit instance, or the banning of the bot. I'm not expecting anyone to, and won't take it personally if you don't, but I wanted to give the community this update nonetheless.

In !about@lemmit.online there's a sticky post of all the Actively archived communities on the server (including NSFW ones, since that is not public without logging in), as well as the list of communities for which archiving is now disabled.

Cheers!

1
testie mctestes (old.reddit.com)
0
submitted 1 year ago* (last edited 11 months ago) by admin@lemmit.online to c/about@lemmit.online

In the short time since this instance and bot launched, I've been seeing the same questions resurface multiple times. This is totally understandable, since the concept of a Fediverse is still new to most (myself included), and this server is not like the others.

Q: What is Lemmit?

A: Lemmit is a Lemmy instance specifically designed for archiving Reddit content. Users can request new subreddits to be included in the archiving process by posting in the !requests@lemmit.online community. It is powered by an open source python bot, which periodically checks the request list, adds new requests to the queue, and continuously monitors the Hot feed of those subs for new posts to cross-post here.

Q: Does it synchronize comments?

A: No, that would be impossible. Considering there are thousands of posts already on Lemmit, many of them having at least several hundred comments on Reddit, often buried in deep layers, it simply wouldn't be feasible to index those for more than a few posts, let alone keep them up to date.

Unfortunately, this means that archiving certain subreddits, such as Ask Historians/Men/Women/Hyperintelligentshadesofthecolourblue-type subs, is going to be rather pointless.

Q: Can it send comments back to Reddit?

A: No, it cannot. The purpose is to help bootstrap the Lemmy platform, not to serve as a bridge between the two networks. Also, see the answer about synchronizing comments.

Q: Can I request any subreddit?

A: ~~Technically, yes. However, as the list of subs grows, the time it takes to update all of them will also increase. I do not have strict guidelines in place for this, so I'm relying on your common sense (hoooo boy). At some point, I will probably have to either stop accepting new requests or disable scraping for very low-traffic communities.~~ Nope. Automatic requests for new subreddits are closed. You can send me a PM if you think a specific subreddit would be a valuable addition to the Lemmyverse, but you'll have to make a strong case.

Q: Does this use the API? Will it keep working after July 1st?

A: Nope, it uses a combination of the public feed and scraping old.reddit.com. So, as long as those are still available, it will continue working. And even if they close those sources, there will probably be new ways to achieve the same effect. "Content, eh, finds a way."

Q: This is spam, can you stop?

A: First of all, I apologise for the inconvenience. All you have to do is block @bot@lemmit.online, and none of its posts will ever show up on your instance. If you you don't want anyone else on your server to be exposed to this bot/instance, you should convince your admin to defederate from lemmit.online. Since there are no other users on here, there will be no harm done.

Obviously I could stop, because running this server and software is only ever going to cost me time and money. But for the reasons listed above, I still think this server is a useful addition to the lemmyverse at this time. But I'm looking forward to the day where I can turn the bot off because it's no longer needed.

Q: What started this?

A: Okay, nobody asked this, but I'm going to tell you anyway. After Reddit made it clear that they are effectively killing third-party apps and implementing plenty of other anti-end user decisions, I realized that I would either have to accept not being able to access my time-wasting content or have to do so in a rather uncomfortable way (either through the official app or old.reddit.com for as long as they'll allow it to exist).

Being a stubborn developer, naturally, I chose option C: Have my own Reddit. With blackjack, and hookers. This way, I would still be able to access my beloved content without being beholden to Reddit's mood swings and abusive relationship tendencies.

Besides that, I also know that Content is King. So I'm order to counter the network effect (No users because no content, No content because no users), I figured it would be better to have some inorganic content to bootstrap the adoption of Lemmy.

Q: Are NSFW subreddits allowed?

A: Absolutely. Like I said: Blackjack and hookers.

Q: My request isn't picked up by the bot!

A: That isn't a question. ~~But yeah, the process isn't flawless yet. I'm trying to iron out all the bugs as I encounter them. In the meantime, feel free to re-request the subreddit by making a second post. No harm done.~~ New subreddit requests are closed for the time being.

Q: No new posts are showing up at all on Lemmit

A: If no posts are appearing on the Lemmit Frontpage (sorted by NEW), it's possible that the bot has crashed or is stuck on something. Since no software is flawless, this sometimes happens. I usually fix this as soon as I'm aware, and I'm happy to say that these kinds of fatal errors are becoming less and less frequent. However, they may still occur, and as a human with needs of sleep and other responsibilities, I'm not always able to fix them immediately.

Q: Posts aren't showing up on my instance, what's up?

A: Due to the spammy nature of the bot, some server admins choose to block this server, and that is completely understandable. So first of all, make sure to check the instances link in the footer of your home server. If Lemmit is the Blocked Instances list, you're out of luck.

When you have verified that Lemmit is not blocked on your instance, try unsubscribing, waiting a little, and then re-subscribing. That tends to fix things.

[-] admin@lemmit.online 1 points 1 year ago

This doesn't have anything to do with lemmit.online?

[-] admin@lemmit.online 1 points 1 year ago

Why so? It shows up here, and on beehaw?

[-] admin@lemmit.online 0 points 1 year ago* (last edited 1 year ago)

Thanks, I was gonna do this tonight, but are you sure about that format? (I'm still relative newbie to the Fediverse)

Just to test: /c/about.

edit: Looks good, I'll merge it :)

edit2: Deployed :)

[-] admin@lemmit.online 1 points 1 year ago

I think @tubbadu@lemmy.one wrote something to that effect (I'm still a mess with making proper links on here :/)

And I also found something else that was written in java (not javascript).

The downside from using the RSS feed is that it doesn't contain the whole body, which my scraper does fetch.

[-] admin@lemmit.online 1 points 1 year ago

I'm absolutely with you on that on. If anything, part of the reason I wanted this was to have some fresh content that I knew I was going to miss once after July 1st. Stuff like !itookapicture@lemmit.online for example. There's a bit too much reddit-circlejerk going on right now, even here imho.

[-] admin@lemmit.online 2 points 1 year ago

If that's what happens, that's what happens. ¯\_(ツ)_/¯

I'm just here to offer a service for people who Do like it.

[-] admin@lemmit.online 1 points 1 year ago

Interesting idea! I have some thoughts if you’re open to feedback:

Always!

Have you considered moderation? These mirrored communities on lemmit.online will still be getting comments from all over the federated network, and if you’re the only user and sole moderator of every community, then it might get quite overwhelming!

I have, and I hope it won't be a problem ;) I'm a software engineer, as mentioned above, have little interest in managing people outside of work :P If anyone wants to become a moderator, they're free to request it.

A small VPS might not be able to handle that

We'll see how well it does. I don't mind spending a little money on this (few dozen €/$ per month), if it takes off. In the end though, it's more meant as a kickstart for Lemmy content than anything else.

How are you planning to deal with API limits from Reddit?

HA! By not using the API. For starters, because someone-who-isnt-me would like to browse NSFW content. I do a bit of client-side throttling between requests, which I hope will keep me under the radar. But it's mostly based on rss for the subreddit overview, and scraping for the individual posts.

In the end... we'll just have to see how it goes.

[-] admin@lemmit.online 1 points 1 year ago

Yeah, the bot only operates on its own instance. So you can have !bestof@lemmit.online and !bestof@lemmy.ml.

[-] admin@lemmit.online 2 points 1 year ago

Yups. It's all done by one bot though, so you'll just have to block that to get rid of them.

7
submitted 1 year ago by admin@lemmit.online to c/lemmy@lemmy.ml

I have created some software that is capable of synchronising posts from Reddit to Lemmy. It's still a little rough around the edges, but it works as a such:

People can request new subreddits to be mirrored on !requests@lemmit.online. A bot (open source) will monitor the threads there, and if it finds a new request for a subreddit, it will make a new community on the Lemmit server, and add it to its monitored list. It will then make periodic checks to see if any new posts (it doesn't copy any comments) have been posted on reddit, and copy those over.

Users can then subscribe to those communities from their own lemmy instance, and from there federation will pick it up. Or at least, that's the theory. At the moment, federation is not working awesomely, and that is where my lack of fediverse knowledge comes in. Maybe it needs more time, or something is not so properly - I don't know.

Furthermore: registrations on this server are closed. The point of this service is not to become a community on its own, but to deliver, ehh, "original" content to all the rest of the Fediverse while it's going through a ramp-up phase. Besides, the instance is running on a pretty small vps, and I rather have this thing manage itself. There is a !about@lemmit.online community for further questions about the project itself though, in case people want to discuss it further.

So ehm... Let me know what you think :)

view more: next ›

admin

joined 1 year ago
MODERATOR OF