Heyas, wondering if there’s an open sourced piece of software or the like, that could scrape media platforms for a specific topic. Platforms like YT, X, Lemmy, News Media, etc., perhaps using RSS? But, a program I can host on my server, that only I have access too, via webpage, CLI, whatever…
Thanks for any info…
FreshRSS has been working great for me! It even has the ability for web scraping if you need it.
Everyone is suggesting readers. I think you are looking for something like https://docs.rsshub.app it’s capable of generating RSS feeds from pretty much everything.
This is it. Exactly what I was looking for. Thanks much!
You will also be interested into https://wiki.archlinux.org/title/Web_feed#Obtaining_web_feeds
I use Miniflux, it’s a lightweight RSS reader
And they just added Omnivore integration, which I’m so excited for.
Oh, this looks nice! I need to try this!
I use rss-bridge for scraping sites that don’t offer rss feeds: https://rss-bridge.github.io/rss-bridge/index.html
I use tt-rss.
Check out https://awesome-selfhosted.net/tags/feed-readers.html
I would recommend miniflux a “minimalist and opinionated feed reader”. It is great on mobile and desktop and dead simple to set up and use.
New Lemmy Post: Self Hosting an RSS feed for news/media/etc? (https://lemmy.world/post/9805996)
Tagging: #SelfHosted(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)
I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md
YouTube has RSS feeds you can access without scraping, but it’s per channel so if you follow a lot of channels you’ll be following a lot of RSS feeds.
Lemmy also has RSS feeds for each community.
Are you looking for a reader instead? A reader aggregates the feeds and displays them. Usually it keeps track of which items you’ve already read.