- cross-posted to:
- technology@lemmy.world
- cross-posted to:
- technology@lemmy.world
cross-posted from: https://sh.itjust.works/post/61139432
I seriously can’t believe how much progress he’s made for the FOSS community. He actually might take a bite out of the big 3’s profits with this
Why is everyone so excited for Yet Another Slop Project?
As per his video, he states that he wishes that he could magically make AI go away and that he “hates everything in his project”, but since that’s not possible, he would prefer if people using AI did so using their own hardware and not some company’s cloud servers.
Make of that what you will. I think it’s pretty neat, not for me, but I’m sure someone will find it useful.
Funnily enough, you can use Vibe, Claude and Codex with local models if you configure them so. OpenCode supports it explicitly (while still retaining a cloud option because dev wants to make money)
So this isn’t anything too new, but eh, at least it’s more attention to the self-hosting capability of LLMs for code agents.
Having a large internet personality like Pewdiepie advocate for privacy, self-hosting, and open-source is always good!
Probably because they can call it YASP and give it a cute mascot
What the actual fuck is this timeline that we are living in?
I kinda loved his “you should self host to decentralize from big tech” and “run graphene and Linux to avoid data collection” content, but idk what the local ai stuff is any good for
It’s good for the same things machine learning has always been good for. Language synthesis and analysis. Selfhosting something like Paperless for document management. It actually has a very rudimentary learning engine for document classification for a long time but feeding document content to a local AI model for organization tagging is very useful.
Quite a lot, actually.
Coding, document analysis, STT, home assistant, shopping assistant, gaming, journalling, image and video generation, OCR, language translation, recipe/meal / workout planning, study/flashcard generation, email drafting, adversarial review, search engine on steroids, hardware troubleshooter, companion for elder care, music curator and DJ …
All of that without creepy ass cloud shit from Big AI.
I can go on, but “a lot” probably covers it.
EDIT: asked, answered and…down voted. Classic Lemmy anti-ai knee jerk. FWIW I work with AI in healthcare settings as well as code review for my own personal projects.
What I said are actual use cases, not a wishlist generated by Jippity. I can elaborate on any and all of them with actual real life experience.
Coding is okay, but the flagship free models usually crush whatever and to have decent memory you need hella VRAM Doc analysis is okay if they’re small, Home assistant isn’t bad but kinda overkill with an LLM when you can just set manual automations, Journalling, gaming, music, DJ’ing, elder care, imagine and video generation, are all relatively bad even on flagship models
Quick overview of searches is fine but I’ll use a free flagship model for that, in depth research tends to not be great
How many GPUs do you even need to have a usable, self-hosted AI? It looks like he has 6 on his rig. Probably each costs 2k or something. That’s not peanuts. I have a 12GB VRAM card. It probably can’t generate anything in any meaningful amount of time. Which brings me to the question: who is this for?
Regardless, impressive what he vibe-coded there.
I use an 6700 XTX and it’s working perfectly fine, depending on the model. Gemma4 takes a long time to generate answers, but the Qwen-Series is quick and starts generating answers in ~10 seconds.
What’s the quality of the answers though? And how much context can it hold? I imagine it’s only good for small, short questions, but have no concept of what is needed for that.
I’m assuming you’re using a 12b or 24b qwen model. The ones from deepseek go up to hundreds of billions of params and I can’t tell if bigger number is better or just meaningless posturing.
I’m using the 35b models.
Quality for qwen is mostly fine - sometimes it does hallucinate some shit while thinking, but it does correct itself almost every time. But the answers itself are, for the most part, precise and useful. Not what you know from the cloud models, obviously, but it’s absolutely fine for everyday use. What is actually annoying is the web search - not sure if that’s a qwen problem or a problem with open webui, but it actually takes a long time to finish the search.
I once had a situation where a model was running into an “infinite loop” while thinking, thinking the same line over and over again. And once, qwen just started outputting chinese halfway through the answer lol.
When it comes to context, I’m gonna be very honest - I don’t know. I have never hit any kind of problems or limits because of that since I’m not using AI over a long term project. I use it for small, concise cases and that’s it.
Thanks for the response. It’s interesting to read about the experience of others.
Didn’t downvote. I use AI, and not ashamed of it. I don’t write huge programs and I damn sure don’t release anything to the public mainly because, in the back of my mind, I can just see some poor chap using my code and now smoke is coming out of his server. It works for me. Usually it’s ‘write a script that does _________’ or Docker compose files. It seems pretty accurate for those uses and if I need a bash command sequence explained, it’s good for that too.
I also use AI when I master my audio tracks before I upload them. I am clinically deaf and there are some frequencies that I just can’t hear well enough to make a judgement call. It’s pretty good at that too.
My MacBook Air with 24GB of unified RAM is enough to run something simple and useful.
That’s like what, 5 or 6k?
Like 1k
Reasonable price!
I have a rx5600xt (6gb), 32gb ram, ryzen 3600. System hasn’t been updated since i built it during covid. QwenV3-vl35B is the heftiest thing I can run, it gets around 2 tokens/sec, in LM studio. It’s easier than most people seem to think.
How do you now run out of RAM? Does it offload to system RAM?
Yes, offloads into system. Oh and i forgot to mention that’s with the context set around 25k. That can vary greatly per model though, it’s taken some experimentation to figure that out.
Thank you. That’s good to know.
I think in one video it looked like 16 cards. I think he did multiple bifurcations of the pcie lanes. I think he is / was using it for protein folding as well.
That’s definitely not my level of disposable wealth/income. I can barely afford one card.

First one-click RCE is in: https://www.reddit.com/r/LocalLLaMA/comments/1ttls1y/just_found_a_1click_rce_in_pewdiepies_odysseus/ … smh …
And disclosed to the public before the project maintainer, too. This is shit from every angle.
One more harness, bro.
did he vibe code it on claude?
It sounds like it, on the homepage there’s a joke about prompting ai to build this
Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I’ve seen in this thread:
Fewer Letters More Letters CF CloudFlare DNS Domain Name Service/System HA Home Assistant automation software ~ High Availability HTTP Hypertext Transfer Protocol, the Web HTTPS HTTP over SSL IP Internet Protocol SSL Secure Sockets Layer, for transparent encryption VPN Virtual Private Network VPS Virtual Private Server (opposed to shared hosting)
8 acronyms in this thread; the most compressed thread commented on today has 5 acronyms.
[Thread #324 for this comm, first seen 1st Jun 2026, 16:50] [FAQ] [Full list] [Contact] [Source code]
He’s done the main quest. Now he’s doing the side quests.
I’m a tiny bit confused as to what this actually is. I don’t use the Codex/ClaudeCode/Cursor stuff, but it seems like this is just an interface for connecting those services, isn’t it? It doesn’t seem like that actually protects your data at all.Can anyone help explain it a bit?
Edit: I realized I kinda glossed over all the stuff that seemed to be included in this, I more meant the start where he talked about this being privacy centric. Is he just trying to make self-hosting less painful?
Yeah essentially the other tools you had to use API keys, and none of them were FOSS, mostly paid only tools.
This lets you self host both the application interface itself (which can also be an IDE) and use a self hosted LLM
I love that guy. Remember hating him back in the days when he got popular by sitting and yelling while playing games. But damn the guy matured and put out epic content the past 10 years or so.
Honestly now that hes mostly retired his content is so chill now
Agree
Man, this is Ouroboros feedback loop.
I was finally becoming interested in Felix when he started dabbling in Linux and made some serious cool shit…Now that he’s made his own slop tool, I am losing interest in him again. It’s so fucking cursed, this slop nonsense needs to stop. I wish he would’ve stuck with ricing, making fun projects instead of open source washing corporate garbage.
Those are his fun projects. He’s not doing them for you. I honestly think its a cool project even if its not something crazy or anything. And honestly self-hosted AI projects are imo a lot better than just using claude tokens or whatever
It’s clear that this is his own project, that he is doing this for himself…It’s just sad this is what Felix is choosing to share with the community of people that follow him on YouTube. It’s bad enough that corporate idiots are peddling slop tools, but having a content creator with history and sway in the YouTube scene do it. Also, it’s an LLM not an “AI”, as AI is actually out of human reach unless humans actually do some real multidisciplinary work to make it happen. Techbros being able to conflate LLMs with AI was the worst thing to happen to the world. If only they were forced to advertise their slop tools correctly, we’d be in a different situation. 😮💨
Back in the BERT days I had a physics major friend that stuffed a bunch of Norwegian names in a file and trained a Norwegian name generator. He also made a Moby Dick sentence generator for funsies.
PewDiePie’s project is nothing different than a personal pet project like these cases. Nothing about being a YouTuber makes you an expert at machine learning. It should be treated the same way as any other pet project.
If the concern is someone with large amounts of influence causing disproportionate harm with their personal projects by name alone, at least in this particular case, I feel it’s appropriate to blame someone who trusts a YouTuber’s pet project in the first place.
fuck this nazi piece of shit and his sloppy ass slop and every one of you dipshits praising him and this garbage. “how much progress he’s made for FOSS” lol, lmao, lmfao, even.













