cross-posted from: https://sh.itjust.works/post/61139432

I seriously can’t believe how much progress he’s made for the FOSS community. He actually might take a bite out of the big 3’s profits with this

    • sbeak@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      25
      ·
      11 days ago

      As per his video, he states that he wishes that he could magically make AI go away and that he “hates everything in his project”, but since that’s not possible, he would prefer if people using AI did so using their own hardware and not some company’s cloud servers.

      Make of that what you will. I think it’s pretty neat, not for me, but I’m sure someone will find it useful.

      • boonhet@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 days ago

        Funnily enough, you can use Vibe, Claude and Codex with local models if you configure them so. OpenCode supports it explicitly (while still retaining a cloud option because dev wants to make money)

        So this isn’t anything too new, but eh, at least it’s more attention to the self-hosting capability of LLMs for code agents.

        • sbeak@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          4
          ·
          10 days ago

          Having a large internet personality like Pewdiepie advocate for privacy, self-hosting, and open-source is always good!

    • appauled@sh.itjust.worksOP
      link
      fedilink
      English
      arrow-up
      22
      ·
      11 days ago

      I kinda loved his “you should self host to decentralize from big tech” and “run graphene and Linux to avoid data collection” content, but idk what the local ai stuff is any good for

      • Encrypt-Keeper@lemmy.world
        link
        fedilink
        English
        arrow-up
        20
        arrow-down
        1
        ·
        11 days ago

        It’s good for the same things machine learning has always been good for. Language synthesis and analysis. Selfhosting something like Paperless for document management. It actually has a very rudimentary learning engine for document classification for a long time but feeding document content to a local AI model for organization tagging is very useful.

      • SuspiciousCarrot78@aussie.zone
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        2
        ·
        edit-2
        9 days ago

        Quite a lot, actually.

        Coding, document analysis, STT, home assistant, shopping assistant, gaming, journalling, image and video generation, OCR, language translation, recipe/meal / workout planning, study/flashcard generation, email drafting, adversarial review, search engine on steroids, hardware troubleshooter, companion for elder care, music curator and DJ …

        All of that without creepy ass cloud shit from Big AI.

        I can go on, but “a lot” probably covers it.

        EDIT: asked, answered and…down voted. Classic Lemmy anti-ai knee jerk. FWIW I work with AI in healthcare settings as well as code review for my own personal projects.

        What I said are actual use cases, not a wishlist generated by Jippity. I can elaborate on any and all of them with actual real life experience.

        • appauled@sh.itjust.worksOP
          link
          fedilink
          English
          arrow-up
          1
          ·
          7 days ago

          Coding is okay, but the flagship free models usually crush whatever and to have decent memory you need hella VRAM Doc analysis is okay if they’re small, Home assistant isn’t bad but kinda overkill with an LLM when you can just set manual automations, Journalling, gaming, music, DJ’ing, elder care, imagine and video generation, are all relatively bad even on flagship models

          Quick overview of searches is fine but I’ll use a free flagship model for that, in depth research tends to not be great

  • onlinepersona@programming.dev
    link
    fedilink
    English
    arrow-up
    26
    arrow-down
    1
    ·
    12 days ago

    How many GPUs do you even need to have a usable, self-hosted AI? It looks like he has 6 on his rig. Probably each costs 2k or something. That’s not peanuts. I have a 12GB VRAM card. It probably can’t generate anything in any meaningful amount of time. Which brings me to the question: who is this for?

    Regardless, impressive what he vibe-coded there.

    • realitaetsverlust@piefed.zip
      link
      fedilink
      English
      arrow-up
      8
      ·
      12 days ago

      I use an 6700 XTX and it’s working perfectly fine, depending on the model. Gemma4 takes a long time to generate answers, but the Qwen-Series is quick and starts generating answers in ~10 seconds.

      • onlinepersona@programming.dev
        link
        fedilink
        English
        arrow-up
        3
        ·
        11 days ago

        What’s the quality of the answers though? And how much context can it hold? I imagine it’s only good for small, short questions, but have no concept of what is needed for that.

        I’m assuming you’re using a 12b or 24b qwen model. The ones from deepseek go up to hundreds of billions of params and I can’t tell if bigger number is better or just meaningless posturing.

        • realitaetsverlust@piefed.zip
          link
          fedilink
          English
          arrow-up
          4
          ·
          11 days ago

          I’m using the 35b models.

          Quality for qwen is mostly fine - sometimes it does hallucinate some shit while thinking, but it does correct itself almost every time. But the answers itself are, for the most part, precise and useful. Not what you know from the cloud models, obviously, but it’s absolutely fine for everyday use. What is actually annoying is the web search - not sure if that’s a qwen problem or a problem with open webui, but it actually takes a long time to finish the search.

          I once had a situation where a model was running into an “infinite loop” while thinking, thinking the same line over and over again. And once, qwen just started outputting chinese halfway through the answer lol.

          When it comes to context, I’m gonna be very honest - I don’t know. I have never hit any kind of problems or limits because of that since I’m not using AI over a long term project. I use it for small, concise cases and that’s it.

          • irmadlad@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            11 days ago

            Didn’t downvote. I use AI, and not ashamed of it. I don’t write huge programs and I damn sure don’t release anything to the public mainly because, in the back of my mind, I can just see some poor chap using my code and now smoke is coming out of his server. It works for me. Usually it’s ‘write a script that does _________’ or Docker compose files. It seems pretty accurate for those uses and if I need a bash command sequence explained, it’s good for that too.

            I also use AI when I master my audio tracks before I upload them. I am clinically deaf and there are some frequencies that I just can’t hear well enough to make a judgement call. It’s pretty good at that too.

    • new_world_odor@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      12 days ago

      I have a rx5600xt (6gb), 32gb ram, ryzen 3600. System hasn’t been updated since i built it during covid. QwenV3-vl35B is the heftiest thing I can run, it gets around 2 tokens/sec, in LM studio. It’s easier than most people seem to think.

    • Dultas@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      11 days ago

      I think in one video it looked like 16 cards. I think he did multiple bifurcations of the pcie lanes. I think he is / was using it for protein folding as well.

    • frongt@lemmy.zip
      link
      fedilink
      English
      arrow-up
      17
      arrow-down
      3
      ·
      12 days ago

      And disclosed to the public before the project maintainer, too. This is shit from every angle.

  • Decronym@lemmy.decronym.xyzB
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    2
    ·
    edit-2
    9 days ago

    Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I’ve seen in this thread:

    Fewer Letters More Letters
    CF CloudFlare
    DNS Domain Name Service/System
    HA Home Assistant automation software
    ~ High Availability
    HTTP Hypertext Transfer Protocol, the Web
    HTTPS HTTP over SSL
    IP Internet Protocol
    SSL Secure Sockets Layer, for transparent encryption
    VPN Virtual Private Network
    VPS Virtual Private Server (opposed to shared hosting)

    8 acronyms in this thread; the most compressed thread commented on today has 5 acronyms.

    [Thread #324 for this comm, first seen 1st Jun 2026, 16:50] [FAQ] [Full list] [Contact] [Source code]

  • Matt@lemmy.ml
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    1
    ·
    11 days ago

    He’s done the main quest. Now he’s doing the side quests.

  • MissesAutumnRains@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    6
    ·
    edit-2
    12 days ago

    I’m a tiny bit confused as to what this actually is. I don’t use the Codex/ClaudeCode/Cursor stuff, but it seems like this is just an interface for connecting those services, isn’t it? It doesn’t seem like that actually protects your data at all.

    Can anyone help explain it a bit?

    Edit: I realized I kinda glossed over all the stuff that seemed to be included in this, I more meant the start where he talked about this being privacy centric. Is he just trying to make self-hosting less painful?

    • appauled@sh.itjust.worksOP
      link
      fedilink
      English
      arrow-up
      4
      ·
      11 days ago

      Yeah essentially the other tools you had to use API keys, and none of them were FOSS, mostly paid only tools.

      This lets you self host both the application interface itself (which can also be an IDE) and use a self hosted LLM

  • somethingDotExe@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    8
    ·
    12 days ago

    I love that guy. Remember hating him back in the days when he got popular by sitting and yelling while playing games. But damn the guy matured and put out epic content the past 10 years or so.

  • A Sharky Anthro@fedia.io
    link
    fedilink
    arrow-up
    3
    arrow-down
    2
    ·
    12 days ago

    I was finally becoming interested in Felix when he started dabbling in Linux and made some serious cool shit…Now that he’s made his own slop tool, I am losing interest in him again. It’s so fucking cursed, this slop nonsense needs to stop. I wish he would’ve stuck with ricing, making fun projects instead of open source washing corporate garbage.

    • timestatic@feddit.org
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      2
      ·
      12 days ago

      Those are his fun projects. He’s not doing them for you. I honestly think its a cool project even if its not something crazy or anything. And honestly self-hosted AI projects are imo a lot better than just using claude tokens or whatever

      • A Sharky Anthro@fedia.io
        link
        fedilink
        arrow-up
        2
        ·
        12 days ago

        It’s clear that this is his own project, that he is doing this for himself…It’s just sad this is what Felix is choosing to share with the community of people that follow him on YouTube. It’s bad enough that corporate idiots are peddling slop tools, but having a content creator with history and sway in the YouTube scene do it. Also, it’s an LLM not an “AI”, as AI is actually out of human reach unless humans actually do some real multidisciplinary work to make it happen. Techbros being able to conflate LLMs with AI was the worst thing to happen to the world. If only they were forced to advertise their slop tools correctly, we’d be in a different situation. 😮‍💨

        • StarDreamer@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          1
          ·
          12 days ago

          Back in the BERT days I had a physics major friend that stuffed a bunch of Norwegian names in a file and trained a Norwegian name generator. He also made a Moby Dick sentence generator for funsies.

          PewDiePie’s project is nothing different than a personal pet project like these cases. Nothing about being a YouTuber makes you an expert at machine learning. It should be treated the same way as any other pet project.

          If the concern is someone with large amounts of influence causing disproportionate harm with their personal projects by name alone, at least in this particular case, I feel it’s appropriate to blame someone who trusts a YouTuber’s pet project in the first place.

  • nublug@piefed.blahaj.zone
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    26
    ·
    11 days ago

    fuck this nazi piece of shit and his sloppy ass slop and every one of you dipshits praising him and this garbage. “how much progress he’s made for FOSS” lol, lmao, lmfao, even.