• adarza@piefed.ca
    link
    fedilink
    English
    arrow-up
    25
    ·
    21 days ago

    so these crazy prices i hear about being implemented (like at github) should actually be at least 10x higher?

  • some_guy@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    20
    ·
    21 days ago

    It’s gonna come crashing down pretty soon. It’s gonna hurt all of us. It won’t hurt the people responsible nearly enough.

      • jj4211@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        20 days ago

        The thing is this really depends on the speed of some financial events, not some technical failing.

        Notably, if OpenAI has to cancel any of their commitments to buy hardware because they find they have neither the money nor can secure even more debt to cover, that event would potentially cause the bubble to pop, even for hypothetical companies that may have been more responsible and might have a viable business approach. Those commitments are coming up, and a lot of analysis struggles to see how they will fund those commitments.

        The thing with this bubble is that the investors don’t get the nuance and will flee at signs of trouble in any of OpenAI, Anthropic, or a handful of others, and Altman’s leadership has made trouble at OpenAI very likely, but the investors don’t believe it and won’t believe it’s unique to OpenAI, even if it would be.

      • some_guy@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        16 days ago

        Only because the hype has lasted longer than expected. Now that IPOs have been filed, the AI companies (Anthropic, OpenAI) released statements about slowing down to protect us. They’re setting the stage for lower growth. But I think you should invest every penny you have into “SpaceXMegaTwitterSuperCarAI”.

    • M0oP0o@mander.xyz
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      20 days ago

      $1000 I would guess. They are just burning money at this point.

  • mfed1122@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    14
    ·
    20 days ago

    I mean, this is no different than Walmart making prices low until other businesses die out and then raising them.

    It is no different than police shoving all the homeless people and drug addicts into one area of town to crash the property prices, and then evicting them once developers buy everything for cheap.

    They’re purposely operating at a loss in the expectation that they can get ingrained into a ton of workflows, and then gouge everyone absolutely to death while also worsening the quality of the service to make it cheaper for them to run.

    If it weren’t so horrible for the environment, I’d kind of like it, because all the dumbass executives that are signing up for this are going to get exactly what they deserve. You’d think they’d recognize a scheme when they see one.

  • nullspace@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    3
    ·
    20 days ago

    I can’t imagine paying for AI when the open source tools have made it so easy to set up a model locally.

    • potustheplant@feddit.nl
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      5
      ·
      20 days ago

      Don’t be daft. The vast majority of people don’t have the knowledge or resources to set that up locally.

      • nullspace@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        19 days ago

        You’re right if we’re talking about the entire population of Earth. With these local models though, other people have already done all the hard stuff. Anyone with an RTX card and just a minimum level of patience can get going.

        • potustheplant@feddit.nl
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          19 days ago

          Minimum for local models is 12gb imo. There are several “rtx” cards that have 8gb. Also, why nvidia? AMD works well too. My previous point stands, still. If you don’t already have the hw, buying a pc today is very expensive. I don’t know if you go out much but it ain’t pretty out there. People arrn’t precisely swimming in cash.

          Also, patience isn’t the only requirement. Keep in mind that some people struggle to even install a program.

    • ranzispa@mander.xyz
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      3
      ·
      20 days ago

      Easy to set up, but still needs a 15k $ graphics card and electricity bill. The price you pay openai/anthropic is much cheaper than that for that quality of model.

      Sure, you can setup a small model on a consumer graphics card, but the output will be considerably worse and the processing speed considerably lower.

      For 240€/year you got a subscription to anthropic which will happily ingest a whole repository and process it in about one minute. No matter what latest model GPU you installed on your computer, you won’t be able to do that.

      Sure, this guy was able to run a 26B model on an old CPU: https://point.free/blog/gemma-4-on-a-2016-xeon/

      But that was not easy at all and the speed you get is definitely not the same as the one provided for a very cheap price.

      • zbyte64@awful.systems
        link
        fedilink
        English
        arrow-up
        2
        ·
        19 days ago

        If you were paying the real price it would be 2 grand a year though. And in 5 years that 15k graphics card will be $200 and sip on electricity by comparison.

        • ranzispa@mander.xyz
          link
          fedilink
          English
          arrow-up
          1
          ·
          19 days ago

          A100 is 6 years old and is now sold at over 10k $. If you were paying a higher price it could be cheaper to buy the card, since the prices are low that is not the case.

        • kuerbiskernoel@feddit.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          19 days ago

          Currently nearly 5 year old used graphics cards are being sold for their initial price. Not sure how much they’ll get cheaper…

      • CovfefeKills@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        19 days ago

        There is a middle ground. Crypto farmers have transitioned into running AI workloads for money. There are things sort of like folding@home but you can let people use your GPU and you earn tokens which are used to buy compute or sold to people who want to buy compute on the network. So you can setup a bigass open source model for private on demand use it’s still not cheap but a lot closer to reality for a lot of people than a 15k initial purchase.

  • AdolfSchmitler@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    20 days ago

    Trust me bro we’re so close to profitability bro, just need this IPO to secure funding one last time bro then we’ll be profitable bro I swear.

  • Cornpop@lemmy.world
    link
    fedilink
    English
    arrow-up
    9
    ·
    20 days ago

    What is the actual “cost” after they buy the hardware, is that $1000 really pure power usage cost?

    • HereIAm@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      20 days ago

      I’m curious as well. My knowledge is probably quite outdated, but from what I understood the training part is what’s expensive and then querying the model is pretty cheap. Is it still true (or was it ever) that the generated answers on search engines are cheaper to generate than the actual search results?

      • CheeseNoodle@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        20 days ago

        I find that hard to believe, I recently had to uninstall co-pilot after it weaseled its way into my search bar. Its not an exageration to say that my PC literally ran cyberpunk 2077 with pathtracting better than it ran the fucking windows search bar with co-pilot.

      • Shteou@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        20 days ago

        It is sorta. Training is orders of magnitudes more intensive than inference, but we infer billions of times within a model generation.

    • rumba@lemmy.zip
      link
      fedilink
      English
      arrow-up
      3
      ·
      20 days ago

      that’s the $84,000 question. They’re filling datacenters with the fastest possible equipment and need it to be 10x faster, That hardware is dinosaur fodder a year after they install it.