Someone made a GPT-like chatbot that runs locally on Raspberry Pi, and you can too

Lee Duna@lemmy.nz · 2 years ago

Someone made a GPT-like chatbot that runs locally on Raspberry Pi, and you can too

QuadratureSurfer@lemmy.world · 2 years ago

Direct link to the GitHub repo:
https://github.com/nickbild/local_llm_assistant?tab=readme-ov-file

It’s a small model by comparison. If you want something that’s offline and actually closer to comparing to ChatGPT 3.5, you’ll want the Mixtral 8x7B model instead (running on a beefy machine):

https://mistral.ai/news/mixtral-of-experts/

Deceptichum@kbin.social · 2 years ago

Sick, I only need 90gb of VRAM!

QuadratureSurfer@lemmy.world · 2 years ago

I’ve got it running with a 3090 and 32GB of RAM.

There are some models that let you run with hybrid system RAM and VRAM (it will just be slower than running it exclusively with VRAM).

Deceptichum@kbin.social · 2 years ago

Yeah but damn does it get slow.

I always find it interesting how text is so much slower than image generation. I can do a 1024x1024 in probably 20s, but I get like 1 word a second with text.

ferret@sh.itjust.works · 2 years ago

Languages are complex and, more importantly, much less forgiving to error

DarkThoughts@fedia.io · 2 years ago

Hopefully we see more specific hardware for this. Like extension cards with pretty much just tensor cores and their own ram.

Deceptichum@kbin.social · 2 years ago

I’d love to see some consumer level AI stuff, sadly it all seems to be designed for server farms and by the time it ages out into consumer prices it’s so obsolete there’s no point in getting it.

raldone01@lemmy.world · 2 years ago

Do they want consumer ai cards to exist though?

Think about the data!

Deceptichum@kbin.social · 2 years ago

Card makers? They only want money, if theres enough consumer level demand they will make them.

raldone01@lemmy.world · 2 years ago

I guess your right.

anticurrent@sh.itjust.works · 2 years ago

Can we have smaller more domain specific models. that shouldn’t require more than casual hardware. like a small model for coding, one for medicine, one for history, and so on. ???

fruitycoder@sh.itjust.works · 2 years ago

Check out hugging face! Honestly fine tunned models for specific domains seems very popular (if for nothing else because training smaller models is just easier!).

DarkThoughts@fedia.io · 2 years ago

Unfortunately the roleplaying chatbot type models are typically fairly sizeable / demanding. I’m curious how this will develop with more specific AI hardware though, like extension cards with primarily tensor cores + their own ram, so that you don’t have to use your GPU for that. If we can drag down the price for such hardware then locally run models could become much more viable and mainstream.

Pantherina@feddit.de · 2 years ago

Dude sorry to say but roleplay is not equally important as medicine or coding XD

DarkThoughts@fedia.io · 2 years ago

For me they are. I have no use for medicine or coding bots.

long_chicken_boat@sh.itjust.works · 2 years ago

but you have the use for the very software you’re using daily or medicine developments.

I play D&D from time to time, but saying that roleplaying is more important than medicine is just nuts.

DarkThoughts@fedia.io · 2 years ago

Not so much for the latter but I’m pretty specifically talking about my personal use case here. lol “Roleplaying” in this scenario isn’t really referring to actual tabletop type RPGs btw. It’s the LLM roleplaying specific characters or personas that you then chat with in specific (or not so specific) scenarios. Although that same tech is also experimented with to be used in video games for NPCs. But who knows. A specifically trained model could potentially make a half decent dungeon master too.

Kilnier@lemmy.ca · 2 years ago

There also a huge amount of training, medical and otherwise, that’s done through role-playing. I could definitely see medical students getting use out of learning telemedicine with LLMs that were ultimately adapted from TTRPGs character generator schemas.