Run a ChatGPT-like AI on Your Laptop Using LLaMA and Alpaca

by Gary Explains December 29, 2023, 10:15 pm 999 Views 41 Comments

All the popular conversational models like Chat-GPT, Bing, and Bard all run in the cloud, in huge datacenters. However it is possible, thanks to new language models, to run a Chat-GPT or Bard alternative on your laptop. No supercomputer needed. No huge GPU needed. Just your laptop! Is it any good? Let’s find out.
—
PHNX the super-slim smartphone cases:
This is an affiliate link.

llama.cpp: https://github.com/ggerganov/llama.cpp
llama model download: https://github.com/shawwn/llama-dl
alpaca.cpp: https://github.com/antimatter15/alpaca.cpp
alpaca model download: https://github.com/tloen/alpaca-lora
Stanford Alpaca: https://github.com/tatsu-lab/stanford_alpaca

Twitter: https://twitter.com/garyexplains
Instagram: https://www.instagram.com/garyexplains/

#garyexplains

41 Comments

@user-fu6nj8lv5b says:

December 29, 2023 at 10:15 pm

Is that uncensored?
@robertheinrich2994 says:

December 29, 2023 at 10:15 pm

I totally agree with your explanation that not everything should be stored in the cloud. I somewhat believe in ownership of stuff, including computers and software. a cloud service can be turned off tomorrow, your PC cannot.
@robertheinrich2994 says:

December 29, 2023 at 10:15 pm

explaining the 4 bit quantization: it's not reducing the resolution of the image. it's turning a perfectly fine 24bit color cat into a 4 bit cat. essentially EGA graphics (16 colors, nothing else).
but apparently this works just fine for AIs.
@piyusha5448 says:

December 29, 2023 at 10:15 pm

Will i be able to run Litgpt or some thing small on a quadro p4000
@phillippi2 says:

December 29, 2023 at 10:15 pm

One thing to note about the extra text that gets generated is that it's from another conversation that your computer is having with itself based on your parameters. You can actually see a clear example of this when you run Tavern AI locally, through a terminal. What it does is, every time you submit text, it breaks the text down to weights. This is what is used to decide the subject matter and how the AI responds based on predefined characteristics. It then has a series of conversations along those lines. Then it decides on a response to give. From there, the reply it posts is generated based on specific characteristics of the character. This is an answer refinement stage, which is put into action as the response is generated.
@CrusadeVoyager says:

December 29, 2023 at 10:15 pm

Everyone can have their personal assistant or guru 😉
@MichaelAstrom says:

December 29, 2023 at 10:15 pm

Now I feel like "Blade Runner" 1987 working for the Tyrell Corporation 😉 Just started the alpaca chat on my Mac mini M1 8GB RAM, downloaded the trained model ggml-alpaca-7b-q4.bin from another site. Thank you for this tutorial Gary!
@Ti-JAC says:

December 29, 2023 at 10:15 pm

Thanks for the details Gary 👍
@philtookgrenadesforme7785 says:

December 29, 2023 at 10:15 pm

Is there a tutorial on where to download and set this up? You went from background to it being installed already…unless I am missing something 🤷‍♂
@GirijaCk-gg1ty says:

December 29, 2023 at 10:15 pm

I just want to use 3B model, does it support in my laptop, 8GB ram Ryzen 5 hexa core 5600H processer, Nvidia GeForce GTX 1650+ AMD Radeon (TM) GPUs, please tell me
@DeltaXMusic says:

December 29, 2023 at 10:15 pm

Gary the AI god
@petevenuti7355 says:

December 29, 2023 at 10:15 pm

Is there a way to run thing like this without using local memory? Like tb of drive space and only 128Mb of ram?
@DivineMisAdVentures says:

December 29, 2023 at 10:15 pm

I have to say that's a fail. Perfect thumbnail – it's "Chatgpt for Alpacas". . . . But super appreciate the video!
@platosgroove3955 says:

December 29, 2023 at 10:15 pm

Could you train a bot in the cloud then bring that training into a laptop one?
@bmflinux says:

December 29, 2023 at 10:15 pm

You didnt explain how to install it
@easternpa2 says:

December 29, 2023 at 10:15 pm

Great video, thank you. I recently picked up a Coral TPU in anticipation of finding a project like this (but could offload the model to the TPU).
@EmptyNonsens says:

December 29, 2023 at 10:15 pm

Gary, this is an amazing material
@agentstona says:

December 29, 2023 at 10:15 pm

If your friend has come over at 5am in the morning , It's obvious you are wide awake and let him in through the door and are in your kitchen at the FRIDGE . Hence that is the point from which you Decide or ask the question what do you open first !!!!!!!!!!!! the answers about opening your eyes or the door first are INCORRECT or IRRELEAVENT to the context ………. Because the context clearly is that YOU ARE at a point where you are having trouble in deciding on what FOOD to make thus which items to open and serve first !!!!!!!!!!!!
@LiloVLOG says:

December 29, 2023 at 10:15 pm

Can you help me? I'm installing this with w64devkit in windows with a 4gb model and I got just totally random answers, you know what maybe gonna wrong with my version?
@kunalvids says:

December 29, 2023 at 10:15 pm

Can you train with custom data?
@valeriehammond3664 says:

December 29, 2023 at 10:15 pm

The llama model repo is gone: Repository unavailable due to DMCA takedown.
@edwardpendragon says:

December 29, 2023 at 10:15 pm

really useful thank you
@cedrust4111 says:

December 29, 2023 at 10:15 pm

llama_model_load: invalid model file 'ggml-alpaca-7b-q4.bin' (bad magic)

comes up what running the executable
./chat (mac terminal)
Any clue how to fix this?
@ibizenco says:

December 29, 2023 at 10:15 pm

Hell wil freeze over sooner before me allowing an AI on my computers.
@jaybenton7716 says:

December 29, 2023 at 10:15 pm

The question about your friend coming for breakfast isn't a good one to test. It tells you know it's your friend at the door who has come for breakfast, so it infers you've already opened your eyes and the door.
@powercore2000 says:

December 29, 2023 at 10:15 pm

This is probably one of the best tutorials about AI I've seen. You broke down so many terms, and helped confirm things I researched, or was curious about. Thank so much!
@yedemon says:

December 29, 2023 at 10:15 pm

Yeah, absolutely right, I always expect thing to run locally, at least most locally, thus to be reliable. Because creativity and inspiration are not always available, so i really need a hand when i'm in need of them..
@developer-of-things says:

December 29, 2023 at 10:15 pm

I must have the GPT model on my computer if I would even begin to give my attention to developing things around AI 😉
@VulcanOnWheels says:

December 29, 2023 at 10:15 pm

11:21 You said in one sentence, but these are two sentences.
I thought you were going to explain how to "run a ChatGPT-like AI" program on a simple computer, but you didn't.
@relaxandlearn7996 says:

December 29, 2023 at 10:15 pm

Vicuna-13B has already beat this 🙂
@simsstudiosllc says:

December 29, 2023 at 10:15 pm

Can you offer the llama DL download from somewhere else? It's been taken down…
@jazzvids says:

December 29, 2023 at 10:15 pm

7:57 small correction – most of the screens today use 8 bit color, and some advanced LED screens use 10 bit color. 32 bit color can't be reproduced on any commercial screen that I know of
@simsstudiosllc says:

December 29, 2023 at 10:15 pm

This is the exact video I needed for my hobby project, thank you.
@Ambience20 says:

December 29, 2023 at 10:15 pm

What does it mean if Alpaca 7b outputs repetitive text?
@timothysuhr7903 says:

December 29, 2023 at 10:15 pm

Can these tools be used in Windows?
@SanctuaryLife says:

December 29, 2023 at 10:15 pm

If an EMP goes off in major cities in the coming years and takes down the big servers, it's exactly these home systems with AI's running on them that will keep civilisation at an advanced level. The sooner we can train our own AI's to learn and program the better.
@Vincent_Koech says:

December 29, 2023 at 10:15 pm

I am guessing this would be faster if it run on a GPU.
@evilbetty9204 says:

December 29, 2023 at 10:15 pm

Repository unavailable due to DMCA takedown.
This repository is currently disabled due to a DMCA takedown notice. We have disabled public access to the repository. The notice has been publicly posted.

If you are the repository owner, and you believe that your repository was disabled as a result of mistake or misidentification, you have the right to file a counter notice and have the repository reinstated. Our help articles provide more details on our DMCA takedown policy and how to file a counter notice. If you have any questions about the process or the risks in filing a counter notice, we suggest that you consult with a lawyer.
@JimsworldSanDiego says:

December 29, 2023 at 10:15 pm

Cannot even download the llama language model now, it's been taken offline due to DMCA…
@ZAToMsTyLee says:

December 29, 2023 at 10:15 pm

what student got the link for me
@AllensTrains says:

December 29, 2023 at 10:15 pm

I have heard that they have banned Chat GPt4 in Italy, but I think they are overreacting! It seems to me that Chat GPT has got a long way to go before it will be anything more than an amusing toy. The weakness of Chat GPT in so far as I have experimented with it, is that it can only give answers based on the information that has been fed to it. Quite an interesting example is to ask it to calculate the weight, trajectory, and fuel required to go to the moon. Calculations suggest a moonshot is impossible, and that nobody has ever been to the moon. But the received information contradicts this and Chat GPT will tell you that men did in fact go to the moon. You can give Chat GPT a headache in the way Captain Kirk did in the Star Trek episode, "The Changeling". In this episode, an alien problem called, "Nomad" had acquired a dangerous level of power. Kirk convinces it to self-destruct, uttering the famous line, "Nomad, you are wrong! You are a mistake!" Thanks for uploading.

Run a ChatGPT-like AI on Your Laptop Using LLaMA and Alpaca

ChatGPT’s Custom Instructions Are Overpowered (An Easy Guide + Templates & Jailbreaks)…

A ChatGPT Alternative That’s Free & Open Source!

The 10 Best ChatGPT Plugins So Far

41 Comments

iOS 16 just LEAKED the iPhone 14 Pro!

RESET Windows 10/11 Forgotten Password (Without losing Data) 3 Methods of 2024

The “New” File System in Windows: ReFS

I wore Apple Vision Pro for 48 Hours! (bad idea)

UPDATED!! Version 4.6 to 4.8 Banners Roadmap Including RERUNS – Genshin Impact

Why EVERYONE Plays: Neuvillette | Genshin Impact

Windows 11 vs Mac Os vs Linux in 2024

iPad Pro 2024 – Apple POR FIN lo Hace!!

Spatial Computing on the Quest 3: Do You Really Need a Vision Pro?

FINALLY HERE! How To Jailbreak iOS 11.4.1 With The NEW Pangu11.mobi untethered iOS 11 Jailbreak!

COMO FAZER GEMAS DE 5 ESTRELAS (DIABLO IMMORTAL)

Universal MF

Europe forces iPad to have alternative stores too

Diablo Immortal MOD Unlimited Eternal Orbs 2024

Black PC Build | No RGB | Ryzen 7600X | RX 7900 XT | Fractal Design North | bequiet Dark Rock Pro 4

CREME: Cook with Video Recipes

Friday Night Funkin’: VS Pibby Bugs Bunny FULL WEEK + Cutscenes [FNF Mod/HARD] Pibby Corrupt Mod

Apple CarPlay Android Auto for Infiniti G37 2007-2013 OEM Integration Screen Upgrade

iOS 17.4 Review after 3 days | Should you update to iOS 17.4?

I Saved 333 Pulls F2P for Xiao | Genshin Impact

Movement Player Unlocks Octane’s New Heirloom…

CALL OF DUTY MOBILE PO POLSKU – 30 KILLI NA NOWYM TABLECIE