Local AI Just Got Easy (and Cheap)

by Data Slayer February 6, 2024, 8:30 am 1.5k Views 42 Comments

This Google TPU makes local AI simple…

Full Blog Tutorial:

Product Links (some are affiliate links)
– Coral USB TPU 👉 https://amzn.to/3vUkUJH
– Zima Board 👉 https://amzn.to/42dYHCr
– Raspberry Pi 5 👉 https://amzn.to/3HBSBSI
– Coral PCIe TPU 👉 https://amzn.to/3vLKfVW
– M.2 Adapter 👉 https://amzn.to/3OnchgZ

Explore the groundbreaking Coral AI mini PCIe accelerator by Google, a game-changer for home automation and DIY projects, in my latest video where I integrate this innovative chip with the ZimaBoard and Zima Blade for superior performance and cost-effectiveness. Discover how this setup outperforms others, like the Raspberry Pi 5, in speed and thermal efficiency, and follow my journey from troubleshooting software issues to successfully running Frigate, an advanced home lab computer vision system. Learn how this affordable, under $100 setup can revolutionize your home tech projects!

Monitor your security cameras with locally processed AI
Frigate is an open source NVR built around real-time AI object detection. All processing is performed locally on your own hardware, and your camera feeds never leave your home.

https://coral.ai/products/m2-accelerator-ae
https://frigate.video/
https://mqtt.org/

42 Comments

@ssenkumbadeogratius6910 says:

February 6, 2024 at 8:30 am

why secure boot on linux computer?
@EquaTechnologies says:

February 6, 2024 at 8:30 am

Local AI is the future.
@orangepi2893 says:

February 6, 2024 at 8:30 am

Please, specify a moment, when I can see something "fast" on that video.
@andrewgoodrich3530 says:

February 6, 2024 at 8:30 am

usb 3 is supposed to be 5 gb/s not 650mb im pretty sure
@liamblu says:

February 6, 2024 at 8:30 am

Want this on a slot for my Framework 16 instead of the graphics card.
@mxmzb says:

February 6, 2024 at 8:30 am

nice work, binge-watching your stuff! what's the screen recording software you're using with the rounded-borders face cam bottom right?
@muharief3885 says:

February 6, 2024 at 8:30 am

does intel also sell neural compute stick for the same purpose?
@grilsegrils9330 says:

February 6, 2024 at 8:30 am

What is the tiny monitor that we see early in the video clip?
@CrapperCopter says:

February 6, 2024 at 8:30 am

Suddenly, someone needed to sell a bunch of slow 3 year old inference chips with very limited use. This is their story.

An RTX4090 is more energy efficient than an equivalent number of these things assuming no sparsity @ 660 INT8 TOPS for ~380W (ignoring the dual Epyc or PCIe switch cards needed to run 160 of them in the same machine and still come in below the base clock int8 performance). I suppose you could attempt to run that many off USB somehow on a 16 lane consumer processor but it won't be able to cope with the interrupt management and USB overhead since it's a garbage protocol. That number is wrong since the 4090 will be running quite a bit faster than base clock at 380W draw but it doesn't matter. 0.576W / TOP vs 0.8125W / TOP. Sparsity potentially halves that number. Of course most of the models that can be quantized to INT8 will run in INT4 as well so you could be running everything at ~1.3 INT4 PetaOPS.

Asus makes the only PCIe cards that hold 16 of them per card and according to this incredibly fat gay black trans-man you might as well cut your junk off and join scientology before expecting any of their products to work correctly. They're either going to require a 1/1/1/1/1… etc 1×16 way bifurcation nothing supports, or Asus is going to be charging you 500-600 for a 2-generation outdated PCIe switch that they've probably programmed wrong if their redrivers on Threadripper boards are any indication. The fact that they need a 2-slot card to dissipate 54W on these is a good indication. Their new boards with USB 4 state that the cables shouldn't be removed or swapped until all power to the motherboard is killed. You know, USB-C connectors, those famously stable things. Their last threadripper board managed to have slots that stopped working if you used other slots on a platform with 128 lanes available, and lacked enough 8-pin PCI-e power to the motherboard to run more than 2 GPUs… and it sounded like less of a disaster than their old x99 boards.

TL;DR if you're going to buy outdated tech junk, instead of feeding money to whatever goofdouche here is promoting, hop on ebay and treat your home network to an upgrade to P2P 56Gb Ethernet / Infiniband dual port cards and some SR4 transceivers so you can quit being insulted by things like 2.5Gb ethernet in brand new machines, or any of the FPGAs that are available that I'm just gonna go ahead and bet will destroy the performance of these things (plus people might actually hire you for "Languages known: VHDL / Verilog" on a resume, they won't for "I SET UP PYTHON AND DETECTED FACES!!!!!!11").
@Petoj87 says:

February 6, 2024 at 8:30 am

i guess you already know, but you should use compose files, otherwise maintenance of your containers will become a hell..
@lukesmith1519 says:

February 6, 2024 at 8:30 am

I'm curious if it's possible to use the pcie version via USB with a simple adapter. I understand that it might affect speeds but could it at least hypothetically function? I'm trying to research this ATM. Also the USB version is about $30 more than the USB version so… ya know. Pinching pennies where I can and all that,
@Omizuke says:

February 6, 2024 at 8:30 am

Person detection. Training the AI to hunt us more efficiently. lol
This is an amazing little thing. I do have an issue with the AI online. Having a local device that can be use to monitor and do what it has to do on an offline local network is amazing.
@aimademerich says:

February 6, 2024 at 8:30 am

Phenomenal
@netroy says:

February 6, 2024 at 8:30 am

4:13 /dev is devices (and not drivers). There was no apex_0 because the device wasn't plugged in yet.
@lel7531 says:

February 6, 2024 at 8:30 am

Stop using that warp terminal please
@gabrielstangel919 says:

February 6, 2024 at 8:30 am

gains sentience as you plugged in the TPU made me lol
@sjoervanderploeg4340 says:

February 6, 2024 at 8:30 am

Pretty sure you do not need to add root to any groups 😉
@CodeCowboy64 says:

February 6, 2024 at 8:30 am

I'm curious whether the tensor flow models are updatable. Ie, could an enterprising person add a model which detects an approaching person (vs departing)… same with a vehicle.
@DinoFancellu says:

February 6, 2024 at 8:30 am

"easy", just follow this simple 30 step process
@jonathanbutler6635 says:

February 6, 2024 at 8:30 am

Would love to see this in the pi 5 since it has pcie to see what it’s capable of
@festro1000 says:

February 6, 2024 at 8:30 am

I love that you relate Frigate to golden eye, I do the same thing with relating facilities to bathrooms because of that game.
@RakanAlsubaiei says:

February 6, 2024 at 8:30 am

how much will all of this cost me
@gg-gn3re says:

February 6, 2024 at 8:30 am

The usb one isn't only slower because the interface, it's also underpowered. The one you got isn't the greatest either, you should have got the dual edge which is by far the best price:performance ratio
@MyPhone-qg2eh says:

February 6, 2024 at 8:30 am

So….slower than without it?
@razvanab says:

February 6, 2024 at 8:30 am

It will be interesting to know if it can run stable diffusion and how fast.
@faust9091 says:

February 6, 2024 at 8:30 am

I was out when you said built by google
@junialter says:

February 6, 2024 at 8:30 am

I cannot find an adapter so I can plug this PCIe Mini Card into a PCIe slot
@toinfinityandyourmom2219 says:

February 6, 2024 at 8:30 am

that zooming in and out of the screen capture is actually making me barf
@jediknight2350 says:

February 6, 2024 at 8:30 am

AI is the antichrist well done humans your understand the second coming of christ is near.
@CaptZenPetabyte says:

February 6, 2024 at 8:30 am

Can you confirm the Goog-Le hardware doesnt 'phone home' and share things, that it is completely self-contained and works offline?
@ololh4xx says:

February 6, 2024 at 8:30 am

using ffmpeg just like that is …. dumb. You need serious reencoding performance AND you need to crank the settings as high as possible – so that the image quality is decent so that the inference can make actual f*cking guesses and doesnt miss all the time because you're sending a pixelated 320×240-mess. OR : (hear me out) you could use a proper webcam or surveillance cam which pumps the stream out itself.
@travisaugustine7264 says:

February 6, 2024 at 8:30 am

Maybe I missed it, but what card did you plug your TPU into? to be able to plug it into a PCIE slot?
@rakshithsajjan6445 says:

February 6, 2024 at 8:30 am

Can it run llama 7b?
@darthwater999 says:

February 6, 2024 at 8:30 am

>by google
no thanks
@Nolfavrell says:

February 6, 2024 at 8:30 am

is the voice AI generated? its kinda choppy
@Derick99 says:

February 6, 2024 at 8:30 am

Is there a discord or something to join
@woolfel says:

February 6, 2024 at 8:30 am

the annoying thing with google tensorflow is how often stuff is broken with new releases. Over the last 20 years, I've lost count of how many times Google decided to break backward compatibility in their projects. Google's idea of open source kinda blows.
@Derick99 says:

February 6, 2024 at 8:30 am

My goal is to make a llm one day but i dont have a good enough graphics card to handle llm of higher sizes.

Can you give me an idea on what you would doo for strictly an LLM but one that can handle a 70b llm

Ideas for small but powerfull sets or soemthing even specific for that would be cool. Was ready to buy a zima blade and board and coral thing but i want the best without buying an a100
@JimFeig says:

February 6, 2024 at 8:30 am

Given the keying of your M2 module is should have a max transfer rate of 250MB/s or maybe 500MB/s if using both PCIE channels.
@monstercameron says:

February 6, 2024 at 8:30 am

PS there is nothing wrong with using an older version of python. Use conda or venv to properly version your pip deps
@ozkifovxvypyvp3574 says:

February 6, 2024 at 8:30 am

I don't know how you do your video editing, but I really like the zoom and other effects thrown in here. Nice walkthrough.
@Melvin420x12 says:

February 6, 2024 at 8:30 am

What software did you use to screen record while talking at 4:06. That flow is was so smooth

Local AI Just Got Easy (and Cheap)

The Compute Module 4 Is an Emulation Beast! Raspberry Pi CM4 Review

SEGA SATURN Emulation just got BIG!

Raspberry Pi 4: Call Of Duty Mobile Garena ( Android 12)

How To Run TensorFlow Lite on Raspberry Pi for Object Detection

42 Comments

10 Secret GPT-4 Tips And Tricks (How To Use GPT-4)(GPT-4 Tutorial)

iOS 16 just LEAKED the iPhone 14 Pro!

RESET Windows 10/11 Forgotten Password (Without losing Data) 3 Methods of 2024

The “New” File System in Windows: ReFS

I wore Apple Vision Pro for 48 Hours! (bad idea)

UPDATED!! Version 4.6 to 4.8 Banners Roadmap Including RERUNS – Genshin Impact

NEW GUN AGR 556 + GHOST JAWBONE GAMEPLAY IN CALL OF DUTY MOBILE BATTLE ROYALE!

Windows 11 vs Mac Os vs Linux in 2024

iPad Pro 2024 – Apple POR FIN lo Hace!!

Spatial Computing on the Quest 3: Do You Really Need a Vision Pro?

Pick:shine Movies & TV Shows

Battery Health Drop After Every iOS Update | How to Fix Hindi

Install VS Code on a Mac | Visual Studio Code 2024 Tutorial

Solo Vs Squad Bgmi Gameplay|| 20+ kills Pubg Mobile||Dare Gaming||Iphone 13

iPad Studio with M3 will be announced this fall! ? Intermediate model between M4-equipped Pro and M2-equipped Air

200 Photos On The iPhone 15 Pro Max vs iPhone 14 Pro Max | Photo Shoot|out

New Method iCloud Unlock IPhone 4,4S,5,5S,5C,6,6S,Plus,7,8,X No JailBreak 2024

Friday Night Funkin’: VS Pibby Bugs Bunny FULL WEEK + Cutscenes [FNF Mod/HARD] Pibby Corrupt Mod

Apple CarPlay Android Auto for Infiniti G37 2007-2013 OEM Integration Screen Upgrade

iOS 17.4 Review after 3 days | Should you update to iOS 17.4?

I Saved 333 Pulls F2P for Xiao | Genshin Impact

Movement Player Unlocks Octane’s New Heirloom…