homehome Home chatchat Notifications


This Engineer Put an AI Language Model on a USB Stick and It Actually Works

A Raspberry Pi-powered USB stick runs a lightweight AI model, making LLMs portable and plug-and-play.

Tibi Puiu
February 17, 2025 @ 9:45 pm

share Share

In a world where artificial intelligence often seems to demand the computing power of a small city, one engineer has managed to shrink it down to the size of a USB stick. Meet the pocket-sized language model, a feat of ingenuity that proves big ideas don’t always need big hardware.

Large language models (LLMs) like GPT and LLaMA have become the rock stars of the AI world, capable of generating human-like text, answering questions, and even writing code. But these models typically rely on billions of parameters and require massive data centers to function. Enter YouTuber Binh, a tinkerer who decided to challenge the status quo by cramming an LLM onto a USB stick.

This isn’t your average flash drive. Inside its custom 3D-printed case lies a Raspberry Pi Zero W, a tiny computer no bigger than a stick of gum. Running on this modest hardware is llama.cpp, a lightweight version of the LLaMA model from Meta. But getting the software to work on the Pi wasn’t easy. The latest version of llama.cpp is designed for ARMv8 processors, while the Raspberry Pi Zero W runs on the older ARMv6 architecture. So he had to painstakingly remove the ARMv8 optimizations.

His persistence paid off, and he successfully adapted the model to run on the older hardware. The result is a portable AI that fits in your pocket — no cloud computing required.

Plug-and-Play AI

The real magic of this project lies in its simplicity. Binh designed the USB stick to be a composite device, meaning it can interact with any computer without requiring special drivers. To use the LLM, all you need to do is plug in the USB stick, create an empty text file, and give it a name. The model automatically generates text and saves it to the file.

While it’s not as fast as its cloud-based counterparts, the USB-based LLM is a groundbreaking proof of concept, as first seen on Hackaday. “I believe this is the first plug-and-play USB-based LLM,” Binh said. And he’s probably right.

This project isn’t just a clever hack; it’s a glimpse into the future of AI accessibility. By making language models portable and easy to use, Binh has opened the door to new possibilities. Imagine students in remote areas using USB-based LLMs for homework help, or journalists in the field generating drafts without an internet connection.

It also raises questions about the environmental impact of AI. Large models require vast amounts of energy, contributing to carbon emissions. Smaller, more efficient models like this one could help reduce that footprint.

Of course, there are limitations. The Raspberry Pi Zero W has only 512MB of RAM, which restricts the size and complexity of the model it can run. But as hardware improves, so too will the capabilities of these pocket-sized AIs.

For now, Binh’s USB stick is a reminder that innovation doesn’t always mean building bigger and faster. Sometimes, it’s about thinking smaller. And in this case, small is mighty.

share Share

We can still easily get AI to say all sorts of dangerous things

Jailbreaking an AI is still an easy task.

Scientists Solved a Key Mystery Regarding the Evolution of Life on Earth

A new study brings scientists closer to uncovering how life began on Earth.

AI has a hidden water cost − here’s how to calculate yours

Artificial intelligence systems are thirsty, consuming as much as 500 milliliters of water – a single-serving water bottle – for each short conversation a user has with the GPT-3 version of OpenAI’s ChatGPT system. They use roughly the same amount of water to draft a 100-word email message. That figure includes the water used to […]

Smart Locks Have Become the Modern Frontier of Home Security

What happens when humanity’s oldest symbol of security—the lock—meets the Internet of Things?

A Global Study Shows Women Are Just as Aggressive as Men with Siblings

Girls are just as aggressive as boys — when it comes to their brothers and sisters.

Birds Are Singing Nearly An Hour Longer Every Day Because Of City Lights

Light pollution is making birds sing nearly an hour longer each day

U.S. Mine Waste Contains Enough Critical Minerals and Rare Earths to Easily End Imports. But Tapping into These Resources Is Anything but Easy

The rocks we discard hold the clean energy minerals we need most.

Scientists Master the Process For Better Chocolate and It’s Not in the Beans

Researchers finally control the fermentation process that can make or break chocolate.

Most Countries in the World Were Ready for a Historic Plastic Agreement. Oil Giants Killed It

Diplomats from 184 nations packed their bags with no deal and no clear path forward.

Are you really allergic to penicillin? A pharmacist explains why there’s a good chance you’re not − and how you can find out for sure

We could have some good news.