Embedded AI - Intelligence at the Deep Edge

“Intelligence at the Deep Edge” is a podcast exploring the fascinating intersection of embedded systems and artificial intelligence. Dive into the world of cutting-edge technology as we discuss how AI is revolutionizing edge devices, enabling smarter sensors, efficient machine learning models, and real-time decision-making at the edge.

Discover more on Embedded AI (https://medium.com/embedded-ai) — our companion publication where we detail the ideas, projects, and breakthroughs featured on the podcast.

Help support the podcast - https://www.buzzsprout.com/2429696/support

All Episodes

Embedded AI - Intelligence at the Deep Edge

Squeezing AI into your Pocket

May 28, 2026 • David Such • Season 5 • Episode 30

0:00 | 19:30

Send us Fan Mail

By 2026, language models have moved off the cloud and onto the device in your pocket. What was a research demonstration two years ago is now a routine engineering capability, and the centre of gravity for artificial intelligence has begun to migrate from distant data centres to local silicon.

The episode traces the four engineering moves that made this possible. Quantization, which shrinks a model by storing its parameters with less precision. Optimized key-value caches, which let a model hold a long conversation without exhausting memory. Neural Processing Units, the dedicated AI accelerators now standard in flagship phones. And specialized frameworks such as LiteRT-LM and llama.cpp, which finally make all three usable from a single application.

The consequences reach further than performance figures. Privacy becomes the default rather than a feature, because data never leaves the device. The cost structure of AI applications changes, because there are no per-query cloud fees. And the link between training capital and deployment capability begins to decouple, opening the door for small teams to ship genuine intelligence on hardware they already control.

Support the show

If you are interested in learning more then please subscribe to the podcast or head over to https://medium.com/@reefwing, where there is lots more content on AI, IoT, robotics, drones, and development. To support us in bringing you this material, you can buy me a coffee or just provide feedback. We love feedback!