debugging
How to Run a 400B Parameter LLM on a Phone (Yes, Really)
A 400B LLM ran on an iPhone 17 Pro. Here's how flash offloading and aggressive quantization make the impossible possible.
llmon-device-aimobile-development
A 400B LLM ran on an iPhone 17 Pro. Here's how flash offloading and aggressive quantization make the impossible possible.
Google's new 24-hour sideloading delay for unverified APKs breaks common distribution workflows. Here's how to fix your pipeline.