LLMs as a parallel to Operating Systems

Question

LLMs (Large language models) have some incredible parallels with OSes (operating systems). Andrej Karpathy recently published a fantastic primer on LLMs. Highly recommend the watch: https://www.youtube.com/watch?v=zjkBMFhNj_g

An LLM OS diagram looks quite similar to a traditional OS diagram. For example, you can think of the context window as RAM. It has the equivalent of a disk, Internet connectivity, and I/O (audio, video, text, etc), which are all aspects of a traditional OS.

Even more interesting, this comparison extends into market dynamics as well. Windows and macOS led the market for consumer OS, with a smattering of open source companies and products to follow. And similarly, OpenAI’s GPT-4 and Anthropic’s Claude-2 are shaping up to lead the production of consumer LLMs with a large open source community starting to follow. LLMs seem to have a similar distribution of proprietary and open source models.

Additionally, just like OSes had security challenges (think of iOS jailbreaks), LLMs also have a newfound security question. There are lots of attack vectors that allow malicious actors to jailbreak LLMs: prompt injections, “sleeper agent” via keywords in the training data, multimodal adversarial inputs (an image can have a hidden message etc).

Of course, it’s still very early in the space, although it’s moving rapidly. Always interesting to think about the second and third order effects of the work being done here.

madechorus · Accepted Answer

Karpathy's video was great. Keeps it high level for everyone to get an idea of what's going on, but also dropping some tidbits about the latest frontier in the space. Security will definitely be one of the most interesting challenges, but researchers are already coming up with very clever ways to solve theses issues.

tragictaiba · Answer

When LLMs can also have long term memory and Type 2 Thinking, is when we’ll begin to have breakthroughs for AGI

skycomputer · Answer

I think the more apt comparison is the CPU, not necessarily the whole operating system, but LLMs are the CPU and they draw on all these peripherals such as Internet browsing to serve what the user needs. The CPU also uses things like disk, Internet access, your keyboard / mouse to make things work.

LLMs as a parallel to Operating Systems

[1hr Talk] Intro to Large Language Models

About

Tech

Members