What Are Agent Skills Really About?

Watch on YouTube

Now Playing

Transcript

94 segments

0:00

Why is everyone talking about agent

0:02

skills? What are they really about?

0:05

Let's take a closer look. Before skills,

0:08

we placed entire guides and tool

0:09

definitions into a single massive

0:11

prompt. This approach hits the model's

0:14

context limit quickly. Models also

0:17

perform worse as instructions get

0:19

longer. They forget rules, drift from

0:22

brand voice, or hallucinate. Skills flip

0:25

this setup. Instead of carrying every

0:27

instruction all the time, the agent

0:30

keeps a small catalog of capabilities

0:32

and pulls the right skill only when a

0:34

task actually needs it. Instead of one

0:37

giant prompt, you can get a small

0:39

reusable playbooks loaded on demand. So

0:42

what is a skill? Technically, a skill is

0:46

a folder. It has a skill.md plus

0:49

optional scripts and resources. A

0:51

skill.md is a how to use this skill

0:54

guide. It contains metadata, clear

0:57

instructions, guardrails, script entry

1:00

points, and the rules for running them.

1:02

Scripts are runnable code like Python

1:04

files. They handle deterministic steps

1:07

that must behave the same way every

1:09

time, like parsing a PDF, validating a

1:12

spreadsheet, or applying a migration.

1:15

Resources are the static stuff the code

1:18

and instructions rely on, like

1:20

templates, schemas, and reference docs.

1:22

Now the important part is how skills get

1:25

used. The agent does not inject every

1:28

skill into the prompt. It starts with a

1:30

tiny index basically just a skill names

1:33

and oneline descriptions. That is enough

1:36

for the model to decide when to load a

1:38

skill. When the user asks for something,

1:41

the model decides to use a skill if

1:43

needed, then requests the full skill.md

1:46

for that one skill by emitting a tool

1:48

call. The agent runtime loads a

1:50

skill.mmd, appends it to the context,

1:53

and feeds it back to the model. This is

1:56

called progressive disclosure. Like a

1:58

well organized manual that starts with a

2:01

table of contents, then a specific

2:03

chapters, and finally a detailed

2:04

appendix, a skills let the model load

2:07

information only as needed. Let's walk

2:10

through one concrete example from a

2:12

start to finish with two skills. You

2:14

upload a Q4 budget Excel sheet and say,

2:18

"Audit this spreadsheet and draft a

2:20

short email summary. First, the agent

2:23

runtime appends the lightweight skill

2:25

index to your request and sends the

2:27

prompt to the model. The model sees only

2:30

that index, not the full skill

2:32

instructions. Next, the model decides it

2:35

needs a spreadsheet audit and it emits a

2:37

tool call asking the runtime to load a

2:40

skills a spreadsheet audit a

2:42

skills.mmd."

2:43

Then the runtime loads that skill.mmd

2:46

and appends it to the context. Now the

2:49

model has the detailed audit procedure

2:51

and knows which script to run like

2:53

audit.py.

2:55

The runtime executes the script in a

2:57

sandbox and returns a structured results

3:00

like a JSON report of formula issues and

3:02

outliers. The model reads those results

3:05

instead of guessing what's in the sheet.

3:08

Finally, the model notices the draft and

3:10

email part. Because the model can see

3:13

all available skills through the index,

3:15

it knows a style guide still is

3:17

available. It requests the style guide

3:20

skill. The agent reads the skill which

3:23

contains the formatting rules, tone

3:25

constraints, and examples. The model

3:28

uses those rules to turn the audit

3:30

findings into a clear draft email a

3:32

finance team can scheme quickly. In

3:35

short, skills make instructions portable

3:38

and versioned. You can package a

3:40

procedure once, test the script, update

3:43

it, and reuse it across teammates and

3:46

environments. You do less prompt

3:48

surgery, keep context smaller and more

3:51

relevant, and get more consistent

3:53

behavior.

Interactive Summary

Ask follow-up questions or revisit key timestamps.

Agent skills address the limitations of large prompts in AI models by allowing agents to selectively load capabilities only when needed. Instead of one massive prompt, skills provide small, reusable playbooks. A skill is a folder containing a skill.md with instructions and optional scripts for deterministic tasks, and resources. The agent uses "progressive disclosure," starting with a small index of skill names, then loading the full skill content on demand. This approach leads to portable, versioned instructions, smaller context, and more consistent model behavior.