Copilot Cowork Walkthrough

Watch on YouTube

Now Playing

Transcript

446 segments

0:00

Hi everyone. In this video, I want to

0:02

talk about Copilot Co-work, an awesome

0:06

new capability that is exposed today as

0:08

a separate agent available under

0:10

Copilot. We can see it super quick. Here

0:14

I'm in a Frontier tenant,

0:16

and I can see under my agents, I have

0:18

this nice new Co-work

0:21

capability.

0:23

So,

0:25

what exactly is this? So, if I think

0:27

about Copilot in general,

0:32

we have a number of different

0:34

capabilities. It's fantastic for

0:36

brainstorming, finding information,

0:38

accomplishing a task, summarizing. I

0:42

have obviously the interactive I can

0:44

have like a a chat experience both

0:46

directly in Copilot within all the

0:49

different experiences.

0:51

There's things like analyst

0:53

to help me as my own personal data

0:56

expert. There's things like researcher

1:02

where I can go and do that very deep

1:04

longer thinking, understanding how to

1:07

solve and get me all the information

1:09

about a certain thing. They're tuned for

1:11

different types of tasks.

1:13

And so, now what we have as another type

1:16

of agent

1:17

1:20

Co-work.

1:22

And this is for when I have very

1:24

complicated requests potentially going

1:27

across multiple systems,

1:29

many, many different steps required to

1:31

solve it, but for potentially a very,

1:34

very long time. Now, straight away with

1:38

the name Copilot Co-work, the question's

1:41

going to be is it just a skinned Claude

1:43

Co-work? Not at all.

1:45

This is Microsoft Copilot's own

1:49

implementation of a Co-work

1:51

functionality. I.e.,

1:54

many tasks over a long period to address

1:58

a a very complicated thing you want it

1:59

to do.

2:01

So, the way it's working is

2:03

it has its own

2:06

Co-work

2:08

agent runtime.

2:13

So, there's a secure isolated sandbox

2:16

where it does the things. This is the

2:18

orchestrator of everything it's going to

2:20

do. Now, yes,

2:22

this Co-work agent runtime obviously has

2:25

to go and talk to

2:26

a reasoning model, a large language

2:29

model. So, it's going to go and for that

2:31

reasoning

2:33

talk to a large language model. The

2:35

specific large language model

2:38

is likely going to change over time as

2:39

models evolve, new ones come out, things

2:41

improve.

2:43

Now, part of what has made Copilot

2:46

Co-work possible with this very

2:48

long-running, very complicated set of

2:50

tasks is yes, Anthropic made a big leap.

2:54

I think it was November 2025 for complex

2:57

reasoning. It's Opus 4.6 model made a

3:00

huge leap in the ability to reason for a

3:03

very long time. I think days

3:06

before it went off the rails.

3:09

So, now what you can have is models have

3:11

this agentic loop that can reason to

3:15

complete the task. It can tell me what

3:18

to do next once it's worked it out.

3:20

And so, today at time of recording,

3:23

the Copilot Co-work uses the Anthropic

3:26

model for its reasoning capabilities to

3:29

create the plan

3:31

given the context and tools that are

3:33

available and to work out what it should

3:35

do.

3:37

Now, Copilot Co-work natively

3:40

understands

3:42

and leverages

3:48

Work IQ.

3:50

And that native grounding on Work IQ, so

3:53

think about yes, all of the M365,

3:56

Dynamics 365, etc., etc. knowledge,

3:59

but then

4:01

the context, the the things it has

4:04

learned about how data relates to other

4:07

data, how people relate to data, how

4:09

people relate to people, the rhythm of

4:12

business, how work is done, the types of

4:14

activities, and then specific skills and

4:17

tools. It is grounded on all of that

4:20

information,

4:22

and it's not just using the Work IQ API

4:26

piece by piece. It is also natively

4:29

hooked into

4:32

things like SharePoint

4:36

both from a data and API to do things,

4:39

OneDrive,

4:44

but then things like Outlook,

4:47

Teams,

4:49

um things like Fabric IQ,

4:55

and Dynamics 365.

4:58

It can use third-party

5:02

app connectors, services, APIs. And so,

5:06

this is a big deal about this directly

5:08

built on top of it. Other

5:11

solutions will use things like MCP APIs,

5:15

maybe computer used to interact with

5:17

client apps to do certain things, to

5:19

complete actions.

5:21

The Copilot Co-work is natively using

5:24

the solutions. It's going to be more

5:25

consistent, more reliable for any of

5:28

those interactions.

5:30

So, I you hear an analogy sometimes.

5:33

It's built natively on all of this, so

5:35

it's got the full context rather than

5:38

that per API call sipping through a

5:41

straw.

5:42

So, you get one query responded to at a

5:45

time, so the time taken, you may not

5:47

find all of the the perfect information

5:49

you want.

5:51

Now, another big thing to understand

5:54

about Copilot Co-work,

5:56

it it is running in the cloud.

6:00

It is not

6:02

running locally on your machine,

6:03

consuming your local resources,

6:06

having

6:07

access to everything on your local

6:08

machine. It is purposefully a cloud

6:11

agent. And because it's running in the

6:13

cloud, it's therefore observable. It's

6:17

auditable. I can use Purview on what

6:19

it's doing. I get better compliance,

6:21

better manageability.

6:23

And again, because it is a cloud agent

6:26

with all of these capabilities, but it

6:28

is not talking directly to your local

6:31

machine. There's no local device access.

6:34

As it creates artifacts,

6:36

it's going to actually go and save them

6:42

into your OneDrive.

6:44

And so, the things it creates become

6:48

additional knowledge. It will get the

6:50

proper labels, and we can see this. So,

6:52

if I jump over for a second,

6:55

and I just go and look at my OneDrive

6:58

folder,

7:00

and I'll look at my documents, I'll see

7:03

Co-work.

7:05

I'll see a bunch of different session

7:08

folders for where I have done work. So,

7:11

there's my sessions, and these are the

7:13

different interactions I have had with

7:15

it. If I select one of these folders, I

7:17

see any data about inputs, I can see the

7:19

outputs, and there's a number of

7:21

different files here

7:23

based on

7:25

things and work I have had Co-work do

7:28

for me.

7:29

And we're going to come back to that.

7:34

Now,

7:36

it does have the same concepts like

7:39

skills and plugins with Anthropic's

7:42

Co-work.

7:43

So, the nice thing here is it will be

7:45

able to use those skills and plugins

7:47

from the other platform within Copilot

7:49

Co-work.

7:51

Now, one of the things I think to really

7:52

understand this

7:54

is to see it in action, to see it doing

7:56

some really long reasoning, multi-step

7:59

work, how it breaks a problem down into

8:02

various steps. So, what I have prepared

8:05

is a folder.

8:08

And now you'll understand why I'm

8:09

wearing like a superhero type t-shirt.

8:13

I have a villain incident report folder

8:17

with a series of 20 different incidents

8:20

involving super villains. Just open one

8:23

of these up.

8:25

And it talks about what happened, where

8:28

it happened, when it happened, sort of a

8:31

description,

8:33

injuries, damage, how it broke down in

8:36

terms of exactly what happened, current

8:38

status. So, there's an incident report.

8:41

There's 20 of these incident reports I

8:43

have created.

8:45

And what I want to do,

8:48

as is pretty obvious,

8:50

I'm going to get Co-work to help me

8:53

understand what actually happened across

8:55

all those incident reports.

8:57

So, I'm going to ask it for a Word

8:59

document and a PowerPoint presentation

9:03

diving into everything, looking for

9:05

certain patterns. So, let's go and look

9:09

at our Co-work. So, we're going to start

9:12

a brand new session,

9:14

and I'm going to ask it about all of

9:17

those different villain reports.

9:20

Now, I'm not going to type all this in.

9:21

I'm going to paste it. I've prepared

9:22

this prompt already.

9:25

And what we can see

9:29

9:31

I'm asking it. I'm saying it, "Hey,

9:32

they're in my OneDrive folder,

9:35

so create an executive overview Word doc

9:37

and a PowerPoint presentation. Analyze

9:40

it for correlations. Look for patterns

9:42

across geo-clustering, severity by

9:44

villain,

9:45

trends.

9:46

Tone should be authoritative. Top five

9:49

priority villain ranking, recommended

9:51

resources allocation by region, etc."

9:54

So, I'm going to just

9:56

tell it to stop?

9:59

Okay. Go and work

10:01

on this particular ask.

10:04

So, it's thinking.

10:06

And you can see straight away, it starts

10:08

to break down

10:10

what it thinks it should be doing, the

10:11

types of tasks it's going to create over

10:13

here a little window

10:15

so I can see everything it's going to go

10:18

and work on.

10:19

But you'll notice for a second, I still

10:21

have

10:24

a prompt. It's thinking, it's doing

10:26

things,

10:28

but it's not

10:30

offline to me. So, I'm going to actually

10:32

give it another instruction. I've

10:33

decided, actually, what would be really

10:36

cool as well

10:38

is I would like an interactive web app

10:41

that shows all this in a HTML file.

10:44

So, I'm going to queue this up.

10:47

So, now sending it that additional one.

10:48

It accepted it straight away.

10:50

So, it's still working

10:52

on the other task, but now I've added to

10:56

what I asked it to do. I can work with

10:58

it. I can interrupt it. Maybe I could

11:00

ask it to hey, go and schedule a meeting

11:02

once this is done with Bruce and Clark

11:05

to discuss all of the things you're

11:06

going to find.

11:08

So, this is just going to go off and

11:09

it's going to carry on. It's going to

11:10

think for

11:12

probably many, many minutes.

11:14

So, I'm going to cheat. I'm actually

11:16

going to stop this

11:18

because if I go to tasks,

11:22

as you would expect, I did this earlier

11:24

on today.

11:26

Now, when I look at this tasks view,

11:28

firstly, these are all the ones that

11:29

I've done before.

11:31

You can see I basically ran exactly the

11:33

same request earlier on this morning.

11:37

But I can also view it in a board view,

11:40

so the ones that are currently in

11:41

progress, ones that have completed.

11:44

One of the first things I ever tried

11:46

with it was a financial analysis, and I

11:49

could see again the full progress of

11:51

everything it did. There's the output

11:53

folder of the deliverables it created

11:56

for me,

11:57

and I could go and see all of the work

11:59

it did. It took half an hour

12:01

to do a financial sort of deep dive

12:04

report for me, massive numbers of steps

12:07

and investigations and information here.

12:11

But,

12:13

let's focus on the same thing you just

12:15

saw me ask it to do.

12:18

So,

12:19

there's all the output. So, we go back.

12:21

I did exactly the same thing.

12:23

I asked it the same hey, 20 volume

12:25

reports. I waited for 1 minute, then I

12:28

added on this idea of a self-contained

12:30

HTML file.

12:33

And

12:34

here you could see it did that same kind

12:35

of thinking about what it should do,

12:38

all of the details.

12:40

It went and retrieved the contents of

12:42

the data. It tried different ways to get

12:44

the data.

12:46

It then did various query graphs to get

12:49

information from it.

12:53

It read the results.

12:55

You created to do things.

13:01

All of the data.

13:03

Then it starts structuring what it wants

13:05

to create.

13:08

Just really working out. It went through

13:10

the various incidents, all the different

13:12

correlations. Just a massive amount of

13:15

work going on, all these different

13:17

things over a fairly long period of

13:18

time. I think it maybe took 10 15

13:21

minutes in total

13:23

to complete

13:25

what ended being, as we scroll down, so

13:28

it's creating the Word doc, creating the

13:29

presentation.

13:31

There's the Word, there's the

13:32

PowerPoint, there's the

13:35

actual application it created

13:37

until it delivered everything.

13:39

So, it delivered

13:41

my Word doc, my PowerPoint, and that

13:44

HTML file that I asked it to create.

13:47

And so, there were the outputs.

13:50

So, we can open up.

13:52

There's the Word doc.

13:57

So, it gave me that full analysis,

14:00

really nicely presented, very clear.

14:04

All the information I asked it for, the

14:05

prioritizations.

14:08

It created me the PowerPoint document.

14:13

Again, same nice formatting.

14:15

You can go through, very clearly see

14:19

all the different clusterings of

14:21

information, which is exactly what I

14:22

asked it to do.

14:24

However,

14:26

the app didn't actually work.

14:29

And so, you see there was a second

14:30

prompt. And all I did was say the HTML

14:32

map does not seem to work. Can you fix

14:34

it, please?

14:36

And it then went away,

14:39

and once again reasoned for a certain

14:40

amount of time, found some problems,

14:44

and it fixed it.

14:47

So, again, it is going through, looked

14:49

at the different issues that it may

14:50

find,

14:53

and fixed all of the problems. So, I

14:55

then had the incident map app that it

14:58

created for me.

15:00

And you can see the different options. I

15:01

can run it over a certain period of

15:02

time.

15:04

It's going to show me this various

15:05

things, and I can just hit play,

15:08

and it starts over a time period showing

15:10

me all of those incidents on a nice

15:12

little map.

15:17

The amount of damage, so there was an

15:20

increased clustering of them, until I

15:22

get all 20 incidents. I can select one

15:27

to see the detail about it.

15:31

But it's just this fantastic. And again,

15:33

I can move the little slider,

15:36

and it created

15:37

an app for me.

15:39

One prompt.

15:42

Hey, look at all those 20 different

15:44

files.

15:45

And then, oh, I added an additional ask

15:47

into it say, hey, go and create me an

15:49

app as well while it was thinking on the

15:52

first thing

15:53

to generate all of that content, the

15:56

Word, the PowerPoint, and my app. And

15:59

when the app didn't work the first time,

16:00

hey, I just asked it to fix it, and it

16:03

was done.

16:04

And again, it's written into my

16:06

OneDrive. That was actually the folder I

16:08

showed you earlier. And [snorts] if

16:10

you're curious,

16:12

I used Co-work to create the 20 incident

16:15

reports. Again, I just gave it a single

16:18

prompt.

16:21

I'm working on a Co-work demo.

16:23

I need 20 different incident reports for

16:25

DC bad guys that have dollar damage,

16:28

what happened and when. I'm going to use

16:30

it to create Word doc and PowerPoint and

16:32

a fun web app. Can you create for me?

16:35

And it went and thought about it, and

16:37

that was the only prompt I gave,

16:40

and it went and created me

16:42

the 20 documents that you see. It

16:45

noticed they had a problem on one of

16:46

them,

16:47

and it's saying, hey, there was a

16:48

transient issue you need to go and fix

16:50

and rename that.

16:52

So, I actually used Co-work for the prep

16:55

that you saw

16:56

to actually make it go and do all of

16:59

this stuff.

17:01

So,

17:03

really crazy powerful.

17:06

As long as you give it some fairly

17:07

decent instructions that can have many,

17:09

many requirements,

17:11

it goes and works it all out for you.

17:13

That is the benefit here for that longer

17:16

running, more complex, this is what

17:18

Co-work does. Now, as mentioned, the

17:20

model's going to change over time. Today

17:22

is an Anthropic model. They have a sub

17:24

process, which means they have the same

17:25

global data privacy guardrails that

17:29

Microsoft has internally for all of your

17:31

services and your data.

17:34

And that's it. I mean, I really just

17:36

wanted to show it to you for the most

17:38

part because I think seeing really makes

17:40

it click the difference with what

17:42

Co-work brings over existing kind of

17:45

per task, smaller duration interactions.

17:50

So, just tell it, hey, I need this

17:53

result, and it will go and work out how

17:56

to do it. It really is a complete game

17:59

changer compared to me doing one tiny

18:02

piece at a time.

18:04

So, I hope that helped. As always, till

18:06

next video, take care.

Interactive Summary

Ask follow-up questions or revisit key timestamps.

The video introduces Copilot Co-work, a new agent designed for highly complex, multi-system, and long-running requests. Unlike other Copilot capabilities, Co-work leverages its own secure agent runtime, currently using Anthropic's reasoning model (Opus 4.6) for intricate planning. It boasts native integration with Microsoft's Work IQ, M365 services like SharePoint and Outlook, and third-party apps, providing comprehensive context. Operating as a cloud agent, it ensures observability, auditability, and compliance, saving all generated artifacts to OneDrive. A demonstration showcases its ability to analyze 20 villain incident reports, creating a detailed Word document, a PowerPoint presentation, and an interactive web application, even self-correcting an initial bug in the app, highlighting its power to manage complex tasks autonomously from high-level prompts.