HomeVideos

Builders Unscripted: Ep. 3 - Matias Castello, Product Lead at Alchemy

Now Playing

Builders Unscripted: Ep. 3 - Matias Castello, Product Lead at Alchemy

Transcript

606 segments

0:13

(speaking french)

0:15

(speaking french)

0:17

Matias, thank you for joining.

0:19

Thank you for having me. Really happy to be here.

0:21

I'm really excited to chat because you have a very unique background.

0:25

You've crossed a bunch of worlds.

0:27

You work in consumer products.

0:29

You worked on developer platforms like at Facebook in the early days

0:32

and now leading product for Alchemy in the crypto space.

0:35

But what I find even more interesting is that you're not an engineer by training,

0:40

and yet you're building more than many engineers that I know.

0:44

How do you go about it?

0:46

I think it just comes down to being interested in a lot of things

0:50

and trying a lot of things over time

0:53

and getting better and better as models, I guess, get better as well.

0:57

That's awesome.

0:58

So today you're using AI a ton at work and outside of work.

1:03

But let's start with, like, Alchemy.

1:06

Like what was, aha moment you had maybe kind of bringing AI to work?

1:11

The first thing that I remember the first time we used AI,

1:14

I guess it was Codex in Slack.

1:16

Right.

1:17

It's probably a year ago or so at this point.

1:19

Yeah.

1:20

The first thing we used it for was to make small edits

1:23

to our documentation, our developer docs from Slack.

1:26

So we went from having to run the site locally and doing that whole process

1:30

that is fairly involved.

1:32

To just being able to add Codex on Slack in the docs channel in our company Slack

1:36

and to be able to just tell it to make changes.

1:38

We do that all the time at OpenAI too.

1:41

We still do it.

1:41

Now we have a bit more of a sophisticated setup

1:43

but that was really the first thing that we used

1:47

Codex for and AI in general at the company.

1:50

From there, the other big moment I remember

1:53

that was probably the bigger turning point, was using code review.

1:56

And so there was this instance, we had a small incident

2:00

that was related to a big migration that had happened months prior.

2:03

And in big migrations, some bugs will happen.

2:06

It was like a lot of code.

2:08

We had an internal postmortem about the incident

2:12

and we basically identify what the race condition was.

2:15

We fixed it and someone on the team had the idea to retroactively run code review, Codex,

2:21

to see if Codex would have caught the bug.

2:24

And it did.

2:25

Wow.

2:25

And so we did that a couple of times just to see.

2:28

People were starting to get curious and

2:30

you could tell that was a bit of a turning point in terms of model capability

2:34

and like a lot of the ingredients were right for for teams to start adopting those tools more.

2:38

From there, I remember a couple of days after seeing one of the engineers on the team

2:43

essentially using Codex review in pull request comments as his teammate.

2:49

He was just he submitted a PR that I saw go through Slack.

2:52

I click through it and I saw him do at Codex review,

2:55

address the comments that Codex would come up with, then review again.

3:00

Do that again and he was just doing this back and forth with Codex.

3:03

You could tell that people were getting over the hurdle of,

3:08

of like thinking that LLMs weren't good enough to help in a professional setting

3:13

with a really large complex code base,

3:15

with code that needs to be good enough to be served to a lot of people.

3:19

That’s really cool.

3:19

That was really a turning point.

3:20

Yeah, that's a very interesting point on code review.

3:22

I remember talking to many companies were just getting into like, you know, AI coding.

3:28

Code review was the first moment when they realized

3:31

that many incidents they had could be caught actually by Codex automatically.

3:35

And they replayed some of those similar to Alchemy.

3:38

I remember Datadog, for instance, back in January, said that

3:41

like more than one incident out of five would have been saved by Codex.

3:47

But I'm sure the model capabilities now are so good

3:49

with GPT 5.5, it could be more than half, easily.

3:52

Nine out of ten, maybe.

3:54

Maybe most of it.

3:55

Yeah. So code review was really a turning point for many companies

3:59

to get into Codex and really realize the power of AI.

4:04

Can you tell me a bit more about how you're using Codex at work today?

4:08

I use it a lot for a lot of different things.

4:11

A lot of those typical product management sort of tasks, writing documents.

4:16

We've created all these skills internally to help us

4:19

do the PM job more easily and better and faster.

4:23

That includes writing PRDs, analyzing customer feedback, like doing all those things.

4:29

I use Codex for that

4:30

and several members of the team reuse those skills as well.

4:33

We have this shared, repo of skills across the company,

4:37

across all the different functions, so that not only we can do the same

4:41

job and the same tasks faster and better

4:44

but also more people who are not necessarily PMs are able to do those things.

4:48

I'm sure, yeah.

4:49

And I think it's also telling a lot about, like, how we think about developers now,

4:54

not just as humans but also as agents consuming our own platforms and infrastructure.

4:59

We have to think about these agents and making sure that, like, tools like Codex

5:03

can actually integrate, like OpenAI’s API or Alchemy’s infrastructure very quickly.

5:07

Yeah, it's really several things that change at the same time.

5:10

First, the way we work has changed and Alchemy has a lot of very technical products.

5:16

It's an infrastructure company first.

5:19

And so all the developers internally have changed the way they work.

5:22

And so it's that's very clear to everyone now.

5:25

But that also means as builders of a developer platform, we have to change

5:30

the way we build tools so that other developers that we know

5:34

are also changing their workflows with AI are able to use Alchemy better.

5:38

So it's, the developer we're building for has evolved.

5:42

Now, we are very much assuming that 100% of developers are building

5:46

software with the help of AI.

5:48

Right.

5:48

And so it's evolving your platform to fit that new reality as one thing.

5:53

And then the other thing that's a bit more nascent

5:55

but that's also very interesting is, I guess,

5:58

this new type of developer that's an agent sometimes, autonomous agent.

6:01

And they don't always need the same tooling.

6:03

They need different things.

6:04

And if an autonomous agent shows up on Alchemy

6:07

and as far as implementation decisions go, they're fully autonomous.

6:11

They have to be able to sign up and integrate something

6:15

and use the blockchain to perform some kind of tasks.

6:18

We need to build the tools for that.

6:20

And it's building for those two.

6:21

And over time those two probably converge.

6:23

But for now they're still fairly different needs.

6:25

And my job is basically to try to figure out what we do for both.

6:28

Awesome.

6:29

Zooming out of Alchemy a little bit, I was reflecting recently

6:33

about my own startup.

6:34

And you were a founder before, as well.

6:37

How do you think now, about this time where you had to build without AI?

6:42

It was very different.

6:43

And it was hard, even if you were not an engineer.

6:45

Even more difficult, like the typical way you'd go about it 6 or 7 years ago

6:50

was to build some kind of prototype, to try to raise enough money

6:54

to hire a small team, to then build the real thing.

6:56

And that that's pretty much what I did with with my company at the time,

6:59

built the first prototype myself.

7:01

It was very difficult because it was essentially copy pasting code left and right until something worked.

7:07

Managed to raise a bit of money, hired a team of 3 or 4 engineers

7:10

to help build the first MVP and not only that took,

7:15

a decent amount of money and several people, but also took months.

7:18

I mean I haven't thought about this much

7:19

but I wonder how long it would take me to build the first version of the app

7:22

we’re a building at the time with AI today, just by myself.

7:26

My guess is that it would take less than a week.

7:29

I don't know what your experience was like.

7:30

You are building a much more complicated product than mine, I guess.

7:34

How long did I take it back then and how many people?

7:37

It took me a team of like 15 engineers for maybe a year and a half

7:42

to get to like the first like milestone of a V1 that we shipped

7:46

to like many customers.

7:48

It's pretty incredible how far we've come.

7:50

Like it's just never been easier to do something.

7:53

Yeah.

7:54

And to start something from scratch.

7:55

Yeah.

7:56

It's probably the best time to be a founder.

7:58

Like any idea you have, you can just build it.

8:00

Yeah.

8:00

Anyone who has an idea now has the tools to basically try it out.

8:04

Speaking of things that you do on the side, it's a great segue

8:07

because I wanted to ask you about how you're using AI outside of work.

8:11

It sounds like you're building so many projects these days

8:14

it's almost like one a day.

8:15

You know, it's amazing because it's never been easier to build things.

8:19

And yet I just kept feeling this sort of anxiety

8:22

that I, I wasn't doing enough pretty much all the time.

8:25

I think I'm a bit past that.

8:27

But the reason I am is because I try to work

8:30

on a setup that allows me to spend little time on my computer

8:33

and have essentially Codex work for me for hours while I'm doing something else.

8:37

It's like

8:39

I had gotten to a point a few months ago where I felt bad about going out

8:43

because I felt like I should be taking advantage of this

8:46

amazing technology we have and I should just be building things all the time.

8:49

I didn't want to do anything else.

8:50

I didn't want to feel like that either.

8:51

I didn't like that trade off.

8:53

And so I invested in a set of skills and a process and several things

8:58

that I set up for myself to basically be able to very quickly

9:01

either describe a new idea and have Codex come up

9:04

with a plan, implement it, test it, and let me know when it's done.

9:07

And it's very easy to to get Codex to work for hours at that point.

9:11

Or if I have an existing product and I want to explore new things,

9:14

I came up with this way of essentially having Codex

9:17

do research for me on what new features I should build.

9:20

It will look at other competitors.

9:22

It will look at other ideas.

9:24

It will try to figure out what my goals are

9:25

and what my own preferences are.

9:28

And I created this set of skills that basically gets Codex

9:31

to build all those things.

9:33

So it comes up with ideas.

9:34

It builds them in a modular way.

9:37

So it's going to feature flag them and build them

9:38

as experiments in the main app so that I can prompt Codex before going to bed

9:43

to go do some research about an app that I have.

9:45

Tell it to build the top ten features it can come up with,

9:48

build them as experiments in the product.

9:50

Wake up to ten feature flags that I can toggle on and off

9:54

automatically to basically

9:57

not be the bottleneck when it comes to generating new ideas

10:00

but still having the control of deciding what is good enough to stay in the project or not.

10:04

I love that. That's really cool.

10:05

It's like without spending a ton of time on a computer

10:08

have these moments, like during my lunch break at work,

10:11

I have my personal laptop and I try to send things to Codex

10:14

so that it can work while I go back to work

10:16

before going out on weekends to dinner, or to have drinks before going to bed.

10:20

I have these Codex moments, I guess, where I just try to dispatch a bunch of work,

10:25

use all these processes and these workflows I have with skills

10:28

and other actions so that Codex can work while I'm not paying attention.

10:33

And it's amazing how much you can do if you kind of invest in it that way.

10:36

That's so cool.

10:37

I mean, honestly, I would love to see your setup and how you're using the Codex app

10:41

and kind of bringing all of these tools together.

10:43

Yeah. Let's check it out.

10:44

Let's check out linear to start.

10:46

So what you see here is basically a collection of projects.

10:50

You see there's this 12 of them in progress. At the same time.

10:53

I'm not working on all of them

10:55

in one go, but maybe we can pick one and I can show you how this works.

10:59

This one, for example is a mac app that since then

11:02

I've also built it as an iOS app that uses Codex App server

11:06

to essentially help me write better while I'm at my computer.

11:09

It's something that I built for myself

11:10

for work first, and that, I kept building out since.

11:14

The way it works is that you have a global shortcut.

11:16

In my case, it's like command shift spacebar,

11:19

and it brings up this little window that you see there.

11:21

That's cool.

11:22

And this is using Codex App Server.

11:24

So in the settings it's using my GPT subscription behind the scenes.

11:28

And if you were to write something in my case, I'm not even going to write.

11:33

I usually talk to my computer just because it's so much faster for everything.

11:36

Like people at work a year ago would make fun of my little headset.

11:40

I basically looked like a call center.

11:42

Call center operator.

11:44

And now a lot of people have those similar headsets, so it's okay.

11:47

But if you say you're on slack and you want to write something,

11:52

Yeah.

11:52

but you need to clean it up

11:53

and you want AI to make it sound more professional or fix typos or things like that

11:57

like I'm going to record something that would be like,

11:59

“Hey, I'm not totally sure I understand what you're saying.

12:01

Maybe we should just get together and figure it out” or something like that.

12:06

Obviously, this is not a great sentence.

12:08

I just dictated that to my computer.

12:10

And then if you hit command return in this case,

12:13

it's going to rewrite it for me in a way that sounds more professional

12:17

because the professional mode was selected.

12:20

This is basically using Codex Server behind the scenes and GPT 5.5.

12:23

You have a model picker.

12:25

I just added 5.5 since it just came out.

12:28

And this is just an example of a little app that I made for myself for Mac.

12:32

I put the same setup in an iOS app after to have

12:37

like a keyboard extension and help you rewrite things quickly and so on.

12:40

I don't know if a lot of people use their,

12:42

their Codex subscription for things that are not just coding.

12:44

I know that people use the Codex app server to build other

12:47

coding interfaces, but in this case I basically used it

12:50

because I wanted the inference to be done, through my ChatGPT account.

12:55

And if we go back to to the process and how to find time,

12:59

what you see here is this linear project.

13:02

I didn't create this project.

13:04

There are like 159 issues that are done, a bunch that are in review

13:08

and some that are in progress.

13:10

I did not create or write a single one of these.

13:12

Wow.

13:13

Codex did.

13:13

What I did is to talk to Codex about what I wanted

13:18

through these skills that I wrote for myself, to basically help

13:23

me create a plan that turns into milestones

13:27

that you see here, like the foundation and the preferences and so on, a bunch of the features.

13:31

Codex will create that plan,

13:33

break it down into milestones, create the tasks,

13:36

work on them in a way that makes sense based on this agent’s .md file that I created,

13:42

that essentially summarizes the way I like to work.

13:44

And I put in a bunch of time and, putting that in a single file.

13:48

So when I create a new project or I have a new idea, I can basically just

13:53

initialize that project with that agent’s .md file.

13:56

From there, I use this skill I have to create a plan

14:00

and after that it's just telling Codex, build the plan.

14:04

That's really cool because you're using linear as the interface

14:08

as like your backlog and your release milestones and things like this.

14:11

But like Codex is not just writing

14:15

all the issues, it's also picking them up to close them and work on them.

14:18

Yeah.

14:19

It's of both delegating the, I guess, the project management

14:22

and the setting up of the project and the milestones and the tasks as well as the

14:27

the execution of the tasks too.

14:29

I’m initiating the project, sharing the initial idea,

14:34

giving Codex enough information about my preferences

14:36

and what I want the product to be so that it builds

14:39

the version of the product that I want.

14:41

I think the biggest gap when you code with LLMs, when

14:45

if the LLM produces something that's surprising to you, in a negative way,

14:49

it didn't do what you wanted, usually it's because it had to make one or

14:53

multiple assumptions and it just didn't make them in the same way you would have.

14:57

And so in order to avoid that, I have this process

15:00

that requires me to clarify all those things upfront.

15:03

And once Codex has all of that, it can just go and build.

15:06

That's really cool.

15:07

And so that's the first part and if you go back to

15:10

to this writing app as an example, the next step after

15:14

setting up a new project, building a V1 of a product

15:18

and having Codex essentially do all the work for me, usually while I'm not even on my computer.

15:23

I wanted to find a way to generate

15:26

new features and keep working on a

15:29

on an existing product without being the bottleneck.

15:32

I didn't want all the ideas to have to come from me

15:35

but I still wanted some level of control

15:38

when it comes to deciding whether or not I like that feature.

15:41

Right.

15:41

So I came up with this process that is basically a way

15:44

to build experiments with Codex on an existing project.

15:47

And it's a skill, again, that

15:51

essentially has me tell Codex I'm interested

15:54

in building features in this direction, or this is the high level goal that I have.

15:58

Go do some research about what types of features you could build.

16:02

Go figure out what the competitors to my products are building, things like that.

16:06

And it will come up with a lot of features based on what I'm saying.

16:10

And it's going to set them up and build them with feature flags

16:15

as experiments in the code base.

16:17

And what that means is that you're able to do things like what you see here.

16:22

You have

16:23

You can flip any switch of any feature.

16:25

That's right.

16:26

So you have a long list of of experimental feature flags.

16:29

These were all built in one go.

16:31

One evening while I was sleeping.

16:33

So before going to bed I said,

16:36

“Hey Codex, I'm building this AI writing assistant for myself.

16:39

Go figure out who else is building that.

16:41

What kind of features do they have?

16:43

Score the ones that are most interesting, most impactful, something like that.

16:48

Build all of them but build them in a modular way so that when I wake up,

16:52

I can toggle each one on or off independently to see if I like them.

16:57

And so you basically see a list of all of those there.

17:01

We can pick a few if you want to see.

17:02

Like we're going to output language is basically going to translate

17:05

Sure.

17:06

just going to

17:06

It’s fascinating that you did not use Codex just to make the plan and implement

17:10

but also use Codex

17:12

even before to make the research online about what the feature should even be.

17:16

Yeah.

17:16

And so Codex is great.

17:18

It's it's way better than me at doing research.

17:21

It doesn't it doesn't get bored of searching for things.

17:24

And so as long as you tell it what to research

17:27

and how you expect the findings to be structured

17:30

and in my case, you also tell me what to do with those findings.

17:33

It's basically just a sequence of fairly small steps.

17:36

but it will do all of them in one go, because it's all encoded in a skill or a set of skills.

17:41

And so it's able to come up with all of this build all of it,

17:44

build in a way that makes it easy for me to test

17:47

and makes it easy for me to decide whether or not I want it to stay in the product.

17:51

And so we selected a few here.

17:53

If we go back to the rewriting.

17:56

They just showed up instantly.

17:57

They just show up there.

17:58

And so in this case you see that you have a translation feature.

18:02

In this case you can change the

18:03

the output format of the message as an example.

18:06

It's like Slack style.

18:07

Yeah.

18:07

And so those are all small features that I got Codex to ideate

18:13

and build in a way that,

18:15

makes it really easy for me to essentially decide

18:18

if I want that to stay in the product or not.

18:19

So, those are all the things that I ended up coming up with to try not to spend

18:24

too much time building in a way but still end up building quite a few things.

18:28

It's really cool.

18:29

Instead of starting from scratch, it's basically

18:33

trying to think about the things you do repeatedly, or even trying to think

18:37

about the ways you like to do things, putting that into a skills,

18:41

putting some of that in an agent’s .md file.

18:43

Sometimes it's additional tooling and other things that make you go even beyond that.

18:47

And then having AI do most of the work.

18:51

That's awesome.

18:52

Have you built any other like projects on top of the Codex Observer?

18:55

Maybe more like on the coding side?

18:57

I built a few things like, this is an AI writing assistant.

19:01

It's a mac app. It's a consumer.

19:04

I use OpenClaw and I've built a couple of other things on my phone, even an Apple Watch app.

19:10

That are

19:11

It was just me tinkering, trying to see if I could have

19:16

different or new entry points to start a coding job, essentially.

19:20

So, in OpenClaw, for example, I set up my OpenClaw that my assistant's

19:24

name is Lou has got a pretty funny profile picture if you want to see it.

19:28

Yeah.

19:29

It’s

19:31

look at this guy.

19:33

[Laughs]

19:34

So what do you do with your OpenClaw here?

19:36

What do you talk about?

19:38

This little guy, Lou, it basically, it mostly has two jobs.

19:42

The first one is to make jokes in group chats with friends.

19:46

It's not very useful but it's quite funny.

19:48

The second one is to just help me code.

19:50

And so it's connected at home.

19:52

It's running on a dedicated machine.

19:54

It's got access to Codex and it's got integrations

19:58

that make it easier for me to do certain things on the go.

20:00

For example, I have an OpenClaw discord and Codex integration,

20:04

so that I have a bunch of discord channels.

20:07

Each one is bound to a specific repo.

20:09

And so if I want to do something from my phone on the go and I wanted to

20:15

to be able to run at home and do things that maybe aren't so easy

20:18

to do yet on the cloud or things like that.

20:19

I can just do it while I'm walking around somewhere.

20:23

I talk through discord by talking to Lou and it's going to work on Codex.

20:28

That's an example.

20:29

Another thing that I did recently was try to see if I could

20:33

build an Apple Watch complication on the home screen

20:36

so that you could record a short voice memo and have that trigger a Codex job.

20:42

So, like, imagine you found a typo on a website or something like that.

20:46

Obviously it mostly works for one shot things but you would just

20:50

very briefly like you would record a voice memo to say, “Hey,

20:55

fix this typo on the landing page of this website.”

20:58

And you have the watch and you have the companion iPhone app to do that.

21:01

Yeah. So, let me show you this, this thing that I built.

21:04

This is my Apple Watch.

21:06

What I essentially built here is a way to just send a voice memo

21:12

like create a test.md file in my skills repo.

21:16

And that basically sent that memo to this iPhone app.

21:22

That is now, it transcribed the message, it understood the intent.

21:27

Based on that message is going to route it to the correct place.

21:30

In this case, it's going to understand that the skills repo means my own personal skills repo

21:35

because it has context on my GitHub account

21:38

and then it's going to run the execution of that task.

21:43

So in a couple of minutes I'm basically going to get a notification that says it created a

21:47

That second one is now successful?

21:48

Yeah.

21:48

So now it's running the execution here.

21:51

And this is all running with like Codex app server behind the scenes?

21:54

Yeah.

21:54

So if you if you go to the settings here,

21:56

you basically see that, there's two different ways it can run.

21:59

There's the typical you put an openAI API key and you can do it that way.

22:04

And the other one, what is here called back in session

22:07

is basically Codex app server running somewhere else.

22:10

That's so cool.

22:11

Roaming off of my account, connected to my GitHub with all my context.

22:15

I love this.

22:16

I can basically just start Codex jobs on the go.

22:19

It's like

22:22

it's not useful for everything but it's one of the ways

22:25

that I kind of like to play around with what becomes possible.

22:28

It's like one step closer to making

22:31

I guess, the entire implementation and implementation detail.

22:34

It's like I just talked to my watch for 10 seconds and it did absolutely everything the right way.

22:38

Incredible.

22:39

I don't think we're always there yet, depending on, you know, how complex

22:42

the actual ask is, but for small things it's kind of convenient.

22:46

It was kind of fun to play around with just to see how to work with Codex App Server.

22:50

And yeah, I just kind of try to do that.

22:52

And by the way, this is exactly what I love to see, right?

22:55

Like we open sourced the Codex CLI, the Codex harness,

22:58

and we also open sourced the Codex app server just for like fostering this like open ecosystem.

23:04

Like a builder, like you can extend the way

23:07

you use Codex in this case from your watch or your phone.

23:10

Even though, you know, we have not shipped that yet.

23:13

So this is really, really cool how you can use your coding agent the way you want.

23:17

I think you guys should build that

23:18

I think we should.

23:20

That complication on Apple Watch and on my iPhone home screen so I don't have to do it myself.

23:24

That'd be great.

23:25

Sounds like a great idea.

23:26

Let's see.

23:26

Look at that.

23:27

The whole request just arrived.

23:29

The request is right there.

23:30

Alright, out of this test.md file obviously not the most useful task but

23:34

but it just shows that it works. It's kind of fun to build as well.

23:37

Pretty cool.

23:39

That's really cool that you build this.

23:40

And by the way this is exactly why we open source not just to the Codex CLI

23:45

and the Codex harness but also the Codex App Server.

23:48

We want builders like you

23:49

to be able to kind of poke at it and also integrate that in their own ways.

23:53

It's really cool that you can have a coding agent

23:55

from your watch, which we don't have it just yet, you know.

23:58

You better build it.

23:59

We we probably should build it.

24:01

Yeah.

24:01

It's a great idea.

24:02

I'm curious,

24:03

like, we launched a ton of things in the past, like, few weeks,

24:06

both on the Codex side and GPT 5.5 our latest model.

24:11

Have you had some, like, surprising moments with these two?

24:14

Yeah, there's a couple of things.

24:16

The first one I use computer use recently that was,

24:19

it was a lot better than what I remembered or I guess I was surprised by how good it was.

24:23

I was at home last weekend doing something on a Raspberry Pi

24:26

and I needed to to do something pretty tedious.

24:29

Like copy paste a bunch of different domain URLs and go put them in some admin panel somewhere.

24:34

And I didn't really want to do it by hand.

24:36

So I told Codex to basically SSH into the Raspberry Pi.

24:40

It was like dangling from a TV to go do it for me.

24:43

And it just did.

24:45

Wow.

24:46

It found a way to just connect to the Raspberry Pi through my browser.

24:51

It basically found all these URLs that I had in a file that I gave it.

24:55

It just did the whole thing.

24:56

What was funny is that I was trying to use computer use, I guess, to save time.

24:59

And I spent the entire time just watching it.

25:02

That’s so funny.

25:02

Doing the thing for me because I was impressed by how I was going to the right place,

25:06

opening the right thing, copy pasting stuff for me.

25:09

It was just the funny thing.

25:10

I love how the cursor moves also in that you can kind of follow along the animations.

25:15

It's pretty but you don't want to touch it but you kind of do at the same time.

25:18

Because you can actually let it disappear in the background too.

25:20

Yeah, that was pretty cool and recent.

25:22

The other thing I guess is 5.5 just launched

25:26

and I have this old hackathon project that I've had for over ten years

25:30

that I use almost like my personal eval in a way, that is,

25:34

I know that models nowadays can one shot this project

25:37

but every time a new model comes out I try to do it again

25:40

just to see how much better it gets.

25:41

How many more details it gets right. How the design improves.

25:45

It's a bit of a silly project but I can tell you about it.

25:47

It was, it's a thing that was called SnapCat, which is a bit absurd.

25:51

It was basically this app for cats to take selfies.

25:54

And the way that it works is that, I mean, I guess I could just show you.

26:00

But it's essentially

26:02

a black screen with a red laser dot that moves around.

26:07

And

26:07

For the cat to catch?

26:09

Right. And so if you are a cat, we can pretend you're a cat for a second.

26:12

And you try to catch that. Every time you did

26:14

and if you don't mind trying to tap to catch the dot.

26:17

I guess I'm not as fast as a cat.

26:19

You're really not a very good

26:20

I just did.

26:21

I think I got it.

26:21

There you go.

26:22

So as you do that, every time you hit the viewport

26:25

you’re basically taking a photo with the front facing camera.

26:27

Woah

26:28

And so you can exit this cat mode with the volume buttons,

26:31

which cats don't really have easy access to.

26:34

At that point you go to the gallery.

26:36

You see the upside down pictures you just took of yourself as a cat.

26:40

You can basically select one, share it and that kind of stuff.

26:43

And what's cool here is two things.

26:45

The first one, it easily one shot this project.

26:49

It's something that over ten years ago we did at a hackathon.

26:51

It took us over a full day to build.

26:53

We had a team of five people.

26:55

It was quite difficult to do really quickly.

26:57

Now this is one really well crafted prompt and a couple of skills

27:01

and it can build the whole thing for you in one go.

27:03

How did you build the UI?

27:05

Was it like 5.5 building the UI? Was it image gen?

27:08

That's the second thing, so the first thing, it can one shot the skeleton of this app really easily.

27:14

What it did in this case after that was

27:17

I described the style of an app I wanted.

27:21

I said I wanted something light and colorful and playful and that kind of stuff given the theme.

27:26

I generated an image of that UI, and then I told Codex to build that.

27:31

And once it did that with the camera roll view that you're looking at right now,

27:35

I like that style.

27:36

And I told it “Go redesign the settings page based on the same style.”

27:39

And it generated another image and then implemented it.

27:42

Then it did the same for the home screen.

27:44

So it's a pretty different and new way to go about building UI.

27:50

So that's your eval, every time there's a new model you're just rebuilding SnapCat?

27:53

I basically just look at how easy it is for cats to take selfies.

27:56

Is it the best you've seen so far?

27:58

It's the best by far.

28:00

It is, it’s a lot better than the last time I tried this a few months ago.

28:05

It is infinitely better than what we managed to build in a full 24 hours.

28:09

Wow.

28:09

And I basically made this in one shot last night in preparation for today.

28:15

Incredible, I love this.

28:17

Well, I guess now you have selfies of me among your cats.

28:20

I'll text you one of them.

28:23

That's so cool.

28:24

Thanks so much for showing me your setup.

28:26

I love watching users building with the Codex app.

28:29

Maybe to to close out,

28:31

what would be your advice for other founders or builders, who would like to master the codex the way you do?

28:37

You know, I think it boils down to a few assumptions

28:39

that I've made early on that I think have been really, really helpful for me.

28:43

The first one is assume it's possible.

28:45

Like if you have an idea, more often than not, it can probably be done.

28:51

The second one is assume you can do it.

28:53

That one's a bit more personal

28:54

but a lot of people could be building these things.

28:57

And I think the blockers that they think they can't.

28:59

I think assume you can.

29:01

In this case, maybe be in denial even that you can't do it.

29:05

It's usually true.

29:08

And the third one is when you try to get something out of an LLM and it doesn't work,

29:12

this one's a bit difficult,

29:13

you have to put your ego aside, assume it's your fault.

29:16

Don't assume that the tool isn't capable.

29:19

Assume you haven't found a way to communicate it or convey what you want just yet.

29:24

And I guess try again. Like I've done a lot of trying again.

29:26

And I think that's what's gotten me quite a long way into building things.

29:30

So that's probably the advice I give to people.

29:33

I love that and it feels like the perfect note to end.

29:37

Anyone can just build anything.

29:38

It's true.

29:39

It sounds cheesy but it's true.

29:41

Thank you so much Matias. This was a great conversation.

29:44

Merci.

29:44

Thank you.

29:45

(speaking french)

Interactive Summary

In this conversation, Matias shares how he leverages AI, particularly Codex, to accelerate product development and personal projects. He highlights how AI has fundamentally transformed his workflow, moving from manual coding and team-based development to an agent-driven process where he delegates tasks to AI models. Matias demonstrates practical setups, including using AI for code review, generating project roadmaps, and building fully functional apps with minimal manual effort. He emphasizes the importance of clear communication with LLMs, assuming that most ideas are achievable, and maintaining a growth mindset when refining prompts to get the best results.

Suggested questions

4 ready-made prompts