HomeVideos

I Spent 200 Million Tokens Vibe Coding With Gemini 3.1 Pro

Now Playing

I Spent 200 Million Tokens Vibe Coding With Gemini 3.1 Pro

Transcript

595 segments

0:00

video,

0:00

I'm going to be sharing with you what I built yesterday

0:02

after working for over 17 hours and spending two hundred

0:07

and fourteen point six million tokens on the newly released

0:10

Gemini 3.1 Pro.

0:12

So the first thing I want to talk about is the benchmarks

0:15

with this model.

0:15

And I'm going to be showing you guys the benchmarks and I'm

0:18

also going to be showing you what I built with it yesterday

0:20

after spending this amount of tokens.

0:23

So let's just dive in, take a look at the benchmarks.

0:25

One thing I want to say is I have been absolutely blown

0:29

away by how this model has performed so far.

0:32

I was expecting the model to release yesterday,

0:34

but I wasn't expecting it to be this good.

0:36

It's a massive jump from Gemini 3 Pro to Gemini 3.1 Pro.

0:41

So if you look at the Arc AGI 2, look at this,

0:43

31.1% to 77.1%.

0:46

That tests abstract reasoning puzzles.

0:48

Humandy's last exam, same thing, 7% jump,

0:51

6% jump respectively.

0:53

Live Code Bench Pro, 2,439 to 2,887.

0:57

And then Sweet Bench Pro, a very large jump of over,

1:00

it was 4.4%.

1:01

So Sweet Bench Pro,

1:02

it does score under Opus 4.6 by a little bit.

1:06

But one thing I want to say is that there are areas where

1:08

this model excels in coding and I'm going to be sharing

1:12

with you guys a couple of the instances that I had

1:15

yesterday where this model just performed incredibly well

1:18

in real world vibe coding tasks.

1:21

So with that being said, guys,

1:22

I do have a like goal of 200 likes on this video.

1:26

And if you haven't already joined the fastest growing vibe

1:28

coding community on the internet,

1:29

make sure you check the link in the description down below,

1:31

as well as in the pinned comment and join the bridge,

1:34

my discord community.

1:35

And with that being said, let's dive right into the video.

1:38

All right.

1:38

So the first thing that I want to cover is a little bit of

1:40

the results from the storage bench.

1:42

So I have put it through the creative HTML tasks.

1:45

So for example,

1:46

here is what it did for the space invaders demo.

1:49

So you can see this is what it came up with.

1:51

And what I will say is that this model,

1:54

the capability of the UI functionality of it,

1:57

its capabilities to be able to write unique in modern UI

2:01

elements is very, very noticeable.

2:05

Okay.

2:05

So you can go over here, right?

2:07

And you can go, let's just go to the lava lamp.

2:08

And I actually already have it pulled up.

2:10

So this is the Gemini 3.1 pro lava lamp.

2:13

And this is the Opus 4.6 lava lamp.

2:16

And you guys can make your judgments of which you think is

2:18

better, but I can notice right off the bat, Hey,

2:20

this is better.

2:22

This is a better lava lamp than this.

2:23

And this is Opus 4.6, right?

2:25

So now like, let's go a little bit past,

2:28

you guys can go check out the bridge bench.

2:29

I haven't yet put it in the leader board yet for the

2:32

benchmark results where we put it through 130 tasks

2:35

associated with vibe coding,

2:36

but I have put it in the creative HTML.

2:39

You guys can go check it out at bridge of mind.ai.

2:41

But I want to show you that what I did yesterday and how I

2:45

spent all of these tokens.

2:46

So the first thing is I had it completely refactor pretty

2:49

much probably I would say like 20 to 30 pages on the

2:52

website.

2:53

And I'm going to show you just a couple of the highlights.

2:55

So first of all, do you guys see this video?

2:58

This video was created using Gemini 3.1 pro.

3:02

And it was created using Gemini 3.1 pro and remotion.

3:05

And all I had to do was use cursor and say,

3:08

look at the website and look at the products in the bridge

3:11

mind vibe coding suite and create a marketing video that is

3:13

accurate and represents the bridge mind brand and theme.

3:17

And it came up with this.

3:19

And I would say that, Hey,

3:20

this is where you guys can see that as these models get

3:23

better,

3:23

the capabilities are going to go beyond just coding, right?

3:27

We're creating marketing videos now.

3:29

So that's one thing that it created.

3:31

Another thing is I want to go over to the bridge MCP and

3:33

show you guys a really interesting example.

3:34

So if you guys see this,

3:36

all this entire UI was created by Gemini 3.1 pro.

3:39

And if you look at this,

3:40

you guys see how the open clock codex cursor,

3:42

Claude windsurf, prior to using Gemini 3.1 pro,

3:45

I didn't have the actual brand assets for each of these

3:49

brands.

3:50

And it was just like placeholder icons.

3:52

But I told Gemini 3.1 pro,

3:54

I want you to go on the internet and I want you to actually

3:57

go grab the actual logos for each of these companies and

4:00

then create a unique customized component for this.

4:04

And this is what it came up with.

4:05

It was able to actually go and grab the logos off the

4:08

internet, which I think shows a lot more than just the,

4:11

you know, if you go back to the benchmarks,

4:13

that example right there is showing the intelligence

4:17

capabilities of it knowing and being able to understand,

4:20

okay, I need to go here and I need to, you know,

4:22

look at the header and grab, you know,

4:24

go to the brand assets and download this file.

4:26

And then I need to copy it over to the project and I need

4:29

to input that PNG here.

4:30

And it did it flawlessly.

4:32

You guys can see this looks great, right?

4:34

So it also was able to create another marketing video.

4:36

So this all the entire UI,

4:38

the marketing video that you're seeing was created using

4:40

Gemini 3.1 pro.

4:42

You can see this here, all of it, Gemini 3.1 pro.

4:45

This animation was created using Gemini 3.1 pro very,

4:48

very good animation,

4:49

reflecting the Kanban capabilities of the bridge mind MCP.

4:54

So let's go back up.

4:55

And another thing that I want to show you guys is just like

4:57

these other pages, right?

4:58

So look at this animation that it created.

5:00

So this used three JS and it was able to create this unique

5:03

animation that shows the capabilities of bridge space and

5:06

its ability to run 16 agents in parallel.

5:09

It created this unique component here to kind of give it

5:11

that fresh, unique look.

5:13

And here's what we're seeing from Gemini 3.1 pro in terms

5:15

of styling.

5:17

Gemini 3 pro is already good at styling.

5:19

This is a step up 100%.

5:21

I noticed the capabilities of this.

5:23

This is incredibly good at styling.

5:27

I'm not going to be using Opus 4.6 for my styling ever

5:30

again.

5:31

This is the go-to model now for styling.

5:33

Okay.

5:34

And I'm going to get into backend and database in a little

5:36

bit because I did try it there.

5:38

But I do want to continue to just show you guys what it was

5:41

able to do.

5:41

Look at this.

5:42

Another video, right?

5:43

Bridge space.

5:44

And it was able to create a unique video showing the

5:47

capabilities of bridge space for marketing purposes.

5:49

It was even able to improve the performance by compressing

5:52

the video so that it rendered faster and improve my site

5:55

speeds.

5:56

Okay.

5:56

That's another thing.

5:57

Bridge voice.

5:58

Same thing.

5:59

Look at the three JS animation.

6:00

Look how unique it is.

6:02

Look at the like, just look at what it's doing, right?

6:04

So a lot of people, they say, Oh, your,

6:06

your website looks vibe coded.

6:07

It's not unique.

6:08

And that was before we had this model.

6:10

I just went through and I revamped the website.

6:12

I had it create unique custom components.

6:15

And I've just been very,

6:16

very impressed on what it's been able to do for me.

6:18

I had it rewrite the pricing page here.

6:20

And this is what I'm seeing.

6:22

This is a very, very good model in terms of UI.

6:26

And a lot of people, they say, Oh, Gemini 3 in,

6:29

in just Gemini models, they can't be used for it,

6:31

for backend purposes, right?

6:32

They're great at front end, but not backend.

6:34

And this is one example I'll show you.

6:36

So I'll go out to cursor and I can't like,

6:37

it's not a great example,

6:38

just cause I'm showing you guys the conversation,

6:40

but I used Gemini 3.1 pro to be able to go through and

6:47

completely refactor my auth system.

6:49

I had an issue with my auth system where it was just a

6:53

complicated issue.

6:53

Okay.

6:54

I was using Opus 4.6.

6:55

It was like 1AM in the morning last night.

6:57

I was using Opus 4.6.

6:58

I was throwing everything I had with it with Opus 4.6.

7:02

It couldn't get it.

7:03

I give Gemini 3.1 pro the issue.

7:05

I put it in plan mode and cursor.

7:06

I have it generate the plan and I run the plan and it was

7:10

able to refactor the entire auth system across the API,

7:14

the bridge mind web app, the bridge mind admin portal,

7:16

and the bridge mind UI, four different repos.

7:18

It was able to refactor the entirety of the auth system,

7:21

backend, front end, auth guards, complex logic,

7:24

and be able to completely refactor it in a sensitive way

7:28

where it completely fixed the issues that I was

7:30

experiencing with the auth system in one shot.

7:33

And I can't really show that to you guys because it was

7:36

just something that I experienced offline,

7:38

but I want you to know that that is what I experienced.

7:41

So we'll see in the coming days if that continues.

7:44

But what I will say is that I'm very impressed so far with

7:47

that.

7:47

But with that being said,

7:49

I think that gives you guys like a look at like just some

7:51

of the examples, right?

7:52

Even on the bridge bench, this was all refactored.

7:54

The styling here was rewritten by Gemini 3.1 Pro.

7:58

All of this stuff is done.

8:00

It was redone by Gemini 3.1 Pro.

8:02

It completely revamped my website.

8:05

And then I even used a, it was called a copywriting skill.

8:08

So I created a copywriting skill inside of cursor and I

8:10

gave the skill to Gemini 3.1 Pro and I had it rewrite all

8:15

of the content on my website so that it better fit my

8:18

brand, right?

8:19

So even like bridge code, your terminal, your AI teammates,

8:22

if you go to bridge space to run 16 agents in parallel,

8:26

right?

8:26

It was the one doing the copywriting for this.

8:28

So it's also good at writing.

8:31

So if you are a vibe coder, hey,

8:33

if you're doing copywriting,

8:34

just create a skill in your brand voice and Gemini 3.1 is

8:37

just going to do a phenomenal job.

8:39

That's what I'm seeing.

8:40

So now that I've covered a little bit about what I did

8:42

yesterday, and there's a lot more, I just can't show it.

8:45

Like I literally can't show you guys everything that I did

8:48

yesterday would not be possible.

8:49

I did so much, but those are just some of the highlights.

8:52

Okay.

8:52

Now I want to get a little bit into what we're seeing on

8:55

the benchmarks in terms of speed.

8:58

And this is like one thing to definitely highlight is that

9:01

this model is very, very fast.

9:03

If you look at artificial analysis, the speed here,

9:06

106 on artificial analysis compared to Opus 4.6 at 73.

9:10

And then, you know, GPT, GPT, where even is it right here?

9:13

85, but look at open router.

9:15

This is actually a better place to look for speed.

9:18

Look at Google Vertex.

9:19

This is what I'm seeing 60 tokens per second on Google

9:22

Vertex.

9:23

That's the best place to look.

9:24

And then if you compare that to Sonnet 4.6,

9:26

so you're looking at 42 tokens per second there.

9:28

So it's about a 50% improvement, and that's big.

9:33

That is noticeable.

9:34

I noticed the speed improvement.

9:35

So know that it's a big speed improvement.

9:39

And then when you look at the cost,

9:41

this is another reason to be using this model.

9:43

Look at the cost $2 per million on the input,

9:46

$12 per million on the output compared with Opus 4.6 at $5

9:50

and $25.

9:51

It's like, Hey,

9:52

does it make sense to use a model that is highly performant

9:56

at half the cost, more than half the cost?

9:59

Yes, it does.

9:59

So there's a lot of people that because they've had bad

10:02

experiences with Gemini models,

10:03

they don't want to use them,

10:04

but I want to draw your attention to probably one of the

10:08

biggest improvements that I've seen with Gemini 3.1 Pro.

10:12

So if we add this model here, let's just add it real quick.

10:14

Check this out guys.

10:16

Look at the artificial coding index.

10:18

It ranks number one on the artificial analysis coding

10:22

index, which is a very, very important benchmark.

10:24

Look at this 56 compared with GPT 5.2 49 Opus 4.5.

10:30

Can I add Opus 4.6 to this list?

10:32

I think I can.

10:33

I know I can, but it's, this is having an issue here.

10:36

Here,

10:37

let's scroll all the way up here and then add it Opus for

10:40

this artificial analysis being so annoying.

10:42

It's they, they definitely vibe coded this up.

10:43

So I can't add in Opus 4.6 just because they have this bug

10:46

here with the scroll bar, but I think, what is it?

10:49

I think it's like 53.

10:50

It definitely does beat it out.

10:51

I think I have it on my X actually.

10:53

Hold on.

10:53

Let me pull my extra real quick and then let me go to this

10:55

so you guys can see it here.

10:57

So where is it?

10:58

It's right.

10:59

Hold on chat.

11:00

Okay.

11:01

Right here.

11:01

All right.

11:02

So yeah, here it is.

11:03

So Opus 4.6 got 48.

11:05

It got 48 on the artificial analysis coding index in Gemini

11:08

3.1 preview 56, which is just insane.

11:12

And I have another benchmark that I want to look at and

11:15

it's this one here.

11:16

So this is another one off of artificial analysis.

11:19

This is one of the biggest benchmarks to look at because

11:22

this is what measures the hallucination rate.

11:25

And one thing about Gemini models that a lot of people have

11:27

struggled with is that they've had these incredibly high

11:30

hallucination rates,

11:31

which means that the model hallucinates more and it just

11:34

runs into more issues, right?

11:35

You ask it something to do,

11:36

and it goes off it on a bunny trail that you never wanted

11:38

to do.

11:39

It misunderstood your prompt, right?

11:42

Look at how Gemini 3.1 Pro performs.

11:44

50%.

11:45

This is the lowest hallucination rate for a frontier model

11:50

that we are seeing because if you look at Opus 4.5 at 58%

11:55

and you look at GPT 5.2 at 78% and then you look at Gemini 3

12:00

.1 at 50% they are getting this hallucination problem

12:04

figured out and the biggest thing I look at is even the

12:08

comparison from the last iterations of the 88% to 50%.

12:12

Google is doing something like they do not count Google

12:15

out.

12:16

They are going to do very well and if you like just because

12:20

they had bad models that had high hallucination rates

12:23

previously you have to stay up to date with this stuff and

12:25

this is why.

12:26

The hallucination rate I was noticing it.

12:29

It's like when I had this one when I had it one shot that

12:31

auth issue across four different repositories.

12:34

I noticed that right.

12:36

Opus 4.6 couldn't solve my problem.

12:38

Gemini 3.1 Pro did.

12:40

So take note of these benchmarks because they are very

12:43

important.

12:43

Another important benchmark that I am going to take a look

12:45

at real quick and it is not a great benchmark to look at

12:48

but I do want to share it and just kind of share my

12:50

perspective on it.

12:51

So here is Gemini 3.1 Pro in LM Arena.

12:54

You can see that it is performing sixth but it is a little

12:57

bit preliminary.

12:58

So scores based on pre-release testing and may shift as

13:01

community prompts and votes evolve after public launch.

13:04

So it is a hundred behind Opus 4.6 which is a massive

13:06

difference right.

13:08

But one thing I will say is that we need to give it time

13:10

but that is one thing to notice is LM Arena that is where

13:12

models get put head to head.

13:14

It does not perform that well in that.

13:16

It also has not performed very well in design arena.

13:20

So if we look at Gemini 3.1 Pro it scored at 1321 which is

13:24

way behind Opus 4.6 at 1392.

13:27

So again this is very preliminary you know with some of

13:31

these benchmarks it is people voting right.

13:33

So you have to give it like a week to get a really good

13:36

accurate representation of the model.

13:39

But one thing I will say is that my personal experience

13:42

with the model and being able to create you know these

13:45

unique UI components has been very impressive so far.

13:49

Like you guys can look at this and it is like yeah I mean

13:51

the benchmarks and it being behind in you know LM Arena or

13:56

it being behind in design arena.

13:58

I think we need to give it a little bit of time because

14:00

what I've seen so far is it's very I'm very impressed.

14:04

It's very hard for models to get the bridge mind stamp of

14:07

approval.

14:07

I don't give it to a lot of models.

14:09

You know Opus 4.6 has the stamp of approval.

14:12

Sonnet 4.6 has the sample of approval.

14:14

GPT 5.3 Codex has the stamp of approval.

14:17

And I am going to give the bridge mind stamp of approval to

14:20

this Gemini 3.1 model.

14:22

I am going to be using it in my vibe coding workflows

14:25

because the capability of it to create unique UI

14:29

components, it's writing capabilities,

14:32

it's instruction following.

14:34

I've been very impressed and I'm going to be using this in

14:38

my daily vibe coding workflow.

14:41

Now I want to actually put Gemini 3.1 Pro through my vibe

14:45

coding workflow and I want to do this inside of anti

14:48

-gravity.

14:48

So I haven't used it here that I used it for fixing a video

14:55

preview bug and then I also had it redesign that bridge

14:58

bench page that I showed you guys.

14:59

I have not used it since yesterday for probably like a

15:02

couple months and the reason for that is that too up to

15:05

this point every time that I used anti-gravity it had a

15:07

bunch of issues but I did use anti-gravity yesterday and I

15:11

was thoroughly impressed by its capability to use the anti

15:15

-gravity browser tools.

15:17

I've never seen anything like it.

15:18

They have significantly improved this and I want to show

15:20

you guys what I saw yesterday.

15:22

So the first thing I'm going to do is I'm just going to add

15:24

birds mind UI.

15:25

I want you to navigate the website particularly the blog

15:28

page and I want you to review the blog page and come up

15:32

with improvements to the UI for the blog page as well as

15:35

the blog ID pages.

15:36

It needs to be very modern and unique.

15:39

So let's just drop it in like just like this right.

15:42

So it's going to probably open that up and you guys are

15:44

going to see this in a second and again I'm using bridge

15:46

voice right now.

15:47

Bridge voice is one of the tools in the bridge mind suite

15:50

of projects.

15:52

This is the this is like pretty much a near perfect

15:55

transcription time for basically voice to text.

15:59

It's faster than whisper flow.

16:00

It's cheaper than whisper flow.

16:01

I highly suggest that you guys use it so you can see here

16:04

that it immediately starts this and it's going to run anti

16:07

-gravity in the browser and be able to navigate through the

16:10

website.

16:11

So that's going to run.

16:12

I'm going to launch another conversation.

16:13

I'm going to add bridge mind UI.

16:16

Okay.

16:16

Look at this guys.

16:17

So this did just okay.

16:18

This just launched.

16:20

We need to let this launch.

16:21

So this is anti-gravity right now.

16:23

You can see that it's it's getting the Dom.

16:25

It's right now.

16:25

This is actually it says the site can't be released.

16:27

It's refused to connect.

16:28

So let's see if that figures that out.

16:30

But let's drop in bridge mind UI and I'm going to say I

16:34

want you to use the chrome dev tools MCP and I want you to

16:39

navigate through every single page of the website and

16:42

evaluate any console errors that are happening and fix

16:45

them.

16:46

So I'm going to give it that that prompt as well and I'm

16:49

going to kick off another one and I'm going to drop in the

16:51

bridge mind API.

16:54

I want you to review this NestJS project and evaluate it

16:56

for bugs and vulnerabilities, security vulnerabilities.

17:00

I also want you to evaluate it for performance improvements

17:03

that we could make in terms of making it a more performant

17:06

API.

17:07

Do an in-depth review, do not update any code,

17:09

but just output your findings after an in-depth review.

17:13

So we're going to drop in that one as well.

17:15

And then I'm also going to start another conversation.

17:17

I'm going to drop in bridge mind.

17:19

Actually, let's do bridge voice.

17:20

And what I want to do is I want to say,

17:22

I want you to review the themes and the different options

17:25

that users can use to customize bridge voice and bridge

17:28

space.

17:29

And I want you to create better themes that are more on

17:32

that are more customizable,

17:34

offer more themes that are cooler, techno,

17:37

and review what's in existence and then add new themes and

17:41

improve existing themes.

17:43

So we're going to drop in both bridge base,

17:45

Tari and bridge voice,

17:46

and we're going to improve the themes using, you know,

17:49

that styling that I talked to you guys about.

17:51

So we're going to pass in that.

17:52

I want you to make sure that the themes are consistent in

17:55

both Tari applications.

17:57

So we're going to drop that in and we're going to paste it

17:59

in and then let's launch another one and let's drop in.

18:02

We're going to do bridge mind.

18:03

Let's do bridge mind.

18:05

Let's do.

18:07

Let me see here.

18:07

Okay, I have an idea.

18:08

So this is one thing that definitely needs to be done.

18:10

So let's go to birdspace.

18:11

Tari, I'll work on this on stream today.

18:13

I want you to review the logic associated with dragging and

18:16

dropping a workspace up to the header in order to create a

18:20

new workspace with that pane.

18:22

This functionality does not work and I've tried to fix it,

18:25

but it continues to have errors and not work.

18:29

So I want to restart and remove all of the functionality

18:32

associated with grabbing a terminal pane and dragging it up

18:36

to the header to drag it and drop it and create a new

18:39

workspace.

18:40

I want to completely remove the existing functionality.

18:43

Do an in-depth review and then remove that functionality

18:46

because we are going to build that functionality from

18:48

scratch and I want all of that functionality removed.

18:51

Do an in-depth deep dive.

18:52

Create a structured plan for what code you need to remove

18:55

without breaking functionality and then update the project

18:58

respectively.

18:59

So we're going to drop that in and again you guys can see

19:02

bridge voice is pretty much near immediate transcription

19:05

times and we used opus 4.6 the other day or no it was

19:09

sonnet 4.6 that we used to be able to 10x the improvement

19:12

and the response in the performance of bridge voice bridge

19:15

voice has gotten very good.

19:17

So right now we have several we have one, two, three, four,

19:20

five agents working inside of anti-gravity and again what

19:23

I'll say is that I have noticed improvements inside of anti

19:26

-gravity.

19:27

I probably will use anti-gravity a little bit more in my

19:30

workflow.

19:30

It's been very impressive to use.

19:33

You can see here it says hey MCP versus MCP server not

19:36

found.

19:37

Let's see if it knows how to add that MCP that'll be

19:40

interesting if it understands how it's able to add that MCP

19:42

but all these agents are working and I'm going to let them

19:45

work and we'll you know we'll probably work on this on

19:48

stream but I want to what I want to talk to you guys about

19:50

is just Google anti-gravity has gotten better especially

19:54

with this model you know I think that hey if you want the

19:56

best harness for Google models for Gemini models you want

20:00

to use their native suite right it's like hey if you want

20:03

the best out of anthropic models you probably want to be

20:06

using cloud code right if you want the best out of Gemini

20:08

models you probably want to be using the Gemini CLI or anti

20:11

-gravity.

20:12

One thing to note is that Gemini 3.1 Pro is still not

20:15

available in the Gemini 3 or the Gemini CLI.

20:19

I have the Google AI Ultra plan and it's still not

20:22

available.

20:23

I got Gemini 3.1 Pro a little bit early inside of the anti

20:27

-gravity yesterday which was nice but in the Gemini CLI

20:29

it's still not available so that's one important thing to

20:31

note.

20:32

So these are all working let's see this let's see this here

20:35

so it's able to go and it wasn't able to navigate this so

20:38

it said oh I wasn't able to open this but look at this it

20:40

said hey now navigate here right so now it's navigating

20:44

here can I go back over to oh look at this yes check this

20:46

out guys so now look at this this is the cool part about it

20:49

so you're able to navigate and see like okay this is what

20:53

happened it records it it's able to navigate the site even

20:56

here do you guys see this look at how it's improving it's

20:58

already improving it but Gemini was clicking around the

21:01

website and it has full control of my browser and the

21:05

ability to click through like oh look at this so it's

21:08

literally navigating my website and like scrolling down and

21:12

evaluating it and screen recording it and evaluating it and

21:15

taking screenshots and this is something about anti-gravity

21:19

that is completely different than what you would be getting

21:22

out of something like cursor.

21:24

Cursor does not have this functionality so I'm gonna let

21:27

this work I think I actually just interrupted it and messed

21:29

it up probably but what I will say is that Google anti

21:32

-gravity do not sleep on it because that browser use tool

21:35

is very impressive like look at this you're gonna be able

21:37

to see it look at it clicked on one of the that's just like

21:40

incredible right so this is different this is definitely an

21:43

improvement and this will continue to get better but based

21:46

off of my usage of anti-gravity yesterday I just want you

21:49

guys to know that this tool anti-gravity is getting better

21:54

and especially with this new model Gemini 3.1 Pro it's

21:57

getting even more it's getting more improvement so that's

22:02

one thing that you want to be you know you know clued in on

22:04

is that this is going to get better if you want to use

22:08

Gemini 3.1 Pro models or just Gemini models in general

22:11

we're going to start using anti-gravity and the Gemini CLI

22:14

a little bit more on stream just because I think that this

22:17

model is really good again I'm putting the bridge line

22:20

stamp of approval on this model I'm going to be using it in

22:23

my vibe coding workflow I'm going to start using anti

22:26

-gravity more because look at this use case a couple months

22:28

ago this you know browser tool use this didn't work

22:32

consistently and what you guys are seeing is that now it

22:35

does so this is just one thing that I wanted to highlight

22:39

and say hey the anti-gravity is getting better chat like

22:41

this is this is getting better and we want to stay up to

22:44

date with the latest tools because you can't do this inside

22:47

a cursor cursor doesn't do this guys it uses playwright

22:49

uses some goofy browser tools anti-gravity is able to do

22:52

like screen recordings and navigations that is incredible

22:56

you could test entire UI and auth flows and pretty much

22:59

everything about your site just by prompting anti-gravity

23:02

so definitely take note of this Gemini 3.1 Pro bridge mind

23:07

approved we're going to be using anti-gravity more I'm very

23:10

impressed with how it performs I'm impressed with the speed

23:13

I'm impressed with the intelligence I'm impressed with the

23:15

cost we'll continue to use it and that could change over

23:18

time and as new models release that could change as the

23:21

workflows improve or different models come out or different

23:24

you know tools come out but what I will say is that I'm

23:27

going to be integrating this model into my vibe coding

23:29

workflow I'm going to start using the Gemini CLI more I'm

23:32

going to start using anti-gravity more because Google is

23:35

cooking and we need to stay up to date with what they're

23:38

releasing because I've been thoroughly impressed by this

23:40

latest release so with that being said guys I'm going to

23:43

end the video here if you haven't already liked and

23:45

subscribed or joined the bridge mind discord community make

23:48

sure you do so and with that being said I will see you guys

23:50

in the future

Interactive Summary

The speaker shares insights on Google's newly released Gemini 3.1 Pro model, highlighting its significant improvements over previous versions and competitors like Opus 4.6. After 17 hours and 214.6 million tokens spent, the model demonstrated substantial leaps in benchmarks such as abstract reasoning, live coding, and especially a significantly reduced hallucination rate of 50%, the lowest for a frontier model. Gemini 3.1 Pro also shows superior speed (60 tokens/sec on Google Vertex) and cost-effectiveness ($2 input/$12 output per million tokens) compared to Opus 4.6. In real-world applications, it excelled in creating unique and modern UI elements, generating marketing videos, intelligently retrieving brand logos, and flawlessly refactoring complex backend authentication systems across multiple repositories where Opus 4.6 failed. The speaker also notes its strong copywriting abilities and praises its styling capabilities. While preliminary benchmarks in LM Arena and Design Arena show it lagging slightly, the speaker gives Gemini 3.1 Pro the "bridge mind stamp of approval" due to its impressive performance in UI, writing, and instruction following, integrating it into daily workflow. Furthermore, the video highlights significant improvements in Google's native Anti-Gravity tool, which now offers advanced browser interaction, navigation, and screen recording capabilities, making it a powerful tool for testing and evaluating websites, especially when combined with Gemini 3.1 Pro.

Suggested questions

5 ready-made prompts