Что защитит наш код от искусственного интеллекта?

Watch on YouTube

Now Playing

Transcript

569 segments

0:00

MUSICAL INTRO

0:20

APPLAUSE

0:27

-Dear comrades programmers,

0:29

tell me, are you afraid of artificial intelligence?

0:33

Because I am afraid of it.

0:36

I am afraid of it.

0:38

But not for the reasons you might think, not because it will fire me,

0:41

but because it will not fire me, but will sit next to me

0:44

and will program.

0:46

This scares me a lot.

0:48

I will tell you about my fears and explain,

0:50

how I prepare for them, what countermeasures I take.

0:55

First, a brief overview of what hinders all of us programmers from living,

0:59

besides, of course, management, and how we feel about it.

1:04

You are programmers, I understand, yes, most of you?

1:07

Who loves their profession? Raise your hand.

1:11

Wonderful. I do too, but there are things that hinder us.

1:15

For example, I have divided them into three categories,

1:17

let's quickly go through them.

1:19

The first - defects, bugs.

1:21

We write programs, make mistakes,

1:24

for example, in this small program, the error is highlighted.

1:27

You understand what this will lead to. We have defects at runtime.

1:29

Defects come to us in the backlog, we fix them,

1:33

and they certainly annoy us.

1:34

But maybe it's not entirely right to view defects

1:39

as an annoying factor.

1:41

Many years ago, in a wonderful book, Steve McConnell showed,

1:45

or rather explained, that for every thousand lines of code

1:48

on average, programmers make between ten to twenty mistakes.

1:53

Good programmers, bad programmers, we all make mistakes.

1:56

Mistakes are an integral or essential part of the development process.

2:01

Whether we want it or not?

2:02

It makes sense to change our attitude towards this component,

2:06

to approach it calmly and, moreover, with love and joy.

2:10

If there are defects, then there are those thousands of lines of code that we write.

2:15

We conducted a small study recently and looked at,

2:17

how many there are now in Open Source projects.

2:21

programmers make mistakes per thousand lines of code.

2:24

Twenty years ago, it was between ten to twenty.

2:27

Now, look at modern Open Source products

2:30

in different programming languages.

2:33

In the right column, the same number or the same figure,

2:37

also calculated the same ratio

2:39

of the number of defects per thousand lines of code.

2:43

You can see that the numbers are different.

2:44

It's not from ten to twenty, but it's not zero.

2:47

And it's not a million.

2:49

We are roughly in a certain range.

2:51

Some have more, some have less.

2:53

I am surprised that the Linux kernel has the smallest number,

2:56

while VSCode has the largest.

3:00

What is your number in the project?

3:02

How many errors do you make per thousand lines of code?

3:05

I'm sure you don't know this number, and that's not good.

3:09

It's worth paying attention to such a metric.

3:11

So how many mistakes are you making?

3:14

It's great if you are making them.

3:16

Treat mistakes as fuel that drives the development team.

3:21

The more errors you find in your code,

3:23

the more you register them,

3:25

the fewer users of the product will find them, of course.

3:28

So mistakes are a good thing,

3:30

but they are one of the annoying factors.

3:33

The second is security holes.

3:35

You, as programmers of a financial organization,

3:39

understand how important this is.

3:41

What is wrong with this code?

3:43

The lines at the bottom.

3:44

Why did I bring it up as an example of mistakes?

3:47

Louder?

3:50

Yes, concatenation.

3:55

And why is that bad? Concatenation and...

3:58

SQL injection, that's right.

4:00

As a variable x, you can substitute

4:03

a piece of an expression, an SQL expression,

4:05

which will split the entire query into several queries,

4:08

and there can be anything inside.

4:10

This is a leak or a security hole,

4:13

which does not lead to a functional problem.

4:16

The product will work, but at some point

4:18

someone might take advantage of the error

4:20

and steal or destroy our data.

4:23

In a wonderful book, again, 20 years ago,

4:26

the authors said that security

4:27

and security errors in the code -

4:30

are an inevitable part of the development process.

4:32

We will make such mistakes.

4:34

Our future is filled with these problems.

4:39

I recommend this book to those who,

4:41

anyone who is in one way or another connected to software

4:44

related to the banking sector.

4:47

And the third problem that annoys us all

4:50

is complexity, I grouped them all together.

4:52

Roughly speaking, it's dirty code.

4:54

This is what our conference is about.

4:56

Dirty code is complex code,

4:57

it's code that duplicates itself,

5:00

and code that has what is called

5:02

"Code Smells" or "Antipatterns."

5:03

You know very well what that is.

5:05

For example, the code in the bottom right.

5:07

No one wants to read or maintain it,

5:09

or understand it, at least not me.

5:11

It's something complex, something unclear,

5:13

written in violation of many programming rules.

5:16

The complexity has been beautifully described by a good author.

5:21

Exactly the following.

5:22

If you, as a programmer,

5:24

believe that complex code,

5:26

written by you, is your achievement,

5:28

if your code is hard to understand

5:30

and only you can make sense of it,

5:32

then you are more likely misunderstanding your profession.

5:35

Complex code is a sign of a bad programmer.

5:38

A good programmer writes simple code,

5:39

which is easily understood by them,

5:41

their colleagues, and even a junior programmer.

5:44

Let's strive to write simple code.

5:45

That's all for now, the intro to the main part.

5:48

The second is duplication.

5:50

A great author of a good book,

5:52

a classic that has long become a classic,

5:55

said that duplication is the main enemy

5:58

of a well-designed system.

5:59

The main one.

6:00

If you took a piece of code from one place

6:02

and did a copy-paste to another place,

6:04

then you have opened the door to the main enemy.

6:07

It will be difficult to maintain such code,

6:09

because in one place you will need to change it,

6:12

and in another place we have already forgotten that it...

6:14

needs to be changed. If you propagate duplication throughout the codebase, in the end, we will have many problems.

6:20

The second problem is duplication. And the third is technical debt or various anti-patterns.

6:26

Technical debt - again, a wonderful book, highly recommended. Technical debt is what

6:31

accumulates in a product as a result of careless programming. Somewhere you use the wrong

6:37

pattern, somewhere you don't use a design pattern, somewhere you wrote it too complicated,

6:43

somewhere you used the wrong algorithm, which is fine for now, but in the long run, it should be

6:48

fixed, and you need to treat your code like a garden that you take care of. I told you all this

6:55

to show that we all live, as programmers, in a state of constant stress in

7:04

as a result of low-quality software, due to technical debt, bugs, security leaks, and

7:09

so on. With the advent of artificial intelligence, the situation is worsening. Let me explain why. Robots

7:18

are starting to program alongside us or instead of us. They write code, they review it, they complete it,

7:27

they help programmers finish code blocks directly in the IDE. Who is already using Copilot or

7:37

something similar? Some AI systems. You understand perfectly what I'm talking about. So robots

7:42

are alongside us, and the problems we've had for the last 50 years are starting to worsen.

7:50

Let me explain why. First of all, we think that robots write correct code. We trust machines. We do not

8:01

expect that a machine can make mistakes, but a calculator cannot be wrong, and a compiler cannot be wrong.

8:07

We know that if we input one number into a calculator and then multiply it by another, we do not

8:12

double-check the calculator. We know it calculated correctly. The same goes for the compiler. If you

8:17

if you give it a program in Java, you know that it generated the correct bytecode. You do not check it.

8:21

And artificial intelligence works like a calculator too. At least, it looks like

8:27

a calculator. We input something, and it immediately gives us an answer, most of the time without hesitation. However,

8:32

research shows, in an article from last year. After analyzing code generated by artificial

8:39

intelligence and subsequently reviewed by a human, it is evident that the code produced by the LLM is

8:49

of low quality, contains errors, and is prone to crashes and hangs. Much more so than code

8:55

that a human would write. Secondly, another study from this year shows that still

9:04

a third of the code instances written by machines have serious flaws. But at the same time,

9:14

let’s remember, we still consider it a calculator that does not make mistakes. If it were a

9:20

calculator that we initially did not trust, there would be no problem. We would know that we need to check,

9:25

but we think it is a machine that cannot make mistakes. Imagine that you hired someone

9:30

a junior programmer who pretends to be a senior programmer. He is so grown-up, overweight,

9:37

in glasses, with a beard, and seems to know everything. But his knowledge is at the level of a sophomore, and he writes along with

9:45

you and sends you code. And you believe him. Well, because it's hard not to believe. You have a high

9:52

level of trust in a senior developer. But inside, he is a junior developer. That's how it works.

9:57

An LLM right now. Behind the guise of a senior developer hides a weak programmer. And this is a serious

10:04

problem. Imagine that you are working with a programmer. Next to you sits a coder. You

10:08

are contributing to the codebase together. You are trying hard, while he makes mistakes at the level of a junior programmer.

10:15

Do you understand what threats this creates? Well, and the third article. If you assign the LLM an article from this year as well,

10:23

if you assign the LLM to check the code, that is, to conduct code reviews, then programmers will ultimately

10:31

take responsibility off quickly, thinking that the machine checks it thoroughly. Again,

10:36

imagine that a developer is sitting next to you. You send him the code. He checks it and

10:41

says "very good, very good." But it turns out he is a junior. He doesn't even really understand what you are writing. And you have relied on

10:46

him for all these six months. And after six months, you realize that you have been deceived. Your expectations were based

10:53

on an incorrect level of quality from your partner. Second. It turns out that LLMs have a negative

11:08

influence on us. Specifically, research from this year shows that when writing code, LLMs introduce into the software

11:24

code a programming style that they like. Not yours, not your repository, but the style of programming

11:31

they were trained on. And they were trained, as you can imagine, on all possible

11:39

examples of code from the internet that they could find. And they have processed all this mess,

11:43

learned something. They have their own programming style, their own understanding. They come

11:48

into your project and program the way they want. At the same time, imagine, let's return to our

11:53

analogy, imagine that sitting next to you in your team is a junior developer who is making

11:59

it seem like he is a senior developer, and he encourages everyone to write the way he thinks is right. To each one

12:05

he tells each programmer, I know exactly how to write, let's program in my style.

12:09

And people start to listen to him. As a result, your codebase will lose quality. You will be

12:17

losing the style, you will lose the principles that you have developed within the codebase, you will

12:22

you will lose yourself, lose your quality. And this is inevitable, as they say.

12:28

researchers. The third is data leakage. Again, let's use our analogy. Imagine that

12:35

this junior developer who pretends to be a senior, turning everyone against

12:39

himself and also communicates with a competitor at that moment, telling them that you

12:44

you are programming here, how it happens. Imagine, you open your IDE,

12:49

write code in it, and a colleague from the neighboring department calls you and says that something in

12:53

production is not working. To check, you insert the password and

12:57

13:01

just to verify that there is a request going there. You don't commit these passwords to

13:05

the repositories, you are an honest programmer, a careful programmer, but you click

13:09

the copilot button, and ask it to help you write. What does it do? It takes all

13:13

your source code along with the password and login, sends it to the OpenAI server, and

13:18

receives a response from there. Yes, let's hope that OpenAI immediately deletes your

13:23

request. But that doesn't happen. Most likely, your request is not deleted. Rather

13:27

in total, it remains on the server, whether it's OpenAI or GigaChat. Look at what

13:33

Gemini writes in its policy regarding such issues. They do not say that we do not

13:39

save your data. I read the entire policy carefully. Nowhere do they

13:42

state that we will immediately delete your data along with the login and password. They will not

13:46

delete it. They simply say, don't enter confidential information. It's your problem if you

13:52

did that. And we will take and use it. If they take your requests and use them

13:56

to train the model, then the login and password will go into the next version of the model. And

14:01

the whole world will be able to access the password and login that ended up there

14:06

by chance. You didn't even click the "send" button. You just

14:10

asked Copilot how to fix the algorithm on lines 15 and 16. But in

14:15

lines 50 and 60 had a password, and it was sent to the server. What does this tell us?

14:21

GigaChat on this matter? And GigaChat is even stricter. It says, it is prohibited

14:25

to send confidential data to our server. Google

14:31

says do not send, while GigaChat says, we prohibit sending. But neither of them actually

14:36

says, if you send them, we will delete them. So you understand that you are sitting with

14:42

a junior developer who constantly tells the competitor about what you are doing.

14:46

you are working on. And finally, the pressure from management. Management thinks that robots

14:53

can write code faster and better than humans. And they put pressure on

14:58

programmers. They say, why are you taking so long to work on this problem when I

15:03

can ask GPT chat, and it will answer me in five minutes.

15:06

Why do I need a programmer sitting here for five days working on one problem?

15:10

Management thinks that we are just like this impostor. That we, too, can...

15:17

in quality, in level, are the same as a junior developer pretending to be a senior.

15:21

developer. I conducted a small survey in my Telegram channel. I asked, maybe a little bit...

15:26

not visible. The next question was. Programmer, what is the difference between you and a robot,

15:31

that can write code? I asked programmers. There were five answer options.

15:36

different options. Two of them. The first is that I write code better, make fewer mistakes.

15:41

The second is that I write cleaner code. All the other options are different. For example, I care...

15:47

about design, I go to parties, and a robot can't do that, and the last one is that I don't...

15:53

better, I'm just more expensive. So the other three options are more like...

15:56

They are funny and do not relate to the quality of the code. What conclusion can we draw from this?

16:00

Only a third of programmers believe that they are better than robots. Two-thirds...

16:08

engage in self-deprecation. We are convinced that we are worse than machines. This is a question for coders.

16:14

I asked programmers. This means the opinion of people; people think that they are worse.

16:20

machines. Even management doesn't need to pressure them. Management just walks into the room,

16:24

and we all know that we are already ashamed. We already understand that yes, indeed, to put...

16:29

Chat GPT will do everything better. Here is the result. The question is, I was extremely surprised. I...

16:34

thought that people would say, no, we write better. After all, research shows this, we...

16:38

we know from the numbers, well from articles, that people still write code better. Maybe in the future...

16:43

in five years the situation will change, and such a survey will make sense. But right now it is...

16:47

it doesn't make sense. That is, management pressures us, we will bend, thinking that yes,

16:52

indeed, we will be ashamed that such a knowledgeable specialist is next to us, while we...

16:56

program so slowly. What should we do? We are approaching the most interesting part. What about this...

17:05

What to do? How to fight against such robots that will be introduced to you one way or another in your...

17:10

teams to implement? Whether you will do it yourself, incorporating various assistants into...

17:14

IDE, incorporating assistants in the command line, you will do this. Or someone will come to you...

17:19

management will, one way or another, today or tomorrow, force us to use robots,

17:24

that will give you suggestions, help, and so on. How to protect yourself from their negative...

17:28

influence? How to ensure that they do not ruin the codebase? How to make sure that...

17:32

that they do not introduce garbage into the code that we will then have to clean up ourselves? I see five...

17:38

ways. I'll start from the simple to the complex. The first one. The team needs to learn to do code

17:45

reviews in small portions, small ones. In most teams, unfortunately, this does not

17:50

happen. In most teams, my experience shows, people accept large

17:54

pull requests, significant ones, and expect that each pull request will contain a large, cohesive

17:59

block of work. If you do it this way, the LLM will do the same. It will send you

18:06

large portions of code, and you won't be able to check what this very

18:11

imposter wrote there. You won't have the opportunity to do a quality review. So until

18:15

the robots arrive, we start writing code in small portions. Google conducted

18:20

research. True, it is already quite old, from 2018, so seven years have passed,

18:25

but I think not much has changed since then. Google believes that there is a direct...

18:30

the correlation between the size of a pull request and the quality of code review. The smaller the code in

18:36

in the pull request, the better you can review that very robot that

18:40

and tries to insert faulty code into your work.

18:43

repository, without any conscience or responsibility,

18:46

no emotional attachments to your team.

18:49

To combat it, do small code reviews, small...

18:53

pull requests.

18:54

What does "small" mean?

18:55

Another study, which is much older,

19:02

it is almost 12 years old, and yet, it is an analysis of open-source repositories,

19:06

in which the authors showed what the average size of a pull request is.

19:10

44 lines of code, 25, 32, sometimes 263, which is a lot, 78.

19:17

This is the average amount.

19:19

It is clear that there are super small pull requests, fixing 2-3 lines

19:23

and immediately merging.

19:25

But on average, we should still strive to reduce the size of

19:29

the pull request.

19:30

In my understanding, more than 100 lines in a pull request

19:33

is already a risk.

19:34

If you allow programmers to make large pull requests,

19:38

LLMs will learn, see how you do it, and will be

19:40

do the same.

19:42

That is, implement the practice of small changes.

19:45

Small changes will give you the opportunity to protect yourself.

19:47

and from clueless junior programmers, as well as from the robots that rely on them.

19:51

similar.

19:52

The second.

19:53

Test coverage control.

19:54

Well, probably everyone writes tests, right?

19:59

Although I remember when I spoke at conferences 5-7, 10 years ago,

20:03

when asked who writes unit tests, 5 people in the room raised their hands.

20:07

Who writes unit tests now?

20:10

Don't scare me.

20:11

Well, many do, right?

20:13

Good.

20:14

So, many do write them.

20:15

But now I will ask you a second question, and you will see how

20:17

few hands will be raised.

20:18

Who among you controls the build based on the percentage of test coverage

20:22

of the source code?

20:24

Who controls test coverage?

20:26

One, two, three, well, here are four.

20:29

Five.

20:31

There is such a metric.

20:32

Test coverage means the percentage of the source code covered

20:36

by tests during the execution of a full test cycle.

20:40

The higher the percentage of code covered by tests,

20:43

let's say in simple terms, the more lines

20:46

of code are involved while the tests are running,

20:50

the better.

20:51

Ideally, 100%.

20:52

Ideally, all your code should be tested during testing.

20:54

be involved.

20:55

All lines of code should be executed in one way or another.

20:58

What is this percentage?

20:59

Few people know, on one hand, and even fewer control it.

21:03

What does it mean to control?

21:04

They do not allow programmers to go below a certain level.

21:07

boundaries.

21:08

If we have, for example, 80% coverage in the repository, then

21:12

we will not allow any programmer to achieve 79%.

21:15

If a programmer achieves 79%, we will reject their work.

21:18

pull request and will require them to add tests.

21:21

This is considered a good practice by many.

21:24

According to someone else's opinion, I know the opinion of programmers,

21:27

who believe that this is not the case, that there is no need to control it.

21:29

coverage.

21:30

Let's listen to the company Google.

21:32

The company Google says, article from 2019.

21:37

They say, we do not apply any mandatory measures.

21:40

coverage thresholds.

21:42

A threshold is what I just told you about.

21:45

They say we do not apply it.

21:46

Okay, we do not apply it.

21:49

However, right in this article, in the same one we mentioned,

21:52

there is a voluntary notification system that defines

21:55

five levels of thresholds.

21:57

We do not enforce it, but there are five levels.

21:59

What are these five levels?

22:01

They are defined in this same article.

22:04

If your coverage automation 2 is completely disabled, like for many others.

22:08

Here, this is level 1 in your project.

22:10

If you are using coverage automation, that is, you are reading

22:14

the number, but you do not prohibit programmers in any way from

22:18

going below the number, you have level 2, so to speak,

22:22

in your repository.

22:23

And if you have 90% coverage, then you have level 5.

22:28

This is how the company Google does it.

22:30

I recommend that you do exactly the same.

22:33

It doesn't have to be 90, but you need some number.

22:37

Set up a system for collecting information about test coverage and this.

22:42

use the information to make decisions about what,

22:46

to accept changes or not.

22:47

When the robots come to you, impostors pretending,

22:51

that they can do everything, they will send you code with weak coverage,

22:55

and you will be able to stop them because the system will already be

22:58

ready for this.

23:00

You will be ready for this threat.

23:01

And the robots will be forced to write a sufficient amount of

23:04

tests.

23:05

When they write a lot of tests, they encounter

23:07

their own mistakes and are forced to fix them.

23:10

If a robot is given the freedom to write without tests or with

23:12

low coverage, it easily handles the task, but

23:16

makes a lot of mistakes, far more than if we

23:18

were controlling it.

23:19

Third.

23:23

Coding standards.

23:24

An interesting thing.

23:25

It's called a "manifest."

23:26

I don't know if you've heard of this modern

23:29

term?

23:30

It's called "agent coding manifest."

23:33

I don't know how to translate it correctly; this is an example from

23:35

our repository, there's a link on GitHub, this is an example

23:38

of the text we use.

23:40

What is it about?

23:41

When you work with a robot, with an LLM, it, while editing

23:46

your code, knows what it knows.

23:49

The programming principles that it is familiar with.

23:52

It doesn't understand your principles.

23:54

It doesn't know, for example, that you like to name variables

23:57

in Camel case, or conversely, that you prefer Kebab case.

24:03

It doesn't know that.

24:04

It can, of course, look at your previous code and

24:06

draw conclusions from that, but in general, it approaches

24:09

you with a blank slate, regarding your repository.

24:12

You can write a so-called coding manifest, in which

24:14

you outline all your programming standards, telling

24:18

it how you want it to program.

24:21

Imagine that a Junior Developer comes to your project, and you

24:24

are training them at the start, saying, "Sit down,

24:27

grab a coffee, and we will spend 3 hours explaining how we write

24:31

in Java."

24:32

Not how to write in Java in general, but how we write

24:35

in Java.

24:36

So you need to create such a document and place it

24:39

in the repository.

24:40

This is necessary; if you don't do it, the Junior Developer will

24:43

work with you blindly.

24:45

They do not understand your repository and your style.

24:48

Creating such a document is extremely challenging.

24:50

It will take time; you need to write it in a way that is

24:52

concise and compact, and at the same time,

24:55

it should cover everything the Junior Developer needs to know about

24:57

your programming style. There is a repository I recently

25:01

found where leaked or stolen prompts are published,

25:07

or so-called system prompts that are used by

25:10

ChatGPT, Anthropic, and so on. Open repositories

25:12

on GitHub. Check there to see how such coding manifestos are written.

25:17

In what style they are written. Take the most valuable parts from there, compile

25:21

your coding manifesto, and place it in the repository. In this way,

25:25

you will prepare for the war with robots. Many probably

25:32

know this famous tweet, published not long ago,

25:35

stating that the hottest programming language today

25:38

is English. Indeed, looking at the situation with the coding manifesto,

25:43

this statement can be confirmed. You will indeed have to

25:46

program in English. You will have to learn to

25:50

write in English what you would explain to a programmer who came

25:53

to your office using gestures or examples.

25:56

code. Imagine I come to your project and ask

26:00

you, well, how do you program here? Tell me. And you start

26:03

to teach me. Well, you know, we place the files like this...

26:06

like this, and we name the classes like this, and we name the methods like this...

26:09

we don't just name, but we name like this. These are the patterns we use,

26:13

and we don't use these, we tried them, and they didn't work for us. And all this

26:16

you will be conveying information to the programmer's mind.

26:19

day by day, maybe month by month. You need to

26:22

do this quickly in the form of a manifesto for LLMs. Give it a try.

26:25

to write it right now. It will be difficult without it. The style,

26:32

programming style. Stylistics, style checking. When you write

26:39

code, there is a certain style that you adhere to, which you...

26:42

adhere to. Who even thinks that style in programming is...

26:47

is not important, but what's important is that it works? Raise your hands. Wonderful!

26:52

There are no such people, although maybe you are just afraid.

26:55

raise your hands, because we are at a conference about clean...

26:58

code, but there are such people, I have encountered them in my life,

27:01

who say that style doesn't matter, everyone has their own style,

27:05

I wrote this class in my style, and the neighboring programmer...

27:08

wrote it in his style, he has tabs there, I have...

27:11

spaces, he has line breaks arranged in such a way,

27:15

I have it differently. It doesn't matter, it matters. And you with...

27:19

agree with me, since no one raised their hand. So,

27:22

controlling style should not be done only through conversations.

27:25

with programmers, not just through mutual, so to speak, agreement.

27:29

about how we program, but with robots, tools.

27:33

The robots are those that will come to us, with tools.

27:35

There should be tools configured by you, which...

27:38

punish programmers for violating...

27:41

the style. For example, I will give you a short list of the most...

27:44

the most popular tools for style checking, you can

27:47

integrate them into your programming language, take some

27:50

others and configure them to your needs. The stricter, that's my

27:54

point, you all probably know this, these are obvious things,

27:57

but my point is that the stricter you configure the style

28:00

control tools, the easier it will be for you to deal with

28:03

the robots. Again, the robot comes to you, not understanding

28:06

your style, it needs to encounter a barrier, it needs

28:08

to write code and have someone slap its hands

28:12

and say, this is not how you should format it, this is not right. It will understand,

28:15

it will understand how to do it correctly, but it needs a

28:18

feedback, and you won't have the time, strength, and energy

28:22

manually explain to the robot each line so that it

28:25

rewrote it. You should have a system in the repository

28:28

style control. Take one of them and here you go.

28:31

look at the interesting numbers, what a number of checkers

28:35

or how to say it, rules within each style.

28:40

checker. A huge number. Imagine in ClangTidy,

28:43

almost 700 rules. You won't remember 700 rules manually, you

28:47

won't be able to explain to any robot, nor to any programmer,

28:51

that he did something wrong, keeping 700 rules in mind.

28:55

And a style checker can, if you, of course, enable all the rules.

28:59

I recommend you use a style checker with the maximum

29:01

strict configuration. When you insert it, you take

29:05

into the project, integrate it into the project, enable the configuration.

29:08

to the maximum. Let it be painful, let the programmers

29:11

complain, let you have difficulties at the start,

29:14

but it's better to go through them, and then you will be fully prepared.

29:17

armed, when the robots come to you and start trashing you.

29:22

your repository. And this is exactly what they will be doing.

29:24

Well, imagine once again a Junior who comes in.

29:28

into your project, pretending to be a senior and starts writing.

29:31

in the style that he wants. And no one will stop him.

29:34

stops him, except for you, who is writing the code.

29:36

review, and he sends you a huge number of files, like a thousand files,

29:39

formatted in a haphazard way, but the tests pass. And you do not

29:43

understand what the design is, what patterns are there, how

29:46

he did it. Well, it seems that the tests pass, but you accept

29:50

and you have what you have. And there will be no turning back,

29:53

since, well, he has no conscience, you won't punish him.

29:58

for this. And here are a few exotic style checkers

30:02

that might be useful for you to check out.

30:04

For example, Shell Check. Who has ever used Shell Check? One

30:08

person, two wonderful people. This is a checker for

30:13

bash scripts. And who writes scripts in bash?

30:17

I mean, you write scripts in bash, but you don't use

30:20

Shell Check. Try using it. It will tell you so much

30:24

interesting about your bash scripts, it will show you such interesting

30:27

mistakes that you make. I get a lot of pleasure

30:30

programming in bash now, after I spent a whole

30:34

year struggling with the rules of Shell Check. And in the end, I realized that it is from

30:39

me what he considers to be correct, necessary, and safe.

30:42

programming in bash. The same goes for the others.

30:45

checkers that are for files of various sizes.

30:50

For example, Dockerfiles. We write Dockerfiles, how many of you have used them?

30:53

Hadalint. This is a checker for higher quality, more competent

30:59

formatting of Dockerfiles. Not just formatting,

31:01

how to place spaces, and which commands to use,

31:04

in what order, what can be called, what

31:06

cannot be. Checkers will help you, protect you from...

31:12

And finally. The last thing we need, the most difficult, I don't have exact answers for you.

31:19

This is an open question that needs to be explored, as I understand it.

31:23

Description of architecture. It is necessary for the knowledge of developers and architects in the project to somehow be translated into English.

31:33

So that the LLM can understand how you program.

31:36

I will give a specific example. Look at two blocks of code. One is written in C++ and the other in C++, they are both in C++.

31:44

In the left example, we are trying to read some content from a file.

31:50

If we fail, we return -1. If we succeed, we return 0.

31:56

Quite understandable error handling.

32:00

In the right case, in the right example, we are also trying to do something with the file, only in this case to save the file.

32:07

If we fail to save, we throw an exception.

32:11

Both files are functional. They may have been created by the same programmer.

32:17

Perhaps even within the same time frame.

32:20

But they are a vivid example of an inconsistent method or way of handling errors.

32:28

In one part of the program, you handle file system errors one way, in another part, you handle them differently.

32:35

No style checker will catch you.

32:38

No code reviewer will catch you. And no LLM will catch you on large volumes.

32:44

If you, of course, show the LLM these two short snippets of C++ code, the LLM will say, yes, you have inconsistency here.

32:51

But if you open a repository with a thousand files, in one place it will be one way, and in another file, far from it,

32:57

it will be different, you are defenseless. No one will help you catch such an error.

33:04

This is a defect at the architectural level. And modern science and technology are powerless against this, well, almost powerless.

33:12

I will show you something. But this is, of course, trivial compared to the threats we have.

33:17

That is, programmers write as they want. And now imagine, you invite a junior developer robot to your project,

33:24

who pretends to be a senior developer, and you say, write me the third snippet in the third file.

33:31

And it writes in some third way. And makes another mistake, similar to this, also handling errors in its own way.

33:39

How will you catch it? How will you be able to counter this? Because both pieces of code will easily pass code review.

33:47

Well, unless you are an architect who keeps a finger on the pulse and closely monitors all changes,

33:53

then you will see that this is not acceptable. I remember that we always return -1 in case of an error.

33:59

And therefore, the right code, no-no-no, don't do it this way. Most likely, you won't catch it manually.

34:04

That is, some tool or mechanism is needed where we could describe in text form,

34:10

how we, for example, handle errors, and the LLM will read and understand that it can only be done this way.

34:16

Our architecture is like this. That is, it is necessary to invent some language for describing architecture,

34:21

or maybe some tools for describing architecture that could be placed in the repository

34:26

and referred to each time when working with robots. What language is this? I don't know, it seems it doesn't exist.

34:32

Well, for example, there is a rather old study that says if you have a good readme file,

34:42

then your repository will seem to have more popularity.

34:46

I think, why am I bringing this study here. It seems that the readme file is the place where you can describe the architecture.

34:53

That is, you have a starting readme file in the repository, which, on one hand, people will read and understand,

34:59

what is said there and how your repository works. And on the other hand, robots will also pay attention to it.

35:05

And it will be difficult for the LLM robot to go against what is stated in the readme file. How does the LLM work?

35:11

It tries to find an answer to the question you ask, minimally deviating from all the context that surrounds it.

35:20

That is, you ask something, it builds an answer so that it matches as closely as possible to what you asked.

35:26

And with what it knows beforehand, and with what you have in your repository.

35:30

It tries to give you an answer that is closest to where we are among all these coordinates.

35:36

The readme file will help you. And finally, there is a tool called ArcUnit. Who has ever heard of this?

35:44

Hardly anyone. Only two people have heard. We are three people. We are trying to use it here and there, but it can help in some way.

35:52

For example, we can say that from the Source module, you can access the Target module, but you can never access the Foo module.

36:02

This rule can be formulated as a unit test. That is, you will run a unit test, written in unity, which will scan the entire bytecode.

36:13

And check in your bytecode whether there were accesses from the Source module to the Foo module. This is a picture from their website.

36:21

If such an access occurred, the unit test will fail. Quite a good way to control.

36:27

But how much can you write in ArcUnit? It turns out not that much. We tried to write something more or less substantial, but unfortunately, it doesn't work out very well.

36:38

So, it's an open question. Do you think about how to do this? We don't know. We are discussing it, and there are no serious solutions yet.

36:44

It seems that a special language is needed, which is English on one side, and on the other side, it is technical.

36:50

And in this language, it should be possible to write, write, write, explaining how our architecture is structured.

36:56

And then, if we have described everything, set up style checkers, established coverage control,

37:01

then we can invite robots in large volumes without worrying that they will cause harm.

37:06

They will then contribute, contribute, adhering to our style and our architecture.

37:11

This is where we need to get to. Unfortunately, we are not there yet.

37:15

That's all. Subscribe to my Telegram channel. Thank you very much.

37:26

Subtitle editor A. Olzoeva Proofreader A. Kulakova

Interactive Summary

Ask follow-up questions or revisit key timestamps.

The speaker expresses fear not of AI replacing programmers, but of it working alongside them, potentially degrading code quality. He outlines three common problems for programmers: defects, security vulnerabilities (like SQL injection), and code complexity (dirty code, duplication, technical debt). The core issue, he argues, is that AI, perceived as infallible like a calculator, often produces low-quality, buggy code, introduces inconsistent programming styles, and poses a significant data leakage risk. Management's overestimation of AI also pressures human programmers, leading to self-doubt. To counter these threats, the speaker proposes five strategies: performing small code reviews, enforcing strict test coverage, creating a 'coding manifest' to define team standards for AI, implementing rigorous style checkers, and developing methods to describe architecture in a machine-readable format to ensure consistency.

Recently Distilled

Videos recently processed by our community

Global Network Webinar - Introduction Module of Advancing Responsible AI - 3 June 2025

Feb 19, 2026

by UNStats

The OpenClaw Saga: Zuckerberg Begged This Developer to Join Meta. He Said No. Here's Who Got Him.

Feb 19, 2026

by AI News & Strategy Daily | Nate B Jones

Elon Musk Sued His Way Into Being Tesla's "Founder" (The Full $175 Million Story)

Feb 19, 2026

by Dr. Josh C. Simmons

They Deleted My H-1B Exposé. Now 100 Employees Are Confirming the Same Pattern.

Feb 19, 2026

by Dr. Josh C. Simmons

The AI Bubble Just Popped | Here's What's Worse

Feb 19, 2026

by Dr. Josh C. Simmons

Рост с нуля до $10 млн: системный разбор e-commerce кейса в США

Feb 19, 2026

by Павел Антонов | Targetorium

Airbus Slips, Euronext Drops, Nestlé Rises | Stock Movers

Feb 19, 2026

by Bloomberg Podcasts

Ethiopia’s fossil fuel car ban is a vision of the future | Zero: The Climate Race

Feb 19, 2026

by Bloomberg Podcasts

US Ratchets Up Iran Pressure; OpenAI Funding to Top $100 Billion | Bloomberg Daybreak: US Edition

Feb 19, 2026

by Bloomberg Podcasts