HomeVideos

Azure Update 10th April 2026

Now Playing

Azure Update 10th April 2026

Transcript

253 segments

0:00

Hi everyone, welcome to this week's

0:01

Azure update. It's the 10th of April.

0:04

Pretty small number of updates this

0:05

week, but as usual you can jump to

0:07

whichever one you care about the most.

0:10

Just one new video this week. It's been

0:12

a crazy week and this took a lot of

0:14

research and investigation. And it's

0:16

really about optimizing Cosmos DB.

0:18

Cosmos DB is amazingly powerful,

0:21

but there are specific types of skew and

0:24

configuration, there's data modeling, so

0:26

there's a lot of considerations

0:28

to make sure you're optimizing

0:31

what you pick as the service, and then

0:33

how you pick things like partition keys

0:36

and if I want global secondary indexes.

0:38

And so I go into all of that detail so

0:41

you can make sure you get the most

0:42

optimal Cosmos DB environment.

0:45

So on to what's new on the computer

0:46

side.

0:48

So the AKS CNI overlay

0:51

for expanding the IP address space has

0:54

gone GA.

0:56

So the whole point of CNI overlay is it

0:58

uses a separate side range from that of

1:01

the node. So the node is joined to a

1:03

certain subnet, it uses those IP spaces,

1:06

and then the pods use a separate IP

1:09

space, so it saves from that underlying

1:11

node IP space you're using for your

1:13

VNet.

1:14

Now each node gets assigned a /24

1:17

from the side range that's assigned to

1:19

the pods. So what I can now do is expand

1:22

that pod side address range. I cannot

1:25

shrink it. I cannot try and add a

1:27

non-contiguous

1:29

space to it or add a brand new range,

1:31

but I can take the existing range and

1:34

expand it. I.e. I could change that pod

1:36

side range from a /18 to a /16.

1:41

This is only supported for IPv4 side

1:44

ranges and Linux nodes only today.

1:48

On the Azure function side,

1:51

it now has MCP resource triggers in GA.

1:56

So the whole goal here is I can host an

1:57

MCP server on an Azure function. So

2:00

obviously MCP is fantastic for it. It's

2:02

a standard way to talk. It's a standard

2:04

way for an AI application to basically

2:07

ask the MCP server, "Hey look, what are

2:09

your capabilities?" The MCP server

2:11

reflects back its capabilities, and then

2:14

there's a standard way to

2:16

translate that and convey that to the

2:18

large language model so it can then use

2:20

it.

2:21

So Azure functions could already expose

2:23

tools.

2:25

So hey, I want to perform a certain

2:27

action, but now it can expose resources,

2:29

think about knowledge as well.

2:31

So it could now expose static or dynamic

2:34

content. So I would now add a resource

2:36

trigger for the Azure function so it

2:39

respond to the re- source request from

2:42

the app.

2:43

So my Azure function-based MCP server

2:45

can now be more complete in its

2:46

functionality offering both tools and

2:49

resources, i.e. knowledge.

2:53

So for AKS, I can now disable the HTTP

2:56

proxy in GA. So the whole goal of HTTP

2:59

proxy is, "Hey look, I have outbound

3:01

traffic. I need to flow through a

3:04

required set of infrastructure, maybe

3:05

inspection, maybe it's just limited who

3:07

can talk to the internet or whatever

3:09

else." I can now disable the HTTP proxy

3:12

for an existing cluster. Now, it will

3:15

result in a re-image of the nodes,

3:17

so you'd want to make sure you're using

3:18

things like disruption budgets to ensure

3:22

the disruption to the bots pods pods

3:25

is controlled, so you're safeguarding

3:27

any kind of critical workload.

3:31

Still talking about AKS, so there's been

3:33

some observability improvements in GA

3:35

and specifically thinking about

3:36

namespace and workload views of the

3:39

data,

3:40

well this now can also utilize data from

3:43

an Azure monitor workspace. So that's

3:45

where it gets data powered by

3:47

Prometheus, which has that deep

3:49

understanding of Kubernetes.

3:51

So now when I'm looking at node,

3:54

namespace, workload, pod resource

3:57

utilization,

3:59

it's going to surface all of that data

4:02

in those various views, which is going

4:04

to give me a better understanding of

4:05

resource utilization, but also then

4:07

better enable me to troubleshoot, look

4:09

at trends.

4:12

Uh the Azure Red Hat OpenShift

4:15

clusters and the nodes now support using

4:18

skews that are powered by Nvidia H100

4:21

and H200 GPUs. So basically what this

4:25

now lets me do is, "Hey, I can use those

4:26

VM skews with GPUs." So thinking of the

4:29

workloads I could run in those clusters,

4:32

well anything that uses a GPU, AI

4:34

workloads, high performance computing,

4:36

anything where those GPU-accelerated

4:38

containers would be useful.

4:41

On the networking side,

4:44

so the Azure Network Watcher now has a

4:46

rule impact analysis in preview.

4:50

So I hey, I'm going to make some

4:51

security admin rules

4:53

and I'm going to change them

4:55

before they get applied to the

4:57

environment, it will help me understand

4:59

what the implications are. So I'm going

5:02

to be able to look at what the impact

5:04

will be of any proposed rule change

5:07

before I go and apply it. Maybe I've got

5:09

a misconfiguration that will break

5:10

everything. So this will help me

5:12

understand that before I go ahead and

5:13

roll the thing out.

5:16

For the network um security perimeter

5:18

feature which allows

5:20

multiple pass services to be put within

5:22

the same perimeter, so then they can all

5:24

talk to each other, but also I can

5:27

singly configure inbound and outbound

5:29

connections to that group of services

5:31

within the perimeter,

5:33

well I can now include Azure Service Bus

5:35

as one of those services that can sit

5:37

within the network service perimeter.

5:39

Imagine for example I've got my network

5:42

um

5:42

perimeter with Azure Service Bus and

5:44

then Azure Key Vault.

5:46

So now it'll be really easy for that

5:48

Service Bus to go and talk to the Key

5:49

Vault to get the key it's using so I

5:51

want to use customer-managed key. So

5:53

there's a whole bunch of scenarios this

5:54

becomes really useful.

5:56

On the storage side,

5:58

so Azure Migrate now supports Azure

6:01

Files as part of its assessments in

6:03

preview. So it can look at your on-prem

6:06

SMB or NFS file shares and then evaluate

6:09

the suitability and the business case

6:12

for migrating them to Azure Files. Azure

6:14

Files supports both SMB and NFS over its

6:16

various offerings.

6:18

So this would also tell you, "Hey, which

6:20

skew of Azure Files you should use based

6:24

on resiliency requirements, performance

6:26

requirements, and obviously, hey, what

6:27

region does this need to live in?"

6:30

Database,

6:33

so there's a consolidation of

6:35

maintenance notifications for

6:37

PostgreSQL. So if I have multiple

6:39

different servers even distributed over

6:41

subscriptions within the same region, I

6:43

will receive a consolidated notification

6:46

of the maintenance for all of the

6:48

servers within the region. Previously,

6:49

it was a separate message per server, it

6:51

got kind of bloated. This really just

6:53

improves that usability of the

6:56

notifications.

6:58

And then PG Bouncer 1.25.1

7:01

support is now GA.

7:03

PG Bouncer is the connection pooling

7:04

capability.

7:06

So when I think about scaling the

7:08

connections to my server, it exposes

7:10

itself on a different port,

7:12

but if I have a lot of idle connections,

7:14

if I have lots of short-lived

7:15

connections, connection pooling makes it

7:18

far more scalable for those connections.

7:21

And the 1.25.1 update just has a bunch

7:23

of performance, uh stability, security,

7:27

and protocol improvements. And this will

7:28

just it's managed, it's going to go

7:30

ahead and hap- happen for you.

7:33

And then miscellaneous,

7:35

so three new models in public preview

7:39

for state-of-the-art

7:41

speech transcription

7:44

across 25 languages at 50% reduced GPU

7:47

compared to existing capabilities.

7:50

A high-fidelity speech generation model

7:53

can generate 60 seconds of speech in 1

7:56

second.

7:58

And then Maya Image 2,

8:00

which is a high-capability text-to-image

8:03

model. So these are out of Microsoft

8:06

focusing on very specific scenarios to

8:11

hey, light up a whole bunch of different

8:13

types of AI

8:14

and capabilities you may want. Now

8:16

Microsoft Foundry is constantly, I think

8:17

every day there's new models being

8:18

added, but the GLUE 4.2 is now

8:22

available. This is a

8:24

general-purpose large language model

8:26

that's obviously part of the GLUE 4

8:27

family, but it's designed for

8:29

reasoning-intensive

8:31

real-world problem solving. So I'm just

8:33

calling this out as an example of I

8:35

think there's nearly like 12,000 models

8:37

there, that's one of them.

8:40

And Foundry Local went GA.

8:42

So this is all about running models on

8:44

the local device. Now it's a super light

8:46

runtime, but it does all the work to

8:48

acquire the model, manage the model,

8:50

utilize hardware acceleration on the

8:52

local device, so GPUs, MPUs,

8:54

and then actually use the model for

8:56

inferencing and it uses the ONNX

8:59

uh runtime.

9:01

It's only about 20 meg gets added to

9:03

your app package and it's going to use a

9:05

curated model catalog. So it's not going

9:07

to expose every model in Foundry, but

9:09

they wouldn't run.

9:10

It focuses instead on optimized models

9:13

for specific use cases, applications

9:16

need where I want to run it locally. So

9:19

there's a big push today about sort of

9:21

hybrid capability where I can, let's use

9:24

the capability of the local device, and

9:26

then scale and offload certain things to

9:28

the cloud.

9:29

And

9:31

that was it. As always, I hope that was

9:32

useful. Till the next video, take care.

9:37

>> [snorts]

Interactive Summary

This Azure update for April 10th covers several key enhancements across compute, networking, storage, and AI. Highlights include optimization strategies for Cosmos DB, the General Availability (GA) of the AKS CNI overlay expansion, and new MCP resource triggers for Azure Functions to better support AI applications. Additionally, Azure Red Hat OpenShift now supports Nvidia H100 and H200 GPUs, Azure Migrate adds assessment for Azure Files, and Foundry Local has reached GA, enabling efficient on-device AI model inferencing.

Suggested questions

5 ready-made prompts