Automatic CPU and NUMA pinning - TIB AV-Portal

Automatic CPU and NUMA pinning

00:00

6

Zugehöriges Material

Rotenberg, Liran

Formale Metadaten

Titel

Automatic CPU and NUMA pinning

Serientitel

Anzahl der Teile

287

Autor

Rotenberg, Liran

Lizenz

CC-Namensnennung 2.0 Belgien:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/57013 (DOI)

Herausgeber

Erscheinungsjahr

Sprache

Inhaltliche Metadaten

Fachgebiet

Genre

Abstract

In FOSDEM 2019 we presented the addition of high-performance virtual machines in oVirt. With this new VM type, parts of the VM configuration were changed to improve the performance of workloads it runs. In particular, it was useful for CPU-intensive workloads, such as SAP HANA. However, better performance came at the expense of usability. Users were still expected to set various things manually, like CPU and NUMA pinning and hugepages. In this talk, I will guide you through our journey of simplifying and automating the settings of high performance VMs in oVirt. We'll see the evolution of the changes, the challenges we faced, where we are today and what's more to come in oVirt 4.5.

FOSDEM 2022180 / 287

1

28:39

A gentle introduction to Picocli

2

43:04

Z80: the last secrets

3

34:39

Why your next embedded project should be written in Go

4

32:07

5

23:40

A Better Public Transport App

6

29:11

Open Geodata Digital Spaces

7

16:22

Open Source Firmware status on AMD platforms 2022

8

28:22

Firmware Settings and Menus

9

28:50

Bringing RAUC A/B Updates to More Linux Devices

10

11:45

GPIO across Linux and Zephyr kernels

11

28:48

Eclipse Amlen: Messaging for IoT/Web/Mobile

12

28:42

Back to DirectFB!

13

29:23

Why Embedded Linux Needs a Container Manager Written in C

14

1:00:10

Automotive Ethernet PHY bring-up: lessons learned and debug tips

15

25:52

How to teach OSS licenses and compliances at a university

16

30:10

Why the pandemic could help FOSS, but was a win for proprietary software

17

58:11

Panel: Hot Topics

18

27:53

A globally unified governance framework for Open Source

19

28:40

An update on the Digital Markets Act

20

28:19

Why Device Neutrality is important for Free Software?

21

28:19

Somebody set up us the bomb

22

29:12

Rapid Prototyping Physical Interfaces with Web Serial and Cheap MCUs

23

29:05

Can JS also build the metaverse?

24

41:39

Running trusted payloads with Nomad and Waypoint

25

43:58

Simple (but useful) Ansible reporting with ara

26

44:04

Deploying An Embedded Linux Distro Build Factory with Ansible And Proxmox: lessons learned

27

28:48

Utilizing AMD GPUs: Tuning, programming models, and roadmap

28

28:49

Porting Signal processing algorithms to CuPy for precision measurement

29

28:54

PIRA: Performance Instrumentation Refinement Automation

30

28:49

Bringing together open source scientific software development for HPC and beginners

31

26:24

Containers in HPC

32

25:34

Uncovering Arcon: A state-first Rust streaming analytics runtime

33

46:07

v3dv: Status Update for Open Source Vulkan Driver for Raspberry Pi 4

34

21:39

The status of turnip driver development.

35

43:13

36

56:34

Optimal buffer allocation on Wayland

37

31:08

Fun with Finite Automata

38

34:40

TornadoVM: Hardware Acceleration For Java In Practice

39

27:03

Update On Java On The Raspberry Pi

40

33:17

azul: How Your Java is Still Served Hot

41

33:23

Jakarta EE: Present & Future

42

30:47

Fundamentals Of Diversity & Inclusion For Technologists

43

30:59

Polyglot Cloud Native Debugger: Going Beyond APM

44

59:42

Using LibreSilicon

45

38:42

Efabless Open ASICs

46

48:35

Coriolis RTL-to-GDSII Toolchain

47

44:00

48

34:01

Next generation micro-controller programming

49

39:35

Nim Metaprogramming in the real world

50

33:59

Nim concurrency & Parallelism

51

28:15

Why rule-based monitoring is (still) great

52

29:06

Network Traffic Classification for Cybersecurity and Monitoring

53

18:04

Peer-to-peer hole punching without centralized infrastructure

54

29:04

Kubernetes networking : is there a cheetah within your Calico?

55

19:05

Keep appetite for the stats, it costs nothing

56

18:12

Faster memory reclamation with DPDK RCU

57

29:02

2-cluster Kubernetes, with Calico, BGP Interconnect and WireGuard... All Without Leaving Your Laptop!

58

29:06

Challenges and Opportunities in Performance Benchmarking of Service Mesh for the Edge

59

22:05

The relational model in the modern development age

60

23:52

Percona XtraDB Cluster(PXC) Non blocking operations, what you need to know to avoid pitfalls

61

24:59

ProxySQL Cluster: challenges and solutions to synchronizeconfigurationacross multiple decentralized cluster nodes

62

22:35

ProxySQL 2021 Dev Submit

63

18:52

Release Note Highlights from 2021

64

35:06

MySQL 8.0: Logical Backups, Snapshots and PITR like a rockstar

65

56:43

MySQL Operator for Kubernetes

66

25:07

MySQL on Kubernetes demystified

67

18:53

Hash join in MySQL 8.0

68

23:07

Flame Graphs for MySQL DBAs

69

23:41

MySQL Performance on Modern CPUS - Intel, ARM, AMD

70

42:25

Newest MySQL component services features

71

59:43

MySQL InnoDB ClusterSet

72

25:08

Encrypting binary (and relay) logs in MySQL

73

39:55

Efficient MySQL Performance

74

27:10

Backup/Restore tools performance comparison

75

37:33

Bootstrapping a multi dc cloud native observability stack

76

05:45

Monitoring and Observability devroom: Opening

77

39:02

Profiling in the cloud-native era

78

38:38

Adopting OpenTelemetry and its collector

79

25:56

Suggestions for a Stronger Mozilla Community

80

28:58

Collecting Sentences for Common Voice

81

18:59

Introduction to Foxfooding

82

37:12

Searchfox: Fast code search and indexing

83

39:03

Linux Mobile vs. The Social Dilemma

84

29:00

Phosh Contributors Get Together

85

33:59

ModemManager in your phone

86

38:46

2 Years of Mobian

87

30:18

Mainlining the reMarkable 2 eInk tablet

88

35:10

From Android to mainline on the Snapdragon 845

89

29:05

Running Mainline Linux on Snapdragon 410

90

24:05

Librem 5 phone kernel report

91

39:01

The road towards using regular linux on ebook readers

92

1:28:44

FOSDEM 2022 - Closing Session

93

1:28:45

Status of camera support on mobile FOSS devices

94

24:00

Anatomy of GNOME Calls

95

33:50

FOSDEM 2022 - Welcome to Libadwaita

96

29:01

Bring openwifi to PYNQ-Z1 with ultra low cost

97

39:03

RedLeaf: Isolation and Communication in a Safe Operating System

98

33:29

Unikraft: Debugging and Monitoring

99

30:59

Mitigating Processor Vulnerabilities by Restructuring the Kernel Address Space

100

48:18

Genode meets the Pinephone

101

20:23

Advanced Unit Testing in the Hedron Microkernel

102

43:53

The Composite Component-Based OS

103

25:26

A practical solution for GNU/Hurd's lack of drivers: NetBSD's rumpkernel framework

104

54:07

Unhackable across 30 Years, End in Sight

105

28:31

UX/RT - a QNX-like OS based on seL4

106

35:32

Hardware accelerated applications on Unikernels for Serverless Computing

107

33:30

Managarm: Design of a pragmatic fully-asynchronous microkernel

108

29:05

A year of RISC-V adventures: embracing chaos in your software journey

109

19:00

Why everyone needs to know some coding: last-mile sandboxing

110

23:44

Designing a programming language for the desert

111

30:10

Fuzion Language Update

112

18:37

How to design powerful DSLs for users

113

15:09

Declarative and Minimalistic Computing

114

33:59

The Concise Common Workflow Language

115

29:01

Adventures in Dataflow

116

28:47

The Matrix State of the Union

117

59:00

The matrix-rust-sdk

118

29:29

Growing Pinecones for P2P Matrix

119

19:01

Opsdroid: Building a bot using Python3

120

19:12

The next generation of Matrix interfaces

121

30:02

All things with moderation

122

29:15

MLS meets Matrix

123

29:21

Beyond the Matrix: Extend the capabilities of your Synapse homeserver

124

28:11

Events for the Uninitiated

125

04:56

Decentralized Collaborative Annotations using Matrix

126

04:28

ChatStat - An R package for Matrix stats

127

29:35

Through The Looking Glass

128

34:34

8-bit Character support on architectures were the smallest addressable unit size is 64-bit in Clang and LLVM

129

23:20

LLSOFTSECBOOK: LOW-LEVEL SOFTWARE SECURITY FOR COMPILER DEVELOPERS

130

16:25

Towards an Operational Code Aesthetics

131

24:02

Online performance

132

30:09

Why ODF is a better standard than OOXML

133

10:33

Macro Dialog feature

134

29:09

LibreOffice WASM – an Update: A status report from the journey to get LibreOffice into the browser, fully*

135

11:07

Information Engineering Operations

136

29:23

Improved coverage analysis for LibreOffice's CI

137

10:24

Editing Simulation

138

22:38

Improving Developer Experience at LibreOffice

139

26:38

Curl based HTTP/WebDAV UCP

140

09:21

Kubernetes setup & deployment

141

09:49

Canvas For Rendering UX

142

30:10

Advantages of LibreOffice Technology

143

10:09

Building Collabora Online UI: based on the LibreOffice components

144

18:26

Peergos - Combining peer-to-peer connectivity, end-to-end encryption and fine grained access control to build a secure and privacy focused self-certifying web protocol

145

27:57

State of libp2p

146

18:00

Edges Are Infrastructure: IPFS Everywhere for a More Resilient Future

147

19:21

Hyper Hyper Space: In-browser p2p applications

148

24:02

Earthstar: The merits of being a bicycle when everything else is a hyperloop.

149

22:10

What's coming in VIRTIO 1.2

150

19:13

Tracing KubeVirt traffic with Istio

151

18:52

The story of adding TPM support to oVirt

152

39:09

Phyllome OS: A friendly virtualization-focused Linux distribution

153

27:30

Network interface hotplug for Kubernetes

154

25:27

KubeVirt scale test by creating 400 VMIs on a single node

155

23:54

Isolating PCI/CXL Devices: It All Starts with System Launch

156

27:19

Introducing OKD Virtualization

157

28:26

DevOps, Cloud Native, DPUs: beyond the buzzwords

158

27:22

Cross-platform/cross-hypervisor virtio vsock use in go

159

39:27

Panel 2: Dependencies for Vulnerability Discovery and Tracking

160

24:03

SweetAda: A Lightweight Development Framework for the Implementation of Ada-based Software Systems

161

1:04:19

SPARKNaCl: A Verified, Fast Re-implementation of TweetNaCl

162

24:00

Exporting Ada Software to Python and Julia

163

38:43

Proving the Correctness of GNAT Light Runtime Library

164

34:15

The Outsider's Guide to Ada

165

33:38

The Ada Numerics Model

166

24:34

Ada Looks Good, Now Program a Game Without Knowing Anything

167

13:49

Introduction to the Ada DevRoom

168

1:03:39

Introduction to Ada for Beginning and Experienced Programmers

169

03:37

Closing of the Ada DevRoom

170

24:02

Implementing a Build Manager in Ada

171

23:41

Getting Started with AdaWebPack

172

29:04

Overview of Ada GUI

173

28:41

Use (and Abuse?) of Ada 2022 Features in Designing a JSON-like Data Structure

174

29:02

Alire 2022 Update

175

21:27

secPaver: Security Policy Development Tool

176

29:11

State of Open Source Databases

177

46:51

IMPLEMENTING AN INCENTIVISED PARTNERS PROGRAM IN MAUTIC

178

19:07

Introduction to qbe

179

23:49

Verifiable Credentials and Decentralized Identifiers with DIDKit

180

27:11

Automatic CPU and NUMA pinning

181

39:32

Build and release tools tailored to building, releasing and maintaining Linux distributions and forks

182

29:59

Modding the Immutable – how to extend Flatcar, an immutable image-based OS

183

44:49

Collaboration instead of Competition

184

38:16

CentOS Stream: stable and continuous

185

09:00

Extending Kubernetes with WebAssembly

186

28:41

Boot2container: An initramfs for reproducible infrastructures

187

29:20

Free tools that help you run online events in an effective way

188

25:37

Streaming and Edit Conference Videos with OBS, Jitsi and kdenlive

189

38:34

FOSS Events Primer

190

50:12

Run a conference with pgeu-system

191

03:34

Welcome to the Conference Organisation Dev Room

192

28:23

FOSDEM Conference Infrastructure

193

28:57

Lessons from 6 Virtual Ansible Contributor Summits

194

19:54

Debian Conference Infrastructure

195

29:06

Introducing ONLYOFFICE Forms for paperwork automation and smart collaboration

196

24:28

Oniro - an open-source starter for fast-paced IoT environments

197

32:04

Unifying Infrastructure and Application Delivery Using Keptn

198

23:46

Porion a new Build Manager

199

23:46

Massive Unikernel Matrices with Unikraft, Concourse and More

200

23:58

How to improve the developer experience in Heptapod/GitLab

201

44:26

Continuous Integration Pipelines with Nomad, Vault and Jenkins

202

19:00

Pushing the Open Source Hardware Limits with KiCAD

203

19:37

Open CASCADE Technology: status update

204

18:51

ngspice - current status and future developments

205

18:38

LibrePCB Status Update

206

59:14

KiCad Project Status

207

18:40

Hacking through BIM models

208

29:43

Valgrind and debuginfo

209

21:49

Adding Power ISA 3.1 instruction support to Valgrind

210

53:16

Upstreaming the FreeBSD Port

211

10:11

Enable AVX-512 instructions in Valgrind

212

20:31

Privacy-preserving video object detection in WebAssembly inside Veracruz

213

20:13

SGX Enclave Exploit Analysis and Considerations for Defensive SGX Programming

214

23:51

Secure boot, TEEs, different OSes and more

215

23:18

Logging, debugging and error management in Confidential Computing

216

23:20

Intravisor -- a hypervisor for fine-grained isolation using CHERI

217

23:18

Symbolic Validation of SGX enclaves using Guardian

218

23:16

Gramine Library OS

219

23:59

Rethinking the OS for Isolation Flexibility with FlexOS

220

20:46

WebAssembly + Confidential Computing

221

23:15

Developing for the AWS Nitro Enclave Platform

222

58:13

Process-based abstractions for VM-based environments

223

23:10

Arm CCA enablement through the Trusted Firmware community project

224

35:09

Unit testing Linux kernel drivers

225

13:10

How (not) to make a mockery of trust

226

43:01

LAVA + OpenQA = Automated, Continuous Full System Testing

227

21:33

Data Replication and Migration from Ceph RGW to Cloud

228

28:57

Introducing Garage, a new storage platform for self-hosted geo-distributed clusters

229

36:33

COSI : a brief update

230

09:43

Migrate to Ceph-CSI

231

29:01

Trajectware - timeline-based navigation across computing heritage

232

58:59

A Brief History of Spreadsheets

233

37:28

Debunking The Myths About The Raku® Language

234

58:31

Keeping old Unix/Linux up-to-date with pkgsrc

235

1:24:01

A Computer Museum Why and how?

236

27:40

FrogFind and 68k News

237

29:33

Old Web Today: Keeping Flash (and other) Retro Web Sites Accessible on the modern web

238

33:58

Hack for the Planet

239

19:45

Getting 1K Chess for the ZX81 online

240

29:35

AOSC OS/Retro - An Introduction

241

43:57

Made by Woz: how Apple-1 operating system works?

242

28:21

Radically simple testing in Raku

243

57:44

Raku Steering Council Q&A Panel

244

24:00

Class learning analytics with Raku

245

36:23

A Raku Grammar for Navigation Lights

246

33:36

GitHub Actions (in|for) Raku

247

22:12

Free Software, Dependency Management, and what I got wrong at FOSDEM 21

248

33:33

Keeping the past to preserve the future

249

38:23

Decentralized DevOps with Unfurl

250

29:02

Voyager 1 adventures

251

29:05

Opensource WiFi chip (openwifi) progress and future plan

252

29:06

gr-ofdmradar: OFDM Radar in GNU Radio

253

09:34

Introducing the M17 Project

254

29:42

Emitting Hellschreiber from a Raspberry Pi GPIO: combining gr-hellschreiber with gr-rpitx

255

58:07

AlekSIS, the Free School Information System

256

42:59

Working effectively with (-support-) the community

257

56:43

Solving the knapsack problem with recursive queries and PostgreSQL

258

58:30

Slow things down to make them go faster

259

29:37

PostgreSQL Distributed & Secure Database Ecosystem Building

260

28:30

Future Postgres Challenges

261

43:42

What I wish I knew about security when I started programming

262

58:22

Secure Communication with Tls

263

38:12

Sudo: Watch and control your blind spots

264

44:27

WebRTC broadcasting with WHIP

265

43:06

UnifiedPush: A FOSS cross-platform push notifications protocol

266

48:16

Jitsi: 20 years of Real Time Communications

267

58:58

On the Far Side of REST

268

58:44

Implementing the NTFS filesystem in Rust

269

45:09

Open Source Network Automation in 2022

270

43:48

European digital sovereignty and open source

271

35:38

Are we being inclusive with our community recognitions?

272

20:35

Strengthening Developer Communities in Unprecedented times

273

20:10

Tracking your time with Timewarrior

274

19:18

A lightning intro to re-Isearch

275

18:59

Rapid Prototyping of a Positioning System

276

14:58

NetOTA: Quick introduction to IoT centric package archive

277

20:10

Jupyter for React.js developers

278

18:06

Measuring and analyzing humidity data using Python, syslog-ng and Elasticsearch

279

14:29

C meta-programming for the masses with C%: cmod

280

18:45

Generating virtual 3D exhibitions from Wikipedia

281

20:42

Collabortive group self-awareness with Where, a Holochain app

282

17:21

LibreOffice 7.3 New Features

283

13:47

Thunderbird in 2022

284

20:30

ToroV, a kernel in user-space, or sort of

285

05:18

Hardware-accelerated graphics in secure multi-tenant environments

286

37:27

Making a Community Managed FOSS Project Sustainable

287

22:18

Valgrind on RISC-V

Automatisches Abspielen

Sprache

Text

Bild

00:00

BefehlsprozessorGamecontrollerHauptplatineSocketCoprozessorSpeicherabzugMathematische LogikThreadLokalität <Informatik>SupercomputerQuellcodeROM <Informatik>SprachsyntheseBefehlsprozessorArithmetisches MittelHalbleiterspeicherTermSocket-SchnittstelleTypentheorieGamecontrollerStellenringUmwandlungsenthalpieGrenzschichtablösungBitPhysikalismusSocketHypermediaPolstelleSpeicherabzugAbgeschlossene MengeQuellcodeMetropolitan area networkDiagrammComputeranimation

02:31

ZeichenketteBefehlsprozessorInverser LimesTaskWort <Informatik>BefehlsprozessorMultiplikationsoperatorPhysikalismusSuite <Programmpaket>KonfigurationsraumRechter WinkelSelbst organisierendes SystemMultiplikationInverser LimesVirtualisierungZahlenbereichPersönliche IdentifikationsnummerTaskComputeranimation

04:34

Mengentheoretische TopologieMathematikBefehlsprozessorSkriptspracheZeichenketteSocket-SchnittstelleHydrostatikMereologiePersönliche IdentifikationsnummerOrdnung <Mathematik>PunktAlgorithmische ProgrammierspracheResultanteDienst <Informatik>BeobachtungsstudieTouchscreenMetropolitan area networkExtreme programmingGezeitenkraftStellenringKonfigurationsraumFormation <Mathematik>MomentenproblemVersionsverwaltungMinkowski-MetrikSystemaufrufSchlussregelTropfenIntegralTopologieSpeicherabzugDatenfeldThreadZeichenketteSkriptsprachePhysikalismusBefehlsprozessorHydrostatikDatensatzFunktion <Mathematik>TermComputeranimation

07:35

BefehlsprozessorKonfiguration <Informatik>ThreadMengentheoretische TopologieSpielkonsoleZufallszahlenBefehlsprozessorBitMereologieThreadTopologieMultiplikationPersönliche IdentifikationsnummerSocketInverser LimesSocket-SchnittstelleAlgorithmische ProgrammierspracheHyperbelverfahrenStrömungsrichtungAlgorithmusKonfiguration <Informatik>SichtenkonzeptBetriebsmittelverwaltungNabel <Mathematik>SkriptspracheEntscheidungstheorieQuick-SortWort <Informatik>DigitaltechnikGezeitenkraftDivergente ReihePhysikalismusAntwortfunktionNormalvektorKonfigurationsraumBildverstehenFigurierte ZahlZahlenbereichSchnitt <Mathematik>DatenflussVererbungshierarchieComputeranimation

10:42

SocketSpeicherabzugThreadSocket-SchnittstelleZeichenketteBefehlsprozessorRechter WinkelSchlussregelGruppenoperationSystemaufrufTermMomentenproblemBitQuick-SortStichprobenumfangVerkehrsinformationEinsZellularer AutomatBaum <Mathematik>PhysikalismusSinusfunktionIntelligentes NetzSocketSpeicherabzugBefehlsprozessorZweiThreadVirtualisierungMereologiePersönliche IdentifikationsnummerSocket-SchnittstelleTopologieRechenschieberComputeranimation

13:15

BefehlsprozessorSpeicherabzugEndliche ModelltheorieHeegaard-ZerlegungNormalvektorKappa-KoeffizientSystemaufrufTermEinsTopologieBefehlsprozessorKonfigurationsraumDifferenteDemoszene <Programmierung>SchnittmengeComputeranimation

14:14

SocketBefehlsprozessorSocket-SchnittstelleThreadTopologieZahlenbereichAlgorithmusBefehlsprozessorThreadTermVirtualisierungSpeicherabzugMomentenproblemStrömungsrichtungRechter WinkelGreen-FunktionAggregatzustandAuswahlaxiomDiagramm

15:39

SoftwareQuellcodeBefehlsprozessorÄhnlichkeitsgeometrieKeller <Informatik>Offene MengeLaufzeitfehlerZeichenketteSchnittmengeGammafunktionPhasenumwandlungOvalBitDisjunktion <Logik>Persönliche IdentifikationsnummerBefehlsprozessorPhasenumwandlungKonfigurationsraumPhysikalismusOrdnung <Mathematik>LaufzeitfehlerRechenbuchKategorie <Mathematik>Inverser LimesTouchscreenResultanteDemoszene <Programmierung>MomentenproblemUnrundheitSchreiben <Datenverarbeitung>Zellularer AutomatVorzeichen <Mathematik>Selbst organisierendes SystemGebäude <Mathematik>BildschirmfensterSchedulingDatensatzAggregatzustandBinärdatenWald <Graphentheorie>Quick-SortEntscheidungstheorie

19:12

BefehlsprozessorEreignishorizontProgrammschemaMigration <Informatik>EinsUmsetzung <Informatik>BenutzerfreundlichkeitBridge <Kommunikationstechnik>BefehlsprozessorWeb-SeiteMereologieMigration <Informatik>Persönliche IdentifikationsnummerKonfigurationsraumDatenflussVerschlingungComputeranimation

20:16

Computeranimation

20:38

Coxeter-GruppeHoaxSpieltheorieMigration <Informatik>LastAggregatzustandUmwandlungsenthalpiePhasenumwandlungBitrateDatenverwaltungInstantiierungOffene MengeImplementierungBeanspruchungMultiplikationsoperatorBefehlsprozessorPersönliche IdentifikationsnummerTopologieMatchingGammafunktionDatenflussAlgorithmusPhysikalismusMathematische LogikBesprechung/Interview

27:06

Computeranimation

Transkript: Englisch(automatisch erzeugt)

00:06

Hello everyone, I'm Van Vattenberg from VEDAT and I will speak today about automatic CPU and non-opening. It's a nuclear inovert.

00:21

Three years ago in FOSDEM we introduced inovert a new VM type, iPerformance. It was useful for CPU-intensive workloads, especially SAP HANA VMs. This VM type automatically configured some VM properties, usually that are auto-configured.

00:47

Such as making the VM endless, dropping the USB controller. More stuff, but it wasn't complete. You still needed to do some manual modifications to get the real benefit of the iPerformance in terms of CPU.

01:14

A little bit about CPU and its topology. You have the CPU, which is basically split into two sockets.

01:26

Within the sockets you have the cores, which are the processors. Each one of them can be split into threads. We don't deal with dies inovert. And as far from NUMA, NUMA is Non-Uniform Memory Access.

01:44

Each NUMA node has separate CPUs, memory controller and memory, IO controllers and devices. It's measured in terms of locality.

02:00

Usually each NUMA node has one socket. Which CPU and core basically assign to local memory to use. Which makes, if you configure it right in peeling, specific memory, local memory,

02:22

physically closed to the physical CPUs in terms of performance and it's faster. Here's how we configure CPUs over it.

02:41

So, it's a string. We specify the VM edit configuration. It's pretty difficult to understand and difficult to write. You can limit virtual CPUs to one or more physical CPUs.

03:00

It basically reduces the movement of other processors at most. Here's an example of a CPU pin exchange. As you can see, it's not very easy to read. It means, for example, that virtual CPU 0 is assigned to physical CPU 3 and 2 is assigned to physical CPUs 1 or 2 and so on.

03:31

And there are limitations of using this method. It's a static configuration. Once you edit on the VM, it requires to pin the VM to the host.

03:46

These CPUs are shared. This means it's not an exclusive physical CPU to the VM or virtual CPU you pin. It means that other VMs and processors can run on the same physical CPUs.

04:06

I'm configuring meaningful pinning for a number of VMs. On that host, it's a reduced task. As you can see, when it's a VM with many virtual CPUs and the host with many physical CPUs,

04:23

and you wish to pin it, maybe for one VM, it's fine. But when you're doing it for multiple VMs, it starts to be harsh to do. For the high-performance VM, there is a manual procedure, basically guidelines,

04:43

SAP HANA VMs in order of defining the pinning. Here is an example of the manual pinning. You select a host and you get its topology. Once you get the CPU topology, the normal topology,

05:02

then you change the VM CPU topology to fit this host topology. For example, if you had a host with one socket, three cores, and two threads, then you set your VM with one socket, two cores, which is one core less, and two threads.

05:26

This is a resized process, and that dropping core is basically to let the host know. For high-performance VM, you also pin the IO thread and emulator, usually, to the first core.

05:47

This is the idea behind it. You change the purple NUMA to fit the host physical NUMA in terms of numbers, and then you run the script on the desired host.

06:04

It generates for you the CPU pinning string based on the NUMA nodes and the host topology. So it pins according to the socket, of course, and because it's a script that only supports some topology, not all of them,

06:27

then you need to copy the output of the script, the CPU pinning string into the VM configuration, and pin manually the virtual NUMA to the physical NUMA recording.

06:42

In our profile, we introduce a new feature, which was CPU and NUMA auto-pinning. We assigned the CPUs based on host topology. We had one policy resizing pin that resizes the VM topology and the VM NUMA nodes

07:05

based on the previous manual procedure for SAP HANA. It was effective on PM edit, which means when you set it on the configuration and click OK, then all the static fields of the CPU pinning, NUMA, and so on are set.

07:25

And it did not change on the VM start. It was configurations that keep going as it's found in the VM static. OK, a bit of CPU pinning policy. So we introduce a new configuration, the VM CPU pin policy.

07:47

The resize and pin option does, as I said, the manual procedure automatically. We announced the support, for example, the script supported on the full thread topology on the host.

08:04

And we, for example, have one thread topology supported as well now. And in the future, we plan to make it generic for a known number of threads. And we have the same limitations that you need to pin the VM to one of MOBOS.

08:28

As well, we introduced another policy, which was called pin. The pin policy did not change. It didn't do the resize part. It means that you get the CPU topologies that the VM was configured with.

08:46

And the algorithm ran to the host and basically gets you the best pinning we can with the current CPU topology.

09:01

Finally, one major flaw is that we use the same physical CPUs on the host for multiple VMs. For example, when you run two VMs and the host has two sockets and your HPM was supposed to use one socket each,

09:23

it will use the same socket, which is not good, leaving the second socket free without any HPM. At the moment, we are discussing an alternative solution to add back this policy.

09:44

These are used as a feature of dedicated CPUs, which I will talk a bit later. I'll use shell CPUs as well, like the resize and pin, not changing the algorithm

10:01

to fit and decide which physical CPUs are free to use and not using the same. Here is just the view in the UI, how it can be easily configured. You can just edit the VM, go into the resource allocation tab, and then just setting the CPU ping policy to resize and pin NOMA.

10:29

Also, the API is pretty simple, you just need to provide the CPU ping policy with zero to the desired CPU.

10:43

Here is an example of how we do the resizing part. For example, in this we have a host that you will see in the next slide. It has two sockets, three cores and one thread.

11:02

We have the initial VM with one socket, one core, one thread. We just increased the VM topology to have two sockets as host has, and two cores instead of three, we dropped one.

11:23

And one thread as well. We also set the VM with two NOMA nodes, virtual NOMA nodes, like the host has. Here is how the pinning itself is done afterwards.

11:45

After we increased and resized the topology, we're now pinning it. As you can see, we leave the call 03, which is the first call in its socket, and we pin each core to the physical core accordingly.

12:01

Call 0 in the VM goes to call 1, call 1 to call 2, socket 0 to socket 0 basically. And the same for the second socket, socket 1. We also pin the NOMA, NOMA 0 to NOMA 0 physically.

12:20

The CPU pin, for example, the simple one, is 0 to 1, 1 to 2, and so on. This is a high-level example. It's a pretty simple one. Once you add the threads and so on, it's becoming a bit more complicated and pretty long.

12:45

This is ensuring that the virtual CPUs use a virtual NOMA, and it's pinned to the right physical CPUs, to the right physical NOMA,

13:00

and basically getting closer to bare metal in terms of CPU, because you're using virtually, going to the physically, and using in the same local place, so this is the idea. And we also, while doing so, fixed incorporating splitting of the vCPUs to the vNOMA nodes.

13:27

It generates the CPU set to the NOMA behind the scenes. And with the previous algorithm, a call can be divided into two different NOMA nodes.

13:41

It causes a problem. Within the guest, you could get CPU topologies that would not be the ones that you're actually set to the VM configuration. And that's not what we want in terms of performance. As I said earlier, in terms of NOMA, you don't want the cores, the threads, to be in the same call,

14:11

and the same call to be on the same call on that NOMA. Here is an example of how it's done in the previous algorithm.

14:21

You can see that we had eight CPUs VM with one socket, and four cores, and two threads. And we had three virtual NOMAs. So the Carleton algorithm just did the number of CPUs.

14:42

We divide it with the virtual NOMA count, which means eight divided by three. And once we have a reminder, we'll try to just add one virtual CPU to each NOMA until we didn't have more reminder lefts.

15:06

Now the algorithm is trying to pick up and build the right NOMA, grouping the threads into the calls, to get the same CPU core into the same NOMA instead of splitting.

15:25

It will be better performance-wise, and also better in terms of not misleading the underlying voice in the guest. And getting other topologies are expected.

15:42

And in the other four or five, a new feature is coming, called dedicated CPUs. And all of the pinning is required for that. The new policy will make CPU pinning exclusive. So each vCPU will get exclusiveness over physical CPUs.

16:04

And a lot of vCPUs won't be able to use it. Which means that each VM with vCPUs can get its own physical CPUs. A lot of VMs won't be able to use it.

16:21

The same physical CPUs, calls, processes, and something else that running on the host is able to use it, but not our VMs. The effort was to make the CPU pinning policy similar to that of OpenStack.

16:44

And based on that, it requires CPU assignment on runtime. And here we get a little bit of the chicken and egg problem. So, the old resize and pin flow, it was fairly simple.

17:01

You just need to pin the VM to the desired host and select the policy. And once you click OK, the engine sets the CPU topology, the CPU pinning strength, the normal pinning, all the pinning itself, and sets it into the static configuration of that VM.

17:24

And now, in the new resize and pin flow, we select the policy for the VM and we don't set anything. And once we do that, we run the VM and the engine selects a host for us.

17:47

And only then, the pinning is set. So, this is coming back to the chicken and egg problem. We do all the validations and resolve handling by the static configuration, but in this flow,

18:05

as we wish to do it in run phase, there is no static configuration regarding that. We don't have anything and we don't choose a host yet. So, we had a problem and we need to calculate the intended configuration and save

18:28

it into a special place in order to validate and schedule the VM on a host. And only then, to know what we're currently using.

18:42

Once the VM goes down, we need to reset it. And of course, we drop here the limitation of we don't need the VM to be pinned to a host. So, we basically ended up on setting it as dynamic properties that can be changed.

19:02

And we check it on run phase and calculate what's needed in order to make things work and to be aligned with dedicated CPUs. So, what is next and what is left? So, of course, the pin policy, which is under discussion.

19:21

The huge pages configuration, which completes the high performance configuration. It's a problematic configuration because we don't know the user requirement for the host use. And it requires preparation to have enough huge pages set with the current size in the host.

19:49

It can fail to run the VM, which we don't want it to happen. One gigabyte each pages can have migration flow in the converge part.

20:02

So, currently, we don't do it automatically. Here are links for the dedicated CPUs policy and to the host and high performance VM. And thank you all. And I'm ready for any questions.

20:21

Thank you.

21:10

Auto pin in VMs can migrate. They may require the same.

21:21

Previously, if you use the auto flow in 4.4, then it might be a problem because it needs the same hardware in between those hosts. But now, with the run phase, migration actually can work because it's recalculated once the VM is starting.

21:54

And I will repeat the questions and answers. Can auto pin in VMs migrate? Will they be pinned on destination host?

22:02

And what happens if the Luma topology doesn't match the destination host? If my presentation is about specific performance calls, then auto pinning is a shared CPU.

22:26

And it consumes all the host physical CPU hardware, basically. So, if you use VM with high workload of intensive CPU, it will consume your host.

22:44

So, running more VMs will cause less effective performance. For SAP HANA, basically, it's recommended to use one VM, such VM on the host. I think this is it, mostly.

23:05

You don't even need to pin the VM into host now with the current implementation.

24:11

Is the auto pinning feature overt specific? Or will it be available on plain Linux distro with overt gamma or KVM, for instance?

24:21

Yes, it's overt specific. We do all the logic and algorithm in overt in the manager. So, it is specific to overt.

24:50

Just to add up, of course, you can do it manually when you configure your VM.

25:03

And running comments.

26:21

Okay, I guess there is no more questions and the time is up. So, thank you all for joining and listening. I hope it will be useful for you. And see you all.

Empfehlungen