vhost-user-blk: a fast userspace block I/O interface - TIB AV-Portal

vhost-user-blk: a fast userspace block I/O interface

00:00

5

Zugehöriges Material

Hajnoczi, Stefan

Formale Metadaten

Titel

vhost-user-blk: a fast userspace block I/O interface

Serientitel

Anzahl der Teile

542

Autor

Hajnoczi, Stefan

Lizenz

CC-Namensnennung 2.0 Belgien:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/61458 (DOI)

Herausgeber

Erscheinungsjahr

Sprache

Inhaltliche Metadaten

Fachgebiet

Genre

Abstract

vhost-user-blk is a userspace block I/O interface that has traditionally been used to connect software-defined storage to hypervisors. This talk covers how any application that needs fast userspace block I/O can use vhost-user-blk and its advantages over network protocols. A client library called libblkio is available for C and Rust applications will be introduced. The protocol is also summarized for those wishing to understand how it works or implement it from scratch. This talk is intended for developers interested in connecting applications to SPDK or qemu-storage-daemon and those who want to know more about software-defined storage interfaces. - Local block storage interfaces - Kernel vs userspace - Notifications vs polling - Message-passing vs zero-copy - What is vhost-user-blk? - Implemented by qemu-storage-daemon and SPDK - virtio-blk and VIRTIO - How to connect using libblkio (C/Rust) - How to implement a server using libvhost-user (C) or vhost-user-backend (Rust) - How to integrate with the Linux kernel block layer using VDUSE

FOSDEM 2023320 / 542

1

10:57

Tribe - a content structuring and collaborative framework

2

21:58

Decentralizing moderation

3

15:16

Challenges in Home Energy Management

4

20:42

EVerest: AC and DC electric vehicle charging with open source software and hardware

5

20:43

Green software engineering

6

30:01

What the energy industry can learn from how open source technology has transformed other traditional industries

7

39:45

Carbon Intensity Aware Scheduling in Kubernetes

8

30:06

Inside the FIM (Fbi IMproved) Scriptable Image Viewer

9

24:53

Using GNU Guix Containers with FHS (Filesystem Hierarchy Standard) Support

10

32:35

An Introduction to Guix Home

11

29:50

Bringing RISC-V to Guix's bootstrap

12

1:28:49

Learn 8-bit machine language with the Toy CPU emulator

13

28:12

Literate Storytelling: Interpreting Syntaxes for Explorers

14

30:18

15

14:39

16

16:20

Breaking away from Big Tech

17

16:33

Build CI/CD pipelines as code, run them anywhere

18

41:03

Building an actor library for Quickwit's indexing pipeline.

19

32:20

Celebrating 25 years of Open Source

20

29:17

CI/CD for Machine learning models

21

14:45

CoffeOSM: improve OpenStreetMap a receipt at a time

22

15:21

Consulting for digital humanists

23

19:44

Continuous Delivery to many Kubernetes Clusters

24

15:20

Continuously Update Everything

25

1:48:15

How regulating software for the European market could impact FOSS

26

15:36

data mountains - turn your data into mountains!

27

27:53

Device driver gardening

28

24:24

Delivering a crossplane-based platform

29

14:35

Keep Your Dependencies In Check

30

35:45

Perspectives from the Open Source Developer

31

30:06

Elasticsearch Internals

32

47:16

The ELISA Project - Enabling Linux in Safety Applications

33

15:28

FireCRaCer: The Best Of Both Worlds

34

22:56

FireCRaCer: The Best Of Both Worlds

35

40:59

Teaching machines to handle bugs and test Firefox more efficiently.

36

29:42

FOSDEM infrastructure review

37

15:36

FOSSbot: An open source and open design educational robot

38

16:09

FPGA-based music synthesis with open-source tools

39

26:52

Fuzion — Intro for Java Developers: Mapping Java's Features to Simpler Mechanisms

40

14:28

gallia: An Extendable Pentesting Framework

41

13:31

A GitLab forge for all teachers and students in France?

42

19:12

Classics Never Get Old: Two Easy Pieces For GraalVM

43

14:53

44

19:39

Write Once, Run Anywhere... Well, What About Heterogeneous Hardware?

45

22:29

2D animations in Haskell using gloss, lens and state

46

04:43

Acknowledgements, *prize draw* and farewell

47

28:36

Open-Source Opportunities with the Haskell Foundation

48

41:50

Hackathon HaskellKatas style

49

28:04

Recipes for reducing cognitive load

50

24:41

Reconciliation Pattern, Control Theory and Cluster API: The Holy Trinity

51

25:08

Optimizing string usage in Go programs

52

24:38

FOSDEM 2023 - Go Lightning talks: Come speak!

53

28:00

Is Go Object-Oriented? A Case of Public Opinion

54

23:23

Headscale: How we are using integration testing to reimplement Tailscale

55

27:58

Five Steps to Make Your Go Code Faster & More Efficient

56

21:38

Go Even Further Without Wires

57

29:29

What's new in Delve / Tracing Go programs with eBPF

58

23:35

Debugging concurrency programs in Go

59

26:55

Building a CI pipeline with Dagger in Go

60

25:21

Our Mad Journey of Building a Vector Database in Go

61

25:10

Building FPGA Bitstreams with Open-Source Tools

62

27:32

WAM: an embedded web runtime history

63

24:34

U-Boot as PSCI provider on ARM64

64

26:40

Troika: Submit, monitor, and interrupt jobs on any HPC system with the same interface

65

17:24

Running MPI applications on Toro unikernel

66

26:50

Simplifying the creation of Slurm client environments

67

24:28

How the Spack package manager tames the stat storm

68

28:26

Keeping the HPC ecosystem working with Spack CI

69

26:45

Reverse engineering a solar roof datalogger

70

09:40

Ups and Downs with Remote Desktop Protocol (RDP) on Wayland, Weston and the Yocto Project

71

26:49

Writing a Telegram Antispam Bot in Python: An introduction to async programming

72

23:23

Building a Semantic Search Application in Python, Using Haystack

73

22:06

Code reloading techniques in Python

74

30:10

Will PyScript replace Django?

75

26:25

Simple, open, music recommendations with Python

76

55:35

A quick introduction to MicroPython

77

24:43

Python Logging Like Your Job Depends on It

78

25:52

Pip install malware

79

24:18

Realtime 3D Graphics on a MicroPython ESP32

80

24:42

DuckDB: Bringing analytical SQL directly to your Python shell.

81

25:47

Talk to DBus from a Python application

82

24:52

Continuous Documentation for Your Code

83

26:51

The PolyVent FLOSS Ventilator

84

28:24

Efficiently exploit HPC resources in scientific analysis and visualization with ParaView

85

08:37

Overengineering an ML pet project to learn about MLOps

86

06:17

Open Source Switching: Upstreaming ONIE NVMEM and switch BSP drivers

87

26:29

88

25:00

MUST: Compiler-aided MPI correctness checking with TypeART

89

11:25

HPC Container Conformance

90

26:02

Matter and Thread as Connectivity Solution for Embedded

91

25:41

LOFAR: FOSS HPC across 2000 kilometers

92

26:19

Convergent camera applications for mobile Linux devices

93

22:59

Link-time Call Graph Analysis to facilitate user-guided program instrumentation

94

18:23

LIBRSB: Universal Sparse BLAS Library

95

10:02

The LDBC benchmark suite

96

25:38

KUKSA.val Vehicle Abstraction

97

28:00

Self-service Kubernetes Platforms with RDMA on OpenStack

98

08:04

Exploring a swedish smarthome hub

99

10:08

A journey to the hardware world

100

27:15

How to deal with validation as an HPC software?

101

20:42

Developing effective testing pipelines for HPC applications

102

10:10

Multiple Double Arithmetic on Graphics Processing Units

103

27:23

Introduction to Watermill: Siple Go Event-Driven application in 20 minutes

104

20:23

Visually programming Go

105

24:13

The State of Go: What's new since Go 1.19

106

23:39

Squeezing a go function

107

31:40

OpenTelemetry with Grafana

108

26:45

Building a Web UI for the Fedora installer

109

50:46

Advanced Camera Support on Allwinner SoCs with Mainline Linux

110

25:06

How to get public administrations to use more FOSS

111

26:24

Accessibility & Open Source: How open source is key to building a more inclusive world.

112

25:34

A11y: EAA, WCAG, WAI, ARIA, WTF? – it’s for the people stupid!

113

24:49

A mirror without reflection for Kotlin/Multiplatform

114

20:43

20 minutes from zero to a live chatbot with Tock

115

23:44

5 errors when building embedded systems

116

24:47

On-prem to Cloud to Edge and beyond: Evolution of women contributors in distros & FOSS

117

25:50

Windows and Office "tax" refund

118

24:24

Walking native stacks in BPF without frame pointers

119

24:29

Is it time to migrate to Vue 3?

120

25:16

Introducing Vegvisir: An automation framework for testing QUIC application logic

121

27:12

Value driven design

122

31:32

Uncover the Missing Link

123

28:55

Practical introduction to OpenTelemetry tracing

124

21:09

Better Kotlin Multiplatform architecture with Dependency Injection and KSP

125

28:39

The O11y toolkit

126

34:22

The State of Kotlin

127

14:57

Practical and fun automation for all your terminal sessions

128

20:20

Take your shot of Vitamin!

129

28:37

Strong Dynamic Type Checking for JavaScript

130

25:55

Having Something To Hide

131

15:25

Finite state machine (and some retrogaming)

132

24:57

Shrinking in the Age of Kotlin

133

24:37

Sharp photos and short movies on a mobile phone

134

21:29

Whom Do You Trust?

135

25:44

Where does that code come from?

136

30:33

Demystifying StackRox

137

11:00

What Does Rugby Have To Do With Sigstore?

138

24:41

Enabling FIDO2/WebAuthn support for remotely managed users

139

21:46

Mercator: Mapping the information system

140

24:56

Post Quantum Cryptography in Voice/Video over IP

141

25:58

Remote Attestation with Keylime

142

26:17

Kerberos PKINIT: what, why, and how (to break it)

143

18:39

Converting HPKE to be PQ

144

21:56

OpenSSL in RHEL: FIPS-140-3 certification

145

19:01

Playing with Nix in adverse HPC environments

146

18:05

147

19:14

Nixel: a nicer way to write your Nix expressions

148

12:46

Make Anyone Use Nix

149

20:10

I am excited about NixOS, I want to tell you why!

150

16:02

devenv.sh - Fast, Declarative, Reproducible, and Composable Developer Environments

151

20:28

The Nix package manager development process

152

05:58

Contracts for free!

153

20:13

A success story of adopting Nix at a workplace

154

14:50

NGI Search and OpenWebSearch.EU projects

155

28:47

prplMesh: open source Wi-Fi mesh

156

26:26

So you want to build a deterministic networking system

157

23:47

Snabbflow: a scalable IPFIX exporter

158

22:40

Need to connect your k8s pods to multiple networks? No problem [with calico/vpp]!

159

15:46

Peer-to-peer Browser Connectivity

160

27:07

ntopng: an actionable event-driven network traffic analysis application

161

25:27

Service MESH without the MESS

162

27:39

MetalLB and FRR: a match made in heaven

163

19:41

Networking management made simple with Nmstate

164

30:14

What is an IDS and Network Security Monitoring in 2023?

165

28:47

Hole punching in the wild

166

32:36

Decentralized Storage with IPFS

167

24:32

DDoS attack detection with open source FastNetMon Community

168

25:05

"CNI Unleashed"

169

31:42

Golden Signals with Cilium and Grafana

170

43:35

Open Source Software at NASA

171

21:10

What I Miss In Java (The Perspectives Of A Kotlin Developer)

172

18:10

Major Migrations Made Easy With OpenRewrite

173

31:17

The Microkernel Landscape in 2023

174

25:51

MicroBlocks: small, fast, human friendly

175

20:10

Making Continuous Delivery Accessible to All

176

15:10

Lua for the lazy C developer

177

30:10

Loupe: Designing Application-driven Compatibility Layers in Custom Operating Systems

178

15:30

Writer Content Controls -- what happened in the past half year

179

10:01

A Rocket Engine for LibreOffice Templates

180

08:10

Cleaning up the unittest code mess

181

10:24

State of the Toolchain

182

09:24

Putting the R in LibreOffice: a Shiny dashboard for QA

183

09:06

Improvements to LibreOffice PDF accessibility

184

05:48

Supporting old proprietary graphic formats

185

09:15

News from the ODF Toolkit

186

11:01

Migrating to LibreOffice Technology - old and new motivations and challenges

187

08:55

Marrying Collabora Online and LibreOffice WASM

188

09:03

LibreOfficeKit – bridge between your application and LibreOffice

189

07:59

An Interoperability Improvement in LibreOffice Impress Tables

190

09:57

Fun project by design – How LibreOffice development can be full of flow?

191

09:00

Footnotes in multi-column sections

192

04:26

Feature Locking and Feature Restriction

193

10:10

Turbocharging an elephant. Making Libreoffice faster.

194

08:42

LibreOffice Dark Modes

195

09:30

Crashtesting LibreOffice in the backyard

196

08:01

Make Collabora Online yours

197

10:04

Collabora Online and WASM

198

09:41

Collabora Online over lock-down

199

47:03

200

09:48

LibreOffice graphics subsystems - SystemSpecificRenderers

201

11:32

Libre-SOC: From architecture and simulation to test silicon, and beyond

202

14:57

Keyoxide: verifying online identity with cryptography

203

16:12

Jubako, a new generic container format

204

11:29

Combining EASY!Appointments with Jitsi for online appointment management

205

23:06

Combining EASY!Appointments with Jitsi for online appointment management

206

15:54

Is YAML the Answer?

207

22:17

208

18:06

Devroom kick-off talk: UKI? DDI?? Oh my!!!

209

23:15

Ubuntu Core: a technical overview

210

24:59

Image-Based Linux and TPMs

211

21:45

openSUSE MicroOS design

212

23:38

Building initrds in a new way

213

15:41

MachineOS: a Trusted, SecureBoot Image-based Container OS

214

16:33

Converging image and package based OS updates

215

15:05

I2P: Major Changes of the Peer-to-Peer Network

216

38:40

Hardware acceleration for Unikernels

217

31:56

How We Gained Observability Into Our CI/CD Pipeline

218

17:56

How To Automate Documentation Workflow For Developers

219

44:39

Introducing Helios Micokernel

220

13:27

Introducing Helios

221

31:01

Hedy: A gradual and multi-lingual programming language for education

222

32:42

Web application architecture in Haskell with flora.pm

223

13:15

A quick overview of the Haskell tooling

224

06:05

The Haskell Security Advisory Database

225

29:32

On the path of better interoperability with Rust!

226

16:10

Breaking the Code of Inclusion: Designing Micro Materials Based on PRIMM Principles for Accessible Programming Education.

227

14:42

Beyond Wikipedia

228

26:03

Speak binary to me

229

26:13

Shorter feedback loops with Livebook

230

04:56

Running Erlang and Elixir on microcontrollers with AtomVM

231

28:56

LiveView keeps you warm!

232

18:07

Introduction to Gleam

233

11:21

Elixir - Old wine in new casks

234

23:11

Distributed music programming with Gleam, BEAM, and the Web Audio API

235

26:34

The Actor Model as a Load Testing Framework

236

15:44

Do more awkward user interviews

237

14:33

AsyncGetStackTrace: The Improved Version Of AsyncGetCallTrace (JEP 435)

238

49:05

Building a Linux-compatible Unikernel

239

28:34

Building Personalized AI Apps with MIT App Inventor

240

23:28

Why And How To Upgrade To Java 17 (And Prepare For 21)

241

15:40

Tableaunoir: an online blackboard for teaching

242

38:43

Sustaining Free and Open Source Software

243

49:24

Open Source in Environmental Sustainability

244

14:08

Should there be a standard in libre localization?

245

17:19

Staging of Artifacts in a Build System

246

43:24

What is Digital Sovereignty and how can OSS help to achieve it?

247

21:49

The Role of Open Source at the EU Technology Roadmap for a European Sovereign Cloud

248

18:52

The role of Open Infrastructure in digital sovereignty

249

1:03:30

The Importance of Collaborative Applications for European Digital Sovereignty

250

55:59

The Co-operative Cloud

251

25:09

Responsible Clouds and the Green Web Triangle

252

20:04

Operate First community cloud

253

22:02

On-premise data centers do not need to be legacy

254

42:11

Is Open Source Coming back to your Cloud?

255

14:27

How we created a Documentation Framework that works across a group of vendors in the sovereign cloud stack community

256

40:24

From Zero to Hero with Solid

257

31:28

Effective management of Kubernetes resources for cluster admins

258

36:35

Distributed Storage in the Cloud

259

49:55

Building Strong Foundations for a More Secure Future

260

31:02

Snap! - Build Your Own Blocks

261

48:48

Similarity Detection in Online Integrity

262

23:36

Afraid Of Java Cold Starts In Serverless?

263

16:07

Self-hosting for non-coders?

264

21:11

265

38:24

Intro to Ceph on Kubernetes using Rook

266

35:51

Lessons learnt managing and scaling 200TB glusterfs cluster @PhonePe

267

39:58

Dynamic load change in SDS systems

268

17:40

Present and future of Ceph integration with OpenStack and k8s

269

24:02

A Rust-Based, modular Unikernel for MicroVMs

270

18:35

Using Rust for your network management tools!

271

08:27

Slint: Are we GUI yet?

272

20:51

Scalable graph algorithms in Rust (and Python)

273

41:41

Rust API Design Learnings

274

08:04

Neovim and rust-analyzer are best friends

275

20:17

Merging process of the rust compiler

276

30:31

Let's write Snake game!

277

35:24

How Pydantic V2 leverages Rust's Superpowers

278

30:52

279

33:46

Building a distributed search engine with tantivy

280

07:56

Presentation of BastionLab, a Rust open-source privacy framework for confidential data science collaboration

281

20:30

Backward and forward compatibility for security features

282

40:31

Aurae: Distributed Runtime

283

20:13

atuin: magical shell history with Rust

284

08:38

A Rusty CHERI - The path to hardware capabilities in Rust

285

39:27

A deep dive inside the Rust frontend for GCC

286

49:51

Rosegarden: A Slumbering Giant

287

25:10

Quarkus 101: Intro To Java Development With Quarkus

288

26:21

Modernizing Legacy Messaging System with Apache Pulsar

289

52:31

Podcasting 2.0: it's all about Interoperability

290

50:46

Running a Hybrid Event with Open Source

291

17:05

Update on #JavaOnRaspberryPi and Pi4J

292

11:55

Announcing pg_statviz

293

15:53

294

51:59

Evolution of OSv: Towards Greater Modularity and Composability

295

12:41

Open Source Good Governance – GGI Framework presentation & deployment

296

15:10

Get Started with Open Source Formal Verification

297

40:54

Making the world a better place through Open Source

298

14:53

299

1:05:53

NOVA Microhypervisor Feature Update

300

18:04

Towards Secure Boot for NixOS

301

04:17

Energy policy by the European Commission

302

22:33

European Eichrecht

303

26:21

Tackling document collaboration challenges in 2023

304

31:40

Conquering tribal knowledge with Grav

305

52:06

Creating a content pipeline with Antora

306

45:50

Open Source Collaboration Tools for Alfresco

307

19:41

A Study of Fine-Grain Compartment Interface Vulnerabilities: What, Why, and What We Should Do About Them

308

23:55

Project Veraison (VERificAtIon of atteStatiON)

309

18:05

Rust based Shim-Firmware for confidential container

310

24:36

Nydus Image Service for Confidential Containers

311

21:39

Scalable Confidential Computing on Kubernetes with Marblerun

312

22:37

Gramine Library OS

313

17:47

Building a secure network of trusted applications on untrusted hosts

314

30:20

THE BASE - FOSS Confidential Container SDK to ease the development

315

20:44

Confidential Containers and the Pitfalls of Runtime Attestation

316

29:06

Cascaded Foci (SFUs)

317

34:03

Building a social app on top of Matrix

318

11:45

tissue—the minimalist git+plain text issue tracker

319

31:24

Introduction to the Synapse Kubernetes Operator

320

25:13

vhost-user-blk: a fast userspace block I/O interface

321

36:46

Operating Ceph from Ceph Dashboard

322

31:39

Reviving Reverse Polish Lisp

323

55:36

Self-conscious Reflexive Interpreters

324

29:58

Introduction to Pre-Scheme

325

18:16

Research at the service of free knowledge: Building open tools to support research on Wikimedia projects

326

15:17

The Software Sustainability Institute Community and Events

327

05:20

Establishing the Research Software Engineering (RSE) Asia Association with the Open Life Science programm

328

28:53

Frictionless Application (IDE for CSV)

329

29:10

Papis: a simple, powerful and extendable command-line bibliography manager

330

30:06

V2GLiberty: The open stack that could

331

27:49

Power profiling with the Firefox Profiler

332

22:14

OpenSTEF: Open Source energy predictions

333

26:49

Update on open-source energy system modeling in the global south and including Africa

334

19:45

4 Years of Energy Management with openHAB

335

25:12

Getting to a fossil free internet by 2030

336

26:58

Combatting Software-Driven Environmental Harm With Free Software

337

32:10

OpenCSD, simple and intuitive computational storage emulation with QEMU and eBPF

338

32:14

Stack walking/unwinding without frame pointers

339

29:40

The state of r2land

340

31:18

GNU poke beyond the CLI (Command Line Interface)

341

31:01

342

24:34

Libabigail, State Of The Onion

343

33:06

Libabigail, State Of The Onion

344

23:49

fq - jq for binary formats

345

26:50

An introduction into AMD/Xilinx libsystemctlm-soc

346

30:25

7 things I learned about old computers, via emulation

347

27:05

Transit network planning for everyone

348

25:36

Using open source software to boost measurement data in railways

349

26:40

Automated short-term train planning in OSRD

350

11:57

OpenStreetMap, one geographic database to rule them all?

351

27:03

OpenTripPlanner

352

17:42

Introducing MOTIS Project

353

25:25

Public Transport Data in KDE Itinerary

354

13:13

355

26:43

Pushing the PSP

356

31:52

Emulator development in Java

357

27:38

Emulator development in Java

358

30:27

Understanding the Bull GAMMA 3 first generation computer through emulation

359

49:25

360

17:50

AMENDMENT Global Open Source Quality Assurance of Emergency Supplies

361

29:11

AMENDMENT Covid Exposure Notification Out in the Open

362

31:12

AMENDMENT Public Money? Public Code! in Europe

363

13:31

Defining a multi-architecture interface for SYCL in LLVM Clang

364

53:59

Reggae: cool way of managing jails/VMs on FreeBSD

365

32:12

Bringing your project closer to users – translating libre with Weblate

366

34:30

Building an atractive way in an old infra for new translators

367

16:51

Translating documentation with cloud tools and scripts

368

13:31

Defining a multi-architecture interface for SYCL in LLVM Clang

369

29:51

20 years with Gettext

370

30:10

Translate All The Things!

371

28:17

Demystifying compiler-rt-sanitizers for multiple architectures

372

30:42

Game of Trees Daemon

373

22:32

Happy 5th anniversary pkg-provides

374

32:33

Open source C/C++ embedded toolchains using LLVM

375

31:06

Case study of creating and maintaining an analysis and instrumentation tool based on LLVM: PARCOACH

376

16:52

BSD Driver Harmony

377

42:07

AMENDMENT The New EU Interoperable Europe Act and the Reuse of Software in Public Administration

378

27:19

How to Build your own MLIR Dialect

379

26:20

FIDO beyond the browser

380

18:31

Elliptic curves in FOSS

381

23:35

How to protect your Kubernetes cluster using Crowdsec

382

22:18

Secure by accident

383

25:05

Graphing tools for scheduler tracing

384

12:51

A complete compliance toolchain for Yocto projects

385

27:36

Understanding and Managing the Dependency in SBOM with the New Feature of SW360

386

26:33

In SBOMs We Trust: How Accurate, Complete, and Actionable Are They?

387

29:49

A standard BOM for Siemens

388

12:08

REUSE Software... or if you want nice a nice SBOM downstream, push REUSE upstream

389

1:22:33

Panel discussion: SBOM content, usefulness, and caveats

390

19:51

Generating SBOM made easy with ORT

391

26:22

Automated SBoM generation with OpenEmbedded and the Yocto Project

392

28:03

The 7 key ingredients of a great SBOM

393

26:08

Hermine: converting SBOMS into legal obligations

394

27:57

Using SPDX for functional safety

395

11:41

FOSSology and SPDX

396

27:48

SBOM contents for embedded system images

397

27:22

Build recorder: a system to capture detailed information

398

15:38

Sailing into the Linux port with Sony Open Devices

399

23:59

A Service as a Software Substitute (SaaSS) is unjust like proprietary software

400

21:28

Automating a rolling binary release for Spack

401

49:59

Is “European open source” a thing?

402

30:10

If it’s public money, make it public code!

403

24:27

Controlling the web with a PS5 controller

404

32:01

Adopting continuous-profiling: Understand how your code utilizes cpu/memory

405

26:10

Practical UX at OpenProject

406

48:50

When it all GOes right

407

48:07

Tour de Data Types: VARCHAR2 or CHAR(255)?

408

50:36

Why Database Teams Need Human Factors Training

409

50:10

How to Give Your Postgres Blog Posts: An Outsize Impact

410

49:57

411

43:10

Deep Dive Into Query Performance

412

43:29

DBA Evolution: the Changing Role of the Database Administrator

413

33:11

Observability in Postgres

414

27:36

The problems you will have when creating a plugins system for your shiny UI project

415

17:17

What's new in the world of phosh?

416

20:16

Penpot official launch!

417

28:42

Best Practices for Operators Monitoring and Observability in Operator SDK

418

15:09

Webmapping and massive statistical data, a democratization story

419

29:55

Interactive network visualizations as "guided close reading" devices for the social sciences

420

15:07

The Turing Way: Changing research culture through open collaboration

421

28:35

Tackling disinformation with OSS

422

30:37

RICardo and GeoPolHist: Exploring trade relations between the geopolitical entities of the world from c. 1800 to 1938

423

28:16

Relativitization: an interstellar social simulation framework and a turn-based strategy game

424

24:57

PIMMI: a command line interface to study image propagation

425

25:55

Preliminary analysis of crowdsourced sound data with FOSS

426

26:23

MuPhyN - MultiPhysical Nexus

427

26:45

Guix, toward practical transparent, verifiable and long-term reproducible research

428

14:09

Executable papers in the Humanities, or how did we land to the Journal of Digital History

429

28:40

CorTexT: An open platform for social sciences and humanities Methodological expertise

430

26:39

Setting up OpenQA testing for GNOME

431

26:15

Open Research Open Panel: Open discussion among the open research tools and technologies community

432

24:14

Upstream Collaboration and Linux Distributions Collaboration - Is that excluded?

433

15:40

Ondev2: Distro-Independent Installer For Linux Mobile

434

25:09

Observability-driven development with OpenTelemetry

435

19:27

Nurturing, Motivating and Recognizing Non-Code Contributions

436

22:54

Visualize the NPM dependencies city ecosystem of your node project in VR

437

1:01:01

Fear the mutants. Love the mutants.

438

27:11

MPTCP in the upstream kernel

439

24:14

Localize your open source project with Pontoon

440

31:17

The Road to Intl.MessageFormat

441

22:38

Firefox Profiler beyond the web

442

25:38

Understanding the energy use of Firefox

443

26:45

The Digital Services Act 101

444

24:08

Cache The World

445

17:10

Mobian: to stable... and beyond!

446

25:17

meta netdevices

447

17:45

Mainline Linux on recent Qualcomm SoCs: Fairphone 4

448

24:57

Lomiri Mobile Linux in Desktop mode

449

30:22

Loki: Cloud Native Logging

450

24:31

Linux Kernel Functional Testing

451

25:20

Linux Distributions’ State of Gaming

452

09:11

Lightning Talks: NetXMS | Parca | OpenSearch

453

22:52

Open Source Initiative: Proposed Changes to License Review Process

454

50:44

Panel: Hot Topics - Organizers of the Legal & Policy DevRoom discuss the issues of the day

455

51:33

Learning From the Big Failures To Improve FOSS Advocacy and Adoption

456

23:28

Exploring the power of OpenTelemetry on Kubernetes

457

13:43

KubeOS: Container OS based on OpenEuler

458

19:12

KRuMP - Kotlin-Rust-Multiplatform?!

459

24:07

Why we ditched JavaScript for Kotlin/JS

460

23:22

Kotlin Multiplatform: From “Hello World” to the Real World

461

12:56

Kotlin Multiplatform for Android & iOS library developers

462

16:55

Hacking the Linux Kernel to get moar FPS

463

29:33

KDLP: Kernel Development Learning Pipeline

464

19:40

How we build and maintain Kairos

465

24:06

jxr in /engine/ - coding in WebXR on a plane

466

26:56

Improving the Kotlin Developer Experience in Koin 3.2

467

22:42

Javascript for Privacy-Protecting Peer-to-Peer Applications

468

27:19

Exploring Database Containers

469

29:30

7 years of cgroup v2: the future of Linux resource control

470

29:54

Bottlerocket OS - a container-optimized Linux

471

25:13

composefs: An opportunistically sharing verified image filesystem

472

31:54

Just A Community Minute

473

16:50

CentOS Stream: RHEL development in public

474

30:10

Centering DEI Within Your Open Source Project

475

30:48

Building Open Source Teams

476

22:25

Building a UX Research toolkit

477

08:55

Bluetooth state in PipeWire and WirePlumber

478

24:32

Developing Bluetooth Mesh networks with Rust

479

25:16

eBPF loader deep dive

480

21:49

Monitor your databases with Open Source tools like PMM

481

17:41

Optimizing BPF hashmap and friends

482

23:55

Be pushy! Let's join for wider and better Kotlin support worldwide

483

23:20

barebox, the bootloader for Linux kernel developers

484

22:13

Reckoning with new app store changes: Is now our chance?

485

25:36

A practical approach to build an open and evolvable Digital Experience Platform (DXP)

486

22:04

Zig and Guile for fast code and a REPL

487

38:24

Zero Knowledge Cryptography and Anonymous Engineering

488

50:13

Tools for linking Wikidata and OpenStreetMap

489

31:02

Matrix Widgets in the "Sovereign Workplace" for the German public sector

490

35:19

Whippet: A new production embeddable garbage collector

491

16:37

Quantitative Analysis of Open Source WebRTC Developer Trends

492

23:54

Exploring WebAssembly with Forth (and vice versa)

493

17:49

W3C WebRTC Meetup Update

494

14:51

Using SPDK with the Xen hypervisor

495

25:45

OKD Virtualization: what’s new, what’s next

496

25:16

A journey through supporting VMs with dedicated CPUs on Kubernetes

497

19:36

Fuzzing Device Models in Rust: Common Pitfalls

498

26:15

blkhash - fast disk image checksums

499

31:21

Trustworthy Platform Module

500

28:49

Trixnity: One Matrix SDK for (almost) everything written in Kotlin

501

50:53

Clear skies, no clouds in sight. Running a 14 person company on only free software.

502

24:53

Semihosting U-Boot

503

18:51

Secure payments over VoIP calls in the cloud

504

26:56

Online schema change at scale in TiDB

505

12:15

Scaling Open Source Realtime Messaging System for Millions

506

40:54

Self-Hosting (Almost) All The Way Down

507

35:16

QtRVSim—Education from Assembly to Pipeline, Cache Performance, and C Level Programming

508

35:43

Bringing up the OpenHW Group RISC-V tool chains

509

37:20

Porting RISC-V to GNU Guix

510

28:24

How to add an GCC builtin to the RISC-V compiler

511

48:21

Reimplementing the Coreutils in a modern language (Rust)

512

39:11

Building an Plant Monitoring App with InfluxDB, Python, and Flask with Edge to cloud replication

513

50:53

Passwordless Linux - where are we?

514

32:38

Open Source Firmware status on AMD platforms 2023 - 4th edition

515

45:36

DNF5: the new era in RPM software management

516

24:55

Deep Dive Into Query Performance

517

22:15

Data-in-use Encryption with MariaDB

518

22:09

Transparent, asynchronous, efficient communication

519

26:40

Migrating from proprietary to Open-Source knowledge management tools

520

15:26

The Relentless March of Markdown

521

24:30

Optimizing your core application for integration

522

25:41

Nextcloud Numbers and Hubs

523

25:05

Deploy an enterprise search server with Fess

524

23:05

Collaborating with Collabora Online

525

50:45

The End of Free Software

526

07:03

Build your own Real Time Billing using CGRateS

527

29:51

Open Source Confidential Computing with RISC-V

528

22:42

Keeping safety-critical programs alive when Linux isn’t able to

529

18:44

We need a Let’s Encrypt movement for Confidential Computing

530

19:51

LSKV: Democratising Confidential Computing from the Core

531

25:05

Autonomous Confidential Kubernetes

532

29:03

Introduction to Secure Execution for s390x

533

27:37

Tilting a Pyramid: Confidentiality in a Cloud Native Environment

534

50:30

Open Source Business Guidebook

535

13:13

Bridging ActivityPub with Kazarma

536

31:29

Overview of Secure Boot state in the ARM-based SoCs 2nd edition

537

22:23

Hardware-backed attestation in TLS

538

47:10

Maker Tools in the Browser

539

29:15

The under-equipped social scientist ?

540

27:26

Over a decade of anti-tracking work at Mozilla

541

26:04

Building External Evangelists

542

16:59

Hardening Linux System with File Access Policy Daemon

Automatisches Abspielen

Sprache

Text

Bild

00:00

Minkowski-Metrikp-BlockSchnittstelleVerzeichnisdienstHierarchische StrukturElektronische PublikationSchreiben <Datenverarbeitung>Lesen <Datenverarbeitung>InformationsspeicherungObjekt <Kategorie>Endliche ModelltheorieÄhnlichkeitsgeometrieSCSIAbstraktionsebeneFunktion <Mathematik>SpeicherabzugDatenverwaltungKanalkapazitätNegative BinomialverteilungFaltungsoperatorDesintegration <Mathematik>Komplex <Algebra>FlächentheorieGrenzschichtablösungDatenpfadSystemaufrufTaskVollständigkeitp-BlockInformationsspeicherungSchnittstelleMinkowski-MetrikKartesische KoordinatenIdentitätsverwaltungPhysikalisches SystemOrdnung <Mathematik>SoftwareElektronische PublikationEndliche ModelltheorieBefehlsprozessorFunktionalSchnittmengeiSCSIObjekt <Kategorie>Schreiben <Datenverarbeitung>Protokoll <Datenverarbeitungssystem>SpeicherabzugÄhnlichkeitsgeometrieKanalkapazitätAbstraktionsebeneExogene VariableMathematische LogikProgrammierungVollständigkeitTaskProzess <Informatik>ZahlenbereichHauptplatineMAPCASE <Informatik>HardwareDifferenteMereologieFaltungsoperatorComputersicherheitKeller <Informatik>BitLesen <Datenverarbeitung>Mapping <Computergraphik>BetriebsmittelverwaltungDatenverwaltungDateiverwaltungInterface <Schaltung>SystemaufrufATMNetzbetriebssystemPeer-to-Peer-NetzLeistung <Physik>Interrupt <Informatik>MaßerweiterungCodeZeitzoneFlächentheoriePunktNormalvektorMultiplikationsoperatorSchedulingAdditionServerPhysikalischer EffektPlastikkarteInformation RetrievalRechter WinkelBruchrechnungSCSISkriptspracheGesetz <Physik>RechenschieberKomplex <Algebra>QuaderCoxeter-GruppeSichtenkonzeptIdeal <Mathematik>PlotterComputeranimation

08:30

TaskVollständigkeitROM <Informatik>Message-PassingGemeinsamer SpeicherMinkowski-MetrikStellenringp-BlockSchnittstelleEmulationInformationsspeicherungProtokoll <Datenverarbeitungssystem>ZeitbereichSocketStrebePufferspeicherPufferspeicherProtokoll <Datenverarbeitungssystem>Virtuelle MaschineGamecontrollerProzess <Informatik>WärmeübergangWarteschlangePuffer <Netzplantechnik>Ganze FunktionUmwandlungsenthalpieResultanteBefehlsprozessorOverhead <Kommunikationstechnik>Interrupt <Informatik>Äußere Algebra eines ModulsMinkowski-MetrikTaskSchnittstelleKartesische KoordinatenObjekt <Kategorie>Message-PassingHalbleiterspeicherMultiplikationsoperatorSocketDomain <Netzwerk>ÄhnlichkeitsgeometrieStellenringImplementierungOrdnung <Mathematik>Gewicht <Ausgleichsrechnung>Mini-DiscKonfigurationsraumEndliche ModelltheorieZahlenbereichp-BlockLeistung <Physik>InformationsspeicherungDreiecksfreier GraphInterface <Schaltung>FokalpunktFunktionalHyperbelverfahrenSoftwareInterprozesskommunikationNetzwerkdatenbanksystemFaltungsoperatorVirtualisierungGüte der AnpassungStandardabweichungVideokonferenzGenerizitätSchedulingTypentheorieSichtenkonzeptGemeinsamer SpeicherFormation <Mathematik>AggregatzustandDatenstrukturComputersicherheitVollständigkeitProdukt <Mathematik>SystemaufrufPhysikalisches SystemVerschlingungRechenschieberQuick-SortKnotenmengePartikelsystemTelekommunikationProxy ServerVersionsverwaltungPolstelleProgrammiergerätProgrammierungZeiger <Informatik>Computeranimation

16:44

IndexberechnungProgrammbibliothekp-BlockSchnittstelleCodeKonfiguration <Informatik>InformationsspeicherungFokalpunktProgrammierungTreiber <Programm>EreignishorizontArchitektur <Informatik>Gebäude <Mathematik>Elektronisches ForumCOMClientVollständigkeitStrebeSoftwaretestSkriptspracheServerTopologieWarteschlangeProzess <Informatik>Minkowski-MetrikLesen <Datenverarbeitung>Protokoll <Datenverarbeitungssystem>FaltungsoperatorSocketZeitbereichOffene MengeImplementierungSystemprogrammierungFaltungsoperatorKartesische KoordinatenInformationsspeicherungComputerarchitekturp-BlockSocketCodeKonfigurationsraumLesen <Datenverarbeitung>Software Development KitDomain <Netzwerk>RohdatenVerschlingungSchnittstelleMinkowski-MetrikYouTubeElektronische PublikationEreignisgesteuerte ProgrammierungIntegralInstantiierungFunktionalOrdnung <Mathematik>Treiber <Programm>WarteschlangeSystemaufrufInterface <Schaltung>ProgrammbibliothekServerProzess <Informatik>Mini-DiscPhysikalisches SystemProgrammierungRechenschieberKomplex <Algebra>GamecontrollerDifferenteBitCoxeter-GruppeThreadMultiplikationsoperatorOffene MengeSoftwaretestMereologieDämon <Informatik>SoftwareEinfach zusammenhängender RaumInverser LimesDatenfeldComputersicherheitBildgebendes VerfahrenSichtenkonzeptEreignishorizontIntelProxy ServerVollständigkeitRechter WinkelProtokoll <Datenverarbeitungssystem>ImplementierungVarietät <Mathematik>SynchronisierungSuite <Programmpaket>CASE <Informatik>RechenwerkVerkehrsinformationMatchingGraphfärbungComputeranimation

24:58

Offene MengeCodeMinkowski-MetrikSchnittstellep-BlockImplementierungClientSystemprogrammierungServerInformationsspeicherungIntelComputeranimationFlussdiagramm

Transkript: Englisch(automatisch erzeugt)

00:06

Hi, my name is Stefan Heinecke, and I work on QEMU and Linux. And today I want to talk about vhost-user-block, a fast user space block IO interface. So what is vhost-user-block? vhost-user-block allows an application to connect to a software-defined storage

00:24

system that is running on the same node. So in software-defined storage, or in storage in general, there are three popular storage models. There's block storage, file storage, and object storage. And vhost-user-block is about block storage. So for the rest of this presentation, we're going to be talking about block storage.

00:44

And block storage interfaces, they have a common set of functionality. First of all, there's the core IO, reads, writes, and flushes. These are the common commands that are used in order to store and retrieve data from the block device. Then there's data management commands.

01:01

These are used for mapping and allocation of blocks. Discard and write zeros are examples of these kinds of commands. There are also auxiliary commands, like getting the capacity of the device. And then finally, there can be extensions to the model, like zone storage, that go beyond the traditional block device model.

01:22

Vhost-user-block supports all of these things, and it's at a similar level of abstraction to NVMe or to SCSI. So let's start by looking at how vhost-user-block is a little bit different from the things like NVMe or SCSI,

01:40

things that are network protocols or hardware storage interfaces. Vhost-user-block is a software user space interface. So let's begin by imagining we have a software defined storage system that is running in user space. And it wants to expose storage to applications. So if we're using the kernel storage stack, what will happen is we'll need

02:04

some way to connect our software defined storage to the kernel and present a block device. Ways of doing that might be NVMe over TCP, or as an iSCSI LUN, or maybe as an NBD server, and so on.

02:23

And so that's how a software defined storage system might expose its storage to the kernel. And when our application opens a block device, it gets a file descriptor and then it can read or write using system calls from that file descriptor. And what happens is execution goes into the kernel's file system and

02:43

block layers. And they will then talk to the software defined storage system. Now that can be somewhat convoluted because if we've attached, say, using NVMe over TCP, the network stack might be involved and so on. And at the end of the day, all we're trying to do is communicate between

03:01

our application and the software defined storage processes that are both on the same node. They're both running on the same operating system. User space storage interfaces, they leave out this kernel storage stack. And instead they allow the application to talk directly to the software defined storage process.

03:23

Now there are a number of pros and cons to using a user space interface. And I'll go through them here. So I've already kind of alluded to the fact that if you have a user space interface and you don't go through the kernel storage stack, then you can bypass some of that long path that we discussed.

03:43

For example, going down into the kernel, coming back out using something like MBD or iSCSI in order to connect to another process on the same node. There must be a faster way of doing that, right? So with VIO's user block, it turns out we can actually get rid of system calls entirely from the data path.

04:02

So reads and writes and so on from the device don't require any system calls at all. And we'll have a look at how that's possible later on in this talk. But speed is one of the reasons why a peer user space interface for block IO is an interesting thing. Another reason is for security.

04:22

Typically, in order to connect a block device to the kernel, you need to have privileges because it can be a security risk to connect untrusted storage to your kernel. And the reason for that is that there's a bunch of code in the storage stack that's going to run and it's going to process and be exposed to this

04:42

untrusted data. If you think about a file system and all its metadata, that can be complex. And so there's a security risk associated with that. And therefore, privileges are required to create block devices. An ordinary unprivileged process cannot attach and mount a block device. So in a scenario where you do have an untrusted block device and you

05:04

would like to remove the attack surface there, then using a user space interface allows you to avoid that. Also, if you don't have permissions, if you simply don't have permissions, then you won't be able to create a kernel block device. So then a user space interface is beneficial as well.

05:24

Now, those were the pros. Of course, there are drawbacks to having a user space interface. First of all, it's complex. Compared to simply opening a file and reading and writing from the file descriptor, you're going to have to do a lot more because all the logic for actually doing IO and

05:41

communicating is now the responsibility of the application and not the kernel. So there's that. In addition, if you think about existing programs that you might want to use to access your storage, they won't have support for any new interface that is user space only. They are probably using the POSIX system calls and read and

06:02

write and so on, and that's what they expect. So you'll have to port those applications in order to access your software-defined storage system if you rely on a user space interface. Another disadvantage is that if you have a user space interface, then the kernel storage stack isn't involved.

06:22

So if you decide you need a feature from the kernel storage stack, whatever that may be, or if you have a legacy application that you cannot port and that needs to talk to a kernel block device, then again, you have a problem because your software-defined storage system is isolated.

06:41

Its block devices aren't connected to the kernel. What we're going to do today is we're going to look at both these pros and cons, and we're going to also see how with vhost user block, we can actually overcome these cons. So let's start a little bit looking at some of the performance aspects,

07:00

how this can be fast. I said no system calls are required, so how does that even work? If the software-defined storage system and the application need to communicate, how can they communicate without system calls? All right, so one of the important concepts in IO is how to wait for the completion of IO.

07:22

When you submit an IO request, maybe you have no more work for your process to do. Maybe the CPU is essentially idle until that IO request completes, and at that point, you'll be able to do more work. The normal thing to do in that case is to then de-schedule your application

07:41

and let other threads, other tasks on the system run. And maybe if there are no other tasks, then the kernel will just put the CPU into power-saving mode. It'll put it into some kind of low power state, and it will awake once the completion interrupt comes in. And you can see that at the top of this slide, at the top diagram,

08:01

you can see that there's the green part where we submit the IO, and at that point, we run out of things to do because we're going to wait for completion. So then there's this gray part where other tasks are running, power-saving is taking place, and during that time, the first portion is spent with the IO actually in flight. That's where we're legitimately waiting

08:20

for the IO request to complete so that we can proceed. But then what happens is that the IO request completes, and we need to somehow get back to our de-scheduled process. Now, depending on what other tasks are running, their priorities, the scheduler, and so on, our task might not get woken up immediately. Or maybe if the CPU is in a low power state,

08:43

it'll just take some time to wake up, handle that interrupt, restore the user space process, and resume execution. So this leads to a wake-up latency, an overhead that is added. And so this is why notifications, also sometimes called interrupt,

09:03

interrupts can be something that actually slows down your IO processing. An alternative is to use polling. So polling is an approach where once you have no more work to do, instead of de-scheduling, you repeatedly check whether the IO is complete yet. And by doing that, you're not giving up the CPU.

09:22

So you keep running and you keep consuming CPU. The advantage is that you don't have this wake-up latency. Instead, your process will respond immediately once the IO is complete. The drawback, of course, is that you're hogging the CPU and you're wasting power while there's nothing to do.

09:40

So these are two techniques, and I think we're going to keep them in mind because we'll see how they come into play later. The next performance aspect I wanted to mention that that's kind of important to understanding how vhost user block is different from maybe using a network protocol or an existing storage interface is message passing versus zero copy.

10:02

As programmers, we learn that when we have a large object in our program, we shouldn't pass it around by value because it will be copied and that will be inefficient. And instead, what we do is we use references or we use pointers, allowing the function that receives the object to just go and access it in place

10:21

rather than taking copies. And in inter-process communication and in networking, there's similar concepts. By default, things are message passing. We build a message. It gets copied through various buffers along the network path. Eventually the receiver receives it into its buffer and then it parses it.

10:42

And so that model is the traditional networking model. It's also the IPC model. It has strong isolation. So for security, it's great because it means that the sender and the receiver don't have access to each other's memory. Therefore they cannot interfere or crash each other and do various things. But the downside is that we have these intermediate copies

11:01

and that consumes CPU cycles and it's inefficient. So the zero copy approach is an approach where the sender and receiver, they've somehow agreed on the memory buffer where the data to be transferred lives. And that way, the sender, for example, can simply place the data directly into the receiver's buffer

11:21

and all it then has to do is let the receiver know, hey, there's some data there for you. It doesn't actually have to copy the data. So this is another important concept that we're gonna see with vhost-userblock. So now that we've got those things out of the way, let's look at vhost-userblock. What is it? It's a local block IO interface.

11:41

So it only works on a single node, on a single machine. It is not a network protocol. Two, it's a user space interface. It's not a kernel solution in itself. It's a pure user space solution. That means it's unprivileged. It doesn't require any privileges

12:00

for two processes to communicate in this way. It's also a zero copy solution and the way it does that is it uses shared memory. And finally, vhost-userblock supports both notifications and polling. So depending on your performance requirements, you can choose whether you want to de-schedule

12:21

your process and receive a wake up when it's time to process an IO completion, or you can just poll and consume CPU and have the lowest possible latency. And vhost-userblock is available on Linux, BSD and on Mac OS. And the implementations of this started around 2017.

12:43

Now it's used, it came from SPDK and working together with QEMU. So those communities, they implemented vhost-userblock, but there are also implementations in other hypervisors like CrossVM and Cloud Hypervisor. So primarily this kind of came from virtualization,

13:00

from this problem of how do we do software-defined storage and let a virtual machine connect to it? But that's not all that vhost-user is good for. It's actually a general storage interface. It's generic, just like NVMe or SCSI is. So you could use vhost-userblock if you had some kind of data intensive application

13:22

that needs to do a lot of storage IO and needs high performance or needs to be unprivileged. And that's why I'm talking about vhost-userblock today. So let's have a look at the protocol. So the way that this is realized is that there's a Unix domain socket for our user space storage interface.

13:42

And we speak the vhost-user protocol over this socket. What the socket does and the vhost-user protocol allows us to do is it lets us set up access to a Vert.io block device. So a block device that lives in the software-defined storage process. So when we have two processes running on a system,

14:01

a software-defined storage process and an application, the application is using vhost-user in order to communicate with the Vert.io block device. And that's how it does its IO. So what is Vert.io block? Vert.io block is a standard. You can check out the Vert.io specification.

14:22

Vert.io has a number of other devices, but it includes Vert.io block. Some of the other devices are Vert.io net or Vert.io SCSI and so on. But Vert.io block is one we'll focus on here. And it consists of one or more request queues where you can place IO requests. And each one of these has a little structure. You can do all the requests I mentioned

14:41

in the beginning of the talk. Reads, writes, flushes, discard, write zero and so on. And you have multiple queues. So if you want to do multi-queue, say you're multi-threaded, you can do that as well. And it has a config space that describes the capabilities of the device. The disk size, the number of queues and so on.

15:02

So that's what you can think of Vert.io block as. That's the model we have here. And that's the block device that our application can interact with. If you think of any other storage interfaces or network protocols that you're familiar with, this should be more or less familiar. Most of the existing protocols also work in this way.

15:20

You can inquire about a device to find out its size and so on. And then you can set up queues and you can submit IO. So underneath Vert.io block, we have the vhost-user protocol. And the vhost-user protocol is this Unix domain socket protocol that allows the two processes to communicate.

15:40

But it's not the data path. So vhost-user is not how the application actually does IO. Instead, it's a control path that is used to set up access to these queues, these request queues that I've mentioned. And the IO buffer memory and the queue memory actually belongs to the application. And the application sends it over the Unix domain socket.

16:01

It sends that shared memory over so that the software-defined storage process has access to the IO buffer memory and the queue memory. The application and the software-defined storage process, they share access to that memory. That way we can do zero copy. So this is going back to the message passing versus zero copy thing.

16:22

We don't need to transfer entire IO buffers between the two processes. Instead, the software-defined storage process can just read the bytes out of the IO buffer that live in the application process. And it can write the result into the buffer as well.

16:41

So if you wanna look at the specification and the details of how vhost-user works, I've put a link on this slide. But really, if you're writing an application, I think the way to do it is to use libblock.io. Libblock.io is a library that has both C and Rust APIs that allows you to connect to vhost-user block

17:01

as well as other storage interfaces. So vhost-user block is not the only thing, but for the purpose of this talk, we'll just focus on that. Libblock.io is not a framework. It's a library. It allows you to integrate it into your application regardless of what your architecture is. That means it supports blocking IO,

17:21

it supports event-driven IO, and it also supports polling. So no matter how you've decided you want to do your application, you can use libblock.io and you won't have to change the architecture of your application just to integrate libblock.io. I have given a full talk about libblock.io. So if you wanna understand the details

17:41

and also some of the background and everything it can do, then please check out that talk. I put a YouTube link on this slide for you. I'll give you a short code example here. So this shows how to connect to a vhost-user block socket using libblock.io.

18:00

And this is pretty straightforward. We essentially just need to give it the path of the Unix domain socket, and then we connect and start the block.io instance. And then in order to do IO, we can submit a read request. That's just a function call. That's straightforward as well. And notice here that we do get the queue.

18:20

We call the getQ function in order to grab a queue. That's because libblock.io is a multi-queue library. If you have a multi-threaded application, you could create one dedicated queue for each thread and then avoid any kind of locking and synchronization. All the threads can do IO at the same time. So for completion, what this example shows

18:40

is it shows blocking completion. So here, the program is actually gonna wait in the doIO function until the IO is complete. But as I mentioned, the library also supports event-driven IO, and it also supports polling. So whatever you like, you'll be able to do that. If you develop your application,

19:01

you'll need something to test against. And I think the easiest way to test against the vhost-user-block device is to use the QEMU storage daemon. It's packaged for all the main Linux distros as part of the QEMU packages. And you can just run the storage daemon. You can give it a raw image file

19:20

and tell it the name of a vhost-user-block Unix domain socket that you want to have. And then you can connect your application to it. All right, so that's how you can do that. If you want to implement a server, if you're already in the SPDK ecosystem and you're using Intel's software performance

19:41

development kit in order to write your software-defined storage system, then it's very easy because vhost-user-block support is already built in. So I've put a link to the documentation. There are also RPCs if you want to invoke it from the command line.

20:00

And just for testing, you can create a vhost-user-block server using this. Now, if you're not using SPDK, instead you're writing your own C daemon, your own process, then one way of using vhost-user-block is to use the libvhost-user library.

20:21

So this is a C library that implements the vhost-user protocol, the server side of it. So this will allow you to accept vhost-user connections. It doesn't actually implement virtio-block. That's your job. That's the job of the software-defined storage system. But virtio-block consists of basically just processing the IO requests

20:40

like reads and writes and so on, and also setting the configuration space so that the disk size is reported there. And you can find an example of a C program that implements vhost-user-block using libvhost-user. I've put a link on the slide here for you. So that's how you can do it in C.

21:01

In Rust, similarly, there is a library available for you. So there's the vhost-user-backend RustCrate, and it plays a similar role to the libvhost-user library for C. So this allows you to easily implement whatever vhost-user device you want. And in this case, it's your job to implement

21:21

the virtio-block, just as I mentioned. Okay. Now, I still wanted to touch on one con that we hadn't covered yet, because we've explained how, although a user space interface is complex and is more work than just using file descriptors and read and write,

21:41

I think that the libblock.io and libvhost-user-block and so on, these libraries that are ready for you to integrate into your applications or software-defined storage systems, they take away that complexity and they make the integration easier as well. You don't need to duplicate code or write a lot of stuff, but we're still left with one of the disadvantages.

22:01

How do we connect this back to the kernel if it turns out we want to use some functionality from the kernel storage stack, or if we have a legacy application that we can't port to use the user space interface? So for vhost-user-block, there is a solution here.

22:21

There's a Linux VDUSE feature, which is relatively new, and what it does is it allows a vhost-like device to be attached to the kernel. So even though your software-defined storage system is in user space, this gives you a way of attaching your block device

22:40

to the kernel, and then in the kernel, the virtio block driver will be used to communicate with your device. And what happens is that a devvda or devvdb block device node will appear, and your application can open that like any other block device, and it can read and write and do everything through there.

23:02

One of the nice features of this is that because it's quite similar to vhost-user-block, the code can be largely shared. I think the only difference would be that instead of having the vhost user code, you would have the VDUSE code, which opens this character device

23:20

that the VDUSE driver in the kernel offers instead of a Unix domain socket, and the setup and the control path is a little bit different, but the actual data path in the virtio block is still the same, so you can reuse that code. So that's an effective way of doing it. There's another new Linux feature that I wanted to mention that is interesting here,

23:41

and also a little bit more general, even outside of vhost-user-block, and that's ublock. ublock is a new Linux interface for user space BlockIO, so that your software-defined storage system can present host kernel block devices, so you can have your block device

24:02

and process it in user space, and it uses IOU ring. It's an exciting feature, and it's pretty interesting, so I've left the link here. The only thing with this is that compared to VDUSE, it does not reuse or share any of the vhost-user-block stuff, so if you already have vhost-user-block support

24:21

in your software-defined storage system, or you just want to streamline things, then ublock is kind of a whole different interface that you have to integrate, so that's the only disadvantage, but I think it's pretty exciting too. Okay, so to summarize, if you need a user space BlockIO interface for the performance,

24:41

or because you need to be able to do unprivileged IO, or for security, then implement vhost-user-block. There are open specs, code, and community. Please let me know if you have any questions, and thank you. Have great fast time.