Stack walking/unwinding without frame pointers - TIB AV-Portal

Stack walking/unwinding without frame pointers

00:00

2

Related Material

Thakkar, Vaishali Honduvilla Coto, Javier

Formal Metadata

Title

Stack walking/unwinding without frame pointers

Title of Series

Number of Parts

542

Author

Thakkar, Vaishali

Honduvilla Coto, Javier

License

CC Attribution 2.0 Belgium:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/61440 (DOI)

Publisher

Release Date

Language

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

Sampling CPU profilers periodically fetch the stacks of the profiled processes that are running on the CPU at a given time. Walking the stacks of native processes with a little work is easily possible when frame pointers(FPs) are present. But most binaries in the real world are not compiled with FPs. So it can get quite complicated if profilers have to walk the stacks when frame pointers are omitted. In this talk, we will talk about how we can walk the stacks using the DWARF CFI (mainly .eh_frame). We will also discuss how eBPF is helping us with that and how extending the current stack walking facilities can be useful especially in interpreted languages, such as Ruby, as well as runtimes with JITs, like the JVM.

FOSDEM 2023338 / 542

1

10:57

Tribe - a content structuring and collaborative framework

2

21:58

Decentralizing moderation

3

15:16

Challenges in Home Energy Management

4

20:42

EVerest: AC and DC electric vehicle charging with open source software and hardware

5

20:43

Green software engineering

6

30:01

What the energy industry can learn from how open source technology has transformed other traditional industries

7

39:45

Carbon Intensity Aware Scheduling in Kubernetes

8

30:06

Inside the FIM (Fbi IMproved) Scriptable Image Viewer

9

24:53

Using GNU Guix Containers with FHS (Filesystem Hierarchy Standard) Support

10

32:35

An Introduction to Guix Home

11

29:50

Bringing RISC-V to Guix's bootstrap

12

1:28:49

Learn 8-bit machine language with the Toy CPU emulator

13

28:12

Literate Storytelling: Interpreting Syntaxes for Explorers

14

30:18

15

14:39

16

16:20

Breaking away from Big Tech

17

16:33

Build CI/CD pipelines as code, run them anywhere

18

41:03

Building an actor library for Quickwit's indexing pipeline.

19

32:20

Celebrating 25 years of Open Source

20

29:17

CI/CD for Machine learning models

21

14:45

CoffeOSM: improve OpenStreetMap a receipt at a time

22

15:21

Consulting for digital humanists

23

19:44

Continuous Delivery to many Kubernetes Clusters

24

15:20

Continuously Update Everything

25

1:48:15

How regulating software for the European market could impact FOSS

26

15:36

data mountains - turn your data into mountains!

27

27:53

Device driver gardening

28

24:24

Delivering a crossplane-based platform

29

14:35

Keep Your Dependencies In Check

30

35:45

Perspectives from the Open Source Developer

31

30:06

Elasticsearch Internals

32

47:16

The ELISA Project - Enabling Linux in Safety Applications

33

15:28

FireCRaCer: The Best Of Both Worlds

34

22:56

FireCRaCer: The Best Of Both Worlds

35

40:59

Teaching machines to handle bugs and test Firefox more efficiently.

36

29:42

FOSDEM infrastructure review

37

15:36

FOSSbot: An open source and open design educational robot

38

16:09

FPGA-based music synthesis with open-source tools

39

26:52

Fuzion — Intro for Java Developers: Mapping Java's Features to Simpler Mechanisms

40

14:28

gallia: An Extendable Pentesting Framework

41

13:31

A GitLab forge for all teachers and students in France?

42

19:12

Classics Never Get Old: Two Easy Pieces For GraalVM

43

14:53

44

19:39

Write Once, Run Anywhere... Well, What About Heterogeneous Hardware?

45

22:29

2D animations in Haskell using gloss, lens and state

46

04:43

Acknowledgements, *prize draw* and farewell

47

28:36

Open-Source Opportunities with the Haskell Foundation

48

41:50

Hackathon HaskellKatas style

49

28:04

Recipes for reducing cognitive load

50

24:41

Reconciliation Pattern, Control Theory and Cluster API: The Holy Trinity

51

25:08

Optimizing string usage in Go programs

52

24:38

FOSDEM 2023 - Go Lightning talks: Come speak!

53

28:00

Is Go Object-Oriented? A Case of Public Opinion

54

23:23

Headscale: How we are using integration testing to reimplement Tailscale

55

27:58

Five Steps to Make Your Go Code Faster & More Efficient

56

21:38

Go Even Further Without Wires

57

29:29

What's new in Delve / Tracing Go programs with eBPF

58

23:35

Debugging concurrency programs in Go

59

26:55

Building a CI pipeline with Dagger in Go

60

25:21

Our Mad Journey of Building a Vector Database in Go

61

25:10

Building FPGA Bitstreams with Open-Source Tools

62

27:32

WAM: an embedded web runtime history

63

24:34

U-Boot as PSCI provider on ARM64

64

26:40

Troika: Submit, monitor, and interrupt jobs on any HPC system with the same interface

65

17:24

Running MPI applications on Toro unikernel

66

26:50

Simplifying the creation of Slurm client environments

67

24:28

How the Spack package manager tames the stat storm

68

28:26

Keeping the HPC ecosystem working with Spack CI

69

26:45

Reverse engineering a solar roof datalogger

70

09:40

Ups and Downs with Remote Desktop Protocol (RDP) on Wayland, Weston and the Yocto Project

71

26:49

Writing a Telegram Antispam Bot in Python: An introduction to async programming

72

23:23

Building a Semantic Search Application in Python, Using Haystack

73

22:06

Code reloading techniques in Python

74

30:10

Will PyScript replace Django?

75

26:25

Simple, open, music recommendations with Python

76

55:35

A quick introduction to MicroPython

77

24:43

Python Logging Like Your Job Depends on It

78

25:52

Pip install malware

79

24:18

Realtime 3D Graphics on a MicroPython ESP32

80

24:42

DuckDB: Bringing analytical SQL directly to your Python shell.

81

25:47

Talk to DBus from a Python application

82

24:52

Continuous Documentation for Your Code

83

26:51

The PolyVent FLOSS Ventilator

84

28:24

Efficiently exploit HPC resources in scientific analysis and visualization with ParaView

85

08:37

Overengineering an ML pet project to learn about MLOps

86

06:17

Open Source Switching: Upstreaming ONIE NVMEM and switch BSP drivers

87

26:29

88

25:00

MUST: Compiler-aided MPI correctness checking with TypeART

89

11:25

HPC Container Conformance

90

26:02

Matter and Thread as Connectivity Solution for Embedded

91

25:41

LOFAR: FOSS HPC across 2000 kilometers

92

26:19

Convergent camera applications for mobile Linux devices

93

22:59

Link-time Call Graph Analysis to facilitate user-guided program instrumentation

94

18:23

LIBRSB: Universal Sparse BLAS Library

95

10:02

The LDBC benchmark suite

96

25:38

KUKSA.val Vehicle Abstraction

97

28:00

Self-service Kubernetes Platforms with RDMA on OpenStack

98

08:04

Exploring a swedish smarthome hub

99

10:08

A journey to the hardware world

100

27:15

How to deal with validation as an HPC software?

101

20:42

Developing effective testing pipelines for HPC applications

102

10:10

Multiple Double Arithmetic on Graphics Processing Units

103

27:23

Introduction to Watermill: Siple Go Event-Driven application in 20 minutes

104

20:23

Visually programming Go

105

24:13

The State of Go: What's new since Go 1.19

106

23:39

Squeezing a go function

107

31:40

OpenTelemetry with Grafana

108

26:45

Building a Web UI for the Fedora installer

109

50:46

Advanced Camera Support on Allwinner SoCs with Mainline Linux

110

25:06

How to get public administrations to use more FOSS

111

26:24

Accessibility & Open Source: How open source is key to building a more inclusive world.

112

25:34

A11y: EAA, WCAG, WAI, ARIA, WTF? – it’s for the people stupid!

113

24:49

A mirror without reflection for Kotlin/Multiplatform

114

20:43

20 minutes from zero to a live chatbot with Tock

115

23:44

5 errors when building embedded systems

116

24:47

On-prem to Cloud to Edge and beyond: Evolution of women contributors in distros & FOSS

117

25:50

Windows and Office "tax" refund

118

24:24

Walking native stacks in BPF without frame pointers

119

24:29

Is it time to migrate to Vue 3?

120

25:16

Introducing Vegvisir: An automation framework for testing QUIC application logic

121

27:12

Value driven design

122

31:32

Uncover the Missing Link

123

28:55

Practical introduction to OpenTelemetry tracing

124

21:09

Better Kotlin Multiplatform architecture with Dependency Injection and KSP

125

28:39

The O11y toolkit

126

34:22

The State of Kotlin

127

14:57

Practical and fun automation for all your terminal sessions

128

20:20

Take your shot of Vitamin!

129

28:37

Strong Dynamic Type Checking for JavaScript

130

25:55

Having Something To Hide

131

15:25

Finite state machine (and some retrogaming)

132

24:57

Shrinking in the Age of Kotlin

133

24:37

Sharp photos and short movies on a mobile phone

134

21:29

Whom Do You Trust?

135

25:44

Where does that code come from?

136

30:33

Demystifying StackRox

137

11:00

What Does Rugby Have To Do With Sigstore?

138

24:41

Enabling FIDO2/WebAuthn support for remotely managed users

139

21:46

Mercator: Mapping the information system

140

24:56

Post Quantum Cryptography in Voice/Video over IP

141

25:58

Remote Attestation with Keylime

142

26:17

Kerberos PKINIT: what, why, and how (to break it)

143

18:39

Converting HPKE to be PQ

144

21:56

OpenSSL in RHEL: FIPS-140-3 certification

145

19:01

Playing with Nix in adverse HPC environments

146

18:05

147

19:14

Nixel: a nicer way to write your Nix expressions

148

12:46

Make Anyone Use Nix

149

20:10

I am excited about NixOS, I want to tell you why!

150

16:02

devenv.sh - Fast, Declarative, Reproducible, and Composable Developer Environments

151

20:28

The Nix package manager development process

152

05:58

Contracts for free!

153

20:13

A success story of adopting Nix at a workplace

154

14:50

NGI Search and OpenWebSearch.EU projects

155

28:47

prplMesh: open source Wi-Fi mesh

156

26:26

So you want to build a deterministic networking system

157

23:47

Snabbflow: a scalable IPFIX exporter

158

22:40

Need to connect your k8s pods to multiple networks? No problem [with calico/vpp]!

159

15:46

Peer-to-peer Browser Connectivity

160

27:07

ntopng: an actionable event-driven network traffic analysis application

161

25:27

Service MESH without the MESS

162

27:39

MetalLB and FRR: a match made in heaven

163

19:41

Networking management made simple with Nmstate

164

30:14

What is an IDS and Network Security Monitoring in 2023?

165

28:47

Hole punching in the wild

166

32:36

Decentralized Storage with IPFS

167

24:32

DDoS attack detection with open source FastNetMon Community

168

25:05

"CNI Unleashed"

169

31:42

Golden Signals with Cilium and Grafana

170

43:35

Open Source Software at NASA

171

21:10

What I Miss In Java (The Perspectives Of A Kotlin Developer)

172

18:10

Major Migrations Made Easy With OpenRewrite

173

31:17

The Microkernel Landscape in 2023

174

25:51

MicroBlocks: small, fast, human friendly

175

20:10

Making Continuous Delivery Accessible to All

176

15:10

Lua for the lazy C developer

177

30:10

Loupe: Designing Application-driven Compatibility Layers in Custom Operating Systems

178

15:30

Writer Content Controls -- what happened in the past half year

179

10:01

A Rocket Engine for LibreOffice Templates

180

08:10

Cleaning up the unittest code mess

181

10:24

State of the Toolchain

182

09:24

Putting the R in LibreOffice: a Shiny dashboard for QA

183

09:06

Improvements to LibreOffice PDF accessibility

184

05:48

Supporting old proprietary graphic formats

185

09:15

News from the ODF Toolkit

186

11:01

Migrating to LibreOffice Technology - old and new motivations and challenges

187

08:55

Marrying Collabora Online and LibreOffice WASM

188

09:03

LibreOfficeKit – bridge between your application and LibreOffice

189

07:59

An Interoperability Improvement in LibreOffice Impress Tables

190

09:57

Fun project by design – How LibreOffice development can be full of flow?

191

09:00

Footnotes in multi-column sections

192

04:26

Feature Locking and Feature Restriction

193

10:10

Turbocharging an elephant. Making Libreoffice faster.

194

08:42

LibreOffice Dark Modes

195

09:30

Crashtesting LibreOffice in the backyard

196

08:01

Make Collabora Online yours

197

10:04

Collabora Online and WASM

198

09:41

Collabora Online over lock-down

199

47:03

200

09:48

LibreOffice graphics subsystems - SystemSpecificRenderers

201

11:32

Libre-SOC: From architecture and simulation to test silicon, and beyond

202

14:57

Keyoxide: verifying online identity with cryptography

203

16:12

Jubako, a new generic container format

204

11:29

Combining EASY!Appointments with Jitsi for online appointment management

205

23:06

Combining EASY!Appointments with Jitsi for online appointment management

206

15:54

Is YAML the Answer?

207

22:17

208

18:06

Devroom kick-off talk: UKI? DDI?? Oh my!!!

209

23:15

Ubuntu Core: a technical overview

210

24:59

Image-Based Linux and TPMs

211

21:45

openSUSE MicroOS design

212

23:38

Building initrds in a new way

213

15:41

MachineOS: a Trusted, SecureBoot Image-based Container OS

214

16:33

Converging image and package based OS updates

215

15:05

I2P: Major Changes of the Peer-to-Peer Network

216

38:40

Hardware acceleration for Unikernels

217

31:56

How We Gained Observability Into Our CI/CD Pipeline

218

17:56

How To Automate Documentation Workflow For Developers

219

44:39

Introducing Helios Micokernel

220

13:27

Introducing Helios

221

31:01

Hedy: A gradual and multi-lingual programming language for education

222

32:42

Web application architecture in Haskell with flora.pm

223

13:15

A quick overview of the Haskell tooling

224

06:05

The Haskell Security Advisory Database

225

29:32

On the path of better interoperability with Rust!

226

16:10

Breaking the Code of Inclusion: Designing Micro Materials Based on PRIMM Principles for Accessible Programming Education.

227

14:42

Beyond Wikipedia

228

26:03

Speak binary to me

229

26:13

Shorter feedback loops with Livebook

230

04:56

Running Erlang and Elixir on microcontrollers with AtomVM

231

28:56

LiveView keeps you warm!

232

18:07

Introduction to Gleam

233

11:21

Elixir - Old wine in new casks

234

23:11

Distributed music programming with Gleam, BEAM, and the Web Audio API

235

26:34

The Actor Model as a Load Testing Framework

236

15:44

Do more awkward user interviews

237

14:33

AsyncGetStackTrace: The Improved Version Of AsyncGetCallTrace (JEP 435)

238

49:05

Building a Linux-compatible Unikernel

239

28:34

Building Personalized AI Apps with MIT App Inventor

240

23:28

Why And How To Upgrade To Java 17 (And Prepare For 21)

241

15:40

Tableaunoir: an online blackboard for teaching

242

38:43

Sustaining Free and Open Source Software

243

49:24

Open Source in Environmental Sustainability

244

14:08

Should there be a standard in libre localization?

245

17:19

Staging of Artifacts in a Build System

246

43:24

What is Digital Sovereignty and how can OSS help to achieve it?

247

21:49

The Role of Open Source at the EU Technology Roadmap for a European Sovereign Cloud

248

18:52

The role of Open Infrastructure in digital sovereignty

249

1:03:30

The Importance of Collaborative Applications for European Digital Sovereignty

250

55:59

The Co-operative Cloud

251

25:09

Responsible Clouds and the Green Web Triangle

252

20:04

Operate First community cloud

253

22:02

On-premise data centers do not need to be legacy

254

42:11

Is Open Source Coming back to your Cloud?

255

14:27

How we created a Documentation Framework that works across a group of vendors in the sovereign cloud stack community

256

40:24

From Zero to Hero with Solid

257

31:28

Effective management of Kubernetes resources for cluster admins

258

36:35

Distributed Storage in the Cloud

259

49:55

Building Strong Foundations for a More Secure Future

260

31:02

Snap! - Build Your Own Blocks

261

48:48

Similarity Detection in Online Integrity

262

23:36

Afraid Of Java Cold Starts In Serverless?

263

16:07

Self-hosting for non-coders?

264

21:11

265

38:24

Intro to Ceph on Kubernetes using Rook

266

35:51

Lessons learnt managing and scaling 200TB glusterfs cluster @PhonePe

267

39:58

Dynamic load change in SDS systems

268

17:40

Present and future of Ceph integration with OpenStack and k8s

269

24:02

A Rust-Based, modular Unikernel for MicroVMs

270

18:35

Using Rust for your network management tools!

271

08:27

Slint: Are we GUI yet?

272

20:51

Scalable graph algorithms in Rust (and Python)

273

41:41

Rust API Design Learnings

274

08:04

Neovim and rust-analyzer are best friends

275

20:17

Merging process of the rust compiler

276

30:31

Let's write Snake game!

277

35:24

How Pydantic V2 leverages Rust's Superpowers

278

30:52

279

33:46

Building a distributed search engine with tantivy

280

07:56

Presentation of BastionLab, a Rust open-source privacy framework for confidential data science collaboration

281

20:30

Backward and forward compatibility for security features

282

40:31

Aurae: Distributed Runtime

283

20:13

atuin: magical shell history with Rust

284

08:38

A Rusty CHERI - The path to hardware capabilities in Rust

285

39:27

A deep dive inside the Rust frontend for GCC

286

49:51

Rosegarden: A Slumbering Giant

287

25:10

Quarkus 101: Intro To Java Development With Quarkus

288

26:21

Modernizing Legacy Messaging System with Apache Pulsar

289

52:31

Podcasting 2.0: it's all about Interoperability

290

50:46

Running a Hybrid Event with Open Source

291

17:05

Update on #JavaOnRaspberryPi and Pi4J

292

11:55

Announcing pg_statviz

293

15:53

294

51:59

Evolution of OSv: Towards Greater Modularity and Composability

295

12:41

Open Source Good Governance – GGI Framework presentation & deployment

296

15:10

Get Started with Open Source Formal Verification

297

40:54

Making the world a better place through Open Source

298

14:53

299

1:05:53

NOVA Microhypervisor Feature Update

300

18:04

Towards Secure Boot for NixOS

301

04:17

Energy policy by the European Commission

302

22:33

European Eichrecht

303

26:21

Tackling document collaboration challenges in 2023

304

31:40

Conquering tribal knowledge with Grav

305

52:06

Creating a content pipeline with Antora

306

45:50

Open Source Collaboration Tools for Alfresco

307

19:41

A Study of Fine-Grain Compartment Interface Vulnerabilities: What, Why, and What We Should Do About Them

308

23:55

Project Veraison (VERificAtIon of atteStatiON)

309

18:05

Rust based Shim-Firmware for confidential container

310

24:36

Nydus Image Service for Confidential Containers

311

21:39

Scalable Confidential Computing on Kubernetes with Marblerun

312

22:37

Gramine Library OS

313

17:47

Building a secure network of trusted applications on untrusted hosts

314

30:20

THE BASE - FOSS Confidential Container SDK to ease the development

315

20:44

Confidential Containers and the Pitfalls of Runtime Attestation

316

29:06

Cascaded Foci (SFUs)

317

34:03

Building a social app on top of Matrix

318

11:45

tissue—the minimalist git+plain text issue tracker

319

31:24

Introduction to the Synapse Kubernetes Operator

320

25:13

vhost-user-blk: a fast userspace block I/O interface

321

36:46

Operating Ceph from Ceph Dashboard

322

31:39

Reviving Reverse Polish Lisp

323

55:36

Self-conscious Reflexive Interpreters

324

29:58

Introduction to Pre-Scheme

325

18:16

Research at the service of free knowledge: Building open tools to support research on Wikimedia projects

326

15:17

The Software Sustainability Institute Community and Events

327

05:20

Establishing the Research Software Engineering (RSE) Asia Association with the Open Life Science programm

328

28:53

Frictionless Application (IDE for CSV)

329

29:10

Papis: a simple, powerful and extendable command-line bibliography manager

330

30:06

V2GLiberty: The open stack that could

331

27:49

Power profiling with the Firefox Profiler

332

22:14

OpenSTEF: Open Source energy predictions

333

26:49

Update on open-source energy system modeling in the global south and including Africa

334

19:45

4 Years of Energy Management with openHAB

335

25:12

Getting to a fossil free internet by 2030

336

26:58

Combatting Software-Driven Environmental Harm With Free Software

337

32:10

OpenCSD, simple and intuitive computational storage emulation with QEMU and eBPF

338

32:14

Stack walking/unwinding without frame pointers

339

29:40

The state of r2land

340

31:18

GNU poke beyond the CLI (Command Line Interface)

341

31:01

342

24:34

Libabigail, State Of The Onion

343

33:06

Libabigail, State Of The Onion

344

23:49

fq - jq for binary formats

345

26:50

An introduction into AMD/Xilinx libsystemctlm-soc

346

30:25

7 things I learned about old computers, via emulation

347

27:05

Transit network planning for everyone

348

25:36

Using open source software to boost measurement data in railways

349

26:40

Automated short-term train planning in OSRD

350

11:57

OpenStreetMap, one geographic database to rule them all?

351

27:03

OpenTripPlanner

352

17:42

Introducing MOTIS Project

353

25:25

Public Transport Data in KDE Itinerary

354

13:13

355

26:43

Pushing the PSP

356

31:52

Emulator development in Java

357

27:38

Emulator development in Java

358

30:27

Understanding the Bull GAMMA 3 first generation computer through emulation

359

49:25

360

17:50

AMENDMENT Global Open Source Quality Assurance of Emergency Supplies

361

29:11

AMENDMENT Covid Exposure Notification Out in the Open

362

31:12

AMENDMENT Public Money? Public Code! in Europe

363

13:31

Defining a multi-architecture interface for SYCL in LLVM Clang

364

53:59

Reggae: cool way of managing jails/VMs on FreeBSD

365

32:12

Bringing your project closer to users – translating libre with Weblate

366

34:30

Building an atractive way in an old infra for new translators

367

16:51

Translating documentation with cloud tools and scripts

368

13:31

Defining a multi-architecture interface for SYCL in LLVM Clang

369

29:51

20 years with Gettext

370

30:10

Translate All The Things!

371

28:17

Demystifying compiler-rt-sanitizers for multiple architectures

372

30:42

Game of Trees Daemon

373

22:32

Happy 5th anniversary pkg-provides

374

32:33

Open source C/C++ embedded toolchains using LLVM

375

31:06

Case study of creating and maintaining an analysis and instrumentation tool based on LLVM: PARCOACH

376

16:52

BSD Driver Harmony

377

42:07

AMENDMENT The New EU Interoperable Europe Act and the Reuse of Software in Public Administration

378

27:19

How to Build your own MLIR Dialect

379

26:20

FIDO beyond the browser

380

18:31

Elliptic curves in FOSS

381

23:35

How to protect your Kubernetes cluster using Crowdsec

382

22:18

Secure by accident

383

25:05

Graphing tools for scheduler tracing

384

12:51

A complete compliance toolchain for Yocto projects

385

27:36

Understanding and Managing the Dependency in SBOM with the New Feature of SW360

386

26:33

In SBOMs We Trust: How Accurate, Complete, and Actionable Are They?

387

29:49

A standard BOM for Siemens

388

12:08

REUSE Software... or if you want nice a nice SBOM downstream, push REUSE upstream

389

1:22:33

Panel discussion: SBOM content, usefulness, and caveats

390

19:51

Generating SBOM made easy with ORT

391

26:22

Automated SBoM generation with OpenEmbedded and the Yocto Project

392

28:03

The 7 key ingredients of a great SBOM

393

26:08

Hermine: converting SBOMS into legal obligations

394

27:57

Using SPDX for functional safety

395

11:41

FOSSology and SPDX

396

27:48

SBOM contents for embedded system images

397

27:22

Build recorder: a system to capture detailed information

398

15:38

Sailing into the Linux port with Sony Open Devices

399

23:59

A Service as a Software Substitute (SaaSS) is unjust like proprietary software

400

21:28

Automating a rolling binary release for Spack

401

49:59

Is “European open source” a thing?

402

30:10

If it’s public money, make it public code!

403

24:27

Controlling the web with a PS5 controller

404

32:01

Adopting continuous-profiling: Understand how your code utilizes cpu/memory

405

26:10

Practical UX at OpenProject

406

48:50

When it all GOes right

407

48:07

Tour de Data Types: VARCHAR2 or CHAR(255)?

408

50:36

Why Database Teams Need Human Factors Training

409

50:10

How to Give Your Postgres Blog Posts: An Outsize Impact

410

49:57

411

43:10

Deep Dive Into Query Performance

412

43:29

DBA Evolution: the Changing Role of the Database Administrator

413

33:11

Observability in Postgres

414

27:36

The problems you will have when creating a plugins system for your shiny UI project

415

17:17

What's new in the world of phosh?

416

20:16

Penpot official launch!

417

28:42

Best Practices for Operators Monitoring and Observability in Operator SDK

418

15:09

Webmapping and massive statistical data, a democratization story

419

29:55

Interactive network visualizations as "guided close reading" devices for the social sciences

420

15:07

The Turing Way: Changing research culture through open collaboration

421

28:35

Tackling disinformation with OSS

422

30:37

RICardo and GeoPolHist: Exploring trade relations between the geopolitical entities of the world from c. 1800 to 1938

423

28:16

Relativitization: an interstellar social simulation framework and a turn-based strategy game

424

24:57

PIMMI: a command line interface to study image propagation

425

25:55

Preliminary analysis of crowdsourced sound data with FOSS

426

26:23

MuPhyN - MultiPhysical Nexus

427

26:45

Guix, toward practical transparent, verifiable and long-term reproducible research

428

14:09

Executable papers in the Humanities, or how did we land to the Journal of Digital History

429

28:40

CorTexT: An open platform for social sciences and humanities Methodological expertise

430

26:39

Setting up OpenQA testing for GNOME

431

26:15

Open Research Open Panel: Open discussion among the open research tools and technologies community

432

24:14

Upstream Collaboration and Linux Distributions Collaboration - Is that excluded?

433

15:40

Ondev2: Distro-Independent Installer For Linux Mobile

434

25:09

Observability-driven development with OpenTelemetry

435

19:27

Nurturing, Motivating and Recognizing Non-Code Contributions

436

22:54

Visualize the NPM dependencies city ecosystem of your node project in VR

437

1:01:01

Fear the mutants. Love the mutants.

438

27:11

MPTCP in the upstream kernel

439

24:14

Localize your open source project with Pontoon

440

31:17

The Road to Intl.MessageFormat

441

22:38

Firefox Profiler beyond the web

442

25:38

Understanding the energy use of Firefox

443

26:45

The Digital Services Act 101

444

24:08

Cache The World

445

17:10

Mobian: to stable... and beyond!

446

25:17

meta netdevices

447

17:45

Mainline Linux on recent Qualcomm SoCs: Fairphone 4

448

24:57

Lomiri Mobile Linux in Desktop mode

449

30:22

Loki: Cloud Native Logging

450

24:31

Linux Kernel Functional Testing

451

25:20

Linux Distributions’ State of Gaming

452

09:11

Lightning Talks: NetXMS | Parca | OpenSearch

453

22:52

Open Source Initiative: Proposed Changes to License Review Process

454

50:44

Panel: Hot Topics - Organizers of the Legal & Policy DevRoom discuss the issues of the day

455

51:33

Learning From the Big Failures To Improve FOSS Advocacy and Adoption

456

23:28

Exploring the power of OpenTelemetry on Kubernetes

457

13:43

KubeOS: Container OS based on OpenEuler

458

19:12

KRuMP - Kotlin-Rust-Multiplatform?!

459

24:07

Why we ditched JavaScript for Kotlin/JS

460

23:22

Kotlin Multiplatform: From “Hello World” to the Real World

461

12:56

Kotlin Multiplatform for Android & iOS library developers

462

16:55

Hacking the Linux Kernel to get moar FPS

463

29:33

KDLP: Kernel Development Learning Pipeline

464

19:40

How we build and maintain Kairos

465

24:06

jxr in /engine/ - coding in WebXR on a plane

466

26:56

Improving the Kotlin Developer Experience in Koin 3.2

467

22:42

Javascript for Privacy-Protecting Peer-to-Peer Applications

468

27:19

Exploring Database Containers

469

29:30

7 years of cgroup v2: the future of Linux resource control

470

29:54

Bottlerocket OS - a container-optimized Linux

471

25:13

composefs: An opportunistically sharing verified image filesystem

472

31:54

Just A Community Minute

473

16:50

CentOS Stream: RHEL development in public

474

30:10

Centering DEI Within Your Open Source Project

475

30:48

Building Open Source Teams

476

22:25

Building a UX Research toolkit

477

08:55

Bluetooth state in PipeWire and WirePlumber

478

24:32

Developing Bluetooth Mesh networks with Rust

479

25:16

eBPF loader deep dive

480

21:49

Monitor your databases with Open Source tools like PMM

481

17:41

Optimizing BPF hashmap and friends

482

23:55

Be pushy! Let's join for wider and better Kotlin support worldwide

483

23:20

barebox, the bootloader for Linux kernel developers

484

22:13

Reckoning with new app store changes: Is now our chance?

485

25:36

A practical approach to build an open and evolvable Digital Experience Platform (DXP)

486

22:04

Zig and Guile for fast code and a REPL

487

38:24

Zero Knowledge Cryptography and Anonymous Engineering

488

50:13

Tools for linking Wikidata and OpenStreetMap

489

31:02

Matrix Widgets in the "Sovereign Workplace" for the German public sector

490

35:19

Whippet: A new production embeddable garbage collector

491

16:37

Quantitative Analysis of Open Source WebRTC Developer Trends

492

23:54

Exploring WebAssembly with Forth (and vice versa)

493

17:49

W3C WebRTC Meetup Update

494

14:51

Using SPDK with the Xen hypervisor

495

25:45

OKD Virtualization: what’s new, what’s next

496

25:16

A journey through supporting VMs with dedicated CPUs on Kubernetes

497

19:36

Fuzzing Device Models in Rust: Common Pitfalls

498

26:15

blkhash - fast disk image checksums

499

31:21

Trustworthy Platform Module

500

28:49

Trixnity: One Matrix SDK for (almost) everything written in Kotlin

501

50:53

Clear skies, no clouds in sight. Running a 14 person company on only free software.

502

24:53

Semihosting U-Boot

503

18:51

Secure payments over VoIP calls in the cloud

504

26:56

Online schema change at scale in TiDB

505

12:15

Scaling Open Source Realtime Messaging System for Millions

506

40:54

Self-Hosting (Almost) All The Way Down

507

35:16

QtRVSim—Education from Assembly to Pipeline, Cache Performance, and C Level Programming

508

35:43

Bringing up the OpenHW Group RISC-V tool chains

509

37:20

Porting RISC-V to GNU Guix

510

28:24

How to add an GCC builtin to the RISC-V compiler

511

48:21

Reimplementing the Coreutils in a modern language (Rust)

512

39:11

Building an Plant Monitoring App with InfluxDB, Python, and Flask with Edge to cloud replication

513

50:53

Passwordless Linux - where are we?

514

32:38

Open Source Firmware status on AMD platforms 2023 - 4th edition

515

45:36

DNF5: the new era in RPM software management

516

24:55

Deep Dive Into Query Performance

517

22:15

Data-in-use Encryption with MariaDB

518

22:09

Transparent, asynchronous, efficient communication

519

26:40

Migrating from proprietary to Open-Source knowledge management tools

520

15:26

The Relentless March of Markdown

521

24:30

Optimizing your core application for integration

522

25:41

Nextcloud Numbers and Hubs

523

25:05

Deploy an enterprise search server with Fess

524

23:05

Collaborating with Collabora Online

525

50:45

The End of Free Software

526

07:03

Build your own Real Time Billing using CGRateS

527

29:51

Open Source Confidential Computing with RISC-V

528

22:42

Keeping safety-critical programs alive when Linux isn’t able to

529

18:44

We need a Let’s Encrypt movement for Confidential Computing

530

19:51

LSKV: Democratising Confidential Computing from the Core

531

25:05

Autonomous Confidential Kubernetes

532

29:03

Introduction to Secure Execution for s390x

533

27:37

Tilting a Pyramid: Confidentiality in a Cloud Native Environment

534

50:30

Open Source Business Guidebook

535

13:13

Bridging ActivityPub with Kazarma

536

31:29

Overview of Secure Boot state in the ARM-based SoCs 2nd edition

537

22:23

Hardware-backed attestation in TLS

538

47:10

Maker Tools in the Browser

539

29:15

The under-equipped social scientist ?

540

27:26

Over a decade of anti-tracking work at Mozilla

541

26:04

Building External Evangelists

542

16:59

Hardening Linux System with File Access Policy Daemon

Automatic playback

Speech

Text

Image

00:00

Physical lawProcess (computing)Right angleProfil (magazine)Software developerPrototypeFacebookPoint (geometry)Pointer (computer programming)Polarization (waves)Theory of relativityStack (abstract data type)Software engineeringMereologyEqualiser (mathematics)Web 2.0Frame problemComputer animation

00:45

Chemical polarityPointer (computer programming)Electric currentRun time (program lifecycle phase)Distribution (mathematics)File formatFrame problemBit rateHypercubeInformation securityInformationSystem callSpacetimeTable (information)Codierung <Programmierung>OpcodeFinite-state machineFrame problemTable (information)PrototypeProduct (business)PlanningFile formatLevel (video gaming)Computer programmingCartesian coordinate systemKernel (computing)State of matterCompilerEmailOpcodeAddress spaceStack (abstract data type)InformationDifferent (Kate Ryan album)System callCodeFirst-person shooterPointer (computer programming)Profil (magazine)Run time (program lifecycle phase)BefehlsprozessorSemiconductor memoryMereologySheaf (mathematics)Form (programming)QuicksortSampling (statistics)Process (computing)Mechanism designSoftware developerCache (computing)Latent heatResource allocationSoftwareCompilation albumSinc functionObject (grammar)SpacetimeScaling (geometry)Event horizonOpen setDatabase transactionMultiplication signExtreme programmingPoint (geometry)Group actionGoodness of fitForcing (mathematics)Data storage deviceDressing (medical)WordCASE <Informatik>Game theoryBoundary value problemUniform resource locatorDirection (geometry)Uniformer RaumControl flow40 (number)ArmFerry CorstenPower (physics)Computer animation

08:35

Frame problemInformationSystem callSpacetimeOpcodeCodierung <Programmierung>Table (information)Finite-state machineChemical polarityKernel (computing)Data managementReading (process)Pointer (computer programming)Control flowProcess (computing)Read-only memoryComputer programPhysical systemBefehlsprozessorOpcodeFrame problemComputer programmingProcess (computing)Web pageCombinational logicSoftware developerDataflowProfil (magazine)MappingMathematicsCartesian coordinate systemInformationOverhead (computing)Type theoryCASE <Informatik>Semiconductor memoryDifferent (Kate Ryan album)Sheaf (mathematics)AlgorithmLibrary (computing)CountingSpacetimeRippingTable (information)Hash functionMaxima and minimaData structureMereologyVisual systemSet (mathematics)2 (number)Point (geometry)Device driverHeegaard splittingProjective planeView (database)Event horizonVirtual machineCycle (graph theory)Fitness functionKernel (computing)Differential operatorMetadataBitPattern languageLevel (video gaming)Representation (politics)Game controllerLogicPhysical systemPointer (computer programming)File formatExpressionStack (abstract data type)Protein foldingAreaHand fanCoefficient of determinationVideo gameSingle-precision floating-point formatRight angleHypermediaConnectivity (graph theory)Metric systemStaff (military)Direction (geometry)Lattice (order)Data storage deviceElement (mathematics)Cue sportsRankingDirected graphMultilaterationMultiplication signObservational studyComputer animation

16:25

InformationProcess (computing)Texture mappingChemical polarityScale (map)Resource allocationBefehlsprozessorElectric currentPointer (computer programming)IterationBinary fileStack (abstract data type)Address spaceHash functionExecution unitRead-only memoryComputer programEmpennageProgrammschleifeKernel (computing)SpacetimeLibrary (computing)Software testingFunction (mathematics)Core dumpTable (information)Semiconductor memoryProcess (computing)BefehlsprozessorInformationFrame problemMappingGroup actionIntegrated development environmentVirtual machineLimit (category theory)Resource allocationUnit testingOperator (mathematics)Bit2 (number)Greatest elementServer (computing)Electric generatorProfil (magazine)QuicksortOrder (biology)Memory managementSpacetimeDifferent (Kate Ryan album)Cycle (graph theory)CuboidConfiguration spacePoint (geometry)Computer programmingSheaf (mathematics)Principal ideal domainMaxima and minimaData structureOcean currentStack (abstract data type)ImplementationPointer (computer programming)Binary codeTable (information)Crash (computing)CASE <Informatik>MathematicsSoftware testingRepository (publishing)Single-precision floating-point formatRepresentation (politics)Kernel (computing)Core dumpType theoryShared memoryFunctional (mathematics)MetadataCodeParsingWritingSensitivity analysisHash functionAreaMultiplication signLinear regressionLibrary (computing)MiniDiscNumberLoop (music)Overhead (computing)Cartesian coordinate systemIterationSystem callTraffic reportingAddress spaceUniverse (mathematics)Computer fileInternet service providerSign (mathematics)Right angleReading (process)Existential quantificationOnline helpUniqueness quantificationComputer animation

24:15

Software testingKernel (computing)Computer programMeasurementChemical polarityIntegrated development environmentUser profileComputer hardwareDifferent (Kate Ryan album)Configuration spaceBuildingMiniDiscAddress spaceProcess (computing)File formatCodeStrutOpcodePointer (computer programming)Table (information)Regulärer Ausdruck <Textverarbeitung>Frame problemIntelStack (abstract data type)Revision controlInterpreter (computing)InformationFunction (mathematics)Read-only memoryRun time (program lifecycle phase)Asynchronous Transfer ModeDefault (computer science)VirtualizationSet (mathematics)Level (video gaming)2 (number)Computer programmingProcedural programmingLinker (computing)Goodness of fitLibrary (computing)Subject indexingBuffer solutionFront and back endsFile formatProjective planeSpacetimeInterpreter (computing)Address spaceRow (database)MiniDiscCache (computing)MultiplicationTable (information)Single-precision floating-point formatKernel (computing)ParsingDifferent (Kate Ryan album)ExpressionComputer hardwareInfinityConfiguration spaceMultiplication signComputer fileProduct (business)Profil (magazine)Integrated development environmentStructural loadOpen sourceAlgorithmFormal verificationPoint (geometry)Software testingPhysical systemParsingInteractive televisionCartesian coordinate systemLengthSource codeConfidence intervalSoftware bugInformationFrame problemSemiconductor memoryOpen setJust-in-Time-CompilerLink (knot theory)BefehlsprozessorProcess (computing)CodeSlide ruleVirtual machineMereologyPointer (computer programming)Memory managementDynamical systemDebuggerMixed realitySheaf (mathematics)Functional (mathematics)OpcodeDistribution (mathematics)High-level programming languageChannel capacityFood energySpeciesMultilaterationQuicksortFormal languageGame theoryCodeAvatar (2009 film)Arithmetic progressionPower (physics)Sinc functionBitBEEPCovering spaceAutomationMathematicsRevision controlTraffic reportingState of matterIncidence algebraComputer animation

32:05

Chemical polarityWebsiteDew pointCodeComputer animationProgram flowchart

Transcript: English(auto-generated)

00:05

Okay. Welcome everyone. Today, we are going to talk about walking native stacks in BPF without frame pointers. So my name is Roshali. I work at Polar Signals as a software engineer. I mostly work on profilers and eBPF related stuff.

00:23

Before that, I have worked in different kernel subsystems as part of my job. My name is Javier and I've been working at Polar Signals for a year, mostly working on native stack unwinding, so the work that we're going to introduce today. Before that, I was working on web reliability, tooling for developers, and performance at Facebook.

00:42

Yeah. So before we dive into the topic, I wanted to go through the agenda. So first, we actually want to talk about why there is a need for dwarf-based stackwalker in eBPF, because that's like the most asked question. Then we will go into the design of our stackwalker, and then we'll talk about how we went from

01:00

the prototype to making it production-ready, and then a bunch of learnings so far on some future plans. So as we said, we work on the production profilers, which means that generally sampling profilers collect the stack addresses at particular intervals and attaches values to it. Note that the profilers generally need the user

01:22

and application level stacks as well as kernel stacks, and it involves iterating over all the stack frames and then collecting the written addresses. Historically, there have been a dedicated register for that, called frame pointer in both x86 and 64.

01:42

But in recent times, because of some of the compiler optimizations, it hasn't been mostly disabled in most of the runtimes as well as industries. Also, it becomes really hard when you don't have frame pointers and instead of involving

02:01

a couple of memory accesses per frame, which is quite fast, we will need to really do more work in the stackwalkers. Note that the stackwalking is also a common practice in the debuggers, as you all know. So what's the current state of the world? Well, it's not a problem for the hyperscalers

02:23

because hyperscalers actually have all the applications, which are already enabling frame pointers in the production. This is important because when things break and you want to really go through the inspection, it's really helpful to have all the stack when it's needed.

02:42

There's also a recent discussion in the Federer mailing list, so just feel free to go through it. TLDR of that discussion is that it's being, so FPs are going to be unable. Since, I think, Federer 38, so that's about to be released in late April.

03:03

Mac OS software always have binaries, which has frame pointers enabled. There's also an amazing work going on by Oracle engineers to have a simple format instead of DWARF, and we hope that that work also goes through and helps the ecosystem.

03:22

So that's the current status, but what we want is we want that right now, and we want the support for all the distros as well as all the runtimes, which is scattered over here and there. For example, only Go runtime, and it was FPs since Go 1.7,

03:41

and in x86, and since 1.12 in ARM64. So now, some of you might be wondering, if not frame pointers, what do we have? For example, say in Rust, where it has been disabled by its own by default,

04:05

but when you have the panic, you still get the all backtracked, so how is it happening? So well, compilers always have this information, and we generally need to know the value of the stack pointer in the previous frame, and it can be from any offset if there is no frame pointer.

04:24

So that way, we can always find the value of the return addresses and continue unwinding the stack. This information is generally encoded as part of .eh-frame section or .debug-frame section in the DWARF,

04:41

and there's also one other way which is unwind tables can be also synthesized from the object file, which is something being done by ORC format that has been used in kernel for a while now. We will talk in detail about .eh-frame in a minute,

05:00

but first of all, let's see if anybody else is using eh-frame already, of course. So the profiler we have developed is not the first thing who is going to use eh-frame. Part of the popular profiler from Linux kernel added

05:21

the DWARF support since when the perf event open syscall was added, which was in 3.4, and it can collect the registers for the profile processes, as well as the copy of the stack for every sample. While this approach has been proven to work,

05:40

there are a bunch of drawbacks to it. For example, one of the things which perf does is it actually collects all the stacks and copies it into the user space. The second thing is that the implications of one process having another process's data in the user space scale can be quite complicated,

06:00

and also it's like a lot of data, especially for the CPU intensive applications. So why BPF? Stack walking in BPF for our profilers actually makes a lot of sense to us, because in that case, we don't really have to copy the whole stack.

06:21

The information, a lot of it still stays in the kernel, which provides higher safety guarantees, especially in the case of stack walking mechanism. Once it's been implemented, we can leverage the perf subsystem to get the sample CPU cycles, and then instructions, L3 cache misses, etc.

06:43

And it can then also help us to develop other tools, like allocation tracers, runtime specific profilers, for example, for JVM or Ruby, etc. So some of you who are probably also familiar with BPF

07:01

may know that there is already BPF GAT stack ID, so why there is a need for implementing something different. Well, as it turns out, the problem with that helper is that it also requires frame pointers. So it also uses frame pointers to walk through the stacks. And for the historical reasons, fully featured dwarf unwinder

07:23

is unlikely to be part of the Linux kernel. So before we dive into how we are using EH frame with BPF, let's look at what EH frame actually has to offer.

07:42

So the EH frame section contains one or more call frame informations. The main goal of the call frame information is to provide answers on how to restore every register for the previous frame at any part of our code execution.

08:01

Directly storing the table that sort of contain each program counter and all the registers, and then locations such as, whether they have been pushed to the stack or not, et cetera, would generate huge unwind tables. And for that reason, dwarf is actually quite compact

08:21

and very space efficient in that sense. So the unwind tables encoded in the CFI format are in the form of opcodes, and those opcodes need to be evaluated. So in the case of stack walking, once it has been evaluated, we generate the table that contains for each instruction boundary,

08:42

like how to store the value of the register. There are two main layers to it. One is that it helps with repetitive patterns that compress well and allows for more compact representation of some data. In some cases, there is also specialized opcode

09:00

that consumes, say, one to four bytes, rather than just four bytes at all time. And the second layer, which we call the second layer, is the spatial opcode that contains another set of opcodes, which is containing the arbitrary expressions. That needs to be evaluated, and that's a lot of work.

09:22

The main difference between these two is that in the first one, we just need these two values. But in the second part of it, we will actually need to evaluate the arbitrary during complete expressions. So for that reason, we would need to have the full-blown VM

09:41

to be implemented in the BPF itself, which is not quite practical. So those who don't know how, like generally, the BPF applications flow works, this is how it would look like in a very high-level point of view.

10:01

So in the user space, you would have the driver program written in Go, like that's our stack, and we are using BPF Go over there. We are creating the maps, attaching the BPF program to a CPU cycle perf event.

10:20

It reads, parses, and evaluates the EH frame section of the process and all the dynamic libraries. And in the BPF program, using the PID, we are fetching the table, and then we have an unwind algorithm, which processes the dwarf information.

10:41

We will go in-depth for each component, but let's see how the algorithm looks like. So basically, for this one, it's a really simple one, but basically, we just read three important registers. First one is instruction pointer, RIP. Next one is the stack pointer,

11:02

and the third one is, of course, frame pointer, RBP. And then when the frame count is less or equal to the maximum stack depth we have defined, we find the unwind table for the program counter. We add the instruction pointer to the stack,

11:21

calculate the previous frame stack pointer, then update the registers with the calculated values for the previous frame, and then continue with the next frame. So this is like, just in our shell, that's what the algorithm is in the BPF. But now, the important part is how we store that unwind information

11:44

and what we have done to make it scalable. So now Javier will talk about that. Cool. So now we have some unwind information

12:00

that we're going to describe the format later, but we need to put it somewhere, right? So one possibility will be to store this unwind info in the memory space of the applications that we are trying to profile. And we could do this, for example, using a combination of ptrace, mmap, and mlock. And we could use ptrace to hijack the process execution

12:20

and make it allocate a new memory segment. And then we will have to lock it, because in BPF we need to make sure that the pages that we're accessing are never going to be swapped out. But this has a big problem. That will be altering the execution flow of the program, and this is something that we never want to do.

12:41

One of the reasons for this is because, first, this memory will live in the process, which means that accounting for it will be odd, and developers will find a new memory segment there that appear out of the blue. So in their metrics, there will be something that changes behind their backs, which is not great. But also because the lifecycle of managing this memory chunk is very complicated. For example, what do you do if our profiler dies

13:03

before the processes that we are introspecting? How do we clean this up? It involves a lot of problems, and adding solutions to these problems will require crazy engineering, which we were not planning to tackle, because it will overcomplicate the project a lot.

13:21

The other problem is that sharing memory is way harder, and accounting for our overhead is also very hard. If you think about it, for example, LibC is probably linked in most of the applications in your machine, so why having the same information for every single process, right? So how do we actually store this information?

13:41

We use BPF maps, which are data structures that, as Vashali said, they can be written and read both from kernel and user space. We use, in particular, hash maps, which, in the case of BPF, they are basically a mapping of bytes to bytes, where you can put arbitrary information. So this is always locked in memory.

14:00

BPF allows you, with this flag, not to lock memory, but in the type of BPF program we use, it is mandatory to lock it. Otherwise, as we said before, these pages could be swapped out, and from the type of BPF programs that we have, page faults are forbidden. In the end, we could reuse these mappings,

14:24

because they are in this global BPF map that we have control over, so we can store, for example, libc in one particular area, and then we'll have to add metadata for where it is for every single process that has this mapping. So, yeah, this is a logical representation of unwanted information for different executable segments.

14:43

So imagine this is libc, MySQL, zlib, systemd, and then some chunk that is unused. So this assumes that we have, like, this logical view has, like, a chunk of memory that is contiguous, but in reality, we cannot allocate any arbitrary chunk of memory on BPF, we cannot say we want 200 megabytes,

15:01

and it needs to be, like, contiguous. We cannot do, like, a malloc, right? So we've been doing some experiments, and in the systems that we have tested, and the kernels that we want to support, we can add up to 250,000 unwind entries to each value of a BPF map.

15:20

So because we want to be able to fit larger unwind tables, we basically use the same solution that you would use in any other data intensive application, which is partitioning or sharding. So we're splitting the unwind table into different shards, and depending on the memory that your machine has,

15:42

we'll allocate more or less shards ahead of time. And that will result in a CPU to memory tradeoff, because when they get full, we'll regenerate them. But we'll talk about this later. So, for example, let's see system D's unwind table, and let's suppose that it's a bit bigger than 250,000 elements,

16:01

so it doesn't fit in a single shard. So because it doesn't, we have to chunk it up. So we store the first chunk in the first shard, and then there's a little bit that is stored in the second shard. Because we have added all these layers of indirection, we need some bookkeeping to do, and this metadata is also stored in BPF maps.

16:21

So we have a process that have many mappings. Each mapping can have one or more chunks, and then each chunk maps to a particular shard. Because you might have one unwind entry or up to 250,000 in a shard, we have some other metadata to exactly tell you

16:41

where that information lives. So yeah, thanks to this way of organizing data, we're able to, as we said before, share these executable mappings. And thanks to that, we skip table generation for most of the executables in your box. And thanks to this, we only use 0.9% of the CPU cycles

17:02

doing the process that Vashali was talking about before, which is not the most complex process in the universe, but it consumes some CPU cycles, because we need to read some information from your L file, find the section, then we need to read the section, and we need to parse it and interpret the DWARF information. So now we need to do it way fewer times.

17:20

So in your machine, we're only gonna do it once per unique executable section. So what happens if we run out of space? So basically what we have implemented is basically a bump allocator. We keep on appending information to the shards, and logically you can see it as a big chunk of memory. Once it's full, at some point we'll decide to wipe the whole thing and start again.

17:40

But we do it in such a way that we give every single process a fair chance of being profiled. So yeah, let's take a look at how are we doing this. So we start with a PID of any process that you find in your machine that at that point happens to be running on CPU. And the first thing we do is to check if it has unwind information.

18:00

If it does, we need to find the mapping with the current instruction pointer that we have for that frame. Then we need to find the chunk. We can find it doing linear search, because we have the information of every single like minimum and maximum program counter that is covered by that chunk. Once we get the chunk, we can have the shard information,

18:22

and once we have the shard information, we have the unwind information. But the problem is the unwind information, as we said before, it's basically an array. And this array, we need to find it there. But it can be up to 250,000 items. And if you have done anything on BPF, you know that you don't have like,

18:41

you have to basically build whatever you need yourself, and you don't have, for example, some sort of binary search that is executed on kernel space for you, so you need to implement it yourself, which is not a big deal in general. Implementing binary search is not rocket science, but the problem, most of the times, it's difficult to get all the off by ones, right? But the problem here is that in kernel space,

19:00

we have a lot of limitations we're gonna talk about later and we're gonna talk about how we overcame them, because this produces quite a bit of code that has to be split logically. So not only the data structures are sharded, but the code is sharded too. So anyways, we binary search this with up to, well, seven iterations, but let's say eight, if you're feeling pessimistic.

19:22

And then we're gonna get the unwind action. So what is the operation we need to do to the current registers to recover the previous registers? And add that frame to the stack trace and continue with the next frame, as I actually explained before. If the stack is correct, and we have the luxury to know that

19:41

because when we have no unwind information and our BP is zero, that is specified by the x64 API to be the end of the stack, the bottom of the stack. So if it is correct, we hash the addresses, add the hash to a map, and bump a counter. So it is reasonably cheap and I will show you some data later on this.

20:02

And then every couple seconds, I think it's every 10 seconds or so, we collect all this information from user space and we generate the actual profiles that we send to some server. As I said before, BPF has some interesting challenges for us. I think it's the closest that I've been to coding in the 90s or 80s, because we have very little stack space.

20:21

We have 512 bytes, if I am not mistaken. So in order to overcome that, we use BPF maps as some sort of heap. Then there's a problem that I mentioned before about memory locking. That memory can never be swapped out and it is in kernel space. So we want to make sure that we allocate the minimal space you need

20:42

and we need to do it properly because each single environment has a different C group configuration. And as some talks explained yesterday, it's quite tricky to know the actual memory that your machine has available. For the program size, there is two main issues. One of them is that there's a limitation

21:01

on the number of instructions that you can store in the kernel, but also the BPF verifier, which is this machinery that makes sure that your program is safe, and for example, your program is gonna finish, you're not referencing any null pointers, sorry, and that in general, you're not gonna crash the kernel,

21:22

has a limit on the amount of iterations that it does internally. This is a problem for us because doing a full binary search already fills these limits. So we need to use some techniques like this thing called BPF tail calls that is similar to LISP tail calls.

21:41

And if you're lucky, we are not, you can use, well, we use bounded loops, but we're gonna use this new helper called BPF loop that basically it's a function that you can call multiple times creating some sort of loop in BPF, but we cannot use that because we wanna support all their kernels. That's a pretty new feature.

22:01

So let's switch to something else. We have written our application in user space in Go, and we are a profiler, so we wanna make sure that the overhead we have on your machine is as little as possible. But unfortunately, many of the Go APIs aren't designed with performance in mind. I am new to Go, I didn't know this was like this, and every single time I profile our profiler

22:22

and I found these things, I was like, how? How is this possible? But it has a dwarf and elf parsing library in the standard library, which is great, but they are not designed for performance sensitive environments, let's say. And also, there's two functions that are binary read and binary write

22:40

that we use all the time because we need to deal with bytes back and forth that allocates in the fast path. But anyways, we profile our profiler all the time. We have found lots of opportunities that we keep on fixing, but of course, there's more work to do. And one of the areas where we try to be

23:00

pretty comprehensive, it's with testing. So we have thorough testing, well, unit testing for most of the core functions to ensure that we don't regress. But I think that, in my opinion, has helped us the most is snapshot testing. If you're not familiar with this technique, it's very simple. You basically generate some textual representation of your data structures, write them to disk

23:21

or somewhere in memory, doesn't matter, and then you generate them again after you make some changes to your code and then you compare them. So this is how it looks in our case. We have some Git sub repository called test data, and then we have this textual representation of the unwind tables. You don't have to understand it all, but the idea here is that this covers a full function, which program counter starts in the one over there

23:42

and ends in the one over there. And then we have the information for every single program counter, and then it tells you, for example, what to do here. The first one says CFA type two that I know is for RBP. So you need to get the current RBP at eight, and that will give you the previous frame stack pointer. But anyways, the interesting thing here

24:01

is that this is very easy to implement, as you can see by our very advanced setup in our make file. We just build our binary. We dump these tables to disk, and then we ask it to give us the changes. And if there's anything that has changed, we fail. So thanks to this, we have found a lot of bugs

24:22

and it has allowed us to iterate with confidence. One of the important things in this project has been de-risking it. It's been quite complex. When I started working on this, I had no idea about dwarf unwinding. I had no idea about unwinding without frame pointers at all. So we had to make sure that all these avenues were properly covered.

24:41

We had, for example, the dwarf parser properly implemented, that we had all the interactions with BPF cover, and that the BPF unwinder worked well as well. So for this, we always try to have a plan B at every stage of the project, and we try to go in-depth as well as in breadth. But anyways, I have five minutes left apparently. So we had a lot of automated testing,

25:01

and one of the things that we did was adding kernel tests, which is super important, especially for BPF programs, because the BPF subsystem changes a lot over time, and there's a lot of features that we want to make sure we don't use because otherwise it wouldn't work in other kernels. So we have a kernel testing system where basically it runs our application

25:21

in multiple kernels and reports the state. And one of the things that I want to talk about is that production, as usual, brings a lot of interesting challenges. So by deploying our profiler to production, we found a lot of things that we didn't know about, and we were able to find some of these things thanks to using continuous profiling, our own profiler on our profiler.

25:42

As you know, different hardware and different configuration are the biggest sources of performance differences as well as incidents in production. So I want to show you two things that we have found recently. One of them is basically we're using almost 30% CPU time opening files in our production environment that never showed up on my NVMe.

26:01

And the reason is because it turns out cloud disks are very slow. So we are, now we have fixed this. Another very interesting thing that we fixed the other day, it's something that happened when we rolled out our profiler to production and then it started crashing. If you are interested, we will upload the slides,

26:20

so feel free to check the pull request because everything is open source. But basically what happened here was that, for reasons, Go has a signal-based profiler, and we have it enabled for even more reasons. And this only was enabled in production. So SIGPROF was interrupting our program execution while we were trying to load the BPF program. The BPF program takes a little while to load

26:41

because the verifier has to run a bunch of algorithms to statically ensure that everything's safe, and it was getting interrupted all the time. libBPF, that is the BPF library we used to load the BPF program, was retrying this up to five times until it basically said, I tried, this didn't work, sorry, and obviously we need the BPF program to be loaded to work.

27:01

So there's many other considerations in this project, like short-lived processes, which we haven't optimized for, but we are still pretty decent at. If your program runs for one second, we're probably gonna catch it. But if this is something that you care about, feel free to message us. It will be something that we optimize. And then, yeah, this is our current format.

27:21

I probably have one minute left or something like that. So you don't have to understand it all, but the point is we represent every single row with two 64-bit words, but since we are making it a bit smaller, and this is basically how our size compares to DWARF. We are bigger because DWARF is optimized for disk while we are optimized for disk space

27:41

while we are optimized for just raw speed. So for example, our whole table for one shard pretty much fits in L2 cache. I guess, do I have any more time? Probably not, right? Two minutes, oh, okay, sorry. Maybe I sped up too much. So we need to support parsing

28:01

every single DWARF CFI opcodes, and the reason for this is because otherwise we won't be able to progress. But we cannot unwind from every single program counter, which sucks, but this is not a problem in practice. The reason for this is because the most typical way to recover the previous frame stack pointer is,

28:20

which is called CFA in DWARF, but doesn't matter, is that you will get given which register you need to apply some offset to, and that will give you the previous frame stack pointer. We support that, but the problem is that it could be any arbitrary register. And right now we only support either RBP or RSP offsets, which happen 99% of the time. So this is something that we're gonna work on soon.

28:41

The other problem, as Vashali said before, is that DWARF has a VM that you need to implement, which has to be Turing-complete, and can implement any expression. It's not Turing-complete. The second level, yeah. The DWARF, no. That's why in the infinity project they have to add this new opcode. No? The DWARF had.

29:00

Okay. It's not exactly Turing-complete, it's almost there, yeah. Okay, well. But you need to implement a VM that basically has a set of virtual registers. You need to do the stack machine to, yeah, yeah, yeah. But the second, well, we can talk about those later, because the first level, yeah, it's the stack machine, 100%, but the second level is, I can show you our code, it's messed up. Like, it's messed up.

29:21

But anyways, but the thing is that we are very lucky here, and you can check more about this in this PR. So there's two DWARF expressions that account for 50% of all the expressions that we have seen in most distributions. They are the expressions used by, well, the dynamic linker needs them, basically, and they are the expressions for procedure linkage tables, or PLTs.

29:43

The other good news, as I said before, is that RBP and RSP offsets rarely occur, and all the other possibilities that I haven't talked about, they almost never occur. Like, we've seen them very, very, very few times. What indexes are you supporting? Oh, good question. You're playing with AR64, because the GCC AR64 backend generates CFA expressions.

30:01

That's what I was talking about. Yeah, yeah, yeah. So, but right now we only support x64, but I'm also gonna talk about this later. Sorry, sorry. But anyways. Either, no. Okay, done? Okay, well. Wait, but we have the minutes buffer for the next one, right? Five minutes. Five minutes. Okay. I have two more slides.

30:22

Well, anyways, our BPF program, we tried to make it as fast as possible. So this was running on my machine with a bunch of applications that have 90 frames or more. So even the maximum time that it takes is 0.5 milliseconds, which is not terrible on my CPU, which is from late 2017. And this is in a big part

30:41

because we have optimized everything for memory. So everything's aligned properly, and we try to fit as many things as possible in the CPU cache. What about high-level languages? So there is a project that I happen to work on, which is called rbperf. So this is something that we're gonna be adding

31:00

in the future. Basically, for dynamic languages, you need to have knowledge of the ABI of every interpreter version, and then the stackwalkers are also implemented in BPF. But instead of getting the return addresses, because you have no return addresses there that are meaningful to you, you have to directly extract the function names and other information of the process heap. Our project is called parka.

31:21

So there's a couple things that we're gonna be doing, like mix and y mode, that as far as we know, no one else does this in profiling, in debuggers for sure, which is the idea of that different sections will be unwound using different techniques. So for example, if you have a JIT that will be used, like Node.js, that has frame pointers, so you will unwind it with frame pointers, but once you reach the actual code

31:41

from your interpreter, which is compiled, and has unwound information, we will use dwarf unwound information. ARM64 support is coming late this year, and this feature is now detailed by default, but it is stable enough that we're gonna be enabling it in a month. And then we're gonna add other runtimes, such as the JVM or Ruby. And then just to say that we are open source, userspace, Apache 2, BPF, and the GPL.

32:03

And yeah, all the links are here, and yeah, thank you so much.

Recommendations