Graphics Performance Analysis with FrameRetrace - TIB AV-Portal

Graphics Performance Analysis with FrameRetrace

00:00

8

Formal Metadata

Title

Graphics Performance Analysis with FrameRetrace

Subtitle

A Responsive UI for ApiTrace

Title of Series

Number of Parts

644

Author

License

CC Attribution 2.0 Belgium:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/41344 (DOI)

Publisher

Release Date

Language

Content Metadata

Subject Area	Computer Science
Genre	Conference/Talk

FOSDEM 201863 / 644

1

20:09

Introduction to LXD clustering

2

24:30

Introduction to LibrePCB

3

26:22

Introduction to Flatpak

4

15:10

Introduction into the ppci project

5

23:44

Introducing rust-av

6

20:40

Introducing gtk-rs

7

55:40

Introducing BuildStream

8

09:26

Intro to the SDR Devroom

9

24:40

Intro to Open Source Radio Telescopes

10

04:13

Intro Geospatial devroom

11

27:45

Interface Animation from the Future

12

50:35

Intel GFX CI and IGT

13

21:51

Integrating cloud and container projects in the OPNFV community: Cross Community CI

14

26:46

Instant ADD COLUMN for InnoDB in MariaDB 10.3+

15

28:48

Installing software for scientists on a multi-user HPC system

16

52:28

17

13:15

Industrialisation of applications build in embedded environment

18

26:40

In the SpOOTlight: gr-radar

19

29:34

Improving the SecureDrop system architecture

20

27:07

Improving Linux Laptop Battery Life

21

20:44

Improving GitLab's Navigation and Design System

22

21:55

Improvements around document signatures and encryption

23

49:38

Implementing state-of-the-art U-Boot port, 2018 edition

24

20:51

Implementing a safe and auditable access to customer instances of your SaaS for the support

25

48:07

Image capture on embedded linux systems

26

51:15

Igniting the Open Hardware Ecosystem with RISC-V

27

24:44

28

31:31

Identity governance and data protection with midPoint

29

21:47

30

14:00

i++: run your FOSS off

31

12:14

i3 window manager

32

38:10

Hurd's PCI arbiter

33

25:41

How to write your own NIC device driver (and why)

34

23:19

How To Make Package Managers Cry

35

35:57

How to keep your embedded Linux up and running?

36

32:55

How to cross-compile with LLVM based tools

37

21:15

How to build autonomous robot for less than 2K€

38

18:33

How to build and run OCI containers

39

41:11

How to backup Ceph at scale

40

23:43

How DeepLearning can help to improve geospatial DataQuality , an OSM use case.

41

26:02

How compilers affect dependency resolution in Spack

42

34:48

How Carton, Docker, and CircleCI Saved my Sanity

43

28:07

horizon - a new star on the EDA sky

44

14:31

HOMER 7 - Introducing the latest HOMER 7

45

25:36

Home automation - Not as simple as you think

46

30:35

Histogram support in MySQL 8.0

47

28:24

Highly Available Foreman

48

52:23

49

27:07

High performance network functions with VPP

50

21:43

Hidden Gems in Draw/Impress Part 2

51

36:45

Heterogeneous Computing with D Using the PTX and SPIR-V targets with a system programming language

52

21:27

Handling media - the RESTful approach

53

38:58

Handling Billions Of Edges in a Graph Database

54

22:46

Hairy Security the many threats to a Java web app

55

22:07

Hacking the JVM from Java

56

27:37

Gutenberg to Google Fonts: the sordid history of typeface licensing issues

57

26:35

GStreamer & Rust

58

20:25

GStreamer for tiny devices

59

23:13

GRUB upstream and distros cooperation

60

16:07

GrimoireLab: free software for software development analytics

61

29:14

GRASS GIS in the sky

62

30:10

Graphite at Scale at Criteo: BigGraphite

63

27:17

Graphics Performance Analysis with FrameRetrace

64

30:55

GrayLog for Java developers

65

25:05

Graphic design tools for Open Source FPGAs

66

35:10

Graph-based analysis of JavaScript source code repositories

67

04:34

Grafanalib - Dashboards as Code

68

30:09

Grafana Tips & Tricks & Whats New in v5

69

25:59

Graal: How to use the new JVM JIT compiler in real life

70

25:13

GPU resource multiplexing in component based systems

71

08:29

GPAC: Support for High Efficiency Image Format (HEIF)

72

23:51

Google’s approach to distributed systems observability for Go

73

19:14

Google’s approach to distributed systems observability for Go

74

20:21

Good things come to those who wait - BorderFrames and WriterGraphics

75

15:09

Godot 3, libre gamedev for everyone

76

50:40

FOSDEM 2018: Go Lightning Talks

77

19:00

Gnuk Token and GnuPG scdaemon

78

52:49

Gnucap, and analog and mixed signal simulation

79

28:34

Get your decentralized project some EU funding

80

39:27

Gluster-4.0 and GD2

81

25:34

GeoPandas: easy, fast and scalable geospatial analysis in Python

82

40:17

Geographically distributed Swift clusters

83

41:05

Generic Graphics Tablets in Linux

84

33:43

gdb tools: duel and @PrettyPrinter

85

20:07

GDAL Tips and Tricks

86

15:40

GASdotto: a platform for ethical purchasing

87

34:19

Game development for the ColecoVision and Sega 8-bit systems

88

25:36

G1 - Not^H^H^HNever Done!

89

37:03

G-CORE: The LDBC Graph Query Language Proposal

90

20:36

Fundraising and Crowdfunding for FreeRTC

91

30:02

freeIPA installation using Ansible

92

15:10

FreeBSD : pkg provides

93

20:39

Free your Weather Station!

94

23:59

Funny digital electronics with Open Source FPGAs

95

20:11

FOSSology - OSS Project for License Compliance

96

28:13

FOSS Platform for Cloud Based IoT Solutions

97

20:24

FOSS/H EDA tools for SPICE modeling

98

21:26

FOSDEM Infrastructure Review

99

17:23

Forwarding system calls to userspace

100

30:09

Fleet Commander: The efficient way of managing the Desktop profiles of your fleet!

101

23:47

Flatpak and your distribution

102

23:10

Firefox: How to ship quality software

103

39:44

Finding your way through the QEMU parameter jungle

104

21:30

Finding inter-procedural bugs at scale with Infer static analyzer

105

25:38

Finding a home for docs

106

36:00

File access-control per container with Landlock

107

26:03

Facing the Challenges of Updating Complex Systems

108

30:10

Fast Packet Processing in Linux with AF_XDP

109

17:21

Exploring container image distribution with casync

110

58:08

Exploiting modern microarchitectures

111

23:19

Experiences with testing dev MySQL versions and why it's good for you

112

29:58

Evolving Prometheus for the Cloud Native World

113

25:52

Everything you need to know about containers security

114

22:13

Everything is a device!

115

25:53

Every subway network in the world

116

16:10

117

12:40

Etienne Saliez - A look at “Natural Intelligence”

118

17:23

EPUB export in LibreOffice Writer

119

15:40

Enroll 2FA to thousands of users with privacyIDEA

120

13:39

Emitter: Scalable, fast and secure pub/sub in Go

121

51:20

Elasticsearch (R)Evolution

122

23:24

Efficient implementation of a spectrum scanner on a software-defined radio platform

123

46:17

Efficient Graph Algorithms in Neo4j

124

28:16

Efficient and interactive 3D point cloud processing

125

20:10

Ecosystems of Professional Libre Graphics Use

126

05:10

Easy::jit: just-in-time compilation for C++

127

15:47

Easy GnuPG Shell scripts to make GnuPG more accessible and easier to use

128

27:39

Easy Ada Tooling with Libadalang

129

27:30

Easily Secure Your Front and Back End app with Keycloak

130

43:37

DWARF Pieces And Other DWARF Location Woes

131

42:29

DWARF5 and GNU extensions

132

39:49

DTrace for Linux

133

15:10

DRLM in Action!

134

41:22

Drive your NAND within

135

30:16

Dragonflow - An open network services ecosystem

136

10:02

DragonFFI Foreign Function Interface and JIT using Clang/LLVM

137

23:58

Does data security rule out high performance?

138

32:23

DOSEMU and FreeDOS: past, present and future

139

18:10

Documentation convergence project for LibreOffice

140

25:04

Docs like code in Drupal

141

28:12

DocBook Documentation at SUSE

142

14:07

DNSSEC for higher performace

143

33:13

DNS privacy, where are we?

144

14:39

DNS-based discovery for OpenID Connect

145

23:56

DNA sequencing performance in Go, C++, and Java

146

25:09

DLR-CAFE: CUDA Filterbank Updates

147

11:18

DIY Java Static Analysis

148

55:54

Distributions are not democracies and that's okay

149

57:26

Distributing OS Images with casync

150

25:23

Distributing DevOps tools using GoLang and Containers, for Fun and Profit!

151

33:57

Distributed File Storage in Multi-Tenant Clouds using CephFS

152

29:07

Distance computation in Boost.Geometry

153

39:37

diskimage-builder: Building Linux Images for Cloud / Virtualization / Container

154

30:28

Æ-DIR - Authorized Entities Directory

155

52:07

Digital Archaeology Maintaining our digital heritage

156

19:12

Dialog tunneling in LibreOffice Online

157

38:15

Device Assignment for VMs in Kubernetes

158

17:36

Developing software on ORIC microcomputers

159

55:10

Developing Enterprise and Community distributions at the same time, impossible ?

160

36:27

Developing applications with Swift as Storage System

161

27:38

Developing an Open Source Hardware Laptop with KiCAD

162

23:45

Designing PCBs with code

163

15:09

Designing a Libre Embedded / Mobile RISCV64 SoC

164

25:35

Demystifying Rust parsing

165

28:27

Declarative Extensions for Kubernetes in Go

166

27:28

Debugging realtime application with Ftrace

167

25:47

Debugging A Live Gluster File System Using .meta Directory

168

44:10

Debug your build by tracing and reversing

169

50:10

De-mystifying contributing to PostgreSQL

170

44:03

Data integrity protection with cryptsetup tools

171

32:37

Cypher for Apache Spark

172

31:26

Cypher: An evolving query language for property graphs

173

49:51

174

25:25

Current meta of video compression and probable futures

175

26:01

Cumin: Flexible and Reliable Automation for the Fleet

176

23:34

Cultural interpretations of Design and Openness

177

24:07

178

32:13

Crowdsupply EOMA68 Progress Report

179

27:07

Creating GopherJS Apps with gRPC-Web

180

25:15

CrateDB: A Search Engine or a Database? Both!

181

23:36

Cooperative Communities

182

22:26

183

50:09

Contract-based Programming: a Route to Finding Bugs Earlier

184

20:09

Containing container memory

185

20:09

containerd 1.0 Project Update

186

41:10

Container Attached Storage (CAS) with OpenEBS

187

46:49

Consensus as a Service

188

15:18

Connecting the Edge

189

29:07

Connecting LLVM with a WCET tool

190

22:30

Configuring build base on FreeBSD

191

50:10

Configuration Revolution

192

27:03

Computer Vision Using Go And OpenCV

193

23:48

Componolit - a component-based open-source platform for trustworthy mobile devices

194

13:54

Component Sourcing for Design and Manufacturing in Shenzhen

195

31:14

Compiler-assisted Security Enhancement

196

29:24

Comparative Law of Licenses and Contracts in the US, UK and EU

197

31:07

Community & Business Two worlds, one galaxy

198

24:21

Combining CVMFS, Nix, Lmod, and EasyBuild at Compute Canada

199

26:03

Cockpit: A Linux Sysadmin Session in your Browser

200

06:41

FOSDEM 2018 - Closing

201

25:09

Class Metadata: A User Guide

202

25:28

Class Data Sharing

203

29:23

Claim Space, the Libre Way, using SDRs

204

40:10

CephFS Gateways

205

40:10

Ceph management with openATTIC

206

22:06

207

40:56

CBSD, Isolation manager

208

23:37

Capture the GDPR with Identity management

209

15:09

Can we measure the (de)centralisedness of the Internet with RIPE Atlas?

210

27:57

Calc: The challenges of scalable arithmetic

211

25:23

C++ Code Generation with GRC

212

17:03

BYOR: Bring-your-own-radio hacking session

213

10:52

BulletinBoard DHT and wireguard-p2p

214

25:21

Building RT Linux distribution with Yocto

215

19:40

Building Rock Climbing Maps with OpenStreetMap

216

26:17

Building OSM based web app from scratch

217

24:18

Building Decentralised Communities with Matrix

218

26:58

Building and testing a distributed data store in Go

219

34:27

Building an integration testing framework

220

17:10

Building a WebRTC gateway

221

20:21

Build your own Skype... in the browser

222

48:23

223

24:33

BSD from scratch - from source to OS with ease on NetBSD

224

27:45

225

50:11

Browser-as-GUI and Web Applications with Gnoga

226

25:09

Breaking with conventional Configuration File Editing

227

26:57

Booting it successfully for the first time with mainline

228

45:59

Blue elephant on-demand: Postgres + Kubernetes

229

27:49

Blame (and) DNS: debugging tutorial

230

21:39

BIND 9 Past, Present, and Future

231

23:53

Binary packaging for HPC with Spack

232

31:23

Bicycle-sharing stations: profiling and availability prediction

233

21:15

Beyond WHERE and GROUP BY

234

28:22

Beyond the screen WebXR: when immersive content enters the Web

235

04:14

Beyond string-based logging Structured logging with Serilog

236

23:33

Behind the scenes of a FOSS-powered HPC cluster at UCLouvain

237

22:27

238

24:06

Asynchronous Decision Making - why and how

239

19:25

Barometer: Taking the pressure off of assurance and resource contention scenarios for NFVI

240

17:56

Babelfish: a universal code parser for source code analysis

241

24:41

AV1 Codec Update

242

19:47

Automating style guide documentation

243

13:43

Automating Secure Boot testing

244

20:59

Automated system partitioning based on hypergraphs for 3D stacked integrated circuits

245

20:46

Automated Linux Containers deployment for fun and profit.

246

31:55

Automate oVirt Disaster Recovery Solution With Ansible

247

17:51

Asterisk Project: Do I see video in the future?

248

24:29

Artificial intelligence dealing with the right to be forgotten

249

50:35

ARM64 + FPGA and more: Linux on the Xilinx ZynqMP

250

15:23

Are distributions still relevant?

251

44:28

ARB_gl_spirv: bringing SPIR-V to Mesa OpenGL

252

25:38

Antipatterns in OpenOffice Code

253

25:10

Anonymous Whistleblowing with SecureDrop

254

24:12

Android Real Life experience in Production

255

15:45

Anatomy of the OpenOffice localization process

256

33:48

Analzying Blockchain transactions in Apache Spark

257

22:28

Analyzing developers network in a community

258

25:29

An update on VLC and the VideoLAN community

259

25:19

An optimized GFDM software implementation for low-latency

260

23:53

An Open Platform for Collecting data for OpenSeaMap

261

21:44

Displayport Compliance

262

44:26

An Introduction to Ada for Beginning and Experienced Programmers

263

19:50

aiosip: the efficient swiss-army knife of SIP

264

19:12

AI on Microcontrollers uTensor brings Deep-Learning to MCUs

265

23:13

Advocating For FOSS Inside Companies

266

49:40

Advanced testing in action on a Java project

267

27:06

Advanced Go debugging with Delve

268

15:09

Addressing the long tail of applications

269

21:56

Adding support for a mouse in libratbag

270

16:14

Adding performance counters to htop

271

20:48

Ada, or How to Enforce Safety Rules at Compile Time

272

20:33

Accessing your Mbed device from anywhere using Pagekite

273

21:22

Accessibility 101 (not only) for LibreOffice developers

274

21:20

Accelerating Big Data Outside of the JVM

275

37:29

A unique processor architecture meeting LLVM IR and the IoT

276

57:53

A tour with Firefox' developer tools

277

43:31

A slightly different nesting: KVM on Hyper-V

278

45:00

A real life story about product testing with robotframework

279

14:53

A pixel format guide to the galaxy

280

28:25

A lion, a head, and a dash of YAML

281

26:15

A Guided Tour of Eclipse IoT: 3 Software Stacks for IoT

282

25:09

A decade of config surgery with Augeas

283

25:09

A crash course on remote, moderated usability testing

284

25:10

A Bug in Your Ear

285

34:12

4 Perl web services I wrote and that you may like

286

36:37

ZX Spectrum in the New Millenium

287

15:48

288

45:55

ZFS: Advanced Integration

289

50:10

Zero Downtime Deployment with Ansible

290

23:20

Your Open Source Community Metrics Should be Tracking More than Code

291

13:50

Your Build in a Datacenter

292

21:32

You want a Clean Desktop OS? Containerize it

293

28:13

You’ve Got Some Explaining to Do! So Use An FAQ!

294

26:22

(Yet another) passive RADAR using DVB-T receiver and SDR.

295

30:19

XWiki: a case study on managing corporate and community interests

296

20:09

XMPP as the road to innovation

297

47:56

Writing REST APIs with OpenAPI and Swagger Ada

298

23:51

Writing Node.js Modules in Rust

299

19:58

Writing a Janus plugin in Lua

300

14:21

Wrap it Up! Packaging from Pots to Software

301

19:24

Working in the ODF TC

302

37:49

303

41:21

Wikilab, architecture & CNC

304

25:41

Why you should take a look at Rust?

305

23:13

Why We’re excited about MySQL 8

306

29:09

Why People Don't Contribute To Your Open Source Project

307

53:35

Why I forked my own project and my own company

308

40:19

Why hardware and operating system engineers need to talk

309

20:30

Whisper and Swarm Protocol for RTC

310

17:28

What's new with FPGA manager

311

23:09

What's new in Upipe

312

22:44

What's new in GStreamer?

313

06:34

What's new in Graphite 1.1

314

23:22

What community can learn from marketing

315

02:15

Welcome to the Retrocomputing DevRoom

316

00:48

Welcome to the Perl devroom

317

05:19

Welcome to the Legal and Policy Issues devroom

318

01:55

FOSDEM 2018 - Welcome

319

11:22

Welcome & Chatting

320

16:49

webPh.one - connect community cellular networks using WebRTC and PWA

321

20:02

WebExtensions API status after Firefox 57

322

25:06

Wayland client basics

323

15:41

War Stories from the Automotive FLOSS Front

324

15:56

Vis Editor: Combining modal editing with structural regular expressions

325

16:03

Viva, the NoSQL Postgres!

326

30:10

327

33:04

Vectors Meet Virtualization

328

33:44

Valgrind's Memcheck tool vs Optimising Compilers

329

25:09

330

52:50

Using TPM 2.0 As a Secure Keystore on your Laptop

331

28:29

Using KVM to sandbox firmwares from the Linux Kernel

332

31:00

Using Cryptographic Hardware

333

17:10

Using CGRateS as online Diameter/Radius AAA Server

334

30:03

User Session Recording in Cockpit

335

28:42

Upspin and a future of the Internet

336

44:37

User-level networking on Genode

337

15:16

Urbit: the personal server

338

22:57

Urban spaces as nodes of a decentralized Internet

339

39:54

Unleashing the Power of Unikernels with Unikraft

340

49:32

Unix? Windows? Gentoo!

341

39:54

Unix Architecture Evolution from the 1970 PDP-7 to the 2018 FreeBSD

342

25:16

Understanding 26 U.S.C. § 501, and Organizational Governance ... and why understanding all this matters outside the U.S.

343

27:07

Tying software deployment to scientific workflows

344

24:18

Tutorial: my first FPGA design

345

20:43

Update on GStreamer for Embedded Devices

346

13:04

Turning physical systems into containers

347

22:49

Turning On the Lights with Home Assistant and MQTT

348

24:11

Turbocharging MySQL with Vitess

349

19:51

350

44:56

Towards capabilities in HelenOS

351

21:34

Linux Tools for Evaluating Your Garbage Collector

352

10:12

Tools for large-scale collection and analysis of source code repositories

353

25:56

Too young to rock'n'roll (and to contribute)

354

27:54

Tomorrow's JavaScript Debugger

355

33:05

TLS for MySQL at large scale

356

26:14

357

15:25

Tips and Tricks to finance an Open Source Project

358

23:37

Using Rust to Build a Distributed Transactional Key-Value Database

359

12:48

Thunderbolt 3 and GNU/Linux

360

25:01

The University of Crete Radio Station Project

361

51:22

362

24:00

The State of Go

363

21:49

The State of Containers in Scientific Computing

364

20:46

The RTP bleed and what can we do?

365

27:08

366

27:35

The path to Data-plane micro-services

367

28:24

The package bazaars and the cathedrals

368

25:07

The OpenJDK Developer Experience

369

24:24

Chips4Makers Toolchain

370

25:29

Open Decision Framework

371

25:10

The MySQL Ecosystem - understanding it, not running away from it!

372

18:23

MDN web docs: moz://a

373

25:25

374

09:14

The Magnificent Modular Mahout

375

25:03

The LTTng Approaches to Solving Complex Problems in Production

376

25:10

The Julia programming language

377

25:10

Deploy Software Updates for Linux Devices

378

22:06

The Invisible Internet Project

379

10:18

The Half Rolling repository model

380

23:33

The GNU Radio runtime

381

24:05

The Generic Data Distribution System of the Retroshare Network

382

24:06

Future of the internet

383

15:10

SYMPA: 20 year old open source (GPL) mailing list manager

384

22:49

The Future of Copyleft: Data and Theory

385

30:55

Supporting and Building the FreeBSD Project and Community Worldwide

386

23:37

The free toolchain for the STM8

387

39:23

The Fabulous Destiny of 0000000200000008000000BB

388

26:33

The emPeerTube strikes back

389

47:46

The State of OpenJDK

390

40:09

The Dynamo After Diffie

391

19:42

Could the current EU copyreight Reform proposal be the end of the FLOSS in Europe?

392

39:21

The Computer Science behind a modern distributed data store

393

50:37

The Circuit Less Travelled

394

20:52

The Chromium/Wayland project

395

30:29

The challenges of XDP hardware offload

396

19:00

The Case for interface{}

397

22:04

The case against "It just works" or how to avoid #idiocracy

398

29:00

The AMD Linux graphics stack, 2018 edition

399

43:50

Testing Red Hat Enterprise Linux the Microsoft way

400

19:10

Testing in Rust

401

38:27

Testing for testing

402

42:26

Testing and Validation for Distributed Systems

403

26:50

Testing and Automation in the Era of Containers

404

20:10

Test your API docs!

405

25:10

Terraform is maturing

406

48:55

407

18:26

408

21:13

Teleport: my year writing a new GTK+ app

409

22:35

Technical Writing for Non-Writers

410

24:16

SystemTestPortal

411

31:22

412

51:21

Sustainability of Open Source in International Development

413

46:39

Surviving in an Open Source Niche: the Pythran case

414

26:29

Stupid Pluto Tricks with the ADALM - PLUTO

415

22:23

strace: new features

416

12:41

Static Infrastructure Status with Jekyll and GitHub Pages

417

19:23

State of the rkt container runtime and its Kubernetes integration

418

15:36

Starviewer: Medical Imaging Software

419

32:30

SSSD: From an LDAP client to the System Security Services Daemon

420

20:15

Status of the Apache ODF Toolkit (incubating)

421

29:23

SRv6 Network Programming

422

10:43

Speech-to-Text in Jitsi Meet

423

10:54

424

23:31

Spatial Support in MySQL 8.0

425

50:10

SPARK Language: Historical Perspective & FOSS Development

426

25:10

Software Philanthropy for Everyone

427

30:00

Software necromancy with Perl

428

16:25

So you think you can validate email addresses

429

23:48

So we have free web fonts; now what?

430

17:14

Snabb - A toolkit for user-space networking

431

09:05

Slurm in Action: Batch Processing for the 21st Century

432

22:36

SIP based group chat implementation in Linphone

433

44:16

Simplifying the contribution process for both contributors & maintainers

434

20:11

Shared Memory Parallelism in Ada: Load Balancing by Work Stealing

435

03:11

FOSDEM 2018 - Welcome: Craeynest

436

21:41

Shaders in radeonsi

437

28:51

Servers can't be trusted, and thanks to tamper-proof journals EteSync doesn't need to!

438

20:27

Self-host your visual assets with Free Software

439

48:52

A Praise of Folly

440

40:13

Securing Embedded Systems using Virtualization

441

19:19

Scaling messaging systems

442

22:26

Scaling Deep Learning to hundreds of GPUs on Hops Hadoop

443

09:48

Scale Out and Conquer: Architectural Decisions Behind Distributed In-Memory Systems

444

34:01

SatNOGS: Crowd-sourced satellite operations

445

45:18

Sancus 2.0: Open-Source Trusted Computing for the IoT

446

32:42

Samba authentication and authorization

447

32:07

Samba AD in Fedora

448

40:10

Samba: stories of battles fought and won

449

26:36

450

24:27

451

27:12

452

38:43

Rust versus DWARF versus LLVM

453

22:33

454

28:10

Rust memory management

455

25:09

Rust - embedding WebAssembly for scripting

456

50:15

Running Android on the Mainline Graphics Stack

457

25:11

Rubber 'Duke' Debugging

458

27:32

Ring as a free universal distributed communication platform.

459

34:41

460

25:49

Researchers and Software Licenses

461

20:09

Repairing DNS at TLD scale

462

23:45

Rendering of subtitles with imscJS

463

23:39

Rendering map data with Mapnik and Python

464

20:48

CPAN --> GitHUB

465

42:14

Reimagining EDSAC in open source

466

15:54

Regular Expression Derivatives in Python

467

56:57

Reflections on Teaching a Unix Class With FreeBSD

468

17:40

Reducing CPU usage of a Toro Appliance

469

38:06

Reduce, Reuse, Recycle with Grammar::Common

470

28:17

DARPA's Bay Area SDR Hackfest Recap

471

18:20

Reasons to mitigate from NFS v3 to v4/4.1

472

17:02

Real Time Clustering with OpenSIPS

473

23:28

Reaching const evaluation singularity

474

49:05

Re-structuring a giant, ancient code-base

475

18:23

476

21:29

Rapid SPI Device Driver Development over USB

477

21:46

Qt in Automotive

478

26:44

Qt GUIs with Rust

479

50:33

Python 3: 10 years later

480

22:07

Pulp 3 - Simpler, Better, More awesome

481

25:10

Public money, public code, the Italian way

482

26:53

ProxySQL - GTID Consistent Reads

483

21:21

Improvements to Font Handling

484

25:30

Proposal for an open and democratic Design Rule format

485

24:17

Pronto Raster: A C++ library for Map Algebra

486

26:51

Programming UEFI for dummies

487

27:37

Productionizing Spark ML Pipelines with the Portable Format for Analytics

488

24:36

Privacy aware city navigation with CityZen app

489

40:10

490

51:18

pot: FreeBSD containers on FreeBSD

491

50:05

PostgreSQL Replication in 2018

492

50:36

PostgreSQL - A Crash Course

493

25:10

Portable Graphics

494

30:55

pkgsrc on ChromeOS

495

29:16

Pitch your project: Present your FOSS project to designers and get them to contribute

496

36:28

497

24:25

Physics, Math, and SDR

498

24:08

Photon Performance

499

38:59

Perl in the Physics Lab

500

34:01

Perl in Computer Music

501

45:00

Perl 6 on Jupyter

502

24:31

Performance Analysis and Troubleshooting Methodologies for Databases

503

12:30

504

28:59

People can't care when they don't know

505

18:03

Peeling onions: understanding and using the Tor network

506

15:34

PBX on a non-specialized distro

507

29:10

Passing the Baton

508

21:04

Parsing [S]hell

509

29:33

Packaging C/C++ dependencies with Conan

510

21:10

Painless Puppet Providers

511

27:34

Package quality assurance

512

50:51

Package Management Unites Us All

513

1:05:20

Package Management Panel Discussion

514

25:10

Package management over Tor

515

05:10

516

27:26

Outsourcing Source Code Distribution Requirements

517

21:29

Our Open Source Design

518

23:26

OSS-7: an opensource DASH7 stack

519

25:33

Organizer's Panel

520

26:22

Orchestrator on Raft: internals, benefits and considerations

521

52:30

Optimizing Software Defined Storage for the Age of Flash

522

23:20

Optimized container live-migration

523

10:10

524

38:34

OpenJDK Governing Board Q&A

525

19:53

526

27:02

OpenDaylight as a Platform for Network Programmability

527

41:21

528

37:20

OpenBSD as house alarm system

529

44:32

OpenADx – xcelerate your Automated Driving development

530

16:36

Open Source BIOS at Scale We gave it a try, it worked. You can jump in!

531

23:24

Open source Big Geospatial Data analytics

532

15:27

Open Food Facts: the wikipedia of food products

533

20:10

[matrix] Interoperable 3D Video Calling with Matrix, WebRTC and WebVR

534

26:53

Open Build Service in Debian

535

25:45

ONAP – A road to network automation

536

30:10

Observability and the Dev Process

537

22:49

O’PAVES: An open platform for autonomous vehicle tinkerers

538

25:25

539

15:38

NoSQL Means No Security?

540

55:53

Next Generation Internet Initiative

541

25:08

542

45:23

New GPIO interface for linux user space

543

20:53

Networking Swiss Army Knife for Go

544

23:12

Networking deepdive

545

26:47

546

45:47

Network Automation Journey

547

27:11

net_mdev: userland network io

548

41:34

NetBSD and Mercurial

549

26:10

NetBSD - A modern operating system for your retro battlestation

550

14:27

551

15:46

Nakadi Event Broker

552

25:10

MySQL Test Framework for Troubleshooting

553

46:26

MySQL: Scaling & High Availability

554

25:26

MySQL InnoDB Cluster

555

25:10

A quick tour of MySQL 8.0 roles

556

24:52

MySQL 8.0 Performance: InnoDB Re-Design

557

26:53

MyRocks deployment at Facebook and Roadmaps

558

49:03

Multitasking on Cortex-M(o) class MCU

559

26:36

Deepspeech & Commonvoice

560

28:41

Mozilla Open Source Support (MOSS)

561

29:11

Moving PCI emulation inside of Xen

562

21:42

Our Community Participation Guidelines

563

30:35

Monitoring Legacy Java Applications with Prometheus

564

26:44

Moldable analysis with Moose

565

09:24

566

25:10

Modern tools to debug GStreamer applications

567

25:09

Moby Project and Docker Inc — Balancing community and corporate needs

568

28:25

Mirai and Computer Vision

569

30:17

Migrating to IdM in a large Linux Environment

570

23:10

Migrating code with SmaCC

571

46:26

Microkernels in the Era of Data-Centric Computing

572

25:10

Melting the Snow

573

29:00

NFV a' la VDE way

574

26:15

MethodHandles Everywhere!

575

25:43

Meet purl: a "mostly" universal software package URL that purrs.

576

24:54

Media 101 for Communities

577

23:47

Privacyscore.org: Investigating security and privacy properties of related Web sites

578

15:10

Maximizing UX for Customizing

579

25:41

Matroska Low-Latency streaming

580

27:06

Mapping FOSDEM for accessibility

581

24:13

Managing build infrastructure of a Debian derivative

582

25:21

Mallard, Pintail, and other duck topics

583

30:09

Making the web faster with the JavaScript Binary AST

584

49:21

Making the Ada_Drivers_Library: Embedded Programming with Ada

585

29:00

Making Linux Security Modules available to Containers

586

15:09

Making electronics

587

20:14

Processors have evolved

588

21:49

Make your Go go faster!

589

18:04

Maintaining accessibility through testing?

590

22:46

Mainline Linux on Motorola Droid 4

591

19:53

LTTng: The road to container awareness

592

38:39

LoRa Reverse Engineering and AES EM Side-Channel Attacks using SDR

593

23:52

594

23:43

Load testing with Locust

595

50:09

Python 3 Load Testing Framework Focused on Web Services

596

22:48

597

40:07

LLVM+RUST+Debugging

598

39:27

LLVM @RaincodeLabs

599

10:53

LizardFS and OpenNebula, a petabyte cloud the simple way

600

39:55

LizardFS - a year in LizardFS development

601

21:50

Living on the Edge

602

1:04:02

Live sculpting a Genode-based operating system

603

30:56

Live Block Device Operations in QEMU

604

15:31

LitOps: literature-as-software

605

07:39

Literate Programming meets LLVM Passes

606

21:46

LinuxBoot: Linux as Firmware

607

14:03

Linux Test Project introduction

608

31:15

Linux as an SPI Slave

609

32:26

... like real computers! Making distributions work on single board computers

610

19:57

FOSDEM 2018: Lightning talks

611

23:41

Lightning talk session Come and tell us your most recent hack - in 5 mins!

612

22:21

Ligato: a platform for development of cloud-native VNFs

613

49:53

Lift your Speed Limits with Cython

614

17:50

LibreOffice's automatic updater work

615

20:57

LibreOffice QA - One Year Overview

616

15:28

LibreOffice for Haiku

617

33:26

Leveraging Software Defined Network for virtualization

618

16:01

Let's talk database optimizers

619

15:42

Let's Fix The Internet

620

23:33

Langkit: source code analyzers for the masses

621

27:52

Kubernetes Security Best Practices

622

40:25

Kubernetes - load balancing for virtual machines (Pods)

623

25:41

Kodi v18 features and improvements

624

43:14

Kodi and Embedded Linux

625

22:20

Kids and Schools and Instant Messaging

626

25:09

KiCad: Version 5 New Feature Demo

627

42:43

IsardVDI: Keys to deploy affordable virtual desktops

628

18:24

Kernel Graphics Development on Remote Machines

629

25:35

Keeping It Real (Time)

630

25:39

Kamailio - Pick Your SIP Routing Scripting Language

631

24:39

JVM startup: why it matters to the new world order

632

25:54

Six new trends in the JVM

633

22:10

Join the FREEWAT family

634

48:16

JITing PostgreSQL using LLVM

635

20:17

636

25:37

Java in a World of Containers

637

33:48

Making Italy the most hacker friendly country on Earth

638

29:33

It's a Trie... it's a Graph... it's a Traph!

639

42:54

640

23:48

IoT.js - A JavaScript platform for the Internet of Things

641

08:05

IoT DevRoom Opening

642

22:56

Introduction to web development in C++ with Wt 4

643

09:59

Decentralized Internet & Privacy Devroom

644

34:16

Introduction to Swift Object Storage

Automatic playback

Speech

Text

Image

00:00

Performance appraisalFrame problemWeb pageGoodness of fitMultiplication signPresentation of a groupComputer animationLecture/Conference

00:39

Android (robot)Computing platformGraphics processing unitBlock (periodic table)Computer hardwareSource codeNumberPresentation of a groupComputing platformPhysical systemExtension (kinesiology)Projective planeWorkloadOpen sourceSpacetimeINTEGRALFunktionalanalysisGraphics processing unitGoodness of fitVapor barrierException handlingSinc functionMultiplication signFigurate numberGame theoryProfil (magazine)Position operatorCartesian coordinate systemComplex analysisWindowSoftware bugSoftware developerBitSource codeFlow separationComputer hardwareMathematical analysisCodeProduct (business)Device driverComputer animation

04:37

Mathematical analysisIntelKernel (computing)Graphics processing unitCASE <Informatik>Frame problemMathematical analysisProjective planeExtension (kinesiology)NumberTracing (software)Computer animation

05:10

Computing platformKernel (computing)Graphics processing unitIntelMathematical analysisSoftware development kitDifferent (Kate Ryan album)ImplementationCross-platformWindowDevice driverComputing platformWorkloadComputer hardwareDirectory serviceFrame problemBranch (computer science)Profil (magazine)Right angleComputer animation

05:55

Mathematical analysisDevice driverSet (mathematics)Computer architectureDifferent (Kate Ryan album)NeuroinformatikCASE <Informatik>VolumenvisualisierungFrame problemComputer hardwareLecture/Conference

06:31

Computing platformGraphics processing unitKernel (computing)IntelMathematical analysisShader <Informatik>Metric systemVolumenvisualisierungLogical constantUniform convergenceElectronic visual displayState of matterVisualization (computer graphics)StapeldateiComputer hardwareImplementationLoop (music)VolumenvisualisierungExtension (kinesiology)Block (periodic table)Cartesian coordinate systemTracing (software)Computer animation

07:30

Mathematical analysisMetric systemGraphics processing unitVolumenvisualisierungShader <Informatik>Uniform convergenceLogical constantElectronic visual displayState of matterVisualization (computer graphics)StapeldateiStapeldateiFrame problemLatent heatMultiplication signVolumenvisualisierungSystem callMetric systemSoftware developerDevice driverComputer hardwareWorkloadComputer animation

08:19

Mathematical analysisGraphics processing unitVolumenvisualisierungMetric systemShader <Informatik>State of matterElectronic visual displayLogical constantUniform convergenceVisualization (computer graphics)StapeldateiProfil (magazine)Representation (politics)Maxima and minimaFrame problemComputer hardwareVolumenvisualisierungCore dumpDemo (music)Logical constantDebuggerShader <Informatik>Error messageMathematicsLatent heatUniformer RaumState of matterComputer animation

09:54

Demo (music)Shader <Informatik>Metric systemUniform convergenceVolumenvisualisierungPixelCore dumpFrequencyGraphics processing unitAverageThread (computing)Bit rateInterprozesskommunikationFloating-point unitBinary fileHybrid computerMathematicsThree-valued logicComputer multitaskingPolygonSoftware testingStreaming mediaVertical directionGeometryVertex (graph theory)TesselationState of matterSystem callStapeldateiSource codeSteady state (chemistry)Read-only memoryCache (computing)Total S.A.TLB <Informatik>Process (computing)MeasurementLevel (video gaming)Computer hardwareMaß <Mathematik>Hand fanFrame problemTape driveHierarchyFrame problemVolumenvisualisierungDescriptive statisticsMetric systemGraph (mathematics)Computer hardwareVertex (graph theory)Electronic mailing listTable (information)PixelCache (computing)Latent heatOnline helpComputer animation

11:18

Digital filterAlpha (investment)Gamma functionTexture mappingCountingData bufferGeometrySource codeSteady state (chemistry)Vertex (graph theory)Compilation albumPlane (geometry)Euclidean vectorBlock (periodic table)ProgrammschleifeLogical constantSummierbarkeitSurfaceError messageSynchronizationData typeSkewnessCache (computing)TLB <Informatik>Operator (mathematics)Chi-squared distributionPointer (computer programming)LaserMUDAddress spacePrice indexWorkloadArtistic renderingMereologyObject (grammar)Shader <Informatik>Single-precision floating-point formatDevice driverVolumenvisualisierungForm (programming)Semiconductor memoryFunction (mathematics)Computer hardwareVapor barrierBitFluid staticsSoftware developerMedical imagingTouchscreenIntermediate languageSicTriangleStapeldateiPixelProgrammer (hardware)Frame problemSimulationSound effect

14:19

Core dumpGraphics processing unitMetric systemVolumenvisualisierungStapeldateiSystem callState of matterShader <Informatik>VolumenvisualisierungPixelMetric systemElectronic mailing listDepth of fieldFocus (optics)Clique-widthSound effectGraph (mathematics)Cartesian coordinate systemComputer animation

15:15

Shader <Informatik>PixelGraphics processing unitCore dumpMetric systemVolumenvisualisierungStapeldateiSystem callState of matterIntelGeometryThread (computing)Vertex (graph theory)PolygonComputer multitaskingSoftware testingTexture mappingVolumenvisualisierungPixelStandard deviationDifferent (Kate Ryan album)Shader <Informatik>Vertex (graph theory)BitComputer animation

16:17

Metric systemVolumenvisualisierungStapeldateiState of matterShader <Informatik>System callDigital filterVolumenvisualisierungUniformer RaumMessage passingArtistic renderingComputer animation

16:59

Metric systemWide area networkCore dumpRevision controlGeometryTesselationSteady state (chemistry)Source codeSystem callStapeldateiState of matterShader <Informatik>VolumenvisualisierungVertex (graph theory)CompilerTexture mappingCompilation albumDigital filterGreen's functionCloningView (database)Menu (computing)Daylight saving timeScalable Coherent InterfaceAlpha (investment)Nichtlineares GleichungssystemFingerprintPrimitive (album)PolygonClique-widthVolumenvisualisierungGraph coloringMenu (computing)State of matterSet (mathematics)FlagShader <Informatik>HierarchyMusical ensembleFraction (mathematics)Subject indexingAlpha (investment)Computer configurationDirection (geometry)Error messageString (computer science)Disk read-and-write headRhombusDifferent (Kate Ryan album)TriangleMultiplicationNetwork topologyBack-face cullingComputer animation

19:48

Metric systemVolumenvisualisierungSystem callStapeldateiState of matterShader <Informatik>VolumenvisualisierungState of matterSemiconductor memoryVapor barrierFrame problemComputer animation

20:13

VolumenvisualisierungMetric systemShader <Informatik>StapeldateiSystem callState of matterDigital filterVolumenvisualisierungDemo (music)MathematicsFrame problemCategory of beingState of matterComputer animationLecture/Conference

20:40

Device driverSource codeElectronic visual displayTexture mappingAndroid (robot)VolumenvisualisierungVisualization (computer graphics)Computer hardwareGeometryPolygon meshData bufferProof theoryPixelElectronic visual displayVertex (graph theory)Volumenvisualisierung1 (number)Buffer solutionImplementationTexture mappingComputer hardwareSoftware developerDifferent (Kate Ryan album)FunktionalanalysisSet (mathematics)MereologyDiscrepancy theoryBitSystem callWindowProfil (magazine)GeometryDevice driverVisualization (computer graphics)Computing platformLevel (video gaming)Figurate numberLecture/ConferenceMeeting/InterviewComputer animation

22:26

Android (robot)Cartesian coordinate systemComputer hardwareComputing platformData storage deviceTracing (software)BitSimilarity (geometry)Online helpLecture/Conference

22:51

ProgrammschleifeUsabilityVideo trackingMetric systemMetric systemKey (cryptography)Mathematical analysisComputer programmingState of matterWorkloadElectronic visual displayCartesian coordinate systemUsabilityOnline helpProcess (computing)Computer animation

23:50

Cheat <Computerspiel>VolumenvisualisierungSystem callIntegrated development environmentCore dumpFrame problemMusical ensembleTracing (software)Address spaceWorkloadStapeldateiCartesian coordinate systemSingle-precision floating-point formatVertex (graph theory)Multiplication signSoftware developerPatch (Unix)Computer programmingCollaborationismWindowTexture mappingComputer fileStandard deviationDevice driverPresentation of a groupLecture/ConferenceMeeting/Interview

27:12

CollaborationismGoogolService (economics)Program flowchart

Transcript: English(auto-generated)

00:06

Good morning. Welcome to FOSDEM. I hope you guys had a fun time last night. It's good to see so many people have recovered from the festivities. Hopefully I'll have a seat to sit down when I'm done with my talk. It's very crowded. I was told before I came

00:24

that this was the worst possible time slot to give a talk at FOSDEM first thing Saturday morning, right? But I'm happy to be here. I'm glad that Luke has organized the Graphics Dev Room again this year and that he made time for me to give a short presentation

00:43

on my work, so thanks a lot. I've been working on Linux platforms for more than a decade. Several of those years I spent building graphics performance tools based on a Windows tool that was used throughout the industry. In that position I was able to see how important

01:06

performance analysis tools are for graphics workloads. My project over the past few years has been to try to enable the same workflows for Linux platforms. I've also spent a lot of time automating the integration system for Mesa at Intel, which has helped Mesa's

01:27

productivity and quality quite a bit. But this is really the project that I've been most interested in since I started with the Mesa team. So a little bit about GPU tools and why you don't really have very many good solutions

01:43

in the Linux space. In general, when you have GPU tools, there's a graphics card vendor that understands it's very difficult to go and find out performance bottlenecks or what's happening on the GPU. They've gone and funded some tools specific to their own

02:02

hardware to help developers or their own driver team figure out what the performance profile is of specific applications. But they are very reluctant to go and enable the same capabilities for their competitors. If you do find a good GPU analysis tool, you'll

02:22

find it only works with an AMD GPU or an NVIDIA GPU. Some of the exceptions in the Linux space are made by Microsoft or other entities that care more about cross-vendor functionality.

02:41

Most of the tools are written for Windows and Linux as an afterthought. They're either closed source or the extent to which they're open source is just two commits where they've dumped a huge pile of code into a GitHub account. Whether it compiles or not, you may find that it does not. This is changing a little bit. Intel has

03:06

some engineers that are working on performance tools like myself, and Lionel Landerwillen and Robert Bragg have worked on GPU Top. So there is more native support for performance tools. RenderDoc is another example where Valve has gone and funded a developer to really

03:25

invest in native Linux graphics analysis tools. One thing about a lot of the tools is that tracing and retracing is often not reliable. This can be because the tool was initially written for Windows DX11 or DX10 games. And

03:44

then when they go to implement tracing for OpenGL, they find the complexity of the extensions makes it hard to really capture the workload that you want to investigate. Another reason why tracing is often unreliable is there's not that many users. You might

04:05

have a tools team that goes and tries to build a tool, but unless you have lots of developers going and applying it and looking at different workloads, you're not going to discover the bugs in your tracing system. And up until recently, a big barrier has been the support for GPU performance counters

04:23

in Mesa. Since Linux 4.13, that's enabled now for Intel GPUs, and AMD Performance Monitor is available for some of their newer hardware as well. So now that Mesa is exposing these extensions, there's a whole lot more that we can do.

04:43

So my tool is called Frame Retrace. It's built on top of API trace. I chose API trace because I think it's the most widely used GPU analysis tool. There's a lot of people that use it for quality assurance to make sure that the frames retrace properly. And

05:03

because it has a large number of users, there's often a lot of corner cases of tracing that they've gone and fixed. It's a community supported project, so there's lots of people working on it. Right now, frame retrace is just a directory in a branch of API trace. It's just a UI

05:21

that is built on top of it. Because API trace is cross platform, frame retrace is also cross platform, so it will investigate OpenGL workloads on Windows just as well as it will on Linux, and that's an important capability for driver teams. Because if you have two different driver implementations for different platforms, you can compare the

05:43

performance profile for the workloads and find gaps in your implementation or in the Windows implementation. Our counter support begins with Haswell. There were hardware counters prior to Haswell, but the architecture was different enough that the driver team decided not to enable

06:02

them. So your performance will be better with a newer computer anyways, right? So the Mesa driver team has been using this tool heavily to go and find issues in their driver, and there's a whole set of examples of different special cases that

06:22

they've missed and we found basically by looking closely at each render in a frame and understanding what the bottleneck is. Right now, I'm trying to add support for Radeon hardware and Raspberry Pi through the AMD Performance Monitor extension, and there's some other folks that are looking

06:44

at that with me, and it's going pretty well. There's a few stumbling blocks for the Radeon implementation of that extension. I think that cross-platform support in this tool is one of the main things that needs to be finished before it's a good candidate

07:02

for being upstreamed into API trace. I think that you'll see that the tool is pretty compelling and useful and superior to the API trace UI in some ways, so I'd like to see it go upstream. So what does this tool do? Most graphical applications have a render loop, and the render

07:26

loop just renders the frame over and over again. So if you are looking just at the renders in those frames, you can divide up the frame into each specific draw call, and this tool will give you the metrics associated with each draw call, and you can

07:44

see exactly which render is the one that's taking all the time in your frame. Without it, I mean, generally, you just have a huge asynchronous workload going off to the GPU, and you have no idea why you're missing vsync. You can explore the frame by selecting specific renders, and it'll show you the render targets throughout the frame, which

08:04

is helpful to understand how a frame is composed. It has an API log, which is pretty standard. For driver developers, it's pretty helpful to have batch disassembly, so the batch commands, which are sent directly to the hardware, are disassembled and associated with the render that you've selected. So this is a capability that, at least on Intel

08:24

hardware, you have to, up until now, you would have to dump hundreds of gigabytes of data for any kind of meaningful frame, and then try to sift through the data to try to find out exactly which render went wrong, and this will give you a much more performant

08:41

implementation and let you see exactly what's going to the hardware for each draw. One of the main features that end users and game developers need is a shader debugger or some way to experiment with their shaders and find out why their shaders are mis-rendering. So with frame retrace, you can go to a specific render, look at the shader, change the

09:02

shader, edit it, compile it, and it'll render again, and it'll give you a new performance profile for that shader, or an error if you've made a mistake. You can do the same thing with uniform constants. Just go and see what the constants are and change them, and the frame will render again. There's a couple of experiments that you can do to help you

09:22

try to figure out what the max performance would be for a specific render, and the thing that I've just been editing now is a hierarchical representation of all the GL state so that you can change the coalface and see what happens. So if you have a

09:40

problem with your GL state that's affecting rendering or performance, you can muck with that. So those are the things we'll go through in the demo. So I'm taking a risk. Let's have a demo and see what happens. This is the UI for frame retrace, and this blue bar is actually a graph of renders with

10:09

no metrics, but you'll see here there's a long list of GPU metrics associated with the L3 cache, the pixel shaders, the vertex fetch hardware. A lot of these are somewhat

10:26

inscrutable if you're not familiar with the hardware or don't do a lot of geo-programming. The one that you really want to look at if you want to see why is this slow is you look at how many clocks were required to render the frame. And so this is a graph where each bar is a specific render. There are quite a lot of them, but by far the

10:47

most expensive one is here, and there's a table that will show you the metrics. So here is the clocks, and you can see that it's more than 10% in the entire frame. It's just for this one render. So if you're curious about what a GTI L3 bank L2 read

11:07

is, there's a longer description for that metric that will help you decipher what it means. But typically you can go through here and find an explanation for why this might be the bottleneck for your workload. If you want to see the render target at this

11:25

part in the frame, you'll see that our heroine has found the object of her desire. The rendering of this frame, if you want to see what's actually being rendered, it's rendering the whole screen. In the API calls, it's just drawing a couple of triangles

11:43

for the rect. So it's a little bit puzzling why this might be long, but there's also this GL memory barrier, which is probably something that we'd be interested in looking at. If you want to search for GL memory bearer, you can look at the different renders, which

12:02

contain GL memory barriers. So if you wanted the experiments, if you wanted to see, okay, well, how fast would this be rendered if I just had a simple shader with it just drew pink, you can select that and you can see that the cost is much lower. We go to

12:23

the shaders and in the fragment shader, it's got the, you know, just a substituted fragment shader that just draws pink. So let's disable that and go back to the shaders. So we're now in the fragment shader again, and you can see that there's quite a long fragment shader. So it looks like it's processing all the pixels with some effect, I guess.

12:47

The vertex shader, if you look at it, it's a whole lot of nothing until you get to the very bottom and it just does nothing. So we capture the intermediate representation and the static single assignment form that's output by the Mesa driver. NER is our new

13:06

intermediate representation and the SIMD8 is what's actually sent down to the hardware. Same thing for the fragment shader. You can see exactly how the shaders are compiled. So this is very helpful for a driver engineer, or I guess if you're an elite OpenGL programmer,

13:25

maybe you could make sense of this. So we spoke about the batch. This is an example of the batch. If you look at a handful of renders, you can select one and you can see this is the binary packet that's sent down for the rendering. Again, more for

13:43

driver developers. All right, so let's go back to experiments. If we look more closely at these renders, let's look at the render target. You can see that if we stop at render,

14:02

that means it's going to show the render target immediately after this render. If you advance through these renders and you can see that it becomes progressively blurrier, so there's a little blur and it's going to get even more blurrier on this render. And then finally, it's going to compose those blurry images based on the depth of

14:22

each pixel. So in the background, there's a light here that's quite blurry. And if you look at the first render, it's in sharp focus. So it's a depth of field effect that they're achieving with these final renders. It's just one example of how you can experiment. So, this is an expensive pixel, but it may just be expensive, expensive render, it may

14:46

be expensive because there's quite a lot of pixels. So if you want to look for expensive per-pixel metrics, you can graph on the second axis. So I'm just going to narrow the list of metrics that are displayed. So now the width of each bar represents roughly

15:04

how many pixels are drawn. And so you might look for narrow, tall bars representing very expensive renders. So let's disable this one to make it larger. So you might focus in

15:22

on this tiny shader here, which I guess because of the way it's drawing this particular texture. It's not very many pixels at all, but it's quite expensive per pixel. All right, so what I want to do now is explore a little bit. So let's go to standard

15:54

bar and I'm going to look for vertices. So I want to go and look on the render

16:11

target for where our heroin is rendered. You can see the different render targets that are drawn in this pass. And if we highlight, we'll see that those are the renders that

16:26

are drawing her body. And so let's see. We'll start here, I think. There we go. So this is the full rendering of the character. If you clear before the render and stop after,

16:44

all you'll get in the render target is the character itself. So the reason I wanted to do this is to show how you can go to the uniforms. These are all the uniforms that are bound for the render. You can just change one of them, hit return, go back to the render

17:00

target, and we've gone and moved ahead. So for people who aren't really familiar with OpenGL, this is a really kind of interesting way to look and dissect a more complicated frame and understand some of the techniques or how the API is used. So let's put her head back on. Oh, I mentioned shaders. So let's go to the vertex shader. Somewhere

17:28

at the bottom it's going to assign a color. Let's just go ahead and modify that. I mean, if I compile this, I should get a syntax error saying I've made a mistake, but let's

17:44

do zero. One point zero. So I'm just going to make the red channel zero with that multiplication. And we'll go look at the render target, and now we have a hulkified heroine. So

18:05

that really demonstrates that you can mess around with the shader, try to figure out why it's mis-rendering. You can see how quickly this is. I mean, the fact that you can do this in a fraction of a second is far better than what you had before with the other tools. So what I've been working on recently is this hierarchical state tree. So you

18:26

can collapse different items that you don't want to look at. If you don't know how I've organized them, you can search for sub-strings, like maybe I'm looking for the scissor state. If you want to go and change something, the menu shows you the full set

18:41

of available options in the GL for this particular blend feature. And a lot of the different GL state settings have a set of four values, so it'll give you the index of each one. It might be red, green, alpha. It might be some kind of enabled flag. So

19:04

if I go and disable green, I mean, our heroine was green before, but if I say, hey, there's no green, and we look at the render target, we'll see that she's kind of fading away. So it's fun to play with. But here's another one where culling is enabled for

19:26

this character. That means that the triangles on the back of the character are not rendered because they're facing the wrong direction. If I change it to cull the front of them instead of the back, I go look at the render target, I turn the character around, and she's decided that it's too dangerous to go after the diamond and is going to avoid

19:43

disaster and walk right back out. So that's just an example of how you can mess around with these things. One thing that's interesting, if we go back and look at the final render, I've gone and changed the state, but the character hasn't turned around for the final

20:02

render. And the reason for that is that I actually disabled that draw with the memory barrier in my experiment. So if I turn the frame back on so it's rendering properly and look at the render target, I'll see that the final frame is rendered with the changes. So that's my demo of the features. I think there's a lot more that can be

20:23

done in each tab. There's a whole lot of glState that I haven't gone and implemented, but I think what I've tried to do is demonstrate that each category of state is supported in a relatively easy way to expand, and there's a bunch of experiments that need

20:40

to be added, but the proof of concept is there. All right, so back to the things that still need to be done. Well, one thing I didn't talk about too much is that the fact that you can have this exact same performance profile for Windows is very important for driver developers because differences in rendering

21:05

will stand out starkly when you compare two different sets of this UI running on different platforms because the renders are exactly the same, it's running the same glCalls, and so you can easily find discrepancy in your implementation. Things that need to be done. There's no tab for looking at the textures. If you're

21:25

texture-bound, having an experiment that will clamp the mipmap level down so there's not so much texture data going down is important to see if you've just made textures that are too large. There's no display of the geometry or the vertices, so that's something that I think is of interest to end developers to try to figure out, okay, maybe

21:43

there's just so many vertices that I'm stuck at that part of the fixed-function pipeline. The depth buffer is not displayed. Unity specifically asked for overdraw and hotspot visualizations in the render target, where if you've drawn twice to the same

22:01

pixel in the render target, it'll show up as more expensive, help them figure out if they've got a problem with their engine. There's a bunch of UI improvements. This is all written in QML, and so you have to do quite a bit of hand-tweaking to get the display exactly how you want. Adding support for hardware is, I think, the most important

22:22

thing, which is what I'm working on right now. Another very important thing to enable is Android. There's a whole lot of 3D applications coming to Linux platforms in the Android Play Store. None of those can be analyzed for your driver or for your hardware, and so we need to get API trace working on Android so that we can then capture the traces and

22:45

then analyze them in this way on similar hardware. I've had a little bit of help from some folks I've mentioned before. Lionel has helped me a lot with the performance analysis metrics, and I think his tool, I wish it was being demoed at OZM as well, because

23:04

it's very interesting, so if you find him here, get him to show you what he's done. One thing, when you take a GL program and you relink it, you need to reattach a whole lot of state from the previous program, and that process can be somewhat intricate.

23:21

For the workloads I've looked at, I've done it properly, but whenever there's more features in the GL that an application might have used, that's where the path becomes unpaved. Radeon metrics is what I'm implementing now. Unfortunately, the AMD performance monitor doesn't display metrics. It just exports raw counters, and then you need another application

23:44

to go and compose those counters into usable metrics like we had displayed, so that's a key problem I'm trying to fix now. If anyone's interested, there's a whole lot of features that can be worked on independently, and I'd welcome collaborators. Thanks for listening. Any questions? Yeah.

24:05

The reason that this doesn't address Vulkan at all is because there's no tracing support in API trace, but Vulkan certainly could be addressed with a similar tool. There is a tracing infrastructure that's implemented by LunarG, and RenderDoc has a certain amount

24:22

of tracing, and so there's no reason why the features couldn't be mapped on. I just haven't done that yet because I'm focusing on the GL workload. This is a very cool tool. I was wondering how do you communicate the batch data and the extra details of the shader? Sure. In the i965 driver, you can set

24:44

an environment variable to dump the batch, and you can set an environment variable to dump the SIMD16, so we just capture that on standard out. The batch data is, like I said, it's so much, so there's a special patch that you apply to Mesa and recompile

25:01

it to let you turn on and off that environment variable just before you begin your render so that you don't have to pay that penalty for the whole render. Yeah.

25:35

How did I capture the frame? To get the frame, you use API trace. You say API trace, trace

25:49

this GL workload, and it serializes every single GL call into a file. Before I started the presentation, I played through the frame up until frame 150, which is the one we

26:01

were looking at, and stopped. Almost every GL program on Linux, if it isn't traceable by API trace, the developers have then changed API trace. Yeah, sure. That's what application

26:34

engineers do all the time. They capture, you know, whatever. Grand Theft Auto, and there's actually some teardowns of Grand Theft Auto on Windows where they go through

26:42

the different renders and then show you the techniques. You could conceivably go and export the vertex data and the texture data, and that wouldn't be legal, but you can go and hack away. Any other time? Okay. Thank you.