ProxySQL Cluster: challenges and solutions to synchronizeconfigurationacross multiple decentralized cluster nodes - TIB AV-Portal

ProxySQL Cluster: challenges and solutions to synchronizeconfigurationacross multiple decentralized cluster nodes

00:00

27

Formal Metadata

Title

ProxySQL Cluster: challenges and solutions to synchronizeconfigurationacross multiple decentralized cluster nodes

Title of Series

Number of Parts

287

Author

License

CC Attribution 2.0 Belgium:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/56953 (DOI)

Publisher

Release Date

Language

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

As a reverse proxy for MySQL databases, ProxySQL is being used in various infrastructure sizes and it is not surprising to see deployments with thousands of nodes running. Such large deployments introduce some interesting challenges because nodes can be initialized or destroyed at any time. This session will describe the challenges in configuring such large deployments of nodes, the most common external tools to configure ProxySQL, and then focus on improved ProxySQL native clustering solutions that allows auto-discovery, bootstrap, and a distributed decentralized reconfiguration.

FOSDEM 202261 / 287

1

28:39

A gentle introduction to Picocli

2

43:04

Z80: the last secrets

3

34:39

Why your next embedded project should be written in Go

4

32:07

5

23:40

A Better Public Transport App

6

29:11

Open Geodata Digital Spaces

7

16:22

Open Source Firmware status on AMD platforms 2022

8

28:22

Firmware Settings and Menus

9

28:50

Bringing RAUC A/B Updates to More Linux Devices

10

11:45

GPIO across Linux and Zephyr kernels

11

28:48

Eclipse Amlen: Messaging for IoT/Web/Mobile

12

28:42

Back to DirectFB!

13

29:23

Why Embedded Linux Needs a Container Manager Written in C

14

1:00:10

Automotive Ethernet PHY bring-up: lessons learned and debug tips

15

25:52

How to teach OSS licenses and compliances at a university

16

30:10

Why the pandemic could help FOSS, but was a win for proprietary software

17

58:11

Panel: Hot Topics

18

27:53

A globally unified governance framework for Open Source

19

28:40

An update on the Digital Markets Act

20

28:19

Why Device Neutrality is important for Free Software?

21

28:19

Somebody set up us the bomb

22

29:12

Rapid Prototyping Physical Interfaces with Web Serial and Cheap MCUs

23

29:05

Can JS also build the metaverse?

24

41:39

Running trusted payloads with Nomad and Waypoint

25

43:58

Simple (but useful) Ansible reporting with ara

26

44:04

Deploying An Embedded Linux Distro Build Factory with Ansible And Proxmox: lessons learned

27

28:48

Utilizing AMD GPUs: Tuning, programming models, and roadmap

28

28:49

Porting Signal processing algorithms to CuPy for precision measurement

29

28:54

PIRA: Performance Instrumentation Refinement Automation

30

28:49

Bringing together open source scientific software development for HPC and beginners

31

26:24

Containers in HPC

32

25:34

Uncovering Arcon: A state-first Rust streaming analytics runtime

33

46:07

v3dv: Status Update for Open Source Vulkan Driver for Raspberry Pi 4

34

21:39

The status of turnip driver development.

35

43:13

36

56:34

Optimal buffer allocation on Wayland

37

31:08

Fun with Finite Automata

38

34:40

TornadoVM: Hardware Acceleration For Java In Practice

39

27:03

Update On Java On The Raspberry Pi

40

33:17

azul: How Your Java is Still Served Hot

41

33:23

Jakarta EE: Present & Future

42

30:47

Fundamentals Of Diversity & Inclusion For Technologists

43

30:59

Polyglot Cloud Native Debugger: Going Beyond APM

44

59:42

Using LibreSilicon

45

38:42

Efabless Open ASICs

46

48:35

Coriolis RTL-to-GDSII Toolchain

47

44:00

48

34:01

Next generation micro-controller programming

49

39:35

Nim Metaprogramming in the real world

50

33:59

Nim concurrency & Parallelism

51

28:15

Why rule-based monitoring is (still) great

52

29:06

Network Traffic Classification for Cybersecurity and Monitoring

53

18:04

Peer-to-peer hole punching without centralized infrastructure

54

29:04

Kubernetes networking : is there a cheetah within your Calico?

55

19:05

Keep appetite for the stats, it costs nothing

56

18:12

Faster memory reclamation with DPDK RCU

57

29:02

2-cluster Kubernetes, with Calico, BGP Interconnect and WireGuard... All Without Leaving Your Laptop!

58

29:06

Challenges and Opportunities in Performance Benchmarking of Service Mesh for the Edge

59

22:05

The relational model in the modern development age

60

23:52

Percona XtraDB Cluster(PXC) Non blocking operations, what you need to know to avoid pitfalls

61

24:59

ProxySQL Cluster: challenges and solutions to synchronizeconfigurationacross multiple decentralized cluster nodes

62

22:35

ProxySQL 2021 Dev Submit

63

18:52

Release Note Highlights from 2021

64

35:06

MySQL 8.0: Logical Backups, Snapshots and PITR like a rockstar

65

56:43

MySQL Operator for Kubernetes

66

25:07

MySQL on Kubernetes demystified

67

18:53

Hash join in MySQL 8.0

68

23:07

Flame Graphs for MySQL DBAs

69

23:41

MySQL Performance on Modern CPUS - Intel, ARM, AMD

70

42:25

Newest MySQL component services features

71

59:43

MySQL InnoDB ClusterSet

72

25:08

Encrypting binary (and relay) logs in MySQL

73

39:55

Efficient MySQL Performance

74

27:10

Backup/Restore tools performance comparison

75

37:33

Bootstrapping a multi dc cloud native observability stack

76

05:45

Monitoring and Observability devroom: Opening

77

39:02

Profiling in the cloud-native era

78

38:38

Adopting OpenTelemetry and its collector

79

25:56

Suggestions for a Stronger Mozilla Community

80

28:58

Collecting Sentences for Common Voice

81

18:59

Introduction to Foxfooding

82

37:12

Searchfox: Fast code search and indexing

83

39:03

Linux Mobile vs. The Social Dilemma

84

29:00

Phosh Contributors Get Together

85

33:59

ModemManager in your phone

86

38:46

2 Years of Mobian

87

30:18

Mainlining the reMarkable 2 eInk tablet

88

35:10

From Android to mainline on the Snapdragon 845

89

29:05

Running Mainline Linux on Snapdragon 410

90

24:05

Librem 5 phone kernel report

91

39:01

The road towards using regular linux on ebook readers

92

1:28:44

FOSDEM 2022 - Closing Session

93

1:28:45

Status of camera support on mobile FOSS devices

94

24:00

Anatomy of GNOME Calls

95

33:50

FOSDEM 2022 - Welcome to Libadwaita

96

29:01

Bring openwifi to PYNQ-Z1 with ultra low cost

97

39:03

RedLeaf: Isolation and Communication in a Safe Operating System

98

33:29

Unikraft: Debugging and Monitoring

99

30:59

Mitigating Processor Vulnerabilities by Restructuring the Kernel Address Space

100

48:18

Genode meets the Pinephone

101

20:23

Advanced Unit Testing in the Hedron Microkernel

102

43:53

The Composite Component-Based OS

103

25:26

A practical solution for GNU/Hurd's lack of drivers: NetBSD's rumpkernel framework

104

54:07

Unhackable across 30 Years, End in Sight

105

28:31

UX/RT - a QNX-like OS based on seL4

106

35:32

Hardware accelerated applications on Unikernels for Serverless Computing

107

33:30

Managarm: Design of a pragmatic fully-asynchronous microkernel

108

29:05

A year of RISC-V adventures: embracing chaos in your software journey

109

19:00

Why everyone needs to know some coding: last-mile sandboxing

110

23:44

Designing a programming language for the desert

111

30:10

Fuzion Language Update

112

18:37

How to design powerful DSLs for users

113

15:09

Declarative and Minimalistic Computing

114

33:59

The Concise Common Workflow Language

115

29:01

Adventures in Dataflow

116

28:47

The Matrix State of the Union

117

59:00

The matrix-rust-sdk

118

29:29

Growing Pinecones for P2P Matrix

119

19:01

Opsdroid: Building a bot using Python3

120

19:12

The next generation of Matrix interfaces

121

30:02

All things with moderation

122

29:15

MLS meets Matrix

123

29:21

Beyond the Matrix: Extend the capabilities of your Synapse homeserver

124

28:11

Events for the Uninitiated

125

04:56

Decentralized Collaborative Annotations using Matrix

126

04:28

ChatStat - An R package for Matrix stats

127

29:35

Through The Looking Glass

128

34:34

8-bit Character support on architectures were the smallest addressable unit size is 64-bit in Clang and LLVM

129

23:20

LLSOFTSECBOOK: LOW-LEVEL SOFTWARE SECURITY FOR COMPILER DEVELOPERS

130

16:25

Towards an Operational Code Aesthetics

131

24:02

Online performance

132

30:09

Why ODF is a better standard than OOXML

133

10:33

Macro Dialog feature

134

29:09

LibreOffice WASM – an Update: A status report from the journey to get LibreOffice into the browser, fully*

135

11:07

Information Engineering Operations

136

29:23

Improved coverage analysis for LibreOffice's CI

137

10:24

Editing Simulation

138

22:38

Improving Developer Experience at LibreOffice

139

26:38

Curl based HTTP/WebDAV UCP

140

09:21

Kubernetes setup & deployment

141

09:49

Canvas For Rendering UX

142

30:10

Advantages of LibreOffice Technology

143

10:09

Building Collabora Online UI: based on the LibreOffice components

144

18:26

Peergos - Combining peer-to-peer connectivity, end-to-end encryption and fine grained access control to build a secure and privacy focused self-certifying web protocol

145

27:57

State of libp2p

146

18:00

Edges Are Infrastructure: IPFS Everywhere for a More Resilient Future

147

19:21

Hyper Hyper Space: In-browser p2p applications

148

24:02

Earthstar: The merits of being a bicycle when everything else is a hyperloop.

149

22:10

What's coming in VIRTIO 1.2

150

19:13

Tracing KubeVirt traffic with Istio

151

18:52

The story of adding TPM support to oVirt

152

39:09

Phyllome OS: A friendly virtualization-focused Linux distribution

153

27:30

Network interface hotplug for Kubernetes

154

25:27

KubeVirt scale test by creating 400 VMIs on a single node

155

23:54

Isolating PCI/CXL Devices: It All Starts with System Launch

156

27:19

Introducing OKD Virtualization

157

28:26

DevOps, Cloud Native, DPUs: beyond the buzzwords

158

27:22

Cross-platform/cross-hypervisor virtio vsock use in go

159

39:27

Panel 2: Dependencies for Vulnerability Discovery and Tracking

160

24:03

SweetAda: A Lightweight Development Framework for the Implementation of Ada-based Software Systems

161

1:04:19

SPARKNaCl: A Verified, Fast Re-implementation of TweetNaCl

162

24:00

Exporting Ada Software to Python and Julia

163

38:43

Proving the Correctness of GNAT Light Runtime Library

164

34:15

The Outsider's Guide to Ada

165

33:38

The Ada Numerics Model

166

24:34

Ada Looks Good, Now Program a Game Without Knowing Anything

167

13:49

Introduction to the Ada DevRoom

168

1:03:39

Introduction to Ada for Beginning and Experienced Programmers

169

03:37

Closing of the Ada DevRoom

170

24:02

Implementing a Build Manager in Ada

171

23:41

Getting Started with AdaWebPack

172

29:04

Overview of Ada GUI

173

28:41

Use (and Abuse?) of Ada 2022 Features in Designing a JSON-like Data Structure

174

29:02

Alire 2022 Update

175

21:27

secPaver: Security Policy Development Tool

176

29:11

State of Open Source Databases

177

46:51

IMPLEMENTING AN INCENTIVISED PARTNERS PROGRAM IN MAUTIC

178

19:07

Introduction to qbe

179

23:49

Verifiable Credentials and Decentralized Identifiers with DIDKit

180

27:11

Automatic CPU and NUMA pinning

181

39:32

Build and release tools tailored to building, releasing and maintaining Linux distributions and forks

182

29:59

Modding the Immutable – how to extend Flatcar, an immutable image-based OS

183

44:49

Collaboration instead of Competition

184

38:16

CentOS Stream: stable and continuous

185

09:00

Extending Kubernetes with WebAssembly

186

28:41

Boot2container: An initramfs for reproducible infrastructures

187

29:20

Free tools that help you run online events in an effective way

188

25:37

Streaming and Edit Conference Videos with OBS, Jitsi and kdenlive

189

38:34

FOSS Events Primer

190

50:12

Run a conference with pgeu-system

191

03:34

Welcome to the Conference Organisation Dev Room

192

28:23

FOSDEM Conference Infrastructure

193

28:57

Lessons from 6 Virtual Ansible Contributor Summits

194

19:54

Debian Conference Infrastructure

195

29:06

Introducing ONLYOFFICE Forms for paperwork automation and smart collaboration

196

24:28

Oniro - an open-source starter for fast-paced IoT environments

197

32:04

Unifying Infrastructure and Application Delivery Using Keptn

198

23:46

Porion a new Build Manager

199

23:46

Massive Unikernel Matrices with Unikraft, Concourse and More

200

23:58

How to improve the developer experience in Heptapod/GitLab

201

44:26

Continuous Integration Pipelines with Nomad, Vault and Jenkins

202

19:00

Pushing the Open Source Hardware Limits with KiCAD

203

19:37

Open CASCADE Technology: status update

204

18:51

ngspice - current status and future developments

205

18:38

LibrePCB Status Update

206

59:14

KiCad Project Status

207

18:40

Hacking through BIM models

208

29:43

Valgrind and debuginfo

209

21:49

Adding Power ISA 3.1 instruction support to Valgrind

210

53:16

Upstreaming the FreeBSD Port

211

10:11

Enable AVX-512 instructions in Valgrind

212

20:31

Privacy-preserving video object detection in WebAssembly inside Veracruz

213

20:13

SGX Enclave Exploit Analysis and Considerations for Defensive SGX Programming

214

23:51

Secure boot, TEEs, different OSes and more

215

23:18

Logging, debugging and error management in Confidential Computing

216

23:20

Intravisor -- a hypervisor for fine-grained isolation using CHERI

217

23:18

Symbolic Validation of SGX enclaves using Guardian

218

23:16

Gramine Library OS

219

23:59

Rethinking the OS for Isolation Flexibility with FlexOS

220

20:46

WebAssembly + Confidential Computing

221

23:15

Developing for the AWS Nitro Enclave Platform

222

58:13

Process-based abstractions for VM-based environments

223

23:10

Arm CCA enablement through the Trusted Firmware community project

224

35:09

Unit testing Linux kernel drivers

225

13:10

How (not) to make a mockery of trust

226

43:01

LAVA + OpenQA = Automated, Continuous Full System Testing

227

21:33

Data Replication and Migration from Ceph RGW to Cloud

228

28:57

Introducing Garage, a new storage platform for self-hosted geo-distributed clusters

229

36:33

COSI : a brief update

230

09:43

Migrate to Ceph-CSI

231

29:01

Trajectware - timeline-based navigation across computing heritage

232

58:59

A Brief History of Spreadsheets

233

37:28

Debunking The Myths About The Raku® Language

234

58:31

Keeping old Unix/Linux up-to-date with pkgsrc

235

1:24:01

A Computer Museum Why and how?

236

27:40

FrogFind and 68k News

237

29:33

Old Web Today: Keeping Flash (and other) Retro Web Sites Accessible on the modern web

238

33:58

Hack for the Planet

239

19:45

Getting 1K Chess for the ZX81 online

240

29:35

AOSC OS/Retro - An Introduction

241

43:57

Made by Woz: how Apple-1 operating system works?

242

28:21

Radically simple testing in Raku

243

57:44

Raku Steering Council Q&A Panel

244

24:00

Class learning analytics with Raku

245

36:23

A Raku Grammar for Navigation Lights

246

33:36

GitHub Actions (in|for) Raku

247

22:12

Free Software, Dependency Management, and what I got wrong at FOSDEM 21

248

33:33

Keeping the past to preserve the future

249

38:23

Decentralized DevOps with Unfurl

250

29:02

Voyager 1 adventures

251

29:05

Opensource WiFi chip (openwifi) progress and future plan

252

29:06

gr-ofdmradar: OFDM Radar in GNU Radio

253

09:34

Introducing the M17 Project

254

29:42

Emitting Hellschreiber from a Raspberry Pi GPIO: combining gr-hellschreiber with gr-rpitx

255

58:07

AlekSIS, the Free School Information System

256

42:59

Working effectively with (-support-) the community

257

56:43

Solving the knapsack problem with recursive queries and PostgreSQL

258

58:30

Slow things down to make them go faster

259

29:37

PostgreSQL Distributed & Secure Database Ecosystem Building

260

28:30

Future Postgres Challenges

261

43:42

What I wish I knew about security when I started programming

262

58:22

Secure Communication with Tls

263

38:12

Sudo: Watch and control your blind spots

264

44:27

WebRTC broadcasting with WHIP

265

43:06

UnifiedPush: A FOSS cross-platform push notifications protocol

266

48:16

Jitsi: 20 years of Real Time Communications

267

58:58

On the Far Side of REST

268

58:44

Implementing the NTFS filesystem in Rust

269

45:09

Open Source Network Automation in 2022

270

43:48

European digital sovereignty and open source

271

35:38

Are we being inclusive with our community recognitions?

272

20:35

Strengthening Developer Communities in Unprecedented times

273

20:10

Tracking your time with Timewarrior

274

19:18

A lightning intro to re-Isearch

275

18:59

Rapid Prototyping of a Positioning System

276

14:58

NetOTA: Quick introduction to IoT centric package archive

277

20:10

Jupyter for React.js developers

278

18:06

Measuring and analyzing humidity data using Python, syslog-ng and Elasticsearch

279

14:29

C meta-programming for the masses with C%: cmod

280

18:45

Generating virtual 3D exhibitions from Wikipedia

281

20:42

Collabortive group self-awareness with Where, a Holochain app

282

17:21

LibreOffice 7.3 New Features

283

13:47

Thunderbird in 2022

284

20:30

ToroV, a kernel in user-space, or sort of

285

05:18

Hardware-accelerated graphics in secure multi-tenant environments

286

37:27

Making a Community Managed FOSS Project Sustainable

287

22:18

Valgrind on RISC-V

Automatic playback

Speech

Text

Image

00:00

Proxy serverService (economics)Information technology consultingClient (computing)Group actionInformationCommunications protocolGateway (telecommunications)Cache (computing)Overhead (computing)Server (computing)Data managementSQL ServerScalabilitySystem programmingInstallation artModul <Datentyp>Interface (computing)Set (mathematics)Variable (mathematics)RoutingModule (mathematics)Revision controlEnterprise architectureData centerBusiness clusterImplementationExtension (kinesiology)SynchronizationAsynchronous Transfer ModeElectronic visual displaySocial classBusiness clusterProxy serverCASE <Informatik>Physical systemProcess (computing)Multiplication signGroup actionConfiguration spaceAverageWebsiteOperator (mathematics)SoftwareHypermediaSequelInheritance (object-oriented programming)Video gameDirection (geometry)Line (geometry)WordStability theoryArithmetic meanSingle-precision floating-point formatNumberBitPoint (geometry)Configuration managementPlanningServer (computing)CausalityProduct (business)ResultantView (database)State of matterCategory of beingComplex (psychology)Extension (kinesiology)Source codeData managementWeightLatent heatImplementationSystem callRule of inferenceGodInformationClient (computing)Order (biology)Core dumpEndliche ModelltheorieMereologyElectronic mailing listEqualiser (mathematics)Context awarenessCuboidImage resolutionSuite (music)Exception handlingProper mapProcedural programmingNeuroinformatikStudent's t-testMetropolitan area networkForm (programming)Goodness of fitInstance (computer science)Twin primeComputer programmingQuicksortObservational studyReal-time operating systemCommunications protocolReverse engineeringSoftware developerSeries (mathematics)High availabilityDifferent (Kate Ryan album)Alphabet (computer science)DatabaseDatabase normalizationCartesian coordinate systemFront and back endsType theoryMobile appInternet service providerRevision controlModule (mathematics)Variable (mathematics)SynchronizationAreaQuery languageVideo game consoleStandard deviationMultiplicationComputer animation

08:29

Proxy serverBusiness clusterLatent heatSynchronizationMechanism designRegular graphCore dumpGame controllerVariable (mathematics)Order (biology)Revision controlTimestampProxy serverBusiness clusterVariable (mathematics)Table (information)Server (computing)Instance (computer science)Module (mathematics)Interface (computing)WeightConfiguration spaceException handlingFrequencyCombinational logicRevision controlPropagatorSynchronizationSoftwareSingle-precision floating-point formatPoint (geometry)2 (number)Identity managementMultiplication signSet (mathematics)CASE <Informatik>Flow separationBitView (database)TimestampStructural loadSource codeEqualiser (mathematics)NeuroinformatikMatrix (mathematics)Configuration managementDirect numerical simulationMultiplicationMechanism designBand matrixGroup actionSeries (mathematics)InformationRegular graphMereologySubsetLatent heatSoftware design patternElectronic mailing listSocial classProcess (computing)Operator (mathematics)CuboidForcing (mathematics)Arithmetic meanDirected graphWordQuicksortProcedural programmingGravitationDimensional analysisElectronic visual displayScaling (geometry)Order (biology)AuthorizationFigurate numberMathematicsWaveComputer animation

16:42

Direct numerical simulationRevision controlProxy serverBoolean algebraVariable (mathematics)SynchronizationBusiness clusterCore dumpMiniDiscImage registrationData managementMathematical analysisAsynchronous Transfer ModeFeedbackGoogolGroup actionTwitterConfiguration spaceCASE <Informatik>Instance (computer science)SynchronizationSatelliteResultantSeries (mathematics)NumberProxy serverConnected spaceSemiconductor memoryNetwork topologyDiagramBusiness clusterTwitterSoftware developerAsynchronous Transfer ModeServer (computing)Source codeImplementationMereologySingle-precision floating-point formatCore dumpSystem administratorRevision controlInterface (computing)Client (computing)Computer programmingInformationMiniDiscLimit (category theory)LastteilungFlow separationWebsiteVariable (mathematics)Group actionMultiplicationEmailElectronic mailing listMetric systemDirect numerical simulationIP addressConfiguration managementPhysical systemVermaschtes NetzRead-only memoryReplication (computing)Ocean currentMultiplication signElectronic visual displayStatisticsWeb 2.0Information retrievalType theoryTable (information)CodeProcess (computing)Torus2 (number)PlanningProcedural programmingDirection (geometry)CuboidData managementOrder (biology)Form (programming)Design by contractArithmetic meanCausalityPoint (geometry)Extension (kinesiology)Decision theoryProjective planeCircleSquare numberPhysicalismAlgebraWhiteboardDifferent (Kate Ryan album)Row (database)Reduction of orderSet (mathematics)Electric generatorSocial classState of matter40 (number)Inheritance (object-oriented programming)Rational numberVideo gameCollaborationismDistance

24:54

Computer animation

Transcript: English(auto-generated)

00:07

Hello, and thanks for joining this session about proxy SQL cluster, where we will discuss the challenges and solutions to synchronize config across multiple decentralized cluster nodes. A quick introduction, my name is Enek Anam. I have been a DBA for most of my career, so I have been a DBA for a bit more than two decades.

00:23

I started writing proxy SQL software back in 2013, and in 2016 the company with the same name was formed in order to provide service about proxy SQL product, including support, consultancy, and development. Now, where is proxy SQL? Because the topic of this session is about proxy SQL cluster capability, I will not invest a lot

00:43

of time in explaining what proxy SQL is, and I will only mention it is a reverse proxy that understands the SQL protocol. Client that connects to proxy SQL, this will evaluate the request from the client and it performs value action, like transforming the request itself, filtering, and if the request has to be executed, this is not always the case, it will determine where.

01:05

As mentioned, I won't go into the details about the features of proxy SQL, but I recommend to visit our website for more information. Once we understand what proxy SQL is, one of the most common questions is where to deploy it. Because it uses a massive protocol on both ends of the wire, client, and backend side,

01:23

it can be deployed anywhere between the two, and each type of deployment has benefit and drawback. And I will go through a quick overview of that. So, the first deployment is app server deployment. In this very common deployment where proxy SQL is installed alongside the application, this deployment has benefit but also drawback.

01:45

Some of them listed here decide itself. Relevant to our topic today is about configuration management. If proxy SQL is deployed alongside each application server, you need a way to keep those configurations aligned to each other and consistent. Then we have proxy SQL layer deployment, and the setup application server connect to a single proxy SQL instance.

02:06

On this case, there are benefits and drawback. What is important to point at is that in this specific scenario, it is extremely rare, because it doesn't provide any sort of high availability. Therefore, the middle layer is normally composed with more than one instance, with some sort of HA.

02:22

And of course, the details of the HA is out of the scope of this session, and I will skip it for now. In the setup, the configuration is very simple. There is just one single node to configure. But we can also have a way more complex setup, like this form of cascading setup, where proxy SQL instances are installed both on the application server and as a middle layer.

02:43

We also have more complex setup, where proxy SQL instances are installed on the application server and on the database server, and on a middle layer and on the database server. So really, it can be on a layer. But no matter how you create this complex setup, it is clear that configuring it properly

03:00

becomes more challenging than the previous setup, because there are way more instances and different layers. In all the setup we showed so far, we are basically facing a distributed system, where you have multiple proxy SQL instances, possibly with different roles and different needs,

03:23

that needs different configuration, that communicate and need to coordinate with each other to achieve a common goal. The reason why we rely on a distributed system is to be able to achieve scalability and redundancy. In fact, the number of instances can scale up or down based on the amount of traffic it needs to serve, and we need to avoid a single point of failure.

03:44

Any node can fail at any time, and the system should continue working without downtime. So, the next question is how to configure a distributed management system, and it is here where configuration management becomes relevant.

04:01

Configuration management are processes that maintain a system, in our case a distributed setup of proxy SQL instance, in a state that is desired, consistent, and according to requirement. In other words, it needs to ensure that the various instances that need to perform specific actions and perform in a certain way are configured in the way we want them to perform, and the configuration must be consistent with each other.

04:26

Now, the question is, there are configuration management systems. Yes, there are plenty that already support proxy SQL. For example, we have Ansible. Probably many of you are already familiar with Ansible.

04:41

This is a pretty popular configuration management tool, and thanks to community effort, it supports proxy SQL. It has several modules, and each of these is able to configure different modules of proxy SQL. The fact that it is so many modules is also an evidence that proxy SQL itself is extremely configurable.

05:03

Next, in poor alphabetical order, we have Chef, another very popular configuration management tool that supports proxy SQL thanks to elsewhere community effort. Finally, we have Puppet, that also supports proxy SQL thanks to community effort. According to the number of downloads, it seems that is the most popular among the configuration management tools listed so far.

05:29

But the study doesn't end with configuration management tool. In fact, it is very common to have proxy SQL instance being configured through the use of network configuration and discovery services like Consul.

05:43

How does it work here? A little Consul server receives a configuration that is then propagated to other Consuls and other Consul servers, and ultimately they will reach to Consul client. Upon receiving a new configuration, a Consul client can be configured to connect a proxy SQL instance that is often locally,

06:03

and to issues a series of commands to reconfigure it. This means that discovery service like Consul can be repurposed to be a configuration management tool. Without going into many details, the same can be said about other discovery service like Zookeeper. But Consul and Zookeeper are a pretty popular solution often used to reconfigure proxy SQL.

06:29

Finally, we have proxy SQL cluster. This is our own implementation of configuration management, and is native to proxy SQL itself. This is very important. It means it doesn't rely on any external configuration management tool.

06:41

In fact, the main idea is that it can configure a cluster of proxy SQL instance and reconfigure them in real time without the need of any external dependency. This doesn't mean that you should not use configuration management tools, but it gives you the freedom to not rely on any of them. Furthermore, it has an interesting advantage compared to configuration management tools.

07:02

When a new version of proxy SQL is released, native proxy SQL cluster will support any new specific features or extension that was introduced. By context, if you're using a configuration management tool, it might work for a specific version of proxy SQL and not for another version, or it might not support new features for a long time.

07:24

Finally, it allows a great deal of other enhancement, but I will go into details later. What is able to synchronize? Currently, it's able to synchronize MySQL users, MySQL server, and the various MySQL cluster configurations. So, if you're using a standard MySQL asynchronous replication, or if you're using group replication, Guerrero, AWS, and so on.

07:48

It's also able to synchronize MySQL query rules, and it's able to synchronize the list of proxy SQL nodes that are part of the cluster, and also MySQL variables, LTAB variables, and MySQL variables.

08:01

Each of them is considered a configuration module and can be enabled or disabled independently from the other. For example, you might want to configure only the synchronization of MySQL users and query rules, but you don't want to synchronize the MySQL server because, for example, you want to use console or Jupyter for that.

08:23

Those are the occasional areas. In a nutshell, the way proxy SQL cluster work is the following. So, you have various proxy SQL nodes that connect to a single instance that is configured, that is considered the configuration source of truth.

08:41

When any configuration is changed in this specific instance, the other will detect it and will pull the configuration from it. Okay, many of you probably are already pointing fingers to a pretty evident drawback of these scenarios. Basically, there is just one single point of failure, that is the proxy SQL instance that has the configuration.

09:01

So, let's extend the topology and remove the single point of failure. In here, in this diagram, we have four proxy SQL cluster nodes, and each instance is monitoring each other. So, basically, each four are always monitoring all the others. As soon as they detect that a node has a new configuration, they will all pull the new configuration from it.

09:25

What is important to note from the setup is that, currently, there is no leader, although we plan best for the features. Synchronization is time-based, each configuration has a network associated to it, and the newer configuration is the one propagated.

09:41

It also works to recreate what I already mentioned a few times. Nodes check each other at regular intervals, and they pull the configuration, so there is no actual push of configuration. Okay, how to configure proxy SQL cluster? Well, first of all, as you know, proxy SQL with the configuration is visible in tabular format, and proxy SQL cluster is no exception.

10:03

So, first of all, proxy SQL node which instances they need to monitor because they're configured in a table called proxy SQL server. The table is pretty simple, as you can see here. There is the hostname and the port of the instance, the weight that is currently not used, and a comment.

10:20

Please note that this table itself is replicated by cluster. So, if a proxy SQL instance is configured with just one proxy SQL node in proxy SQL server, it is able to connect to it and pull the whole list of proxy SQL server that are part of the cluster. So it will automatically be able to discover the rest of the cluster.

10:41

Then, we have a lot of variables related to cluster. Here, I'm only displaying a subset of the variables that are most relevant to this talk. First of all, we need a pair of username and password, those are the credentials that will be used to connect to the other instances, to the interface specifically. Then, we have a check interval that defines how often the various configurations are checked.

11:05

Then, for each module, we have a counter that defines for how many checks the configuration needs to be defined from its own configuration before re-triggering a synchronization. It is important to note that if any of those variables is zero, the configuration for that specific module is disabled.

11:25

Great, let's see this in action. So, we have other tables, it's called start proxy SQL server checksum. In this table, we can see a series of information for each proxy SQL instance and for each module. There is really a lot of information here, so now I will actually zoom in and see just one specific module, the MySQL server.

11:49

Great. So, here we can see again that the information is available for each cluster instance and we are able to connect. For each configuration, we can see the version epoch, the checksum of the configuration itself,

12:02

where a specific server got the configuration, and the last time this information was refreshed. Finally, if the remote configuration is different than the local one, we count for how many checks the configuration was different. Now, let's dive into this. To dive into this output, we need to start from the checksum.

12:22

This is the checksum of the configuration itself, and when no monitoring each other, they simply monitor that value. Basically, no are exchanging heartbeat with checksum at regular intervals. This drastically reduces bandwidth and computation because there is not much that is being exchanged, and

12:41

to verify that the configuration has changed or not, we only need to verify the checksum. Also, because they are basically exchanging heartbeat, each proxy SQL node is able to detect failures, so if a node disappeared, this is a perfect way of detecting this. Epoch is pretty self-explanatory.

13:02

Version is a very interesting matrix. Every time a configuration is slowed to runtime, no matter from where, the version is increased by one. Monitoring version, we can detect if any node is loading configuration too frequently or not loading it at all. But it can also tell us something important. If version dropped to one, it means that the node was restarted.

13:25

So when it loads from disk, it's the first time it is loading the configuration, so its version becomes one. Great. So restarting a node creates an interesting problem. What if you have a cluster running for a very long time with a configuration that doesn't change for weeks, and then a node is restarted?

13:44

The problem is restarted with a different configuration, so for example, a server is completely redeployed with absolutely no configuration. The restart node will have a different checksum, but a newer epoch. In theory, all nodes should consider it as the newer configuration and sync from it.

14:03

In reality, this doesn't happen. We prevent it. The reason behind it is that the restarted node will have version equal one and proxy SQL node to not accept configuration from a node with version equal one, because it is likely to be a node yet not configured or incorrectly configured.

14:24

Basically, it prevents a new or not validated configuration to become the source of it. On the other hand, a node with version equal one knows that itself is likely a node with incorrect configuration, even if its epoch is newer. It will then sync from the rest of the cluster, so not to the rest of the cluster, but from the rest of the cluster.

14:46

Because version equal one has a special meaning, this also means that if all the nodes have version equal one during an initial bootstrap, they won't be able to sync. As soon as a load to runtime command is executed on any of the nodes, the cluster is bootstrapped.

15:04

Note that this is a fancy mechanism in case of serious failure, and several or all the nodes are restarted. This is basically by design. Also, pay special attention to the fact that every timestamp is in seconds.

15:21

If you configure too fast synchronization, it is possible with proxy SQL cluster to try to synchronize while configuration is still changing. This means that too fast synchronization can lead, at best, to repeated synchronization, or at worst, to conflicting configuration that won't be able to solve automatically. Now, let's have a bit of a view of poor design patterns that we have solved in the past.

15:46

In the past, we witnessed some poor design patterns in which proxy SQL cluster doesn't work. In the previous slide, I already mentioned that things can go quite wrong if synchronization is configured to be too fast. This allowed two or three seconds before synchronization. By default, it's three seconds.

16:04

We also saw a situation in which multiple proxy SQL cluster instances were being configured at the same time with conflicting configuration. This can lead proxy SQL cluster to enter a setting which is not able to automatically resolve conflict. Please note that proxy SQL cluster support identical configuration applied to multiple instances at the same time, or in a short period of time,

16:26

so you can easily combine proxy SQL cluster with configuration management tool, but if the configuration is conflicting, proxy SQL cluster cannot over-resolve the conflict. Another interesting poor design we saw is the configuration of all proxy SQL cluster nodes with the same DNS name.

16:44

Why this is basically a very bad design? This was a very poor design because you can easily end up in a situation in which an instance is connected to another instance. It detects a new configuration. It then opens a new connection to put the configuration, but because it relies on memory solution,

17:04

it was instead connecting to a different instance and pulling the wrong configuration. We have seen cases in which this could also lead to configuration being rolled back, because it was detecting a new configuration node, but pulling the old configuration from a different node

17:24

and advertising it as an even newer configuration, so basically the end result was that the configuration was being rolled back. In the next series of proxy SQL, we already prevent this to happen because it checked the configuration before applying it, and if this is not what it expected, it simply skipped the configuration and goes back to checking the rest of the cluster.

17:48

Still, even if we are able to prevent this, please do not use a single DNS name for all the cluster. Similar to the DNS horror story, we had also horror stories with several proxy SQL instances configured behind a load balancer.

18:05

Configuration was being checked on one instance, then was being pulled from another instance, just because they were all behind the same load balancer. Also, this is prevented in the next proxy SQL release, but please do not use multiple proxy SQL instances behind a load balancer.

18:21

It completely defeated the purpose of using proxy SQL cluster. Proxy SQL cluster was not able to synchronize variables up to version 2.1.0. This limitation was technical. We could add new variables at any min or release of proxy SQL, and therefore two instances with a different number of variables were never able to generate the same checksum,

18:45

and so they will never be able to synchronize, and they could enter a never-ending attempt to synchronize. In version 2.1.0, we finalized this program in a very simple way. During the initial handshake, proxy SQL instances advertised their version. If the versions do not match, they will drop the connection.

19:02

The fact is that only proxy SQL instances of the same version can be part of one cluster. But this also creates a new interesting problem. What if we want to run multiple proxy SQL instances on the same server? Maybe by listening on the same port for the master SQL listener, but on a different port for the admin listener.

19:21

What if instead we want to specify the IP address in the listener, and the IP address itself is specific to the instance where proxy SQL is going to run? So for this reason, we add new variables named cluster sync interface. If false, the admin and SQL variables related to listener interface are not synchronized via proxy SQL cluster.

19:41

Note that these variables are global variables configured only in proxy SQL to cnf, because it cannot be part of variables synchronized by cluster itself. So far, we saw two cluster topologies, a single source of truth and multiple instances pulling the configuration from it, and a full mesh topology.

20:01

In reality, the most common topology is a setup in which few nodes murder each other, and all the other nodes only monitor the previous mentioned few nodes, as displayed in this diagram. Or in this diagram. Okay, I'm sorry, I'm very bad at diagrams, so forget about all those little proxy SQL instances you see, and let's try to conceptualize it.

20:22

In short, we basically have two groups of instances that can be seen as two layers. You can have a core layer, where few proxy SQL instances monitor each other. For example, three or four. And a satellite layer, where a large number of proxy SQL instances monitor the proxy SQL instances in the core layer. This setup can scale very well to thousands of nodes.

20:43

Very important hints here. Traffic should only be served by the satellite node, while core instances should not serve traffic and only focus on configuration. An important thing to remember here is that if a node in the satellite is manually modified, it becomes out of sync with the rest of the cluster,

21:01

unless the core pushes a new configuration. In a setup like this, the instance in the satellite layer doesn't know about the instance in the core layer, but what the core layer knows about the node in the satellite layer. Until 2.3.2, that is the latest release,

21:21

the answer is very simple, nothing. The core node knows nothing about the outer proxy. In the next release, that the code is already available, not only the satellite knows about the core nodes, but they will also generate a UID that is saved to disk so it becomes persistent in case of restart,

21:41

and they will advertise it to each node they connect to. In this way, the core nodes will know about the satellite nodes as well. As usual, information are visible in a table, and this table is called the StatProxy SQL Server Client Status. For a proxy SQL instance that is currently connected,

22:01

or it was connected in the past, the instances know the UID of the node from which the IP port was connected, that is very useful for debugging, but also the configured admin MySQL interface of that specific node. In this way, if we want to connect back to that proxy SQL instance, we know how to. Finally, we record when was the last time we have seen that node.

22:23

This is the last time it sent a query, no matter if it was for a public configuration, for registering, or for simply pulling the checksums. Next, at the beginning of the session I was mentioning that Nativa Proxy SQL cluster has some benefit that other configuration management tools do not have.

22:41

Specifically, the ability of the node to automatically register when they connect to a node allows more flexibility and further features to be implemented. Because core nodes become aware of a satellite node, they can be aware on how the cluster is growing, and we can implement several tools that connect and monitor those instances,

23:01

retrieves log, performs various types of troubleshooting, etc. For example, if the core nodes become aware of every new satellite node, we can automatically register such nodes into Prometheus and monitor it, or even directly connect them to their Prometheus exporter and collect their statistics in that time without Prometheus.

23:22

Finally, our next web UI will be able to automatically connect to a remote instance for troubleshooting and display metrics without the need of Prometheus. Now, quick notes about leader election. As I was mentioning in the current implementation of Proxy SQL cluster, we do not rely on leader election. Unless conflicting configurations are pushed at the same time to more than one node,

23:44

the lack of a leader election is not a problem. That says, the implementation of leader election is probably not complex to execute, as Proxy SQL has already built in a series of features that will make the implementation simpler. In fact, it is already possible to configure admin to be in read-write or read-only mode,

24:02

thus accepting or rejecting write to this new configuration. Admin can already reply to read-only monitor check, thus admin can already describe itself as a server in read-write or in read-only mode. And using read-only monitor, we can create wasgroup in a replication hostgroup,

24:20

where the leader is in the writer hostgroup while the followers are in the reader hostgroup. So what that means is that we can implement a system in which you can connect to any Proxy SQL instance and the requests are automatically forwarded to the leader. Finally, I want to thank you all for attending this session at positive, and I invite you to visit our website for further information,

24:43

to join our mailing list for any questions, to find us on GitHub for issue or simply to get updates about the development and to follow us on Twitter. Thank you.