Merken

PGCon 2014 Lightning talks

Zitierlink des Filmsegments
Embed Code

Automatisierte Medienanalyse

Beta
Erkannte Entitäten
Sprachtranskript
on the well you will not be able to see it on you and on the other end of the program you have well you say finally we think the you to hear
from you know what from the title was
placed on the right track In my mind I want you to know all of the other and that involved here that almost sees it up for you and for us because we have 2 microphones and 2 microphones in books and you need both because of the way in which the energy difference what is for people here and 1 is to report in and for local many this check check that I've made influence of the enemy about whether to start the legend that I think this going be fairly hectic but you know that that's lightning talks going on I think you'll be fine but 1st of all we have URI from 3 gives you your work and they're both the both of them in both of would require removal of his work with you you you you you can use all of theory and get the right there I mean the animal but I like it a lot I'm liking for those that can indicate that he had thrown also layer because I was to English you speakers so I ask you this and I had the right the past and I I do like to introduce our but could you did that and database right on and product based on PostgreSQL yeah so the name is taking a database of 80 km 18 year for so many ways and that and developed a
day TDL fall Mr. simply because he's been why new weak users of possible escape of database for mission-critical systems so he unsound and this cat so this chapter is a performance measurement of and by then i . si by people's redeployments is possible enterprise consuls jumping the plan and I didn't know that the best included in preparation to the CPU core number so that was the best you high high did I have it here a region hence and also has a high performance future cell and there were as we had this is why I have this in the July we chose an impossible to get full so this rock about
HEAD so a caveat has still point of fast point is all those that we use and how was that has a lot of the US and Europe was addressed yeah and infringment additional function of problem to avoid and made it changes the colossal scores and another point is put into into the Redstone technology layers the hardware and software so we propose and this and leave provide a CD of the optimized stay on for 10 and as the ones from story from this way for a solid and we provide a can be out in the the state data of were hardwareredundancy configuration and self dual operation designed so just like easier for anyone to operate a set up and operate so without assuming of even of precast systems in this region like idea is a parody is making an investment of how the way this is how the
spasticity there's so when you open the front cover so you can see the star back and and then the what the talk state is duplicated so that other sort by the predicted by possibly canister ringing the precondition and about how to is the right so now also we have available to assess the need to upload the I O bottleneck in I have enough time so I right the introduced possibly due to performance by its the data sets in this way axioms highly that the and high performance of all possible so next I had the right to it frank about software which provides a prior on some of them so oppressed model that integrates of where and how do we so
this government there are all over all hardware all and and was addressed year so when did the best last approximately that identify where eyes and its use on the so where so the example of the east you on the screen so use not only creates that the public but so In a prize money vessel it into 4 so said does so some from the finish here you will be and so they thinking
that the probability of linking more useful stated the aim of thanks to but let's try year always good to have powerful and this is I
think when of the idea that the the media presentation example 2 variables are not at all
the up to you know you need to to you cannot do that because you of what you are right what yet another 1 ones that you have to do that on purpose both of them to altogether and at the end of 1 of thank you very few of them died because a race or a probable more like the 1 with area part of my mark you're helping the style of presentation to the Web as a curse sales and from bone organ for you for when offense so today checked and asked to do up
weighting talk asked Thomas if
he could uh similar patch
I have keep trying and the next year so yes
toward see what his cousins and doing so concerned you'll see you can pick them up at the interaction more that you have
on 1 of the things that have opened and this is due to the a lot of of the of the the of the 1 of the the of the course
of that you have if you like and don't
try to homonyms is David then I wanna kill middleware and I have been working with that integrating how we do this that right a writer of the working with integrating some kind of database stuff and web applications for about 20 years now and to say that it seems ridiculous but yes it's been a long time so we have data storage we have rendering and we've got this thing in the middle and what it's done in the past is astonished at your request handling connectivity didn't relations business logic HTML creation and it sends it back and forth and we've got something talking to the database and HTTP and HTML in the back that was the old days today what we have is we have a variety of finance that you can't see because it's too dark and a similar rendering data-manipulation business logic in a sting operation are all on the front end now the back and we've got data storage we still have that but we have data manipulation we have business logic and in the middle all we have is http request handling and connectivity that's what we have now OK but what we can have with the advent of course Chris 9 . 2 and above is we can have this all those things that are in the database plus the Jason type and if we add http request handling in there we get this magic situation where we get to bypass everything in the middle and ask the question what is it for so this is what I want OK I have a demo of this working a created an extension using digging new open-source http request handling created that extension of ones in as a part of the database accepting J. Sun and and running start procedures and that are described as adjacent request returns jason back to the browser works perfectly if anybody would like to see it or would like to contribute war like to just know more about this please come see me after
thank you
fj
you will like people were quick thank you all library so I thought
I was looking at your next to useless in the mind and it
on the other right you where you are on this that you the Europe but
I those in the queries and to
and that of our last updated and saying if you play that give to that
it's it's smaller and intermission do you do do do do do do do static uh and that didn't the new in world community know what the other guy gave you 1 your and you will will note here of you felt very and guarantee that the double the man I think there is no torque calmer about something I imagine that 0 yeah we adopt the new period here I have been done where the the Baghdad going over the right eye and the way they are perhaps you need anything else related to the time I I'm run on I walked for that it will a French position company things you may have gotten the French but and I need to talk to you about my latest that project the g equals that's so what is it about its uh extension which we collect statistics about pretty that's reason enough to Piaget's that's that's nice but what's the same and in fact most of the code was rich source which is that the of uh in and it's avirulent subjects and as this thing really is because it's not been tested extensively like so what does it do it's it's installed as a benefit which included into a fast query full signal on the phone so good and some at Somoza around all some good and some of the operator of some constants or something like that and then you can query statistics of this what is it good for what doesn't when the correctly it's elevated with can help you find which of your occurrence you'll requesting against the most frequently and the next logical step is considered suggest missing and this is based on the statistics so if you will always scoring 1 uh 1 column with another you will need a composite index fund was the school and that that etc. so it's the index and lies on the next step to
this is to be able to suggest a facility indexes and looking that going to use concerns and when it's just for Rangers win was doing good and subsequently join together so just and then also some with this states to ensure it doesn't question but in all of some use for it this is the opportunity send you a few that that is the effect of the of the
PR or both as far as have to off on your right we visually you
begin to make hello my name is more sonar mirror charge you about um was called were bi-directional replication around it's saying upcoming we hope extension impose 1st in about 2 years ago biology for erroneous probably this now 5 times and wants to on but 2 years ago we saw a need within pose 1st so we didn't have I by we want our about that later by bi-directional replication is basically built off of what is now where everyone knows the streaming replication of we had a specific need for William replication around being multi-master there some other products out there that do multi-master that are tightly coupled nodes and but we kind of needed something that was going to spend the global and another thing that can do is filter replication so this is filtered at the database level right now if you do streaming rep you get everything in instances most people probably where another goal is easy least we wanted to look at and so is it look like as a said it looks a lot like streaming replication does right now and unfortunately to get it you still have to recompile and but basically you tweak your post 1st feel comfortable loads and promoter preloads libraries you set up some connections but and then as per database and then you go so it was working on this and so I would like to say thank you the 2nd quadrant they're doing pretty much all the development work on Intel security is where I work that's formerly McAfee were helping with some funding and a little QA work docs so they're on the wiki for oppose her skill that word it is totally open everybody can it's been donated back to P D anything that we produce and same with documentation and everything like that there are a several pieces of the framework that have come out of this and other people look like they're using for things that we hadn't even originally intended which I think is some other point of uh giving back to the community by installing a few
of you you keep in mind that you know all the time but they're
called had its own you from side to state of an outsider status rebuilding a scalable sequel that scales through dozens or hundreds of machines today on the talk about the colonists for it so we open source economist or what appears to be how many of you have heard about what the column like health maybe 10 20 per cent at a really high level it's basically instead of storing data are viral your storing if they need of column by column and this is an extension that releases the foreign data approved about math back and review building it you know when a bill that from scratch we looked around and there was this open source specification out there which was called the RC file with demonstrable optimize rc file format that came out of Facebook a couple years ago and basically the 1st version because the final version was when they were making up their data policy city of 100 million rows and 3 columns column a column B call them see the idea was they put like 1 million call maze by 1 million colonies followed by 1 million column cities and then of the new moving column they publish the specification on I I believe and it picked up particularly on the whole the world of things it's out there they also have some performance comparisions on the published paper about 2 years later they developed a 2nd generation of it so that's the optimize rc file format it's basically again a specification and optimizations they abandoned RC file format were a lightweight indexes min-max indexes and enabling different compression methods so this is again out there on the internet if you're interested these 1 take a look at the basic pick that up and made of alpha data rapid that integrates nicely into post sequel solely to this great extension and create the table and I start running your queries so this is nice this 1 which is stable compression ratios again this is TPC-H which of ideal benchmark because most of of our customers so looking into using this are they typically have hundreds of columns and you you look into this when you're about to make a liar what again time to any columns integers big-integers numeric types states so it's basically there and the velocity file formats basically equiv prison groups all time change in here you see the compression ratios in the benchmark where 1 is 3 to 4 x so depending on your use case this could effectively increase your memory utilization if Europe where there's a memory if you're using necessities you get like most bang for the buck because you're assistance storage effectively increases and also if you're rotational disks on necessities you benefit from reading the data your disk throughput like 3 to 4 x and even if it's integers like up to 10 x 15 x so that's the compression ratios than the last along the line item table years the biggest table that's
that's light and this is the other the slide which again using the TPC-H benchmark which is not again the ideal benchmarking because for most of our customers they have hundreds of columns and and their queries are only taking a few but then again this is an easy to instance on the rotation of this and all of of this is here just like because it's important in here is saturating the entire disk bandwidth but we're using less CPU coffee sold running these queries you gradually so that this bond with the saturated but the CPU we have all the CPU cores are not utilizing and the benefits are not again looking into you more or less about 2 x and the benefits depend a lot on the query itself or queries where most critical isn't that select by CPU such as query number 6 see strong provides bigger benefits because then you reading fewer chunks so they talk like only the columns you're interested in and the compressed data from this 1 and other communities where it takes the postcritical longer to execute say query number 3 in the TPC-H benchmark the benefit is relatively small knowledge but then again there is a benefit particularly when you're about to make on this guy and and those are the 3 slides that I have 1 of these
things I have that you know the heroes
speaker with them by you don't allow you to update it that there is no knowledge about the viability of a work the State of New Guinea anyway yeah when I think of that the status quo began where the mother of all databases
the present and in the bank and in all the words in the sentence as upon the deep and I was not investigated and we didn't have a sequence of anything or not and then you can only have come out in his head with
the right and has light and it was the 1st day and chemical Sternberg Michael Stone that'd be began interests and that was the 1st generation anatomical so notably squares and that was the 2nd generation and the Americans and they're
going off and the other of other things that we don't really care about examination and principles in them and if you tried any of them but when they are concerned with the generations of was began olestra and
all questions will see eaten that Informix which was eaten that idea and that was the 3rd generation and political
yet and this URL and and be this thing called my SQL which I don't think everybody uses anymore but I had mentioned and
because they get Postgres 95 and post 95 beget political SQL and that was the 1st generation and
got published grammars 124 common and yet many things including bridge in the field of interest equal my favorite name and perhaps progress in WordNet database and manifolds with you all know much infertility with us anymore and consequently
begat pervasive post-growth and the the 1 of these is still with us can you guess which 1 and Conference QA
begat PG cluster and PG cluster to those with this anymore so then we began at the
because x and post see yet strong deviance from BBB get POST grows XL and post was going
to be Telegraph CQ and CQ began to the user
and the system of the Atlas was all our and posters on the translators lattice and then post goes into the realm of data warehousing
and they get many think it that's not to use them and it began to grow as and this was the yet green plot and particularly at after data and tools which you'll be at the excelent parents that began with it was redshift that we are not done in the realm of data warehousing because
post you already briefly with and begat had to which began that and also began Yahoo Everest and the cytostatic and here you can
see the family of many generations of because you
have a moment of a of all our we yes you you might
call the model the whole language in front of the camera and the whole development of modern people working in the area that we're going with it right so I'll talk about 1 of the people he gets here post-growth Excel and yet
another pose present from related cluster of the got from X there's some code there there's other parts that are different than planet executer actually fairly different so as I said some same properties cycle cluster in the acid all that but adds in data noted in a direct communication and MPP parallelism also being and right scalable it'll does later
scale out there was a TPC-H slide earlier here's this is kind of outdated but does give you nice performance boost for us to BCH type of workloads and also so earlier somewhat presented about upon their swords and it should be pretty easy for us to adapt that and so we should be aware of the benefits of interview parallelism and compression so we could probably shrink these another you know 2 3 4 times whatever that the benefit was on the other 1 so I'm I'm excited to to go that integrate that and on the road of and
policy create tables to specify a distribution key in this case and distributed on that what plans look
like would be if you're joining on their respective and joint columns it's going to send this plan to both data nodes in this example and facing in a push it down both to do a local has joined in it's basically merge those results not to get different cases where we're doing a joint on 1 call this on the house distribute column another 1 on the other table it's not it's going to simply send out a plant each DataNode but when it executes 1 part of that tree it knows all of I need to have on column 2 and send it over to the other nodes so conjoined with column 3 so we try to post the people of the plants as much as possible and make it and MPP architectures that kind of an open source of Greenplum
type thing so we do have some differences compared the oppose CUI performance that multi-tenant security as well we tried a lot that down to the database PG user you can take a look at that you can see your own information if someone's motivated wants to pull that out and put it into post-process feel free we also collected this additional statistics in terms of the data the project we want emphasize stability and bugfixes over adding new features so that people feel comfortable that their data is safe and because of the scale of cluster focus on performance and we want to be a very inclusive can of communities so if people have some of the 1 of participate roadmap Discussions or priorities of you know we wanted to be driven by the users of so we want to be inclusive of encourage you to get involved
on I was in the X project also from the beginning so as my final slide I I also like to ask which is Suzuki of entity if you'd be interested in being a co project lead on on postcards excel and working on the project thank you touch where we
have a non-canonical might understand analyzing rarely hear me OK and 1 of the networks coupled line alone but we don't know while the party tonight so basically what it
is the bringing up over here so people can see if you're not that this is
not use the website in a way
that no want to know what these thoughts yeah they don't like people like you know a great use those little event the major social
event is that Johnny make use them out in Gori my going and looking around with the right across from what we on the board of money with that you can find it on google make sure that you bring your back because that is very important and that you tiny on this will be like a book that I have going on here with the Diamond had make up because you know what I have 4 minutes to talk to you have to have a population and the other the thing that I will forget that you have to understand that the body to manually about of those at reception I've been told that probably gone but you walk good right somebody's but they didn't really know who but but we we would to buy to bias the bias in the room maybe the engages celebrating or whatever hell that thing at the that that I get another 3 minutes here but is not solemn negatively you offer that I suppose that goes to the back do wherever diabetes against the barber something from the there will be alcohol everybody will have a lot of fun and that
Physikalisches System
Metropolitan area network
Indexberechnung
Optimierung
Computeranimation
Subtraktion
Datenhaltung
Automatische Handlungsplanung
Zellularer Automat
Zahlenbereich
Physikalisches System
Biprodukt
Zentraleinheit
Computeranimation
Metropolitan area network
Physikalisches System
Energiedichte
Weg <Topologie>
Rechter Winkel
Softwareschwachstelle
Maskierung <Informatik>
Speicherabzug
Client
Speicherabzug
Computerunterstützte Übersetzung
Unternehmensarchitektur
Zentraleinheit
Einflussgröße
Lokales Netz
Offene Menge
Dualitätstheorie
Server
Punkt
Datensicherung
Computeranimation
Eins
Metropolitan area network
Informationsmodellierung
Software
Standardabweichung
Datennetz
Gamecontroller
Speicherabzug
Konfigurationsraum
Modul
Schreib-Lese-Kopf
Hardware
SCSI
Addition
Lineares Funktional
Nichtlinearer Operator
Hardware
Softwarepiraterie
Speicher <Informatik>
Überlagerung <Mathematik>
Physikalisches System
Quick-Sort
Software
Menge
Rechter Winkel
Dualitätstheorie
Axiom
Zentraleinheit
Aggregatzustand
Offene Menge
Server
Hardware
Speicher <Informatik>
Expandierender Graph
Computeranimation
Datensichtgerät
Wiederherstellung <Informatik>
Standardabweichung
Datennetz
MIDI <Musikelektronik>
Touchscreen
Hardware
Metropolitan area network
Benutzerbeteiligung
Variable
Flächeninhalt
Selbst organisierendes System
Mereologie
Hypermedia
Regulärer Ausdruck
Indexberechnung
Kombinatorische Gruppentheorie
Computeranimation
Eins
Portscanner
Webforum
Metropolitan area network
Patch <Software>
Gewicht <Mathematik>
Interaktives Fernsehen
Ideal <Mathematik>
Computeranimation
Middleware
Metropolitan area network
Logarithmus
Regulärer Ausdruck
Indexberechnung
Computeranimation
Gammafunktion
Einfach zusammenhängender Raum
Nichtlinearer Operator
Demo <Programm>
Datentyp
Open Source
Datenhaltung
Browser
Web-Applikation
Relativitätstheorie
Speicher <Informatik>
Indexberechnung
Mathematische Logik
Algorithmische Programmiersprache
Computeranimation
Eins
Middleware
Metropolitan area network
Middleware
Mereologie
Datentyp
Debugging
Maßerweiterung
Speicher <Informatik>
Varietät <Mathematik>
Middleware
Metropolitan area network
Rechter Winkel
Programmbibliothek
Regulärer Ausdruck
Indexberechnung
Vorlesung/Konferenz
Bitrate
Computeranimation
Nichtlinearer Operator
Statistik
Ortsoperator
Element <Mathematik>
Varianz
Indexberechnung
Abfrage
Befehl <Informatik>
Quellcode
Nichtlinearer Operator
Hecke-Operator
Frequenz
Analysis
Code
Computeranimation
Konstante
Mustersprache
Metropolitan area network
Moment <Stochastik>
Chatten <Kommunikation>
Automatische Indexierung
Login
Statistische Analyse
Maßerweiterung
Gammafunktion
Metropolitan area network
Soundverarbeitung
Automatische Indexierung
Tabusuche
Schlüsselverwaltung
Automatische Indexierung
Partielle Differentiation
Indexberechnung
Computeranimation
Aggregatzustand
Punkt
Multiplikation
Regulärer Ausdruck
Ikosaeder
Framework <Informatik>
Computeranimation
Übergang
Datenreplikation
Programmbibliothek
Weitverkehrsnetz
Punkt
Softwareentwickler
Maßerweiterung
Kartesische Koordinaten
Umwandlungsenthalpie
Elektronischer Programmführer
Datenhaltung
Computersicherheit
Indexberechnung
Wiederkehrender Zustand
Übergang
Wiki
Biprodukt
Zustandsdichte
Verschlingung
Last
Rechter Winkel
Wort <Informatik>
Instantiierung
Metropolitan area network
Indexberechnung
Extrempunkt
Computeranimation
Minimierung
Gruppenkeim
Versionsverwaltung
Regulärer Graph
Fortsetzung <Mathematik>
Kardinalzahl
Drehung
Sommerzeit
Marketinginformationssystem
Computeranimation
Übergang
Metropolitan area network
Dämpfung
Notepad-Computer
Quellencodierung
Gerade
Benchmark
Umwandlungsenthalpie
Automatische Indexierung
Zentrische Streckung
Softwareentwickler
Gebäude <Mathematik>
Abfrage
Systemaufruf
Gefangenendilemma
Generator <Informatik>
Automatische Indexierung
Festspeicher
Dateiformat
Tabelle <Informatik>
Aggregatzustand
Geschwindigkeit
Partitionsfunktion
Facebook
Stabilitätstheorie <Logik>
Subtraktion
Mathematisierung
Regulärer Ausdruck
Virtuelle Maschine
Datensatz
Mini-Disc
Datentyp
Jensen-Maß
Maßerweiterung
Speicher <Informatik>
Tabelle <Informatik>
Open Source
Softwarewerkzeug
Indexberechnung
Paarvergleich
Elektronische Publikation
Modallogik
Personal Area Network
Indexberechnung
Regulärer Graph
Abfrage
Zahlenbereich
Drehung
Zentraleinheit
Computeranimation
Rechenschieber
Metropolitan area network
Spezialrechner
Trennschärfe <Statistik>
Mini-Disc
Speicherabzug
Bandmatrix
Ganze Funktion
Benchmark
Instantiierung
Folge <Mathematik>
Abstrakter Syntaxbaum
Datenhaltung
Wort <Informatik>
Computeranimation
Aggregatzustand
Relationale Datenbank
Generator <Informatik>
Quadratzahl
Rechter Winkel
Code
Vorlesung/Konferenz
Streaming <Kommunikationstechnik>
Computeranimation
Binärdaten
Computeranimation
Datenfeld
Arithmetische Folge
Datenhaltung
Formale Grammatik
Red Hat
Bridge <Kommunikationstechnik>
Ubiquitous Computing
Bridge <Kommunikationstechnik>
Topologische Mannigfaltigkeit
Computeranimation
Metropolitan area network
Data-Warehouse-Konzept
Verbandstheorie
Translation <Mathematik>
Physikalisches System
Computeranimation
Metropolitan area network
Data-Warehouse-Konzept
Vererbungshierarchie
MIDI <Musikelektronik>
Plot <Graphische Darstellung>
Computeranimation
Metropolitan area network
Informationsmodellierung
Generator <Informatik>
Flächeninhalt
Momentenproblem
Formale Sprache
Familie <Mathematik>
Indexberechnung
Leistung <Physik>
Softwareentwickler
Computeranimation
Telekommunikation
Zentrische Streckung
Kategorie <Mathematik>
Atomarität <Informatik>
Atomarität <Informatik>
Code
Computeranimation
Richtung
Rechenschieber
Metropolitan area network
Beanspruchung
Code
Datentyp
Dreiecksfreier Graph
Mereologie
Parallele Schnittstelle
Quellencodierung
Resultante
Tabelle <Informatik>
Distributionstheorie
Hash-Algorithmus
Open Source
Automatische Handlungsplanung
Systemaufruf
Computeranimation
Portscanner
Netzwerktopologie
Knotenmenge
Distributivgesetz
Mereologie
Schlüsselverwaltung
Innerer Punkt
Tabelle <Informatik>
Zentrische Streckung
Addition
Subtraktion
Stabilitätstheorie <Logik>
Statistik
Computersicherheit
Datenhaltung
Term
Fokalpunkt
Computeranimation
Rechenschieber
Metropolitan area network
Datentyp
Projektive Ebene
Information
Inklusion <Mathematik>
Webforum
Dualitätstheorie
Web Site
Softwareentwickler
Gruppe <Mathematik>
Datennetz
Dualitätssatz
Indexberechnung
Knoten <Statik>
Extrempunkt
Ereignishorizont
Systemaufruf
Computeranimation
Metropolitan area network
Faser <Mathematik>
Ereignishorizont
Hacker
Gerade
Personal Area Network
Gammafunktion
Metropolitan area network
Rückkopplung
Verschlingung
Rechter Winkel
Güte der Anpassung
Indexberechnung
Ereignishorizont
Disjunktion <Logik>
Ereignishorizont
Whiteboard
Computeranimation

Metadaten

Formale Metadaten

Titel PGCon 2014 Lightning talks
Serientitel PGCon 2014
Anzahl der Teile 31
Autor PGCon - PostgreSQL Conference for Users and Developers
Mitwirkende Crunchy Data Solutions (Support)
Lizenz CC-Namensnennung 3.0 Unported:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.
DOI 10.5446/19090
Herausgeber PGCon - PostgreSQL Conference for Users and Developers, Andrea Ross
Erscheinungsjahr 2014
Sprache Englisch
Produktionsort Ottawa, Canada

Inhaltliche Metadaten

Fachgebiet Informatik
Abstract Various speakers & subjects. Database Appliance based on PostgreSQL OHAI, My name is Chelnik! BDR - Bi Directional Replication What the XL?

Ähnliche Filme

Loading...