Bestand wählen
Merken

Disparate data, technology fiefdoms and 65 pictures of your cat

Zitierlink des Filmsegments
Embed Code

Automatisierte Medienanalyse

Beta
Erkannte Entitäten
Sprachtranskript
doing the entire good the more the they're like a bit of an that is good time to do that with a bit of an arena talk about Open Data let's get so but data comes in many shapes and forms as your 1st you did every single day but we should know the data is on an infinite spectral possibility will confined by are so of common understanding of the universe so in that it is in this example we understand temperature you probably get the idea checkins and 0 2 % 2 in every 100 I've got a strong familiarity with ice cream and and this fact it is somewhat geographically contextual it's coming from force with so as the point here is that I can understand this data point without much further explanation it kind of makes sense it's a kind of solid general statistic 1 which has been derived from a vast array of crowdsourced more the so is coming from a whole bunch of other little bit applies to make 1 big 1 is also relevant in a greater understanding of things like ice cream and marketing perhaps taken behaviorally in summertime habits of humans so it sits on on what will call a wide open spectrum so when it will have a problem like data and you want to stop with the JET light and understanding of the complexity of data is can detective or it's deductive yeah I'm excited spot dual so I'll company 4 years ago but you before that I was deeply embedded in data doing to Cisco analysis doing in the eyes doing a whole bunch of stuff with remote sensing of doing spatio distribution of q what types of Scots pine saplings in the in the soon-to-be independent countries called the when I I helped the corporate they datasets helped um dual but to starts is come across a pond be analyzing forestry data and resource itself but since we started spot you would be helping kind of social networking and so the magnitude is increased enormously in the type of data is changed a lot but it still comes down to data data data I would imagine that for a lot of you the story is somewhat similar to that every day you're messing around with pretty weird that's 1 of your central value propositions is as a person you know how to do it at did yes so spot use a technology company was a relatively recent times when clock speed pixel density and and stuff like that and specification would drive technology or what's kind of changed we care a lot more by features because specifications about upon overall happy computers go pretty fast and the fast Raquela more of our experience and experience more often than not is driven by data distance so that means that the spotty was actually data drives your experience of the internet and in fact data drives many other parts of the world it's a measure by which spot was greeted it's a measure by which I would argue that we are probably all grade in 1 form or another in the end doesn't really matter how good your map is because of the data is wrong the technologies that have failed and it's galling when you go to so much effort to build a wonderful all bodies were all the icons that lovely but in the wrong place that's how difficult people get upset and I think in many other companies and organizations probably see themselves the same way I would argue that many companies which used to be something else for probably no data organizations of the 2 companies and somewhere form suddenly municipalities comes back so as a data company technology company were also a data delivered the intersection of technology and data within the context particularly of geography but and open data is also so here's my 1 78 every and it's worth noting that NBC were super like I they I'm so if I were really lucky that pretty government and fantastic job lots of resources and you know over the years the access to date has got better meta meta and so 1st of all to congratulate all those people who make that in some of the media and what you're addressing certain people that the conference you know so evolving process but I think this story all kind of progressive openness around data is 1 that is witnessed kind of across the think we see a lot more openness lot more data publication um states provinces cities regions lots more data out there so that's a that's a great thing you a great job guys but you might know just mentioned earlier I'm a Scotsman which means and never actually happy or terribly satisfied with the situation so this this as a Darras goes in and tell your story
so I come from a little tired in the north of British Columbia this is a map of my little time you know it's a some you may get some even from a there is this my story starts with hackathons we held in Prince George we're looking specifically at Open Data for the city and regional district the finite you know we just want to get a bunch of technologies together where small or stands a technologist uniformity in the opportunity to now virtually get it into topical the data is really good and armies about these cities have just been releasing datasets really cold this hike women so we had various teams do various different things there the ideas the folder or of wanting to particular had had this that this problem in order to solve they a simple idea they thought hey we wanna compare the budgetary financials of different municipalities with each other and find out for you get the best bang your attacked you understand where I should live together best services that was costing over his 1st best place so this is my from a business perspective that makes perfect sense the idea appealed to give the consumers the citizenry idea the best value municipality toward the domestic seems reasonable seemed interesting turned out to be quite a tough tall order and that's mainly because no 1 is really talking the same
language no 1 is talking the user by language I don't mean spoken written in programming languages reviewing data transfer formats and talking right the raw absolute date supports the numbers the numbers published by different municipalities mean different things which means there is no opportunity for any level of comparative analysis hackathon Timor-Leste's comparing apples with oranges because of the fast spectral data of uh the municipalities a B C and by themselves seeing and measuring the financial world in slightly different ways and that's like a different perspective led to slightly different financial this approach was meant completely different data products which meant no dice for the hackathon point here is not to be tunnels is about as normally you know it you know they've come to very troubled waters to get to the point where they are releasing data but the point is highlighting that perhaps as an opportunity cost in general around this kind of stuff and
the In review on the appropriate data on the comparative analysis of budgetary did we find that some a whole bunch of different technologies were applied different technologies different platforms different a whole bunch of different stuff the signal providing data decided a weighted you space we also see a whole bunch of tools and technologies is is a gazillion different tools for different jobs and that it has good thing is nothing was the arm the expectation of the Hackathorn team was not that they would find exactly the same thing I think that would be unrealistic but that they would find may be different dialects of the same language you know the things common enough that you can together in a meaningful way I want you guys I knew that was going to be the I like that but that's a great idea guide you should do that secretly wondering if they would have some special sauce and or that could steal from that used in my work on this again the really cool thing maybe this solve problems but being personally quite validated by the fact that it didn't work you know this is this is the problem I faced every day and thank God and missed trick of did you know the word the real thing is that the barriers to to this problem are many and complex the human barriers to technology barriers there's the old environment security therefore why licensing vendors to consider this a bunch of considerations from but underneath thinking it was thinking harder right to BAC care about any of those considerations and how long will it be before you move on to the next piece sulfur serving for disseminating the dates of when will the next high-speed Internet format come out um it's worth considering the processor just publishing and open data where the website you know just because you can maybe because it's a good thing with this in mind you know the real value of data is of course the data not necessarily the technology housing and this is an important thing or the the sulfur supporting its distribution it's the actual ones and zeros the data the values in those tables more so and the value of each datapoint increases every day as well as a temporal that increases actual value increases to because you have more information I mean lines of furnaces hugely valuable dataset because of its longevity the I the and that happens entirely independently of the software or the technology that happens because of the data and its age and the consistency of his capture so we should make sure that we're capturing of publishing the right data because they were not then again we face is the idea of the opportunity cost and to our investment in that it's so I back to the of who context is really interesting
context is a really important thing that I can't take you get a skewed impression of what our world actually looks like you might be confident in knowing that your little piece of the world is just but unless you have a good idea of what's happening around you the kind of end up with a magic you end up with this idea that you know you got your piece right and carry out so you don't have this idea of context and this isn't you know an extension of this is the idea that we should generate enormous value to our data by publishing it in commonly understood manners so let's think cats print the
the University of did a wonderful study they find that this 14 billion images of domestic cats on the internet of which 2 . 7 % have bred render heads and indeed there's only 220 million domestic cats in the world which is assumes that as the problematic situation is 65 pictures of every single that on the internet I was appointed to the of cats with support the massively popular phenomena of cats on the internet is a combination of cuteness convenience and compatibility I think about it this way each cat data point is commonly understood by both the computer and the person only a few popular image formats um and the most part will document well understood the ability to take a picture of a cat a somewhat ubiquitous it's easy to do and the data points or perhaps just slightly different dialects of the same language so that is you to share their easy manipulate and they're easy yeah always Is that what we want from Open Data consider other the multiplication factor that we had with the temporal nature of data and then consider what the network effect is if we commonly publish the comparible datasets if we understand each other with different dialects of the same language this is the data utopia I think we need to strive towards in I
it's using all this is really hard this is society really really hard book was the 1st easy thing to do this it was the 1st easy she'll things you can do to make your data readily available everyone else is set of is to do that publisher data and 2 well understood projection formats I'm sure that your local conic conformal measures the area we better and it's got better distance about the rest of the world the rest of the web mapping world who want to join things together they care about whether we can be upper limit is always more fun problem is is there a reality so it's typically either a button push for a single line of code to also publish your data In a commonly understood projection system to get it in a commonly understood money that anyone if they wanted it to say hey quality I can consume that into my web map it's the same kind of thing is this thing I can get rules from parallel here which you see him and have western Canadian role which this is also you know the hard big here is not the technology the hobby here is actually the advocacy and willingness to committing to what I'd like to call the commonwealth states the Commonwealth of data formats a Commonwealth of sort of data lumps that we can all access I think that this is the key takeaway here the my point the key
thing is that for instance every individual is about the states becomes more valuable the more can be commonly understood in the context of other municipalities I keep on beating up means of that sort of I just mean entity that publishes the let's let's say that the presence of companies could also be doing this every province territory stands as a company data becomes more useful when it can be placed within a much bigger context in I propose we congratulate ourselves on making a huge leap forward and publishing data but we start thinking a little bit more about what to publish start talking to each other and ideally can't from publish the same thing if and there's a picture of the
cat family into as me thank you very much the fact thank you so you set their publishing their 43 26 isn't that just the technology of the bay whether draw the line with regional and world of check the not I mean that it would you agree I
agree that while that for 3 to 6 it well and entry for centuries and there's of indicative of the technologies over using random of but I think also a lot long in general terms in the VTS for general terms is probably not going to disappear at all we have a different shape because it's most conveniently and frankly were measuring latitude longitude as Kennedy factor global measurement system for the will make it so you know it is a better 1 also was presented was going out there but I am not sure Amiga probably blame you know but Westland electrical polarity but we can also going to and being and all these other guys were joined together in doing the same thing or it is said that an awesome approach and now we can publish in the same thing and no matter manner we want to to to display that data it's really will but how how what compact for some in the OGC standards of the Jupiter just enough to be considered the link interest you we should choose to support full the changes the nature of but I'd also argue that a UTC can can provide a standard but there we could spend an awful lot of time jumping onto the standard in doing that or we could do this thing that's going to the good working platforms have so I mean is that this is pragmatic piece here which is an easy thing to do to get your data to everybody who's using a web mapping applications which we understand 1 of the few education system is to press the button is the right of code that says transform that's catch me it just seems like a very straightforward approach to getting over the hump which is necessary what you see on it's a global kind of use of data home in that we want more people to share more data I think that's something we don't like to see a quick way of doing that is publishing commonly understood relationships I but the see you mention them at the beginning of the municipalities didn't just have different data formats but that the numbers meant different things and and so see the you know the publishing standards is a good way for normalizing the publishing of uh of that kind of data but had yet municipalities to start um tracking the same numbers and talking to each other in same like what they told it to each of this magic and and also also the could that there very high and and also as is the puzzle you when a good idea to human decision about what the to track and I think the trap the Open Data trap is Haiti's we got this thing turned off and the heart of it is really think OK we should actually have some kind of understanding of what is commonly useful to the community and maybe that involves manipulation on on this part isn't out of my experience is that typically gives you our make it harder to hold the release of data that you have been but the velocity so as a risk sure but I think are the network effect of of people talking the same language amenable to be so some level comparative each other is enormous I think it's huge value there and I think the smoothness of the municipality or state province country can nicely leverage that themselves things the of so um in terms of a that sort of interoperability you're talking about with data from multiple multiple municipalities multiple sources are on white besides the SRS you know what parameters are are you running into his hi I don't have the greatest grasp of of well the other using different you know of this problem but so what what are the parameters of the key thing is that people published different stuff about the same thing so the call them I call the same thing but is entirely different entity so the columns are different they all the different information so in essence it's In the it's it's comparing apples and oranges so this sounds like what you're really talking about is developing and status of ontologies yeah but I don't use those words is a really long report but other people who were trying to the colonists and all it is is a is a semantic understanding of what it would be useful in the community and yourselves into that eurostat if
Zentralisator
Offene Menge
Stereometrie
Distributionstheorie
Bit
Punkt
Prozess <Physik>
Selbst organisierendes System
Computerunterstütztes Verfahren
Punktspektrum
Komplex <Algebra>
Computeranimation
Gradient
Internetworking
Bildschirmmaske
Arithmetische Folge
Prozess <Informatik>
Datentyp
Minimalgrad
Abstand
Grundraum
Einflussgröße
Analysis
Umwandlungsenthalpie
Statistik
Shape <Informatik>
Pixel
Aussage <Mathematik>
Bildschirmsymbol
Kontextbezogenes System
Dialekt
Dichte <Physik>
Meta-Tag
Mapping <Computergraphik>
Forcing
Offene Menge
Mereologie
Hypermedia
Größenordnung
Aggregatzustand
Programmiersprache
Subtraktion
Punkt
Wasserdampftafel
Formale Sprache
Datentransfer
Zahlenbereich
Paarvergleich
Biprodukt
Dialekt
Übergang
Mapping <Computergraphik>
Dienst <Informatik>
Offene Menge
Perspektive
Uniforme Struktur
Windkanal
Dateiformat
Ordnung <Mathematik>
Analysis
Distributionstheorie
Web Site
Subtraktion
VHDSL
Formale Sprache
Kolmogorov-Komplexität
Systemplattform
Raum-Zeit
Internetworking
Eins
Erwartungswert
Reelle Zahl
Prozess <Informatik>
Software
Coprozessor
Elektronischer Programmführer
Maßerweiterung
Widerspruchsfreiheit
Gerade
Feuchteleitung
Internetworking
Computersicherheit
Güte der Anpassung
Kontextbezogenes System
Dialekt
Motion Capturing
Software
Rechter Winkel
Dateiformat
Wort <Informatik>
Information
Computerunterstützte Übersetzung
Programmierumgebung
Tabelle <Informatik>
Punkt
Natürliche Zahl
Formale Sprache
Schaltnetz
Computer
Code
Computeranimation
Entscheidungstheorie
Internetworking
Benutzerbeteiligung
Rechter Winkel
Volumenvisualisierung
Inverser Limes
Abstand
Parallele Schnittstelle
Einflussgröße
Bildgebendes Verfahren
Gerade
Schreib-Lese-Kopf
Soundverarbeitung
Beobachtungsstudie
Vektorgraphik
Datennetz
Temporale Logik
Einfache Genauigkeit
Schlussregel
Physikalisches System
Dialekt
Quick-Sort
Mapping <Computergraphik>
Flächeninhalt
Offene Menge
Koeffizient
Mereologie
Dateiformat
Projektive Ebene
Computerunterstützte Übersetzung
Schlüsselverwaltung
Arithmetisches Mittel
Internetworking
Bit
Familie <Mathematik>
Computerunterstützte Übersetzung
BAYES
Kontextbezogenes System
Dialekt
Gerade
Quick-Sort
Aggregatzustand
Instantiierung
Geschwindigkeit
Subtraktion
Natürliche Zahl
Mathematisierung
Formale Sprache
Zahlenbereich
Kartesische Koordinaten
Systemplattform
Term
Code
Übergang
Benutzerbeteiligung
Multiplikation
Vorlesung/Konferenz
Indexberechnung
Glättung
Messprozess
Soundverarbeitung
Parametersystem
Shape <Informatik>
Ontologie <Wissensverarbeitung>
Datennetz
Güte der Anpassung
Systemaufruf
Physikalisches System
Quellcode
Binder <Informatik>
Teilbarkeit
Quick-Sort
Entscheidungstheorie
Mapping <Computergraphik>
Codec
Kompakter Raum
Offene Menge
Rechter Winkel
Mereologie
Dateiformat
Wort <Informatik>
Information
Verkehrsinformation
Aggregatzustand
Standardabweichung

Metadaten

Formale Metadaten

Titel Disparate data, technology fiefdoms and 65 pictures of your cat
Serientitel FOSS4G 2014 Portland
Autor Cadell, Will
Lizenz CC-Namensnennung 3.0 Deutschland:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.
DOI 10.5446/31722
Herausgeber FOSS4G, Open Source Geospatial Foundation (OSGeo)
Erscheinungsjahr 2014
Sprache Englisch
Produzent Foss4G
Open Source Geospatial Foundation (OSGeo)
Produktionsjahr 2014
Produktionsort Portland, Oregon, United States of America

Inhaltliche Metadaten

Fachgebiet Informatik
Abstract Up in the frozen wastes of the Northern British Columbia, we organized a hackathon. We based it on the ideas of open data and civic applications.Our hardy hackathoners pulled together a number of excellent ideas but met with a constant and obtrusive barrier: that open data maybe open but with out some level of standardization its not actually very useful.Now, no one said that data had to be 'useful', and perhaps if we want the technology utopia of real open data interoperability we will need to "build it" ourselves, but it is worth noting that talking the same language as our neighbours is generally awesome. Indeed, perhaps rather than swearing fealty to our technology overlords and just pressing the "publish document to open data platform" button, we could think about the commonwealth of data. The value of any data increases wildly with density and open data should be more valuable!The cats? well you'll have to tune in for that bit.
Schlagwörter open data
standards
technology

Ähnliche Filme

Loading...
Feedback