A New Zealand Case Study: Open Source, Open Standards, Open Data
This is a modal window.
The media could not be loaded, either because the server or network failed or because the format is not supported.
Formal Metadata
Title |
| |
Title of Series | ||
Number of Parts | 95 | |
Author | ||
License | CC Attribution - NonCommercial - ShareAlike 3.0 Unported: You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor and the work or content is shared also in adapted form only under the conditions of this | |
Identifiers | 10.5446/15505 (DOI) | |
Publisher | ||
Release Date | ||
Language | ||
Production Place | Nottingham |
Content Metadata
Subject Area | |
Genre |
00:00
Wave packetProgrammer (hardware)InformationIntegrated development environmentOpen sourceMetropolitan area networkEmulationWide area networkArithmetic meanService (economics)Open sourceInformation managementStandard deviationIntegrated development environmentSoftwareSoftware frameworkPresentation of a groupStrategy gameNumberCartesian coordinate systemHypermediaWater vaporIdentity managementPower (physics)Annihilator (ring theory)Disk read-and-write headMathematicsMultiplication signStaff (military)MultiplicationXML
01:43
Content (media)Open sourceWater vaporArtificial neural networkMetropolitan area networkDomain nameLocal ringPower (physics)Data transmissionInternet service providerIntegrated development environmentClient (computing)Standard deviationOpen setInformationCASE <Informatik>Open sourceIntegrated development environmentObservational studyDependent and independent variablesAreaSoftwareEvent horizonPhysical lawMusical ensembleWindowComputer configurationQuicksortFamilyOperator (mathematics)Computer animation
03:11
Server (computing)GeometryOpen sourceLevel (video gaming)Computer networkTomcat <Programm>Open setOpen sourceIntegrated development environmentWhiteboardVisualizationProgram flowchart
03:36
Magnetic stripe cardWorld Wide Web ConsortiumGrass (card game)Physical systemSanitary sewerDatabaseGeometryComputer networkMeta elementServer (computing)Level (video gaming)Computer-generated imageryOpen sourceQuery languageMUDSicMass flow rateStack (abstract data type)Service (economics)DisintegrationWeb serviceInformationWitt algebraArchaeological field surveyCopyright infringementBoss CorporationDigital photographyOpen sourceSoftware developerDomain nameSoftwareInformationService (economics)Integrated development environmentStandard deviationWeb serviceVisualization (computer graphics)Range (statistics)Latent heatClient (computing)INTEGRALArchaeological field surveyObservational studyCASE <Informatik>Library catalogQuicksortMetadataFunctional (mathematics)State observerWaveGeometryGame theoryVideo gameWritingFreezingRight angleMathematical analysisData managementDemosceneState of matterClosed setSymbol tableNumbering schemeComputer animation
06:36
System programmingArchaeological field surveySatelliteSanitary sewerBackupComputerServer (computing)Solitary confinementNP-hardCovering spaceElectronic program guideDatabaseScripting languageSet (mathematics)SoftwareComputer networkInternetworkingOpen sourceError messageLevel (video gaming)Variety (linguistics)Database normalizationEmailCuboidArchaeological field surveySatelliteCovering spaceServer (computing)DatabaseIntegrated development environmentEntire functionLibrary (computing)AreaGroup actionOrder (biology)Computer animation
07:50
Scripting languageLengthWeightFrequencySpeciesDigital photographyWeightMappingFrequencyLengthSpeciesGeneric programmingLine (geometry)Electric generatorScripting languageVisualization (computer graphics)ResultantPhysical systemWebsiteMultiplication signClosed setComputer animation
08:45
Workstation <Musikinstrument>Archaeological field surveyAreaTrailRaster graphicsVideoconferencingWhiteboardGreatest elementWater vaporMIDIGeometryDatabaseScripting languageOpen sourceSoftware developerPresentation of a groupOpen setLevel (video gaming)Service (economics)Client (computing)Conservation lawPlanningReading (process)View (database)State observerSoftware as a serviceDomain nameTime seriesDatabaseClient (computing)Conservation lawData structureAreaGraph (mathematics)Line (geometry)Complete metric spaceOpen sourceRange (statistics)QuicksortDefault (computer science)Operating systemService (economics)InformationOpen setWeb 2.0Web portalSoftware developerPresentation of a groupDecision theoryGoodness of fitDirection (geometry)Core dumpRight anglePoint (geometry)Focus (optics)Series (mathematics)Physical systemElectronic program guidePhysical lawDemosceneMessage passingVideo gameRevision controlConfidence intervalOcean currentWebsiteWorkstation <Musikinstrument>Bit rateRAIDComputer animation
12:26
MalwareCache (computing)Electronic mailing listUniform resource locatorBookmark (World Wide Web)WebsiteService (economics)Level (video gaming)Revision controlOpen sourceSource codeDirectory serviceBookmark (World Wide Web)InformationWebsiteWeb serviceUniform resource locatorMedical imagingElectronic mailing listPlug-in (computing)Machine visionVideo gameBit rateService (economics)Network topologyComputer animation
13:20
Web pageOpen sourceService (economics)Web serviceAddress spacePlug-in (computing)Range (statistics)Web pageService (economics)Universe (mathematics)Bit rateRight angleElectronic mailing listFamily
13:42
Archaeological field surveyPoint (geometry)Random numberOpen sourceMetropolitan area networkWeb portalWorld Wide Web ConsortiumTexture mappingLinear multistep methodClient (computing)Magnetic stripe cardOpen setMaxima and minimaArmBlack boxWeb serviceData managementArchaeological field surveyWeb pageOpen sourceSystem administratorConnected spaceMappingWebsiteResultantElectronic mailing listDialectWeb 2.0Source codeService (economics)Group actionPoint (geometry)AreaQuicksortMereologyGame controllerCASE <Informatik>Level (video gaming)Canadian Mathematical SocietyShape (magazine)Open setInformationSymbol tableVector spaceGraph coloringMetadataSoftware developerInformation retrievalSoftware testingSet (mathematics)AuthorizationLibrary catalogWordReliefMatching (graph theory)Metropolitan area networkPlanningInheritance (object-oriented programming)Musical ensembleModule (mathematics)Numbering schemeFigurate numberOrder (biology)Object (grammar)Rule of inferenceState of matterElectronic program guideRight angleBit rateFood energyVideo game consoleMathematicsComputer animation
17:13
Magnetic stripe cardClient (computing)Open sourceLevel (video gaming)Level (video gaming)Interface (computing)Software developerDigital photographyDatabaseData managementWebsiteUniform resource locatorProduct (business)Point (geometry)Web 2.0Workstation <Musikinstrument>DigitizingMedical imagingRegulator geneRight angleLine (geometry)Associative propertyFilm editingComputer animation
18:16
Library catalogInterface (computing)Level (video gaming)Uniform resource nameClient (computing)Open sourceArchaeological field surveyOpen setStandard deviationContent management systemContent (media)Rule of inferenceRight angleArtificial neural networkOpen sourceIntegrated development environmentConservation lawClient (computing)Self-organizationInterface (computing)Library catalogArchaeological field surveyResultantWebsiteMereologyMetadataCASE <Informatik>Presentation of a groupScaling (geometry)Group actionComputer animation
19:16
Execution unitMaxima and minimaMetropolitan area networkHill differential equationOpen sourceLemma (mathematics)Schmelze <Betrieb>3 (number)Wechselseitige InformationBus (computing)Network operating systemArtificial neural networkSummierbarkeitPersonal area networkConvex hullLevel (video gaming)Open setClient (computing)Point (geometry)SpeciesInterior (topology)World Wide Web ConsortiumWeb portalTexture mappingWeb browserService (economics)Magnetic stripe cardDatabaseWeb pageDistribution (mathematics)Open sourceArchaeological field surveySet (mathematics)DatabaseMultiplicationWebsiteSpeciesWeb 2.0Service (economics)Point (geometry)InformationEndliche ModelltheorieLink (knot theory)Level (video gaming)SpacetimeInformation securityComputer animation
21:03
Physical systemDatabaseAttribute grammarData managementWorld Wide Web ConsortiumTraffic reportingFunction (mathematics)Electronic program guideContent (media)Link (knot theory)Computer wormOpen sourceWeb pagePersonal area networkNumerical taxonomyTable (information)Network topologySpeciesText editorService (economics)Computer-generated imageryTranslation memoryMountain passSession Initiation ProtocolWide area networkSpecial unitary groupProbability density functionTemplate (C++)Interior (topology)Metropolitan area networkEmulationMoving averageRaw image formatLogarithmSupremumFamilyUniform resource nameValue-added networkPhysical lawEuler anglesGamma functionComputer clusterVarianceSpacetimeIntegrated development environmentData modelAerodynamicsProcess (computing)Endliche ModelltheorieProcess modelingComputing platformSoftwareUtility softwareComputer fileAsynchronous Transfer ModeInformationHierarchyNetwork topologySpeciesShape (magazine)IdentifiabilityPhysical systemDigital photographyRing (mathematics)Form (programming)Attribute grammarProcess (computing)Core dumpDatabaseEndliche ModelltheorieContent (media)Function (mathematics)CoprocessorElectronic mailing listGame theoryWhiteboardMathematicsAreaOrder (biology)Selectivity (electronic)View (database)Distribution (mathematics)SpacetimeComputer animation
22:57
Open sourceFile viewerComputer-generated imageryMathematical analysisMotion captureData managementPhysical systemElectronic mailing listHierarchySpeciesSoftware repositoryView (database)DatabaseOpen sourceSpacetimeDigital photographyComputer animation
23:22
Visualization (computer graphics)FingerprintGrass (card game)Scripting languageProduct (business)DatabaseOpen sourceSoftware developerWechselseitige InformationMoment (mathematics)Open sourceDatabaseMultiplication signDisk read-and-write headVisualization (computer graphics)Staff (military)Computer animation
Transcript: English(auto-generated)
00:00
from University of Oxford, and I will be sharing this session for you. So first of all, very welcome to our session. This is session number 13. So we will have three different presentations in this session. Each talk will last about 20 minutes, followed by five minute question session.
00:26
And can I just remind you, this session is being recorded, and you switch off your mobile phones or send them to vibrate. And I suppose this session will be delayed for about 10 minutes.
00:51
Our first talk will be given by Mr Brent Wood, who is from NIWA, which stands for the National Institute for Water and Atmospheric Research in New Zealand.
01:06
He will show us the role that open source software and standards play in NIWA's strategies and the environmental information management framework.
01:24
Hopefully you can all hear me. I can just speak fast, so if you want to hear what I'm saying, I'm going too fast, slow me down. I'm assuming everyone here doesn't know what NIWA is. New Zealand is a long way away, and it's not a big multinational.
01:43
So I'll be starting with a quick overview of NIWA, what it is, what it does. An overview of what we do with open source, how we use open source, a brief digression into an open source case study within the case study, and then look at some of the things that NIWA is doing itself in the open source arena.
02:06
So NIWA is the National Institute for Water and Atmospheric Research. I don't know what happened to the R, but it's never been there. It's a government agency, and it's called the Crown Research Institute. It is owned by the government but operates as a commercial business
02:22
having to find its own funding and do its own work. The quality is not wonderful, but it carries out environmental research in atmosphere, climate, fisheries, oceans, aquatic areas in New Zealand. It's the lead agency for doing that work.
02:44
We do that research for government agencies, utilities, industry. We also have some international roles and responsibilities in the areas like climate science for New Zealand.
03:00
And all of that means that open standard interoperability are absolutely critical to NIWA performing and doing what it needs to do. This is about FOSS4G. One of the comments I'm making here is that the 4G side can only really flourish
03:22
in a place where FOSS itself flourishes. If you don't have an environment that supports open source, open source PIS tools themselves will be harder to sort of implement, keep running, get supported in general. So there won't be a quiz. You don't need to read all that.
03:42
But it gives you an idea of the range of open source tools used throughout NIWA, both infrastructurally, software development, GIS, data management, a wide range of tools. So it's a fairly FOSS friendly place to work. I like that.
04:03
We also have some of our open source tools. We have supported existing open source tools. We funded development in QGIS, SILC, CASL, JIRIS, SPM, myCAMS, CLIDESK are all NIWA initiatives.
04:21
They are open source tools in their own specific domains in climate, fisheries, wave action, stock monitoring, photo analysis, and climate visualization tools that we provide, we write, and they're all open source tools from NIWA. We funded some developments in QGIS-GMT, Postgres Studio,
04:46
our GIR network. It wasn't altruistic. It was to help those tools provide functionality that we had a need for. So it was quite pragmatic. Our environment is deliberately very OGC centric.
05:02
We're using OGC standards to support our interoperability capabilities. We're developing an information services stack, which is based around the OGC web services. We're currently working on some integration with the Kivri services to help interoperability, particularly in the central observation service arena.
05:26
Does anyone know what these acronyms all mean? Or do I need to explain some of them? Dead silence doesn't tell me much. We're looking at sort of, because we're using that platform,
05:41
that framework, we're also trying to improve the way that metadata can be harvested from web services to make them self-documenting, and that the catalogs that we build from harvesting those can be used by clients to drill back to the original data sources, if that makes sense.
06:01
So I did say we have a digression of a case study within this case study. I think it's a nice little story, so you're stuck with it. We went to Oman for a fishery survey off the Arabian coast. That's where we went. It was divided into depth-based strata, as well as latitudinal strata.
06:30
That's our boss collecting souvenirs, and a very happy Omani. All the gear we took on that survey was planned months, years in advance.
06:43
It was very rigorous. We put it all together, shipped it to Oman in a container, and Murphy took over. We were using a couple of small shuttles as our Linux boxes. We had two redundant Linux servers that had all the database tools, all the satellite comms tools for email from the boat, et cetera,
07:03
and they were both dead on arrival when we opened up the container. Flying on route, I boarded a Linux magazine in Dubai airport. It had a Fedora DVD on the cover. The entire survey was run, from then on, from those servers,
07:22
rebuilt from that DVD. You try doing that with commercial software. It was very open-source based. The entire survey we carried out there was using open-source tools. I like mentioning libraries that sit behind the tools,
07:42
because they tend to be forgotten, and I think they deserve an awful lot of credit for the casualties they provide throughout the cost GIS environment. We had GMT scripts, so you could simply, for all the species we caught, you could run the script, access the database, and generate length frequency plots, length weight plots, distribution, catch plots, et cetera.
08:06
We catch a thousand species, so we could generate a thousand maps in two minutes, running one script. What's GMT? It's an open-source tool called Generic Mapping Tools. Thanks. I did suggest when generic, called Greenwich Mean Time, became UTC,
08:24
that they renamed Generic Mapping Tools to Universal Tools for Cartography, but they didn't do it. As you can see in the bottom graph, on the bottom line there, you can see outliers, so the ability of tools like this not only let you visualize the data, but that visualization also allows you to error-check the data.
08:48
QGIS had advanced to 0.8 at that stage, and that was our default desktop GIS tool, all the data going into PostGIS. We carried out a wide range of sampling techniques, methods,
09:03
gear being deployed, et cetera. All of that went into the PostGIS database. All of it was sort of managed quite successfully there. And because it was all open-source, we could leave the Omanis in their recent center with a complete operating system of the working database.
09:20
All the scripts, all the tools were theirs at no cost, no license issues, worked perfectly. So again, from that particular one, all the open-source developers here have contributed. Thank you for your contribution. And back to the main presentation.
09:41
In 2011, the New Zealand government issued a directive on open and transparent government that required or directed all core government agencies to make publicly funded data available to the public. It encouraged non-core agencies such as councils and CRIs to do the same.
10:05
And one of the decisions NIWA made, or I made on behalf of NIWA, was that we can release all our spatial data, set up web services, allow people to download shapefiles, and if they can't afford an ESRI license or a MapInfo license, releasing the data was not particularly useful
10:23
to an awful lot of the people we wanted to benefit from it. The solution we came up with there was to take QGIS and have it customized to facilitate access to the data we were providing.
10:40
And we contracted Sourcepol to do that, and we have, I think, a good working relationship with Sourcepol ongoing. Doing more customization. So rather than building lots of portals to sort of allow people to look at the data that's particular to domain in that portal, this tool now allows people to look at all NIWA data,
11:02
plus their own data, plus other agencies who are using OGC services in the same way. So it's a much more generic tool that allows people to make much better use of the open data that's being provided. So it's hopefully a very empowering approach to information delivery.
11:23
We're working on a SaaS client, so as well as the spatial environmental data, we will be using SaaS, it's a sensor observation service from OGC, to deliver time series data. We have SaaS services built on our hydrometric and climate databases.
11:42
The SaaS client in QGIS will allow people to see the various stations, click on a station, see what readings are available, select the interval for which they want those readings, view a graph of them, and download them. So we're moving away from the straight GIS arena with spatial data,
12:02
into time series data as well. We've had a lot of success with structured conservation planning, for managing marine protected areas in Antarctica, with the toothfish fishery, and we're working again with Sourcepol on a generic tool, for that capability being built into QGIS.
12:30
So apologies for the image quality, but essentially the NIWA plug-in here, parvests a list of OGC web services from the NIWA website. We can add NIWA and other agencies' URLs to that website.
12:44
This tool, the clicker button, will harvest those, provide you a directory tree-like structure, that allows you to click on a service, see what the layers are, add those layers to a favourites list, and you then don't have to reconnect, reload that information until you want to.
13:01
It also allows you to, for both WFS and WMS layers, once you've opened them, you can cache them locally, and use them offline. When you go back online, you can continue using the cached version, or you can refresh from the online source.
13:22
That is pretty hard to read, but that's the NIWA web page that currently has the services that are available, from a range of agencies, district councils, city councils, CRIs, universities. A wide range of agencies are now providing data using that approach.
13:44
One of the other tools we've built is a survey tool, that's being endorsed by the Ministry for Fisheries, as a survey design tool. One of the interesting things with coastal surveys, is trying to propagate statistically robust transacts, in a sensible direction, off a convoluted coastline.
14:02
We think we've got a good solution for that. We're doing some testing of the robustness of that algorithm. The idea in particular, in New Zealand, along the coast, is that Maori, the indigenous people, now have rights, customary rights over that. They also have management obligations as part of that.
14:22
This is a free tool. We've received funding from Maori groups to help develop this, so they can now run statistically robust surveys, for the coastal areas they are responsible for, and contribute to the management of those resources, regions.
14:40
And, as I've told, source pollens have been here. We do have other things in the pipeline coming as well. From a web mapping perspective, we undertook a survey, Ocean Survey 2020, as in the top website. They funded a two year program, looking off the northeast coast of New Zealand.
15:02
One of the contractual obligations from that project, was to make all the results available to the public. And, we went for a totally open source solution. We co-funded the development of an open layers administration module,
15:22
for the Silver Stripe CMS. The CMS now has a tool in it, that allows you to set up a connection to OGC web services. And, because it's within the CMS, we can have method pages that describe,
15:42
I can't show you live here, but we can have method pages that describe each of the sets of gear that are deployed. We can click on the method name in the map legend, and retrieve that information. So, we can, we set up a map layer, into some keywords.
16:03
From the map, we can instigate a search on the metadata catalog, that uses those keywords to find all the relevant entries in the catalog, that pertain to the layer on the map. So, we're able to access reference information in the CMS,
16:21
we're able to access metadata searches, all from a map centric approach. The map in this case has data coming from PostGIS, and from Esri. It's all OGC service compliant, the data source is abstracted, so it's quite portable. And, we also have some input,
16:41
where people want to be able to change the symbology for the layers, vector layers of points on the map. You can see in the middle little black box there, there's a simple pick list that allows people to use, choose the color and shape to use for each symbol on the map.
17:01
And, that's fairly unusual. Most web mapping tools do not allow users any control of symbology. That's fairly rudimentary, but it met the sort of user requirements. We collected over 20,000 CVED photos. Those photos are managed in a commercial product called Atlas,
17:21
AtlasMD, it's a digital asset management tool. We have paid the developers of that tool, to migrate it to PostGIS, as the underlying database. That means we can associate a location as a point with every image, and have a map interface so that all the images in the database,
17:41
can be retrieved via a map. We've also funded a web service, which is used by this tool, so by clicking on the point, that represents that station on the map, you can browse through the carousel here, and go through the photos from each site. The developers at Silverstripe thoroughly enjoyed this.
18:03
20,000 photos, and after about a week of them working on this tool, I was told, did I know there are three photos with bottles in them? So between them, they'd gone through all 20,000 photos. As I mentioned, there's a metadata catalog integrated with this.
18:22
Every report, every presentation, every data set, captured, derived, part of the survey is in the catalog. It's a geonetwork catalog, and the website has a simple CSW client as a front end,
18:40
so users aren't faced with a complex catalog. A very simple interface that allows them to browse it, or search it. Results will come back, and this is one of the main reasons it's used. Councils want data. Environmental conservation organizations want data. This is where most of them are getting it from.
19:03
So the whole tool itself is sitting on top of Postgres, maps, OpenLayers, Silverstripe, Geonetwork, totally open source. And a few months after we put it together,
19:20
we were asked by another government agency who were quoted what I thought was an extravagant sum, to spec out a website for New Zealand marine biosecurity data. For less than the price they were quoted for specing it, we could take our open source solution from the Bay of Islands survey
19:42
and wrap this into it. It is now the New Zealand marine biosecurity portal. Similar tools available. Note it works across 180. If you work with GIS tools, 180 is the biggest pain you will ever face.
20:02
Because this involves a lot of work with individual invasive species, we now have a species picker tool. And that tool allows you to select a species from a pick list. You can see the occurrences of that species. There's also a WMS service that shows you the modelled distribution,
20:23
or predicted distribution of that species. Racing through it. EI browser, because we're using web services, we can set up a single facility that allows people to go to that one point and access and search multiple databases across NIWA.
20:45
So we have bio data, we have climate data, we have hydrometric data, we have water quality data. The idea is this one tool allows them to enter a set of keywords and search across all those databases for relevant information.
21:05
Linked to the system, but not strictly GIS related, we have taxonomic data. We have a taxonomic reference system, which is your core taxonomic hierarchy. Linked to that, we have attribute data.
21:21
And that's used by a publishing and reporting system. Here we have some species data, which inherently has nothing spatial about it whatsoever. It's simply a species in a taxonomic hierarchy. There's the hierarchy in a tabular form.
21:42
There's a tree form. We're in the process of linking this information to our bio data distribution, the GIS, the biodiversity information, so that all the taxonomic and the species information is actually linked inherently with the spatial data.
22:05
Here's an example of species data. As I said, PRS takes that and generates output. This output has some photos and some text. And it's for those who know what oatlets are. They're the ear bones of fish. They're used for both aging fish
22:20
because they have rings like a tree. The shape is also diagnostic and in middens for archaeological research and in stomach contents of birds, fish, marine mammals, they're used to identify prey species. So we can now have a fat-sheep generator and potentially a diagnostic tool to enable it to happen. Similarly, we have freshwater pest species.
22:43
The data is in the database. We can do this and access that information. In-house tools, Niiwa's got the spatial population model that we developed in-house and has been used globally now for people working in the fisheries arena. MyCAMS is used for analysing seabed photos,
23:02
what species are we seeing, what substrates are we seeing. It uses ImageJ, an open source tool from the US National Institute of Health and it uses PostGIS as the in-line data repository. That means that you can open up in QGIS, the in-line database and view where the species are being observed.
23:24
ClidESK is for climate data visualisation. That is a tool sitting on top of a Clid database developed by the Bureau of Meteorology in Australia. This is being deployed across the Pacific at the moment. So, that's where I'm at.
23:40
I'd like to acknowledge the open source developers, the community that gives us these tools to do all these fun stuff and the people who provided the funding to keep us going.
24:05
I'm afraid we are a little bit late for the time. May I ask you to leave your question to the end of this whole session and we will ask the speakers to answer your questions altogether, please. So, we will move to our next talk.