Resource Description & Access (RDA) Vocab Infrastructure

Video thumbnail (Frame 0) Video thumbnail (Frame 4373) Video thumbnail (Frame 8553) Video thumbnail (Frame 10160) Video thumbnail (Frame 10741) Video thumbnail (Frame 11412) Video thumbnail (Frame 11926) Video thumbnail (Frame 13047) Video thumbnail (Frame 13872) Video thumbnail (Frame 14748) Video thumbnail (Frame 16057) Video thumbnail (Frame 16844) Video thumbnail (Frame 18987) Video thumbnail (Frame 19828) Video thumbnail (Frame 22492) Video thumbnail (Frame 23947)
Video in TIB AV-Portal: Resource Description & Access (RDA) Vocab Infrastructure

Formal Metadata

Resource Description & Access (RDA) Vocab Infrastructure
Title of Series
CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Release Date

Content Metadata

Subject Area
A recording of a presentation from Diane Hillmann (Metadata Management) for the December 2017 AVSIG meeting.
Point (geometry) Windows Registry Slide rule Standard deviation Service (economics) Multiplication sign Open set Metadata Predictability Element (mathematics) Medical imaging Word Data structure Library (computing)
Aliasing Windows Registry Stapeldatei Server (computing) Service (economics) Multiplication sign Direction (geometry) Translation (relic) Open set Mereology Metadata Web 2.0 Revision control Data management Mathematics Spreadsheet Different (Kate Ryan album) Term (mathematics) Googol Repository (publishing) Software repository output Local ring User interface Axiom of choice Information Server (computing) Video tracking Web page Computer file Metadata Bit Control flow Flow separation Windows Registry Open set Template (C++) User profile Data management Spreadsheet Software Synchronization Revision control output Quicksort
Stapeldatei Server (computing) Axiom of choice Service (economics) Server (computing) Interface (computing) Web page Computer file Mereology Template (C++) Windows Registry Number Data management User profile Spreadsheet Different (Kate Ryan album) Googol Repository (publishing) Software repository Local ring
Functional (mathematics) Presentation of a group Link (knot theory) Link (knot theory) Server (computing) Multiplication sign Arrow of time Mereology Information Mereology Windows Registry
Electronic data interchange Information View (database) Projective plane Electronic mailing list Mereology Formal language Element (mathematics) Element (mathematics) Formal language Spreadsheet Googol Set (mathematics) Computer worm Software testing
Point (geometry) Moment (mathematics) Electronic mailing list Menu (computing) Electronic mailing list Formal language Element (mathematics) Element (mathematics) Category of being Duality (mathematics) Object (grammar) Set (mathematics) Referenzmodell Drum memory Library (computing)
Trail Mathematics Group action Multiplication sign Set (mathematics) Electronic mailing list Cuboid Database transaction Electronic mailing list Statement (computer science) Mereology Element (mathematics)
Point (geometry) Aliasing Windows Registry Beta function Greatest element Information Variety (linguistics) Web page Metadata Translation (relic) Price index Electronic mailing list Mereology Element (mathematics) Formal language Number Mathematics Category of being Word Different (Kate Ryan album) Set (mathematics) Cuboid Statement (computer science)
Type theory Spreadsheet Spreadsheet Information Link (knot theory) Charge carrier Googol Charge carrier Price index Mereology Formal language Address space
Point (geometry) Group action Scheduling (computing) Link (knot theory) Decision theory Translation (relic) Set (mathematics) Function (mathematics) Formal language Element (mathematics) Revision control Data management Mathematics Profil (magazine) Software repository Subject indexing Software testing Information Building Server (computing) Image registration Semantics (computer science) Translation (relic) Cartesian coordinate system Flow separation Formal language Computer configuration Error message Ontology Software repository Revision control Interface (computing) Conditional-access module Quicksort Representation (politics) Resultant
Data management Slide rule Data mining Word Arithmetic mean Context awareness
okay so we we think of you guys as the other RDA and I'm sure you feel the same way about us but this is the RDA that I've been working with it's a library standard and my business partner John Phipps and I have been working on it since 2008 although not not paid for all that time sadly so I have a couple of let's see I have some slides 60 slides like this but I also have some some images that I can't I don't really have time to go all the way through but I I thought it might be useful for people who you know to go through them later on because there's some stuff that you can't easily get at if you're not if you're not me or John but this is kind of what our goals are we're talking primarily public vocabularies at this point although we at some point I think we will be able to manage private vocabularies as well and we have a couple of services that are associated with with what we're doing it's we we cover both um you know structural vocabularies element sets whatever you want to call them everybody has a different name and also value vocabulary so Scots vocabularies as people sometimes call them so it's a it's a slightly different kind of panoply than the then then your RDA does but it's it's all absolutely necessary for what the the this RDA or our RDA needs because we're transitioning off a world of of mark and really there was no other way to do it okay so this is kind of the we call what we're doing kind of a three-legged stool and I'll show you the picture later first one is they open metadata registry and and that one you can get by if you want to look at it while I'm talking is the it's metadata registry dot org metadata registry is the whole word and that that's really the the heart of the thing and we we started working on the open metadata registry which was at that time called the NS DL registry about 2004 something like that the the OMR is
kind of the central leg if you will although as I said we've been working on it for a long time and there it shows its age in some ways mostly with the user interface which is kind of kludgy but you know it still works and we're in the background trying to rewrite it it's it's using old old software and things like that but we've kind of built on it which is fine but but we have to kind of go back and make that part work better but one of the things we started doing which when we first began working on this was we we record detailed history of all changes and it and and the provenance you know who did it when they get it etc and this it's the it has along with it a specialized vocabulary server which kind of takes a little of the burden off the old software but it and we have a lot of services that go along with that and I'll I'll get to some of those right leg two is get in github which I assume some folks are using or familiar with but that's where our version management comes in a lot of our issues tracking and the documentation that we have for the RDA registry although not not yet with with the with the open metadata registry okay so the third part is the specialized RDA vocabulary management which is the part that it really it doesn't have a user interface it's really intended to for people to use to to to read information or to access information but it doesn't have a separate user interface because the LMR is really the only one that that is important in terms of what we're doing so it does make available downloads and all different kinds of viewable data the when it says aggregated RIF data that's from a a a bit of software that we interact with which is a very very cool little cataloging thing that allows you to work sort of in isolation we're trying to get some money together to allow it to be a web tool but we're not there yet so the lexical aliases are kind of interesting because we've we're doing a lot of stuff with translations and I'll show you kind of how that works when we only take a look at some of those so that our workflow is pretty you
know kind of goes all over the place but I have a picture after this and you can see how how it relates to all these three these three legs so it we're working primarily now with not with direct input although that's still allowable in the OMR and our major users do use that but we're we're trying to make it possible to do a lot of import/export using spreadsheets of various kinds and that's been that's been what we've been working on pretty pretty much for the last year and a half or so so we only have one user that is now enabled to do that and that's Gordon dunshire who who does all the data work for for the RDA so let's see this management workflow it kind of goes all over I'll show you what
the picture so the omr and the vocabulary server are you know from from the outside sound like different things but they're actually pretty much the same thing the vocabulary service it's behind the OM are the Omar has these or interface etc and so these numbers are for those workflow things that I did that has
numbers so you can see that that there's a fairly complex interaction between
these things because they all have slightly different functions but anyway I decided not to put any any arrows in there because I thought nobody'd be able to read that but you know this is kind of for for further reference you can take a look at that and here here are
the links that are I think the most important to look at as I'm going through the rest of these but I thought I'd give you those up front so if you want to if you want to look at some of that well while I'm talking feel free it will be on the presentation so you'll be able to see it at any time okay this is this is part of our newish
stuff which has which is the the Google sheets import and this is Ellie publishing which is our major user and so you can see that that that all of this information you know when it was exported when it was last imported when it was edited within the om are all of those have separate columns so it really requires a fair amount of setup to do this but we've since we have our one major project it's been they've been doing pretty much all the testing and all the pain with the testing too this
is a view of the OM our data so you can see the element sets it manages all of the information about the history and imports and maintain errs who's allowed to get in there and work on this stuff and we have all kinds of as you can see we're working fairly heavily on the on the language the multilingual port parts of it so it looks it's a little hard to deal with but this is the
the RDA work properties RDA is based on Ferber which is now lrm the library reference model and so there's all kinds of pieces to that that that are used to describe each element so at this point you can export that data and then work on it offline and then import it back that's that's really what we're what we're working on at the moment here's a
list of exports for one property these
are just screenshots and I'll leave you to look at them you know in a better way I think the important thing is that all
of that stuff reflects the fact that we are keeping track of every transaction on these lists of vocabularies it's a huge amount of data but it's really important so here's the here's a that this little box tells you what what happens when you look at the updated on that column and it tells you that the previous action was was added so that was the first time it appeared and this is I think this is German and the status is is no published old publish so it started out as published it didn't change its status as part of this update this is this is again from directly from
the OMR and it gives you the history information about a particular property and and who did it and in the last column says import so you know you have a numbered import that you can look for to to figure out if something went wrong all of this is kind of in so that we can we can really problem-solve a lot of this stuff which is really hard to do once you get outside of the the box so this is the
RDA registry which is that that part that is they're basically read-only we don't we don't manage the data through this but we manage the the data itself you know make the downloads available allow you to get information in a variety of different language and it gives you some some of the extended information down down at the bottom so the when you look at the left column under Curie you see RDA w and then a number that's we do all of our URIs are set up like that so that you have RDA W which means work properties and then the the number allows you to have a number that is that works for every language because that's the that's the the basic URI the languages have what we call lexical aliases in other words they also have additional URIs that are in the language that they that that translation is in and they but they all point in the data back to that original that original curie okay here's a here's one from a
value of vocabulary carrier type which of course is this gospel and these are are handled pretty much the same way you can see this one's in chinese i believe and and and there's a lot of information there in in chinese although because you have all the all the information about what its reflected you can you can see it in any language you can see under the languages on the front there there's a a red box around chinese okay this is part of an
example spreadsheet and there's a link there if you want to take a look at it so again we're working very heavily
multilingual versioning for both the element sets and the valuable cam Bueller's this is proved to be a really interesting challenge because English is the primary language so everything every translation is based on the sort of the language here but the translations are done by independent folks who are working within the within the group and they don't necessarily they don't necessarily get to a translation or they don't necessarily update it at the same schedule so we're having to deal with that that there is the version of the primary language English and then there are there are versions that are inherent in the in the edition of the different languages so we have issues about when the language of the of the of the when when the information of a particular language results in a change even if the English didn't change but we always have to kind of go back link back to the to the version that was being used in the translation anyway so it's whoa this all gets pretty crazy so we're we're working on trying to figure out if there's a better way to do it and the bottom bullet says moving the each language into a separate github repo and version them independently is one of the things we're looking at we haven't quite made a decision yet so
this is this is kind of where we're intending to go what we're trying to do where we are with it and again I've the custom application profiles is is based on how how you design your data basically and and so we have application profiles within the OMR and other people can make application profiles for their data and we're starting to get to the point where we would like to be able to store them and use them for output anyway so we're on a more on the beta version of the OMR at this point but nobody is allowed to use it except us but that because you can't get people to test stuff like that unless they know what's going on so we're we're kind of at the point where we're doing a lot of testing internally and not-ready-for-prime-time okay you may
notice this is a this is one of my favorite slides but there are two cars there with the same license plate except one is New York and the other one is Ontario so that's our that's our business logo same words in a different context and they're different license plates as well John is very proud that the Ontario license plate says yours to discover you know mine says Empire State well what does that mean