Resource Description & Access (RDA) Vocab Infrastructure
Formal Metadata
Title |
Resource Description & Access (RDA) Vocab Infrastructure
|
Title of Series | |
Author |
|
License |
CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor. |
Identifiers |
|
Publisher |
|
Release Date |
2018
|
Language |
English
|
Content Metadata
Subject Area | |
Abstract |
A recording of a presentation from Diane Hillmann (Metadata Management) for the December 2017 AVSIG meeting.
|

00:00
Point (geometry)
Windows Registry
Slide rule
Standard deviation
Service (economics)
Multiplication sign
Open set
Metadata
Predictability
Element (mathematics)
Medical imaging
Word
Data structure
Library (computing)
02:55
Aliasing
Windows Registry
Stapeldatei
Server (computing)
Service (economics)
Multiplication sign
Direction (geometry)
Translation (relic)
Open set
Mereology
Metadata
Web 2.0
Revision control
Data management
Mathematics
Spreadsheet
Different (Kate Ryan album)
Term (mathematics)
Googol
Repository (publishing)
Software repository
output
Local ring
User interface
Axiom of choice
Information
Server (computing)
Video tracking
Web page
Computer file
Metadata
Bit
Control flow
Flow separation
Windows Registry
Open set
Template (C++)
User profile
Data management
Spreadsheet
Software
Synchronization
Revision control
output
Quicksort
06:46
Stapeldatei
Server (computing)
Axiom of choice
Service (economics)
Server (computing)
Interface (computing)
Web page
Computer file
Mereology
Template (C++)
Windows Registry
Number
Data management
User profile
Spreadsheet
Different (Kate Ryan album)
Googol
Repository (publishing)
Software repository
Local ring
07:15
Functional (mathematics)
Presentation of a group
Link (knot theory)
Link (knot theory)
Server (computing)
Multiplication sign
Arrow of time
Mereology
Information
Mereology
Windows Registry
07:57
Electronic data interchange
Information
View (database)
Projective plane
Electronic mailing list
Mereology
Formal language
Element (mathematics)
Element (mathematics)
Formal language
Spreadsheet
Googol
Set (mathematics)
Computer worm
Software testing
09:15
Point (geometry)
Moment (mathematics)
Electronic mailing list
Menu (computing)
Electronic mailing list
Formal language
Element (mathematics)
Element (mathematics)
Category of being
Duality (mathematics)
Object (grammar)
Set (mathematics)
Referenzmodell
Drum memory
Library (computing)
09:56
Trail
Mathematics
Group action
Multiplication sign
Set (mathematics)
Electronic mailing list
Cuboid
Database transaction
Electronic mailing list
Statement (computer science)
Mereology
Element (mathematics)
10:42
Point (geometry)
Aliasing
Windows Registry
Beta function
Greatest element
Information
Variety (linguistics)
Web page
Metadata
Translation (relic)
Price index
Electronic mailing list
Mereology
Element (mathematics)
Formal language
Number
Mathematics
Category of being
Word
Different (Kate Ryan album)
Set (mathematics)
Cuboid
Statement (computer science)
12:39
Type theory
Spreadsheet
Spreadsheet
Information
Link (knot theory)
Charge carrier
Googol
Charge carrier
Price index
Mereology
Formal language
Address space
13:18
Point (geometry)
Group action
Scheduling (computing)
Link (knot theory)
Decision theory
Translation (relic)
Set (mathematics)
Function (mathematics)
Formal language
Element (mathematics)
Revision control
Data management
Mathematics
Profil (magazine)
Software repository
Subject indexing
Software testing
Information
Building
Server (computing)
Image registration
Semantics (computer science)
Translation (relic)
Cartesian coordinate system
Flow separation
Formal language
Computer configuration
Error message
Ontology
Software repository
Revision control
Interface (computing)
Conditional-access module
Quicksort
Representation (politics)
Resultant
15:58
Data management
Slide rule
Data mining
Word
Arithmetic mean
Context awareness
00:00
okay so we we think of you guys as the other RDA and I'm sure you feel the same way about us but this is the RDA that I've been working with it's a library standard and my business partner John Phipps and I have been working on it since 2008 although not not paid for all that time sadly so I have a couple of let's see I have some slides 60 slides like this but I also have some some images that I can't I don't really have time to go all the way through but I I thought it might be useful for people who you know to go through them later on because there's some stuff that you can't easily get at if you're not if you're not me or John but this is kind of what our goals are we're talking primarily public vocabularies at this point although we at some point I think we will be able to manage private vocabularies as well and we have a couple of services that are associated with with what we're doing it's we we cover both um you know structural vocabularies element sets whatever you want to call them everybody has a different name and also value vocabulary so Scots vocabularies as people sometimes call them so it's a it's a slightly different kind of panoply than the then then your RDA does but it's it's all absolutely necessary for what the the this RDA or our RDA needs because we're transitioning off a world of of mark and really there was no other way to do it okay so this is kind of the we call what we're doing kind of a three-legged stool and I'll show you the picture later first one is they open metadata registry and and that one you can get by if you want to look at it while I'm talking is the it's metadata registry dot org metadata registry is the whole word and that that's really the the heart of the thing and we we started working on the open metadata registry which was at that time called the NS DL registry about 2004 something like that the the OMR is
02:57
kind of the central leg if you will although as I said we've been working on it for a long time and there it shows its age in some ways mostly with the user interface which is kind of kludgy but you know it still works and we're in the background trying to rewrite it it's it's using old old software and things like that but we've kind of built on it which is fine but but we have to kind of go back and make that part work better but one of the things we started doing which when we first began working on this was we we record detailed history of all changes and it and and the provenance you know who did it when they get it etc and this it's the it has along with it a specialized vocabulary server which kind of takes a little of the burden off the old software but it and we have a lot of services that go along with that and I'll I'll get to some of those right leg two is get in github which I assume some folks are using or familiar with but that's where our version management comes in a lot of our issues tracking and the documentation that we have for the RDA registry although not not yet with with the with the open metadata registry okay so the third part is the specialized RDA vocabulary management which is the part that it really it doesn't have a user interface it's really intended to for people to use to to to read information or to access information but it doesn't have a separate user interface because the LMR is really the only one that that is important in terms of what we're doing so it does make available downloads and all different kinds of viewable data the when it says aggregated RIF data that's from a a a bit of software that we interact with which is a very very cool little cataloging thing that allows you to work sort of in isolation we're trying to get some money together to allow it to be a web tool but we're not there yet so the lexical aliases are kind of interesting because we've we're doing a lot of stuff with translations and I'll show you kind of how that works when we only take a look at some of those so that our workflow is pretty you
05:45
know kind of goes all over the place but I have a picture after this and you can see how how it relates to all these three these three legs so it we're working primarily now with not with direct input although that's still allowable in the OMR and our major users do use that but we're we're trying to make it possible to do a lot of import/export using spreadsheets of various kinds and that's been that's been what we've been working on pretty pretty much for the last year and a half or so so we only have one user that is now enabled to do that and that's Gordon dunshire who who does all the data work for for the RDA so let's see this management workflow it kind of goes all over I'll show you what
06:47
the picture so the omr and the vocabulary server are you know from from the outside sound like different things but they're actually pretty much the same thing the vocabulary service it's behind the OM are the Omar has these or interface etc and so these numbers are for those workflow things that I did that has
07:12
numbers so you can see that that there's a fairly complex interaction between
07:17
these things because they all have slightly different functions but anyway I decided not to put any any arrows in there because I thought nobody'd be able to read that but you know this is kind of for for further reference you can take a look at that and here here are
07:39
the links that are I think the most important to look at as I'm going through the rest of these but I thought I'd give you those up front so if you want to if you want to look at some of that well while I'm talking feel free it will be on the presentation so you'll be able to see it at any time okay this is this is part of our newish
08:01
stuff which has which is the the Google sheets import and this is Ellie publishing which is our major user and so you can see that that that all of this information you know when it was exported when it was last imported when it was edited within the om are all of those have separate columns so it really requires a fair amount of setup to do this but we've since we have our one major project it's been they've been doing pretty much all the testing and all the pain with the testing too this
08:44
is a view of the OM our data so you can see the element sets it manages all of the information about the history and imports and maintain errs who's allowed to get in there and work on this stuff and we have all kinds of as you can see we're working fairly heavily on the on the language the multilingual port parts of it so it looks it's a little hard to deal with but this is the
09:17
the RDA work properties RDA is based on Ferber which is now lrm the library reference model and so there's all kinds of pieces to that that that are used to describe each element so at this point you can export that data and then work on it offline and then import it back that's that's really what we're what we're working on at the moment here's a
09:51
list of exports for one property these
09:56
are just screenshots and I'll leave you to look at them you know in a better way I think the important thing is that all
10:02
of that stuff reflects the fact that we are keeping track of every transaction on these lists of vocabularies it's a huge amount of data but it's really important so here's the here's a that this little box tells you what what happens when you look at the updated on that column and it tells you that the previous action was was added so that was the first time it appeared and this is I think this is German and the status is is no published old publish so it started out as published it didn't change its status as part of this update this is this is again from directly from
10:47
the OMR and it gives you the history information about a particular property and and who did it and in the last column says import so you know you have a numbered import that you can look for to to figure out if something went wrong all of this is kind of in so that we can we can really problem-solve a lot of this stuff which is really hard to do once you get outside of the the box so this is the
11:20
RDA registry which is that that part that is they're basically read-only we don't we don't manage the data through this but we manage the the data itself you know make the downloads available allow you to get information in a variety of different language and it gives you some some of the extended information down down at the bottom so the when you look at the left column under Curie you see RDA w and then a number that's we do all of our URIs are set up like that so that you have RDA W which means work properties and then the the number allows you to have a number that is that works for every language because that's the that's the the basic URI the languages have what we call lexical aliases in other words they also have additional URIs that are in the language that they that that translation is in and they but they all point in the data back to that original that original curie okay here's a here's one from a
12:42
value of vocabulary carrier type which of course is this gospel and these are are handled pretty much the same way you can see this one's in chinese i believe and and and there's a lot of information there in in chinese although because you have all the all the information about what its reflected you can you can see it in any language you can see under the languages on the front there there's a a red box around chinese okay this is part of an
13:15
example spreadsheet and there's a link there if you want to take a look at it so again we're working very heavily
13:21
multilingual versioning for both the element sets and the valuable cam Bueller's this is proved to be a really interesting challenge because English is the primary language so everything every translation is based on the sort of the language here but the translations are done by independent folks who are working within the within the group and they don't necessarily they don't necessarily get to a translation or they don't necessarily update it at the same schedule so we're having to deal with that that there is the version of the primary language English and then there are there are versions that are inherent in the in the edition of the different languages so we have issues about when the language of the of the of the when when the information of a particular language results in a change even if the English didn't change but we always have to kind of go back link back to the to the version that was being used in the translation anyway so it's whoa this all gets pretty crazy so we're we're working on trying to figure out if there's a better way to do it and the bottom bullet says moving the each language into a separate github repo and version them independently is one of the things we're looking at we haven't quite made a decision yet so
15:01
this is this is kind of where we're intending to go what we're trying to do where we are with it and again I've the custom application profiles is is based on how how you design your data basically and and so we have application profiles within the OMR and other people can make application profiles for their data and we're starting to get to the point where we would like to be able to store them and use them for output anyway so we're on a more on the beta version of the OMR at this point but nobody is allowed to use it except us but that because you can't get people to test stuff like that unless they know what's going on so we're we're kind of at the point where we're doing a lot of testing internally and not-ready-for-prime-time okay you may
15:59
notice this is a this is one of my favorite slides but there are two cars there with the same license plate except one is New York and the other one is Ontario so that's our that's our business logo same words in a different context and they're different license plates as well John is very proud that the Ontario license plate says yours to discover you know mine says Empire State well what does that mean
