Merken

An Introduction To BITS

Zitierlink des Filmsegments
Embed Code

Automatisierte Medienanalyse

Beta
Erkannte Entitäten
Sprachtranskript
OK I Martina um as said I work at the NCBI Bookshelf project and we're online archive for uh books and
reports we do cover the life sciences the we got just notes of 14 hundred times in our collection and it's growing nicely about 200 years maybe more hacks year
and we exclusively use the NCBI but the the the converting mostly PDFs to its and but also other XML and we do have would authoring program so we're not
but we do online publishing and we do know a little bit about some altering our self um most busy took my 1st slide a very brief background on the NCBI book text that room but that's OK I'm going to introduce the bits actually gonna run over some basics gonna talk about structure metadata and then introduce you see in the index of the men's which I think are particularly exciting on let's
talk about the abuse said books and journal articles they're really not all that different well DM some trivial to others maybe I looked around white people like the insignia book and you some quotes from you publishers and it's really interesting received seems like we were right back then in the the to it's particularly interesting to publishers who already have article XML and from a business perspective incredible conveniently estoppel knows it already you vendors may know it already you can share sufferer to its the um infrastructure um so really I don't think it's surprising looking over these quotes why the NCBI books DVDs sexually interesting people Beyond NCBI endgames gains quite widespread usage I think every jets gone conferences something about the book the
but so online DTD well I don't complain complaints out there the the most common complain we don't have indexes we can take serious metadata and of why certain it no Table of Contents why do you not have preface where the introduction of forward things like this of global complaint has been well just a mutual . Attica DTDs and they don't look at books in its own right and and at the nice process for the articles DTD offered a nice break to take another fresh milk and really young beyond books beyond the original purpose of broader review of the book model and make it useful for or publishers some basics is based
on ISO jets so you any at all the goodies that are included in the latest Meisel virginal might I support things like the specific use action you and it's not part of the nice so process the its generic it's for interchange should be an easier version forget and it's especially but not only for organizations already have article content injects the scope shouldn't come as a surprise since STM literature and scholarly professional books if you have a material code that I mean requires you to put a lot of effort in design and
formats may not be the right choice for you so cookbooks so travel guides is not of the kind of content that the BTB intends to cover so let's look at the book model here we have a book I don't know how
many of you are familiar with the NCBI build Texas said but if you are going to see some differences most noteworthy at right at the beginning you know there's collection that our 4 sets a series or anything that you define a collection to being then we have at fundamental I think is also interesting new elements their dedication forward preface all new also that the generic front part for anything else that doesn't fit the other elements you can type it um book value has changed it's not only a sequence of book parts again this is for you who are already familiar with the NCBI book Texas and that matter no includes an index and it includes book parts this is also new basically what you see here is a typical model it's quite close to other the the DT these fundamental body that matter but many new elements no book is very large so it must be possible to handle exchanges single component as well as for example the chapter and and that's came up with a new element the book part wrapper you can take basically any component chapter main book part on Appendix a fundamental particle maybe just the C um joined joined together with the book metaphor and record in a book part reprerent send it off to your archive or to your customer um previously the book part played that role would and which still exists again for those of you familiar the old model you don't have book made there anymore but to do a book apartment are fundamental body and their if you think about metadata some collection man as model quite similar to the middle it has specific spot you going to be able to put the time in editors of publications that publication date publishers in there it's from the perspective of the book it's the Metadata travels with the book so it looks at its collection pair and and it does not include all the other books siblings that maybe in the collection and I'm really excited about book Part I. D. this was a so complained in some of the quotes that I showed it was made difficult up to an hour to say up properly the video I or Pub Med idea of the chapter and for that we have the book Part I know and let's also newest pop history at the book and the chapter level I think it's an interesting and amend you now able to model events in the publication history history together with the dates on for hottest 1997 really anything that you'd like to model if you move do not want to do that you can still use only a series of debates in pop history accepted received biased but I think this could be very useful to marry and lowered some of the meaning that we find date they types these days and we have event descriptions OK we talk about the TOC in the Jordan matter maybe you want to use that and the eat um the it's not possible to Ted in front matter and that matter a single you see all might about might I haven't helices for example the main you see separate from this figures and you have to you see this as an element which will allow you to hold for example appendices the all chapters in Part 1 and and that the core is if you CNG element which essentially consists of a tighter than they are and cross-referenced enough pointer uh since we've seen all authors and lots of fuel contribute is also included but you can add paragraphs CleftedMaterial even abstracts to the TOC my um I would suggest the journal people look into adapting this for their certainty and whether you have about this from friends whether the you done over the TOC from the main book exonization here you the lastly
the index element is brand new and equally interesting all represent essentially the same situation as for the Aussies it's not it's not listed in the government in this group grouping together to separating the since the author indexed gifts for example if you have the alphabetic sections in the
index the and that the quality of the index entry so you can ask them for example Winston Churchill travel itself is like in print you can also redirect terms with the CNG element and this is
also typical in print there is additional here see also entries so we can redirect to prefer terms or you can redirect related terms of people should be excited about this there were complaints that you don't have an index now we do in the narrative docked to the narrative of the document itself you can the index terms this is what actually gets complicated on well you can you can include redirects in the text you can also include low level terms you can basically nestedness the nest the next terms and have also sort of secondary terms related terms preferred terms In the main narrative of the document you see in the examples of how you can tag of the the the anchors in the text so in next term ITT 1 is arrange it ends when they actually stop talking about church in the I it's really that I think is gonna be interesting others reviews that you have the option to just convert you print index but this offers really interesting possibilities or the for text mining and more complex operations uh 1 idea was that the next term actually should enable building index on the fly from the narrative of the documents and I under
time but but to sum up so you have a
lot of new stuff but we also definitely fast food all approved principles of based on jets its content that matters not form so you really want to have have publishers we use the article XML in mind and and it's just a draft I quickly put the Justice jets list you thereafter Norris introduction so please look at the library and please come and get a copy on the FTP sites and the questions so you take a few questions right now and will have a fraction of the I due to the park wrapper instead of reasonable part extending that's why we use part represents set of using book part while book part can be included in the body of the book so you have but part macro really an element that's the lower knows that's has to be at the top could you just make look part of root element in another document to say that papa's OK to use as a root element for the document as it was you had you then well 1 reason also was that if you do take a chapter and earlier chapter in 1 and the way we do have to say which but it belongs to so you do have that book matter everybody further quite uneasy about having a book manner in the chapter as opposed to next to a because we do not have the book the so yes so we're suggesting Chris is what we did in the old NCBI right book PTT and it works very well for some things and didn't work very well for chapters that may appear in more than 1 book and he had he want to ship them around if you want to send the book has a chapter on some the chapter is a as a as a part of more than 1 book that you have to read the book XML out of this chapter and insert a different book x amount nite just include different wrapper and and you interchanges could go an appendix for example the 1 that you have an opinion to go up in that matter you convert that to a book part of type dependence but the but matter in there you need a change the metadata title to book partner at uh so you you you have cop some operation there now you take the appendix erected in book that we at the book manner ship it off the you hi my you happened NCBI on about the indexes I assume that you can have the but overlapping indexes as well as multiple enact indexes as well as glossaries for the glass 3 references for the same terms right is designed to handle all of those you could have another have not generally indexing you can have a glossary but you can you also have multiple indexes you can have multiple in certain societies overlapping indexes to so for instance of time and a phrase all you may be out 2 parts of the index what and you could have overlapping and yes he can that's why the index term is separate from the range so the Rangers can overlap because they're actually all pointers point pointers they're not all actually all their milestones not all rappers so that they can overlap of King which makes it a little more complex but allows you to do that and what about multiple indexes you have the same you have the same to save space of input the altar repres where you do have in for line in x u have in this loop where he can move several indices here so people of subjects and you're embedding the index terms in the prose rather than having them separate you can identify which index a term or ranges for so you can have multiple indexes embedded as well as multiple indexes supported this up pre-assembled documents 1 of the things that what is 1 of the question all yeah the the thing I wanted to address was you said that this book model was for all i st and books on not only the idea I actually think it covers they if I'd characterize it as being for scholarly and technical books year so it's certainly not a model optimized for cookbooks or the TV Guide body it's for much of greater subject range than just him it's for scholarly and professional literature it's not only for instance and that if I get the impression no it's not the case you do social science it worked and the through the also 1 right I give rise had control of this criticism of alumina art in the working group we had discussions about what the scope should be and while the certain books if we put out of scope it's conceivable that you could do something like a high school textbook certainly I think a secondary art textbook that would work reasonably well although for example there is no specific you in a model are in DTD so that might be limitation but we are still very quick things like a 8 textbooks out so really if you're interested in using the DTD 1 of the things you should do is look at your content and then look at the DTD and see if it will map were not sitting hard boundaries because even more so with books and journals it's very difficult to set firm boundaries are but you tried out and art please please please give us feedback as to what you need if for example you really need a Q and a mild wet the Working Group now and send the samples of what you want you in a model because that's 1 of the things that we would can
Videospiel
Vorlesung/Konferenz
Projektive Ebene
Verkehrsinformation
Packprogramm
Computeranimation
Rechenschieber
Metadaten
Bit
Datenstruktur
Automatische Indexierung
Indexberechnung
Bildschirmsymbol
Datenstruktur
Baum <Mathematik>
Computeranimation
Tabelle <Informatik>
Mittelwert
Automatische Indexierung
Subtraktion
Motion Capturing
Prozess <Physik>
Metadaten
Element <Mathematik>
Stichprobe
DTD
DTD
Mixed Reality
Übergang
Ähnlichkeitsgeometrie
Computeranimation
Informationsmodellierung
Perspektive
Automatische Indexierung
Kontrollstruktur
Inhalt <Mathematik>
Tabelle <Informatik>
Umwandlungsenthalpie
Selbst organisierendes System
Gruppenoperation
sinc-Funktion
Datenmodell
Code
Computeranimation
Generizität
Selbst organisierendes System
Informationsmodellierung
Mereologie
Dateiformat
Vorlesung/Konferenz
Inhalt <Mathematik>
Elektronischer Programmführer
Auswahlaxiom
Mereologie
Gruppenkeim
Element <Mathematik>
Computeranimation
Videokonferenz
Übergang
Metadaten
Deskriptive Statistik
Bit
Figurierte Zahl
Metropolitan area network
Prinzip der gleichmäßigen Beschränktheit
Abstraktionsebene
Gebäude <Mathematik>
Reihe
Schraubenlinie
Übergang
Zeiger <Informatik>
Ereignishorizont
Arithmetisches Mittel
Texteditor
Gruppenkeim
Automatische Indexierung
Rechter Winkel
ROC-Kurve
Garbentheorie
Fitnessfunktion
Folge <Mathematik>
Subtraktion
Metadaten
Datensatz
Informationsmodellierung
Mailing-Liste
Perspektive
Datentyp
Wrapper <Programmierung>
Zusammenhängender Graph
Zeiger <Informatik>
Ereignishorizont
Meta-Tag
Autorisierung
Fundamentalsatz der Algebra
Einfache Genauigkeit
Indexberechnung
Nabel <Mathematik>
Packprogramm
Menge
Generizität
Mereologie
Speicherabzug
Term
Nichtlinearer Operator
Hochdruck
Gebäude <Mathematik>
Relativitätstheorie
Indexberechnung
Element <Mathematik>
Zeiger <Informatik>
Term
Nabel <Mathematik>
Quick-Sort
Computeranimation
Konfiguration <Informatik>
Übergang
Text Mining
Automatische Indexierung
Vorlesung/Konferenz
Inklusion <Mathematik>
Term
Punkt
Filetransferprotokoll
Mathematisierung
Gruppenkeim
DTD
Element <Mathematik>
Term
Raum-Zeit
Computeranimation
Metadaten
Loop
Spannweite <Stochastik>
Informationsmodellierung
Datentyp
Wrapper <Programmierung>
Stichprobenumfang
Inverser Limes
Vorlesung/Konferenz
Indexberechnung
Wurzel <Mathematik>
Inhalt <Mathematik>
Zeiger <Informatik>
Gerade
Nichtlinearer Operator
Bruchrechnung
Ein-Ausgabe
Randwert
Menge
Automatische Indexierung
Rechter Winkel
Programmschema
Mereologie
Gamecontroller
Makrobefehl
Instantiierung

Metadaten

Formale Metadaten

Titel An Introduction To BITS
Serientitel JATS-Con 2012
Teil 05
Anzahl der Teile 16
Autor Latterner, Martin
Lizenz CC-Namensnennung 3.0 Unported:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.
DOI 10.5446/30573
Herausgeber River Valley TV
Erscheinungsjahr 2016
Sprache Englisch
Produktionsjahr 2012
Produktionsort Washington, D.C.

Inhaltliche Metadaten

Fachgebiet Informatik

Ähnliche Filme

Loading...