Merken

Implementing XML for Japanese-language Scholarly Articles

Zitierlink des Filmsegments
Embed Code

Automatisierte Medienanalyse

Beta
Erkannte Entitäten
Sprachtranskript
thank you very I I II as as mentioned that to talk about why I want what
their uh how the Japanese 98 is uh in all different from European languages and that is why we need jets and I hope how we wanted to to to you know to be and also at at the depth if introduce actually always started with that's 0 . all of which was not mentioned earlier but the other half and how we are working with just 0 . 4 to 2 to produce but and Japanese AP scholarly journals sold out today I'm representing the so the scholarly external Publishing Association which is we just want this June and that to tuple promote XML all scholarly publishing in Japan and this is that run your decision and I we expect to get together you know publishers and perhaps at end up also any stakeholders all who want to use XML all scholarly publishing and the perhaps I'm talking
about the the they had a bit of hit the history there other attempts to implement SGML to scholarly publishing but as early as 19 nineties and that those attempts was non and very I was successful in that that it it it it's it's stopped not the using xml files and in all anymore but but that happened there was there were attempts to to our work was Japanese advantages it and also XML was tried only to solve and and there was also a very experimental and and I have to
talk about the being J. and which you may not be
familiar with it is an elected journal platform all JST JST stands for Japan science and Technology Agency there is no aid at the before at that act begins and the distaste was launched in 1999 and the analyst personally involved in that the launch of in some of that time I was working but for GST then and then that the did did this the developed and idea stage has PD and that there were there was workflow choose FrameMaker + SGML to to produce they this is the ML and people scholarly articles and there was also some investigations about x amount but that it was too early and you know for many many years many reasons and then the debt as I tell you Jets 0 . 4 is now implemented at at the New Garden of J states which was launched in this may and then I will talk well I know more about this data so
in this is what is of this time the hidden the the the even Japanese Korean and Chinese and all honesty I can until I out further the we I would say this is to this is this lady's Japanese but and the question and fixed properties years the Chinese at little and have this is the only animal that I and so all if a I don't get it if all this
year really similar invokes the but then the last I foot and the the language is a all different from each other but this is it Chinese in whole and the other has to make up ground from Korea and commute you're all he is adept Japanese and they'll and that the the line is so different that prices and the different and the writing cell different but what but the Japanese nineties came from Mongolia only originally all we have many of Saemaul resistor well melodions no that you know all of that content Mongorians look ulcers and are the they they they are very the you know when they last came to Japan they don't speak Japanese at all but that after a year or 2 they speak debt is very
prudently was and then use this but and out but also Japanese nasals characters to China all now we use similar characters like Chinese and that all was came from clear but anyway so we we all 3 don't need
and the amount they line these extensions of animal the TV an this heat here are some examples of how Japanese we use you know have for really right Parsons name this is in County County is based a Chinese character and that this is he had Connor and this is cut that kind of an they all here in it's a kind I hope said and my is a hand right hand written kinds but anyway and they are they mostly used to show pronunciations because all kinds you characters can be read in many ways the and and at the end this is of course it's a romanized the that you know they and so also we had name all the different this is set up we had we had the same and removal in between Japanese Chinese and Korean so we we put the family name 1st so we know that in this case is a family name and the this is expressed in jets it's a names that Eastern and so is through means I you you have to put the surname common interest I also we use it all out sometimes but call left to right writings it's a traditional but it is not use anymore in China and probably not include yeah father is still it's is very prominent in Japan it using newspapers magazines and all of places but listen style that that means as horizontal and the left to right before that I'm so this is right to left so I'm very but all right to left and and this and lessons that horizontal if you like is it is a new but there it's exclusively used in business writings and also of of self a government government other connotations and also in using in size and degree publications are in there in japanese and so this is
the the goal a Japanese of a book so far I think this is the public about Steve Jobs center you know I just told her tremendously well in depend to but anyway it's a vertical and rights to live and and this is so can so
when we write when the right those sentences and in a vertical you have to and rotate some characters with sample this parentheses has to be located 90 degrees and that but this is you all the this is another example things it's is a typesetting things and and that the heat
he it's a kind of rendering things in but it on the it has to be by this way yeah also greatly we use a rule and
of this is used to express pronunciations and it historically this is used to teach children how to read how do you get tense characters but I also needed for personal names as well even today possible this is my name written in Japanese but you can be pronounced you know I took easy to Joel but that this is not true so I put the I hear that I hear so you but you can be pronounced right 3 as the presenter so that I FIL this is the how really is the local Japanese they all I the sentences
and this this is very often like the like here and here and everywhere and also we use
emphasis we don't use dialects exportable places and so we have the use emphasis and this this is a dog and or I don't know how to the I have found to say but that 100 stroke was so maybe yeah the
the so or maybe it just that but the line for example I know we we have I have them later and light is visible is probably to 2 spaced with L but it just that I show you uh have and this essay
5 and no short MLE and in 2 lines that is short in the small count in all forms so every in all this is a note so but this is a very special that we don't use this as a but them final therefore granted today also we have a
non Gregorian years uh this is the this is also using in equal all also lean 1 the then the in Japan it is called there are sometimes called the Emperor year so still still use this is still up prominently using governed augmentation and they they sort of but refused to use at In he he but if you are in scientific writings most mostly there we use a with in and the size of the Japanese system publishing the we have about this is the the data I personally you know I yeah you know our investigated the would in in 2008 we had almost 3 thousand journal titles in size and topology in medicine and others at least 3 solve them 2 thousand 600 of Japanese titles and that 7 174 all of you almost then we have to 174 English titles and I don't know 3 374 298 . that means that are e-journals in English-Chinese titles and now very active and how many but not so many for Japanese and journals
and their use in the journals of are based on the same and there's some some subtle differences better mostly they are and they coast to western once so we we don't really need any and in all extensions of eject an enema DTD but Japanese addenda journals had some characteristics like a we have of article titles author names operations up always in Japanese and English that the means English means that romanized so we always has both and ABS that's of mostly in 2 languages captions but sometimes it what many then the Japanese but then I also in in English i'm because they want to reuse the in all the bills cables and figures 2 other ways and in the journals and find in the references are even Japanese or English of labels the and the STM publishing we don't use emphasis and words to that that's the best use this special
focal example the in this article from from based and this the red circle but all all in old Japanese the metadata and the blue circle this remains really think we should I use metadata so it are so this is the typical to call you solvable to line is in uh esteem articles the
I citations but sometimes in Japanese or in this in this case uh press 7 is this is a it's a Japanese flying this article agrees In in japanese but this is English I so ladies but not in this so there are mixture the yeah so
of she but it is humanity and social science the above writing of dominant and especially in humanities the higher social science articles Naxos so so often the final vertical and sometimes they use emphasis in righteous facing humanity all but a full up in dealing with really traditional Japanese and cultures and they don't have a chance article metadata so this is a
typical the 1 I'm like go like this and a so that is the emphasis here and this is not very high you can see and it's hard to see so but anyway I the this is a typical humanity In all articles it
so I'll eat animals today they are mostly in PDF so I'm adding value to the jazz is very because and like and PDF so we need XML and that this is probably a mass for English and agendas but those so for Japanese and journals so the power of 1 and 2 In yet you know just the 2 stock the the Japanese language you know that full text xml sold we we do not want to events all on we want to use XML DTD at that you not they stand that the the the and that it has to be multilingual
and amend the key that this is c . 0 are and how I did a little mentioned earlier are you did not allow xm line horses and creation and so some people the I think new stuff our names style to designate that that this is a on Japanese or English but outside mentioned earlier name studies in that they pulled out for this purpose is the purpose for not rendering so this is all
in know it is not the right use actually this is all part in all by information from 8 years and I also never annotation inanimate DTD of so this action set
the 0 so many elements and that repeatable so on like that if you have blue publisher name so it I we unable to describe the those I'm trees in both languages what some new predictable but we don't do not have and also art it is difficult to indicate that the verbal 2 names or to to to institution names that belong to the same same identities and the so
we started the static scholarly publishing Japan Working Group Inc only 2 so 10 we studied the quantity I of the and in all and in a DTD at that time working group of i was hurled Rosenbaum and then the goals and this so what are we we then this working group was but because it's a publisher's typesetting companies JST ANR NII and and that file we made several
proposal I like this week how we do not goes through this in detail and but if you have questions that I can I can certainly answer and so a
I think I just keep those so
what wireless and that why was the result of the the animal the the Working Group and a you know this has been a In September the 7 10 they use the stairs virgin 3 . 1 draft which include the most of the mother mingle features we wanted so I this was very very in all of great news and this became nicer adapts for the next this this and so this is not to some you know and then and this is you know
details in there is article I can all presentation best year so you may want to that you know take a look and this is that you give the in and out of it in in very well written and the what is what
is achieved by what I achieved xn line is now usable for almost all the elements and it is he headed the down that XML document tree and that both language and script may be recorded I our script that was a you know so a week yeah as I mentioned earlier we have 3 scrape for Japanese language that it's a Khonji Shecaniah and copper upon and
we can distinguish them and that also most sediments and and repeatable and wrapping and now are introduced so a especially for name alternatives
and up alternatives all book
him his good they so this is J. J. P and set candy script and DATA here out is if you're gonna squared date Connor is that of a conscript and at the end is it's a romanized on it you you don't have to to use those great if he the unnecessary but if you want you can use also this is a repeatable elements example might you repeatable adamant that that is a few other group it is said he did here and here is a japanese i counterparts all the she did here so it's line is here i e n and this is jade but this is not but that's not true for article titles article title has unique so that your new 1 article titles the if you have an idea that other than English against the In chance title and they had a bite the whole name
alternatives Our will contain put that in this case this is the J. J. P. A. N. names here it's a comes in India and I but like he name here and the common names here and they all other you know wrapped in name alternatives wrapper rubber hello
this edition in also by useful affirmation alternatives like this and there
are some things which I lived unimplemented index 1 . 0 0 . 4
also I want 1 0 1 1 these that there will be seen now and there a we and that and then he told me that the there are several way all coding movie and then the and that the we in all other the
what do we do not have having any consensus which want to implement so what this of the future implementation good by others
skid resistance to article the have
too much sense and so as they untold earlier than the new J. stays a Japanese the general platform was but FIL launched In this may and that it's supposed to death 0 . 4 time so what information it has 2 at 2 . 3 million articles and that of 1600 journal titles and the more than half of an historical titles and that articles in a scanned from Prince but we also have many all all current journals and died articles and but so far that's several genomes alone years that's XML already so this is the 1 of the 1st implementations of jets and in the world and this it
is and if that's the Jets yeah but it it's a I want the articles in jeopardy Japanese it it's a medical are article
and that the still has a XML guidelines and that's it but it is in the characters has to be in you'd get are there there some entity inferences are allowed in a product reviews these quantities may be used and also are this is the
I doubt that with the use the and
that but there's some and you know all I'm funny things about the excimer guideline of J states the our what genome at this is OK but at whom it up has I did publisher I. D. should be j is the object identifier can type but and there I'm puzzled understand publish ID hat is that publishers online the so this is probably the Minecraft use of here and they they decided to they decided that the contribute content good must exist and that this is this has created some confusion and a man in a publishers also mammal that these are alternatives and grade and they are not necessary for English-language articles and there necessary for Japanese I know multi-language articles are only so these decisions are unable stand up the problematic the destination supports much of masters of formulas in text graphic chromosomal and Tech not that our figures by the while this is also not a problem for some publishers the VS tables should appear where mentioned and the there are some types you know publishers just that put all the all all the figures tables at the end of the article but in that article XML but so this is the other point of a problem and also all it voices Cal's is not supported but this this state does not support elements like for for now it I I would like to show you how people are they and produces XML J. states that user types in a in Japan the the those walks so mostly than their typesetters rather at this point we we don't use offshore a people because of many reasons time the deficit that they use MS Word to southern 7 2 jets XML and they will their own program and and then exam 2 Acceleo if all and the 2 pedia so what any corrections and all you know both the Garrett is and collections of made at the XML inner states using their are it's in the another benefit convert text to jets XML and then this is still appropriate program and that they also have MS article on you know they are converted to XML using Eq styles but as they say Japanese ideas XML's needed much human editing because there and all the convergence and then I well emergence and completely automatic and being at the XML was fee fed to the the 2 and this produce jet 6 male and peer and any corrections are made on something the another still we used FrameMaker and they simply paste text to FrameMaker and this friends they remembered in a text are processed by X Files and interjects . 6 Mt and using excessive the and therefore they say FrameMaker would be replaced by fight fight and design the basin and on other parts of the use of MS what XML using Eq styles and there but there's an open up a japanese text that it is actually difficult and then xml was fair to design and in then amount the was converted to jets XML using this size sheet they and there any corrections or a mating on in design yeah I
like this is the on the lecturer for general comes conflict King comes and become common computational chemistry Japan where and so they have to flows Japanese language flow I'm showing these people they're milestone typewriter design and and then well they designed out well-formed XML and this this this a chain ennoblement to maximality jets so so how do we have
the ball but in all of these use days guidelines but there are other I mentioned already so did they they will fix this in so you in the meantime I hope
but and I know I say by thank you so each could look like if you use the same name as me but the different person in this life pass and in the and also for the fact that of you gave me the of the information about their in on processing and all as this is members 10 thank you very much
in questions and
it the Kevin Hopkins from and publishing at the University of Michigan Library of offenders that correctly you said you created an element called some body the inside of body and this somebody element has the XML line actually on at that right the body somebody O Dell instead of you know tonight that that that was in my area proceeding I like I know just here will you will get lot us they are there all I skipped some properties and over the years and that there was a lot of they are our proposal to do that I will jets looking to me and that of they're not very so common that there are a few be there but 1 of 2 journals which publish to you the article into line like canadians and so in japanese any in her in the annulus right so well we propose somebody solo again uh in all uh then in the right order but that was that the 1 of the the transitions which where you know I think they and I understand that I guess I was curious why you didn't instead propose having the body element repeats and adding XML Milan to that's another way of of saying you know that other time you know that was the head the of the proposal 1 of the other proposals so what we it but in a lot of the time we didn't know that the uh it's a malign can be in our I have here on the east In elements OK good thank you if you and of himself and notice use exon align equal English 1st romanizations of Japanese names but that's not really english is it that's that's alternative scrap In the language still Japanese there was any problems or you're tuning issues program of 140 but I don't see any problems calling it the English but it is nothing was of of old but it also some I would rather that and ask the European and that colleagues to the out of the very few the I'm David Shotton from offered to but I don't think you should conflate the language and the script in which it is written um if you think of the Greek language transliterated into Roman script to distill Greek language for so you have to be able to distinguished both the language and the script it is written and if Jech can't do that now it needs to be no I don't want a peaceful and to consultant to resist some I agree with that but I also think that that's something that has to do with the Debye 3 c spec for action align itself and I think that one's seems to judge might consider is on linking the script from the language give us a global electric to attribute descriptor so that we can treat the script separately strategic inflated arm and use language exclusively for language and I know that this is widely abused and on the web but that's a rejection to way way Debbie pair mulberry technologies that was actually 1 of the suggestions for Japanese Publishing Group made and the Jets committee did not go with it because we follow the w 3 c recommendations which say not to split them but to use the language code for both it just of interest you give a rationale for this no no but IETF and of those people all agree with them so we followed the the standards what can I say you my honesty is it's high TFR Standard and and I think that they do have a lot of variation then expert but there's a lot of ways to handle both languages and scripts so I think you're right it's not a transitions which of X in Erlang value yeah we can take this ofline so
Formale Sprache
Vorlesung/Konferenz
Computeranimation
Entscheidungstheorie
SGML
Bit
Vorlesung/Konferenz
SGML
Elektronische Publikation
Computeranimation
Implementierung
Softwareentwickler
SGML
Elektronische Zeitschrift
Kategorie <Mathematik>
DTD
Vorlesung/Konferenz
SGML
Versionsverwaltung
Systemplattform
Computeranimation
Implementierung
Aggregatzustand
Subtraktion
Wort <Informatik>
Formale Sprache
Vorlesung/Konferenz
Kommutator <Quantentheorie>
Gerade
Computeranimation
Arithmetisches Mittel
Minimalgrad
Rechter Winkel
Familie <Mathematik>
Schreiben <Datenverarbeitung>
Skript <Programm>
Ordnung <Mathematik>
Maßerweiterung
Disjunktion <Logik>
Computeranimation
Stichprobenumfang
Vorlesung/Konferenz
Computeranimation
Arithmetischer Ausdruck
Computeranimation
Vorlesung/Konferenz
Gerade
Dialekt
Computeranimation
Formale Grammatik
Netzwerktopologie
Rastertunnelmikroskop
Erweiterte Realität <Informatik>
Bildschirmmaske
Elektronische Zeitschrift
Schreiben <Datenverarbeitung>
Physikalisches System
Zählen
Gerade
Computeranimation
Autorisierung
Nichtlinearer Operator
Subtraktion
Kreisfläche
Formale Sprache
DTD
Computeranimation
Arithmetisches Mittel
Metadaten
Rastertunnelmikroskop
Betrag <Mathematik>
Wort <Informatik>
Charakteristisches Polynom
Maßerweiterung
Figurierte Zahl
Gerade
Zusammengesetzte Verteilung
Metadaten
Metadaten
Computeranimation
Elektronische Zeitschrift
Multiplikation
Dichte <Stochastik>
Formale Sprache
DTD
DTD
Ruhmasse
Ereignishorizont
Computeranimation
Leistung <Physik>
Beobachtungsstudie
Menge
Proxy Server
Mereologie
Gruppenoperation
DTD
DTD
Vorlesung/Konferenz
Information
Schlüsselverwaltung
Gerade
Computeranimation
Netzwerktopologie
Gruppenkeim
Proxy Server
Nichtunterscheidbarkeit
Formale Sprache
DTD
Identitätsverwaltung
Computeranimation
Hydrostatik
Element <Gruppentheorie>
Gruppenkeim
DTD
Luenberger-Beobachter
DTD
Skript <Programm>
Elektronische Publikation
Computeranimation
Resultante
Multiplikation
Element <Mathematik>
Anpassung <Mathematik>
Element <Gruppentheorie>
Gruppenkeim
DTD
Skript <Programm>
Versionsverwaltung
Computeranimation
Meta-Tag
Netzwerktopologie
Multiplikation
Suite <Programmpaket>
Formale Sprache
Element <Gruppentheorie>
Regulärer Ausdruck
Skript <Programm>
Skript <Programm>
Element <Mathematik>
Kombinatorische Gruppentheorie
Gerade
Computeranimation
Multiplikation
Gruppenkeim
Element <Gruppentheorie>
Gruppenkeim
Regulärer Ausdruck
Skript <Programm>
Äußere Algebra eines Moduls
Vorlesung/Konferenz
Skript <Programm>
Gerade
Computeranimation
Äußere Algebra eines Moduls
Computeranimation
Automatische Indexierung
Güte der Anpassung
Implementierung
Vorlesung/Konferenz
Computeranimation
Singularität <Mathematik>
Ablöseblase
Implementierung
Information
PRINCE2
Systemplattform
Computeranimation
Metropolitan area network
Font
Inferenz <Künstliche Intelligenz>
Biprodukt
Computeranimation
Gammafunktion
Punkt
Texteditor
Element <Mathematik>
Wort <Informatik>
Computeranimation
Gradient
Ausdruck <Logik>
Datentyp
Äußere Algebra eines Moduls
Figurierte Zahl
Metropolitan area network
Meta-Tag
Tabelle <Informatik>
Datentyp
Dichte <Stochastik>
DTD
Mixed Reality
Peer-to-Peer-Netz
Elektronische Publikation
Entscheidungstheorie
Objekt <Kategorie>
Gruppenkeim
Mereologie
Wort <Informatik>
Identifizierbarkeit
Innerer Punkt
Aggregatzustand
Tabelle <Informatik>
Webforum
Verkettung <Informatik>
Dichte <Stochastik>
Extrempunkt
Wort <Informatik>
Formale Sprache
Versionsverwaltung
Computerunterstütztes Verfahren
Datenfluss
Computeranimation
TVD-Verfahren
Prozess <Physik>
Formale Sprache
Gruppenoperation
EDV-Beratung
Element <Mathematik>
Information
Code
Computeranimation
Eins
Benutzerbeteiligung
Gruppe <Mathematik>
Äußere Algebra eines Moduls
Skript <Programm>
Vorlesung/Konferenz
Optimierung
Gerade
Schreib-Lese-Kopf
Videospiel
Kreisring
Differenzkern
Flächeninhalt
Rechter Winkel
Information
Ordnung <Mathematik>
Erlang-Verteilung
Message-Passing
Standardabweichung

Metadaten

Formale Metadaten

Titel Implementing XML for Japanese-language Scholarly Articles
Serientitel JATS-Con 2012
Teil 03
Anzahl der Teile 16
Autor Soichi Tokizane,
Lizenz CC-Namensnennung 3.0 Unported:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.
DOI 10.5446/30582
Herausgeber River Valley TV
Erscheinungsjahr 2016
Sprache Englisch
Produktionsjahr 2012
Produktionsort Washington, D.C.

Inhaltliche Metadaten

Fachgebiet Informatik
Abstract Scholarly publishers and typesetting service providers in Japan, as well as the J-STAGE e-journal platform got together to devise a way to implement XML for Japanese-language scholarly articles, and worked with JATS working group to develop multi-language extention of JATS. It is now implemented at the new J-STAGE platform that was launched in May, 2012.

Zugehöriges Material

Ähnliche Filme

Loading...