Snippet - Multimedia: as a data source

Video thumbnail (Frame 0) Video thumbnail (Frame 442) Video thumbnail (Frame 874) Video thumbnail (Frame 3104) Video thumbnail (Frame 4572) Video thumbnail (Frame 5503) Video thumbnail (Frame 7030) Video thumbnail (Frame 11397) Video thumbnail (Frame 11867) Video thumbnail (Frame 12761) Video thumbnail (Frame 13556) Video thumbnail (Frame 14885) Video thumbnail (Frame 15443) Video thumbnail (Frame 17650) Video thumbnail (Frame 19076) Video thumbnail (Frame 20362) Video thumbnail (Frame 23251) Video thumbnail (Frame 31912)
Video in TIB AV-Portal: Snippet - Multimedia: as a data source

Formal Metadata

Title
Snippet - Multimedia: as a data source
Author
License
CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
2014
Language
English

Content Metadata

Subject Area
Abstract
Multimedia is used and generated as a part of research across all domains. Multimedia can function as an information and a data source. The management of multimedia, as research material, needs to be shaped by the intent for use and the research methods being applied to the material. For example, it is possible to read or mine a text or an image and be informed by both processes of investigation. So the critical questions for data managers is: what does the researcher need and want to do with this type of research material. How are support services and systems designed and provided to enable that.
Source code Service (economics) Hypermedia Source code Expert system Multimedia Multimedia Data management
Point (geometry) Slide rule Group action Observational study Computer file View (database) Connectivity (graph theory) Computer-generated imagery File format Digital library Mereology Disk read-and-write head Mixture model Natural number Different (Kate Ryan album) Hypermedia Atomic number Multimedia Source code Metadata Bit Mereology Term (mathematics) Digital library Type theory Exploratory data analysis Hypermedia Universe (mathematics) Multimedia Quicksort
Web page Satellite Point (geometry) Source code Group action Service (economics) Observational study Image resolution Markup language Temporal logic Computer-generated imagery Water vapor Medical imaging Mixture model Integrated development environment Cube Multimedia Multimedia Domain name
Frame problem Slide rule Observational study Natural language Source code Time series Water vapor Online help Perspective (visual) Product (business) Neuroinformatik Medical imaging Different (Kate Ryan album) Natural number Multimedia Process (computing) Information Data conversion Descriptive statistics Area Arm Information Channel capacity Prisoner's dilemma Computer Analytic set Bit Type theory Word Process (computing) Interpreter (computing) Iteration Natural language Object (grammar) Identical particles
Medical imaging Slide rule Information Virtual machine Shape (magazine) Belegleser Disk read-and-write head
Discrete group Medical imaging Information Connectivity (graph theory) Ultraviolet photoelectron spectroscopy Combinational logic Multimedia Flickr Line (geometry) Mereology Flow separation
Slide rule Information Multiplication sign Projective plane Mathematical analysis
Point (geometry) Slide rule Optical character recognition Chemical equation View (database) Computer-generated imagery Source code Digital library Mereology Shift operator Independence (probability theory) Medical imaging Mixture model Radio-frequency identification Multimedia Information Web page Projective plane Interactive television Word Process (computing) Personal digital assistant Mixed reality Physical system Reading (process) Row (database)
Point (geometry) Group action Context awareness Multiplication sign Connectivity (graph theory) View (database) Computer-generated imagery Source code File format Mathematical analysis Digital library Front and back ends Data management Medical imaging Heegaard splitting Personal digital assistant Data mining Energy level Multimedia Information Descriptive statistics Source code Pairwise comparison Email Information Computer file Projective plane Mathematical analysis Database Cartesian coordinate system Vector potential Data mining Data management Personal digital assistant File archiver Multimedia Natural language Pattern language Data management Row (database)
Point (geometry) Web page Discrete group Group action Computer file Multiplication sign Connectivity (graph theory) View (database) Source code Combinational logic Digital library Function (mathematics) Mereology Metadata Medical imaging Different (Kate Ryan album) Hypermedia Term (mathematics) Multimedia Endliche Modelltheorie Category of being Source code Service (economics) Information Mapping Computer file Projective plane Content (media) Cartesian coordinate system Digital library Flow separation Category of being Cache (computing) Type theory Word Film editing Document management system OAIS Hypermedia Personal digital assistant File archiver Volumenvisualisierung Website Natural language Object (grammar) Fiber bundle Exception handling Library (computing)
today I'm going to talk a bit about
multimedia as a data source particularly about data management and supportive research where multimedia is a data source and as the research material and
I guess I need to confess right up front I'm not a new media expert I'm what
could be referred to as a digital librarian so I come at this with a very different kind of understanding potentially an exploratory approach and I'm very interested in the kind of questions and comments or thoughts those listening and do they have what I wanted
to start off today with was too I guess have a bit of a think about what multimedia is because I have to say that this is what went through my head when I was thinking about offering up this webinar I also wish to acknowledge colleagues over at edith cowan university who triggered the idea for this webinar because they were interested in looking at different types of data particularly the sort of data that is generated in support of performance studies and so that is where this idea came from but I really wanted to look at how multimedia was defined to get a bit of an idea about the way that it's referred to by different groups of people and also what it is from a technical point of view so you can see there on the slide I've I've given given a mixture of definitions of what multimedia is and I think the second one in there is the one that really holds my attention probably the most is that it's about different types of data that's actually are contained in a file and there are different ways of referring to those components so I've picked some of those out in the third point I was quite intrigued to discover that there are our chunks and atoms and parts and perhaps because I don't have a new media background I come up this as someone who's looking at the material nature in some ways of multimedia I wanted to have
a think about where multimedia appears in the research environment and their I've listed films as an example of multimedia and web pages where you can have a mixture of moving image and sound possibly some graphics digitized documents that can have both image and annotations markup and transcripts and also satellite images which is something quite new to me we had the great benefit of a lecture within the Australian national data service recently from our Stuart mentioned from Geoscience Australia and he opened up my eyes to water satellite images and all the layers that actually exists in a satellite image that was really interesting he gave a great talk on the data cube but they've developed but that's a whole other topic I just thought I'd pause at this point to see if any of those in the group have a background in multimedia or have some questions about the definitions of multimedia at this point so I guess
looking into where multimedia appears in the research domains I've got me thinking about the fact that this topic could come up through those working with people who undertake performance studies so I listed some of the research domains there where multimedia is generated to help me kind of unpack who's actually using or creating multimedia and where that's happening and why they're using multimedia to get a better understanding of how it's created and also perhaps used so I guess I'm looking at this from
a cultural production perspective and I started to look at the methods that researchers were using that they use to create or process data or information and I found that a very kind of iterative process I kept bouncing across from the idea of something as information and something as data and having a bit of trouble identifying what was what and I keep circling back to asking myself what is the researcher doing and I think it really helps to to look at research methods to understand when a digital object is being treated as a piece of information and when it's being treated as a source of data and I'm sure that there could be quite an exhaustive conversation about this but I really thought it helped the purpose to which this material was being put and how it was being used and whether there was a human looking at it or whether it was a computer looking at at the multimedia and whether that was a useful distinction or not where I got to
we're looking at multimedia was trying to understand what happens to a digital object and that's what I refer to something as a piece of digital material and so I tackled annotation as an area of research I guess interpretation or a process in which information is applied or data is applied to data and it became very tangled so I picked up four areas where I could see the word annotation being used and the first was genetics which was really interesting i'm quite fascinated by the idea of automated annotation and how that actually operates in genetics and it's more out of curiosity than ever having a desire to be someone who studies genetics but it was really interesting to understand the capacity for generating very large amounts of automated annotation and then I moved into Geoscience to look at what happens when people edit annotate geospatial information and the kinds of terminology that sexually used who understand what's actually happening and with our data is being applied to data or whether informations being applied to information or data has been applied to information I really don't have the answers to this but i wanted to unpack what was actually going on to get a better understanding of how multimedia arm was being used and enabling research so i moved on to linguistics which was again quite fascinating to discover the different types of annotations that are applied to two languages and I've listed them there descriptive analytic time sequence and text so that was I guess really interesting for me to understand to pick apart say a descriptive annotation from a time sequence annotation and try and understand what's data and information that helped kind of distinguish different patience but it certainly didn't help me answer the data and information kind of dilemma what's what's data and wants information but I recognized that material was becoming a while it seemed to be becoming increasingly multi media and nature if it hadn't started off that way in the first place so the last area I looked at was biomedicine and I've got some images following this slide where where researchers they edit 8 images and they do that in different ways by drawing and adding notes and marking areas and I thought this was really fascinating and it made the idea of simplifying managing multimedia into waters data and what is information kind of meaningless in a way because it might be a theoretical concept rather than something which actually helps the researcher to do their research so I thought I'd put some examples in front
of us here and this is a biomedical slide and it's been annotated and you can see that it's been annotated with a-line shape and also with some words and that there's a scanned image underneath ok the next image I've got is
this is this wonderful gene annotation image that I looked at and really couldn't make head nor tail but it may be pretty interested in understanding how geneticists actually managed their data and what information they derive from that data really really complex and the I think I've mentioned the fact that machines actually generate these annotations made me want to understand where those annotations are actually put and how they're linked to the gene sequence but I think that's a whole investigation unto itself and the next
image I've got which is slightly more familiar for many people is is a Google Earth image which has been a settler image which has got Street markings and bubble pop ups and line tracing it made me want to understand a little more about how the hell that information was being captured to know how to support researchers who want to manage the data effectively and be able to potentially make it available on to cite it or to present it as part of their research this is the last one
which I hope those of you who have ever been to Portland enjoy I found this on flickr and it's a graphic image in the background and on top of that it looks like there are letters that are very carefully placed in alignment with what's called a spectrogram which is someone saying it rains a lot in Portland and I thought this is really interesting I wanted to understand a whole lot more about how these discrete pieces of data were actually brought together and whether you capture that all as one thing or whether you capture that separately and if the researcher uses those separate components as part of that multimedia but it made me understand that where the annotations or the combinations occur may be very critical and supporting some of the research findings I thought what I do is
introduce a project that's happening here in Australia that kind of for me emphasizes this idea of what's information and what's data and what other researchers looking at and it's a project based upon Griffith but I think with people dotted around Australia i live by mike for name called the prosecution project it's a center of
excellence policing and security and they're looking at criminal trolls over time and they've been digitizing archival materials and transcribing them and you'll see there on the slide a nice
kind of slashed image there that Mike's applied to kind of give you a view of the digitized image which to me is information but also on the lower part of the image is where the data entry occurs for transcription what if what I found interesting in the exchange with Mark about this project and I met him through an interaction with Alana Piper recently up in Brisbane is that that they're really looking at making the absolute most of this digitized material looking at it from an informational point of view to look at being able to read the records of these cases are criminal cases here in Australia and also looking at what the data underlying that information can tell them it's been a pretty interesting process to get to grips with what it is that they're doing and I hope that this offers some insight are to perhaps why it's important to understand what the research is trying to do and that there are interested in using whatever method and whatever our feature of multimedia to enable them to do their research so mike has emphasized
here and the outcomes that they're looking at a mixture of research methods both quantitative and qualitative he's sent me an article and I will pop the Lincoln to these slides so that others can have a chance to to go and have a read of it but the qualitative aspect of it was something that was a little more familiar to meet a quantitative aspect of it was something quite different and it made me realize that perhaps looking at mixed research methods was also a way of understanding how multimedia is operating as both in information source and the data source but it's the data side of it which I'm finding I guess enlightening as the word to use and that there are getting that data are through transcription human transcription but in other cases of digitization it can be character recognition so this is where I
kind of got to as multimedia as a data source I got got to the point where I decided that it could be both information and data at the same time because it's the way that the researcher is using it and building whatever they learn from that multimedia whether it's being looked at as a piece of information or as a data source to do their research and so reading the cases or reading the court records and also doing text analysis or data mining as enabling this researcher this research or that research group to do their research which I think a pretty incredible potential from one source of digitized material and I think that's quite an exciting prospect so from a
point of view of management it made me think about how how they were going to approach managing that and Marcus can't being kind enough to give me a description of how the backend are to the prosecution project is going to work they've got archival materials as digital images they're going to transcribe those images images into an SQL database and that supports them doing quantitative analysis of longitudinal and comparison patterns this is an email that he sent over the last week they're looking to extend that database by accessing or linking other data sources and he's mentioned the trove archive and possibly other projects or other digitized material like the police Gazette's to enable qualitative what he's referring to as case level as well as well as quantitative analysis and they're looking also to enrich the data by accessing interest transcribing the trial transcripts and other text archives so I guess what I understood from this was that my notions of splitting something into information and data were helping me to understand what it is that was going to enable this research group to do their research but also i needed to dig even deeper into what sits underneath this application to understand how they're storing the digitized images and how they're storing the transcriptions and where they're wanting to store the linkages between those two things and I realized that multimedia in this context is very complex and that all that language that I introduced at the beginning about layers and components are as important to I think inform how we support the management of this material so that's where I got to with the prosecution
project last least I decided to have a look at three applications that enable a person to manipulate multimedia so I picked three that seemed to be reasonably familiar to me and just wanted to have a look at how they how they enable material to be brought in and how they enable information or data to be applied and what happens in these three applications and these are I guess reasonably ubiquitous applications Final Cut Pro and arcgis and WordPress they're certainly not the very domain-specific and yeah I guess less commonly used applications that you might find in biomedicine or genetics more specifically so I had a look at Final Cut Pro to just try and understand some of the language that's used to understand what's actually happening when you use Final Cut Pro and if you've got any competent users in the group today it would be great if you are offered some advice but I just wanted to look at what final cut pro does to digital material and from what I can understand is that it technically consists of separate files there's something called a project file a media source file and render or cache files and to me that gave me an understanding that the multimedia was being captured in different ways and potentially for different purposes and I have to confess I haven't ever used Final Cut Pro and I think this is an interesting way for us to understand how multimedia is either brought into an application and where it is saved but also to try and understand what happens when you want to try and get of that material out of the application and how you do store that and whether you store that as a combined object or whether they're separate objects the open archives information system model which is used in the digital archive in the world has been interpreted different in different way to give you an example a long time ago when I was working on the National Digital heritage archive in New Zealand we decided to be very clear that we would capture metadata separately to capturing material that we were hoping to keep and I think it was the dutch national library i decided to go a different way they decided to build the digital object with both the metadata and also the object that was being collected and to me that is two very simple ways of approaching capturing multimedia is to separate concerns if you like or different types of digital information or to actually build it into a bundle but it made me realize that if I was trying to get material out of final cut pro I would want to understand how this material could be linked back together again in case I ever wanted to work on that multimedia material again I won't carry on with that one but I did I did look at this and wonder how the output of Final Cut Pro is captured and how the components are captured and I don't have the answer to that today so I could GIS there's another tool that I haven't used but I went in to have a look at how material was viewed in that application and what happened to it when it was being used and what I can understand from this is that it's possible to pull in images geospatial images and it's possible pullin geospatial data if you like let long that kind of that kind of data and to build up layers within this application and again it made me think about being able to maintain those components separately but also to maintain the final output which may be a combination of those components could be critical to a researcher it might not but when you're dealing with different parts of material how is that used by the researcher from a point of view of looking at a map it's from a human point of view we can read it but is that an important aspect to the researcher or is it the annotations on the map that aren't more important I cut out to that but I guess in terms of being able to support researchers who use or create multimedia it's important to ask what it is that they want to do with it and whether they want to deconstruct and reconstruct from those original components the last one is WordPress and this is one that I suspect many more people have experience with I've always wondered how people get the content out of WordPress so I went and had a look to get an understanding of what happens if you've had a website up using the word press application and you want to suck all the content out so you can capture it and perhaps put it into a different application and they may it may be very important to keep discrete narrative that's in posts or pages or comments separate from categories and tags what I can glean I think you can get that out as separate pieces of data but it made me wonder how a researcher might actually use that material whether they would just want to reinforce it into another application or whether they actually want to process those tags or categories to see how much contents being given those categories or tags thanks again Ingrid and thanks very much
to everyone for attending today and for your questions and comments
Loading...
Feedback
hidden