Challenges for Scientific Databases
This is a modal window.
The media could not be loaded, either because the server or network failed or because the format is not supported.
Formal Metadata
Title |
| |
Title of Series | ||
Number of Parts | 9 | |
Author | ||
License | CC Attribution - NoDerivatives 4.0 International: You are free to use, copy, distribute and transmit the work or content in unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor. | |
Identifiers | 10.5446/50283 (DOI) | |
Publisher | 05jdrrw50 (ROR) | |
Release Date | ||
Language |
Content Metadata
Subject Area | ||
Genre | ||
Abstract |
| |
Keywords |
1
2
4
00:00
ChemistryComputer animation
00:06
Activity (UML)ProteinHost (biology)Sea levelInfrastructureWine tasting descriptorsGesundheitsstörungDisinfectantCollectingDNS-SyntheseMeeting/Interview
01:53
InfrastructureBase (chemistry)Host (biology)HardnessMeeting/Interview
02:40
Separation processStiffnessGrowth mediumBreed standardAddition reactionWine tasting descriptorsGesundheitsstörungWasserverbandGermanic peoplesTitanateMeeting/Interview
Transcript: English(auto-generated)
00:05
Frederick Glissantsek from Geneva. You are from the Swiss Institute for Bioinformatics. I guess that Geneva is a very nice place for hosting big data, such as CERN is doing and the SIB is doing. So do you have an idea why Geneva is such a great location
00:28
for that? Well, you're putting me in a difficult situation, because I cannot say it's only Geneva, and it's Geneva and Lausanne. And there's really cooperation, especially at
00:43
the level of SIB, that we are, we have, there's a certain division of labour. There's a tradition in Geneva to look at protein data and in Lausanne to look at DNA data. And the big data at the moment is rather on the DNA side, so that there's
01:06
a lot of activity happening in Lausanne as well, and on DNA. And so, because Swiss Prod was born in Geneva, so there is this sort of tradition, yes, to have a specialty
01:21
on protein information and annotation and curation. So was it born by coincidence in Geneva, or was there any plan behind that? How would I know? No, I think it was Amos Berock was in Geneva, his family decided
01:40
to settle in Geneva, and so it happened there. Yeah, I see. So what would you say are the largest challenges to maintain and host data collections, let's say in general? Infrastructure is a challenge for national agencies who are funding research, like as
02:02
if you didn't need the means to actually complete your research. Are we really talking about just the infrastructure? I mean, infrastructure is hardware. Yeah, but it's not only that, it's hardware, but you have to fill in the content. I mean, database information curation is part of it, and actually it doesn't belong, it's
02:23
halfway between research and infrastructure, and it doesn't belong to any in a way. And this is why it was so much of a challenge to actually finance this aspect. But data is not all, I guess. I mean, if you just collect data, let's say raw data,
02:44
you also need something additional. Let's say the descriptors of these data. Yeah, so this is what I call curation and the annotation, and so this is all thought about by people who design databases, and so it's, I mean, like if I take again
03:02
the example of SwissFrot, the schema of the database was certainly designed by someone who knew, I mean, it was Amos in collaboration with a few people, but there was actually some exchange between the people who actually do the schema and the people who have the
03:22
notion of the content of the data. So it's always, communication is the only way to actually solve problems as far as I'm concerned. And we saw that just in the presentations just then. I mean, communication now is helped by a number of media that is adding
03:41
to science. So what about standards? Standards, yes, this is one aspect. Thank you for helping me in going in the items of the list. So this is essential, but I mean, it's part again of exchange between
04:03
people who have the knowledge of the scientific knowledge and people who have the technical knowledge. So how can you actually exchange and really frame the data in such a way that it's not going to be too framed, but still there's going to be some flexibility.
04:22
I still think that the coexistence of several standards is a good thing because you cannot actually capture everything in one box. So we speak different languages, you speak German natively, I speak French, and yet we use English. So let's have a few languages
04:43
and a few standards and a few different means coexist, but let's narrow them down so that it's not scattered all about the place.
Recommendations
Series of 25 media
Series of 14 media