Looking for Patterns: understanding the role of the Data Librarian - 27th May 2013
This is a modal window.
The media could not be loaded, either because the server or network failed or because the format is not supported.
Formal Metadata
Title |
| |
Title of Series | ||
Number of Parts | 13 | |
Author | ||
Contributors | ||
License | CC Attribution 3.0 Unported: You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor. | |
Identifiers | 10.5446/35956 (DOI) | |
Publisher | ||
Release Date | ||
Language |
Content Metadata
Subject Area | ||
Genre | ||
Abstract |
|
Research Data Librarians1 / 13
00:00
Service (economics)Observational studyMereologySet (mathematics)JSONXMLUML
00:09
System identificationMeeting/Interview
00:22
Duality (mathematics)VacuumMusical ensembleComa BerenicesWeb pageLemma (mathematics)Uniform resource nameCAN busQuantumRankingGamma functionComputer configurationVorwärtsfehlerkorrekturWeb pageCodeComputer animation
00:33
MiniDiscExecution unitMenu (computing)Bookmark (World Wide Web)Computer animation
00:36
Convex hullLie groupWeb pageComputer animation
00:42
Wechselseitige InformationOvalOrder (biology)Computer animation
00:49
ParsingSource codeVideo game consoleUniform resource namePattern languageWikiWebsiteOrder (biology)Web 2.0Computer animation
01:05
Broadcast programmingText editorWeb pageSource codeView (database)Numbering schemeWebsiteComputer animation
01:10
Computer configurationMaxima and minimaComputer animation
01:25
Computer animation
01:28
MaizeComputer animation
01:31
Drop (liquid)AirfoilVideo gameWeb pageClique-widthMenu (computing)HookingOvalSound effectComputer animation
01:36
Meeting/Interview
01:47
VideoconferencingJSON
Transcript: English(auto-generated)
00:00
As part of my role as a data librarian at the Institute of Juan March in Madrid, Spain, I help researchers in the creation of their datasets. The research that I'm working with is looking at how the identification of Catalonians in Spain is affected by the news in the newspapers.
00:22
The challenge for the researcher here is to have all those front pages from the newspapers so that he can start doing the coding. And what I'm doing to help is actually scripting a data scrapper that will actually download all those PDFs for the dates required for the front pages of those newspapers
00:44
so that the researcher can actually use them without having to download them one by one. In order to do this, I use a web tool called Scrapper Wiki. And that allows me to look for patterns in the websites that actually hold all those newspapers
01:02
and identify the way in which they've actually coded their website and almost de-engineer that so that I can figure out the way in which they store the PDFs so that I can actually download them all.
01:20
And I guess the researcher is trying to see whether when there's a lot of news with a negative connotation about the relationship between Spain and Catalonia whether that actually has an effect on those opinion pools
01:41
and that particular question that he's asking those opinion pools about the feeling of being Catalan in Spain.