We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

Looking for Patterns: understanding the role of the Data Librarian - 27th May 2013

00:00

Formal Metadata

Title
Looking for Patterns: understanding the role of the Data Librarian - 27th May 2013
Title of Series
Number of Parts
13
Author
Contributors
License
CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
Language

Content Metadata

Subject Area
Genre
Abstract
Luis Martinez-Uribe explains the role of a Data Librarian through a use case where the Data Librarian recognizes patterns that allow for the development of scraping scripts to return PDFs of newspaper articles.
Service (economics)Observational studyMereologySet (mathematics)JSONXMLUML
System identificationMeeting/Interview
Duality (mathematics)VacuumMusical ensembleComa BerenicesWeb pageLemma (mathematics)Uniform resource nameCAN busQuantumRankingGamma functionComputer configurationVorwärtsfehlerkorrekturWeb pageCodeComputer animation
MiniDiscExecution unitMenu (computing)Bookmark (World Wide Web)Computer animation
Convex hullLie groupWeb pageComputer animation
Wechselseitige InformationOvalOrder (biology)Computer animation
ParsingSource codeVideo game consoleUniform resource namePattern languageWikiWebsiteOrder (biology)Web 2.0Computer animation
Broadcast programmingText editorWeb pageSource codeView (database)Numbering schemeWebsiteComputer animation
Computer configurationMaxima and minimaComputer animation
Computer animation
MaizeComputer animation
Drop (liquid)AirfoilVideo gameWeb pageClique-widthMenu (computing)HookingOvalSound effectComputer animation
Meeting/Interview
VideoconferencingJSON
Transcript: English(auto-generated)
As part of my role as a data librarian at the Institute of Juan March in Madrid, Spain, I help researchers in the creation of their datasets. The research that I'm working with is looking at how the identification of Catalonians in Spain is affected by the news in the newspapers.
The challenge for the researcher here is to have all those front pages from the newspapers so that he can start doing the coding. And what I'm doing to help is actually scripting a data scrapper that will actually download all those PDFs for the dates required for the front pages of those newspapers
so that the researcher can actually use them without having to download them one by one. In order to do this, I use a web tool called Scrapper Wiki. And that allows me to look for patterns in the websites that actually hold all those newspapers
and identify the way in which they've actually coded their website and almost de-engineer that so that I can figure out the way in which they store the PDFs so that I can actually download them all.
And I guess the researcher is trying to see whether when there's a lot of news with a negative connotation about the relationship between Spain and Catalonia whether that actually has an effect on those opinion pools
and that particular question that he's asking those opinion pools about the feeling of being Catalan in Spain.