Looking for Patterns: understanding the role of the Data Librarian - 27th May 2013

Video thumbnail (Frame 0) Video thumbnail (Frame 236) Video thumbnail (Frame 549) Video thumbnail (Frame 824) Video thumbnail (Frame 899) Video thumbnail (Frame 1049) Video thumbnail (Frame 1225) Video thumbnail (Frame 1623) Video thumbnail (Frame 1755) Video thumbnail (Frame 2120) Video thumbnail (Frame 2201) Video thumbnail (Frame 2279) Video thumbnail (Frame 2389) Video thumbnail (Frame 2681)
Video in TIB AV-Portal: Looking for Patterns: understanding the role of the Data Librarian - 27th May 2013

Formal Metadata

Looking for Patterns: understanding the role of the Data Librarian - 27th May 2013
Title of Series
CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Release Date

Content Metadata

Subject Area
Luis Martinez-Uribe explains the role of a Data Librarian through a use case where the Data Librarian recognizes patterns that allow for the development of scraping scripts to return PDFs of newspaper articles.
Service (economics) Observational study Set (mathematics) Mereology
Web page Vorwärtsfehlerkorrektur Web page Lemma (mathematics) Coma Berenices Code CAN bus Duality (mathematics) Computer configuration Uniform resource name Musical ensemble System identification Ranking Quantum Gamma function Vacuum
Execution unit Lie group Menu (computing) Convex hull MiniDisc Bookmark (World Wide Web)
Wiki Web 2.0 Source code Wechselseitige Information Parsing Oval Uniform resource name Order (biology) Website Video game console
Source code Broadcast programming Computer configuration View (database) Web page Maxima and minima Website Text editor Numbering scheme
Hooking Clique-width Oval Web page Video game Drop (liquid) Menu (computing) Airfoil
as part of my role as a Bates librarian at the Institute of one March in Madrid Spain I help researches in the creation of their data sets researchers that I'm
working with is looking at how the identification of Catalonians in Spain is affected by the the news in the in the newspapers the challenge for the
researcher here is to have all those from pages from the newspaper so that he can start doing the coding and what I'm doing to health is actually scripting a
data scrubber that will actually
download all those PDFs for the dates requires for for the front pages of
those newspapers that the researcher can actually use them without having to download them one by one in order to do
these I use a web tool called scrapper wiki and that allows me to look for patterns in in the web sites that actually hold all those newspapers and and identify the way in which they've
actually coded the website and almost the engineer that so that I can figure
out the way in which they call they store the PDFs and so that I can actually download them all and I guess and the researcher is trying to see whether when this lot of news with a
negative connotation about the
relationship between Spain and Catalonia whether that actually has an effect on
those so pinion pools and that particular question that is asking those opinion polls about the feeling of being Catalan in