Logo TIB AV-Portal Logo TIB AV-Portal

Wikidata for (Data) Journalists

Video in TIB AV-Portal: Wikidata for (Data) Journalists

Formal Metadata

Title
Wikidata for (Data) Journalists
Title of Series
Author
License
CC Attribution 4.0 International:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
2020
Language
English

Content Metadata

Subject Area
Abstract
Data journalists work similarly to scientists: they formulate (research) hypotheses, analyze data and sometimes even collect data themselves. For this, open data ressources are crucial. They can can reveal unfamiliar correlations, lead to new questions and stories. Wikidata is the largest linked open data ressource on the internet. But what exactly is stored in it and how can journalists use it for their work? We will have a look at the way Wikidata works and how it can be used a.k.a. queried for stories and research.
Keywords WikiPaka 2020 rc3-wikipaka
point complex Free services time sources visualization machine similar Databases loss analysis storage open subsets XML events wikis mathematics regular Correlation causal hypermedia different visualization queue man systems fehlenden Daten services key Development point projects storage analysis statistical correlations Databases Semantic Web connections similar sample visualization pattern table report
wikis Location functionality record geometric
choice complex presentation building Actions states time decision views survey open subsets Part programs information wikis web different hypermedia information category descriptions classes systems services link screens compression share instance connections category sample system cycle Results point aliases services files link study real analysis events specific inclusion reduce Representation statements platforms form organization key information Gender projects experts categorical statistical correlations Databases Location words visualization Software Query statements genetics key family libraries
area Actions scale services link key bits meetings Stream Part programs Twitter Query hypermedia string touch series lies Board write
Chaos
julio long. the artists by always a but his loss.
to sulk. a genocide today is gently have a look and key points of data journalism. we look quickly explain what the key data is what tools you can use inside of wiki data for data visualisation what other third party tools are there for your research and then we have a look at critical research done witwicky data and finally we. have a critical look under the care itself. the. the key points of data journalism are that you want to interview a data set so you want to find connections correlations and causalities behind to data also you want to visualize the data and a compelling way. and you want to write your own story you want to find a new spin. in a new look on at the facts and all of these things you can do we keep data. the. vicky media doesn't we want to support evidence based reporting that's why i want to support you and using the key data. also data journalists and helps you to tailor your story to the users or your readers the data journalism helps you to create visual storytelling instead of walls of text and this again helps you to convey facts faster. and way more easy and dad makes your story way more inclusive. it. so how do you get to restore you would be key data you want to find and recognize patterns and the dead us that you can search for a geographical data you can search for similarities and differences in the data and you can also search for missing data because that also exists and. he did. and you can visualize your findings with the tools that you find in the uk every service and what's most important as you can connect to the key data community and find people who are working on a similar subject or have a similar research research question to the one that you have. i so included this visualisation to show you dead. data is only the beginning of your story. and the path that you'll take a we want you to use today in which the data for to create a compelling story and therefore contribute value and your idea about what's in the data. the. because data is a lot but it's not everything as we've seen and the last month many people aren't convinced by facts are also there's a lack of time and there's a lack of dead or list their data literacy in our society. it's not always easy to understand the complexity of historical events and develop man's to understand the complexity of medical data or demographic changes so it is important to have a story telling aspect to your data. have good visualizations and and easy to understand approach to convey the significance of your data and your story and finally it is important to remain transparent and clear about the use and analysis of data. so what is the key data. with the dead as a free linked database that can be read and edited by both humans and machines so it is a database of flink open data. that means that data to doesn't just sit there and tables it can be connected and combined with other data find on which it. but that it is a realisation of the semantic web as. dreamt by timber earnestly and also we care won a prize for its realisation off the semantic web. we just celebrated we could at davis eight percent a it currently holds ninety million items and has forty four thousand active users and contributors which makes it don't most edited the queue media project. it was initially used to or thought of to support to projects of the other projects have to be cute media ecosystem. and seen as a central storage for the structured data of the system projects like the key voyage wiki source and the most a famous the uk media project wikipedia. he said.
but it also has another function which means which is to provide free and open data to the internet and that became really huge has already set we now have more than eighteen and nineteen million dead items on wiki data a colliery. of mind created to snap and you can see here. the geo location data dead is in the key data and we're very proud that it's distribute it all over the world but it's also we also take it with a grain of salt because as you can see. it's very bright and europe and on the east and west coasts of the u.s. but they're very dark spots where we can't. a record knowledge in the same way as we do in our western societies and that brings us to the question of what his knowledge equity and how can we actually best serve everybody in our global society.
so how does it work.
the key data of items which are real things are concepts and a real words like berlin barack obama helium and these items are identified with an id to queue id so q seventy six or. q. i am i can return on it. so these items have labels description aliases and filings labels that means. it's described in all of the languages that vicky data holes currently those are around three hundred. the descriptions are forms to describe what the item holds. and aliases sometimes one item has several names its cetera et cetera. and item also has properties. those are used to label today likes a person is born somewhere its date of birth or death or the location of a specific building. statements hold information's in properties so p forty seven shares the border with another like a country or the population. statements also have qualifiers to expand the information and and also they have references which is very important because. for scientific to fit research you want to have those references. so here we see again our item berlin q sixty four. the property is a population of three point seven million. a. so what's new about research with the key data is that you can ask your own questions. before you would go to a library and some the librarians librarians are awesome but they would give you books with specific fact and you would consumed and and tried to the use them for your research at the key data you can ask very specific. questions that nobody else came up with before. so for your research you want to do your own wiki day at queries that's why we have to vacate acree service for. the good news is that you don't have to learn pipeline or are or become us data scientists but you want to learn a bit of sparkle. the. we included a few resources here in this presentation and and there's also going to be a talk given by my colleagues you class on the twenty ninth on how to create the key data with sparkle. we also have a guided tour on which the data on a web side which i can recommend. ok so as set once you create your data you can visualize your results. for more compelling story telling and there are several ways of doing this and i'm going to show you some of us just to give you an idea. i. you could for instance after the crease service to show you airports that are named after person and color code them are according to their gender. gender of the person that their of c o two. you can ask the queries surveys show me everything connected to the item berlin. you can ask it to show you the population of the countries that are bordering germany and how it developed. you can also asked the choir he serve is to show you the most common cause of death among noble people. we are here it shows you and historical overview of space probes. or all of the children and grandchildren of things can. i. if every. and so we had a look and the visualisations inside of the key data screen the service that they're also tools said use the data state. for their own visualisations and i'm going to show you some of them now. so years his to pedia which makes time beams of historical events using data from the key data. this will have on terror. basically it let's do some create your own private library and tanned uses the day that. from the key data to describe the publications. i. here is ask me anything. that's than by different researchers in europe and it lets you pose questions the natural language to be key data so you don't have to use to cruise service. and that's a way that to use we could added it's also used by a lot of voices cisterns like scenery and alexa it. i. and here you have scalia which is basically a platform for scientific publications that are published under open access and collect it and it can.
answer your questions like who published what paper with whom who and when or who wrote the first paper on covert when was published its cetera. and here we have some awful paintings. basically it's a database that creates all of the paintings in the world and mr mehta data as so you can combine and indoor own specific way. so i showed you a couple of recent of examples what you could do and i want. he and other researchers who did great stuff with the key data and used for very cool storytelling. i. if my slights work as well. so. women's representation and voice and media coverage of the coronavirus crisis that's a that's a study done by researcher called laura jones their regarding the representation of female experts within the coverage of coronavirus. it uses evel you actions of wikipedia and wiki data. to show. to show how much representation was there a female expert. and as we see it's not a lot. finally there's another great example i want to tell you about it's a project called enslaved to our it's a link to open data platform based on wiki base which is the software behind the key data. and it basically shows or it collects and connects data are related to the transatlantic slave trade. so people who suffered under the slave trade and to record stood were done by the people active and in this slave trade. allstate art is collected and it has been collected in several databases and enslaved build won large database to connect them and rebuilt the stories. which i think it's a really great idea to or really great way to humanize people who've been dehumanised with data. like you can see here. they collect start they collect data are from newspapers and. from the slave holders to recount the story of individuals. so finally i also want to talk to you about one thing and wiki dead at dead is always on our minds which is that we could data is not perfect. i highly recommend the top by all skis question in the key data in which it isn't clear explained that all classification systems are inherently dangerous and we could add as a large and cycle p.d.q. red cross if occasion system which makes choices. ethical and political choices about what is notable about how to. categorize information and these choices. dave reduce complexity and david hughes also as specific forms of. of history like oral history this reduction has consequences i am as he's you know which data is used by many. programs apps voices systems and what what and how we store information and we keep data really matters. so we ask ourselves what is and say competing knowledge. the and how can we are going nice to know what more inclusive whey encyclopedic knowledge as a western concept and we can and must do better than just use our own western view to organize the world. but then also to the key principle applies we have a huge community behind the key data and that helps us to make to see these decisions and you can also become a part of this by sea researching the key data using it for your work. and also contributing your research. i. so once again i want to tell you you can use the key data as a tool for your story telling the data can help you find connections between data we could add a can help you find. can help you build visualisation in its crew the service you can ask questions about historical data are cruel asians. more critically then you could. then you could before. and but they're also downsize to a downside saluki data because it isn't and sekou peak way of organizing western knowledge. so this was only a start and looking forward to our q. and a session now and that if you have further questions concerns are have ideas you can contact me and my colleagues and you can also contact me individually. thank you.
i.
around working stories about her. thank you very much way interesting car that was the very great introduction. i guess things for having me i'm happy that i was able to talk a bit about the key data and. how you can do storytelling with it. i wanted to add dead obviously you can ask me questions now but also i want to hint at the great introduction of the key data as one of my colleagues scale said yesterday two of my colleagues which is already are lie and tomorrow i will be. clearly service or shop where you can learn a bit more intimate as to how to create a wiki data. carrots and there's actually this to questions in the chat right now worse one s i or slights going to be published because people are interested in the innings the area for years yes. that was the i asked before i think the talk will be published and just lights. sarah a key part of our board where it can put it another way i mean i can also put a link on our twitter account which media deutschland. and yeah thing which have been our probably be the best idea actually have to check on the park a bond but we will actually know where you can find everything i put it on to become your daughter and twitter it's at w m d. i think they will also retreat. it's obviously you will find it a profit. there's another question what resources which you recommend assess studying the writing of queries or korea we cannot keep tabs on. in one. i put some things in the industry i mean there is yet we have like the few to charles on the key day there was also a couple of months ago a very nice and very easy the atari a published by the commedia israel. and i so we didn't do it but i can recommended it's a very low key introduction to your first series pam where world also published that somehow and. i have a question for you as well they're mention that we could at us like a great way for a meeting of the people that are working on similar topics so is there some kind of like greater community of journalists using wiki de janeiro. so far from the community is mostly research faintest steps also why we won a tony chowdhury. i would recommend getting in touch with. the community on their first. regarding the research topics let you have and you can also get in touch with us and we connect u.. i haven't moist in my ear but i hope it's only me. what i don't have read somewhere that might. eradicate the like there might be a good also an echo on the string of people on the chat are saying. slow i don't have any other questions and the shed and since there seems to be an echo on the stream i don't want your noir people any further so i would suggest for everyone rest for the questions to you and that you can meet and our big group wants and needs a room that i will. the coasting the trek not and and we were continue a program here at two twenty with another talk about flattered by the one with the brain. so i'm saying by from now. i. to all o.e.c.d..
Feedback