Mathematical World Knowledge Contained in the Multilingual Wikipedia Project

Cite

Related Material

Technische Universität Braunschweig

Halbach, Dennis T.

Formal Metadata

Title

Mathematical World Knowledge Contained in the Multilingual Wikipedia Project

Title of Series

International Congress on Mathematical Software (ICMS) 2020

Number of Parts

Author

Halbach, Dennis T.

0000-0003-1316-6416 (ORCID)

License

CC Attribution 3.0 Germany:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/48938 (DOI)

Publisher

Technische Universität Braunschweig

Release Date

2020

Language

English

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

The purpose of this project is to test and evaluate an approach for Formula Concept Discovery (FCD). FCD aims at retrieving a formula concept (in the form of a Wikidata item) together with its defining formula within documents, in this case 100 English Wikipedia articles. To correctly identify the defining formula of a Wikipedia article, this approach searches for shared formulae across Wikipedia articles available in different languages. The formula shared in the most languages is then assumed to be the defining formula. The results show that neither this approach alone nor a combination with an existing approach that considers the order of the formulae inside an article leads to satisfying results. It is thus concluded that the number of times a formula is shared across a Wikipedia article in different languages is not a good indicator to determine the defining formula with the current approach. Consequently, several ideas for further research are proposed which could improve the results.

Keywords

Formula Concept Discovery

Wikidata

Wikipedia